Abstract
The promise of Self-Service Business Intelligence (BI) is its ability to give business users access to selection, analysis, and reporting tools without requiring intervention from IT. This is essential if BI is to maximize its contribution by radically transforming how people make decisions. However, while some progress has been made through tools such as SAS Enterprise Miner, IBM SPSS Modeler, and RapidMiner, analytical modeling remains firmly in the domain of IT departments and data scientists. The development of tools that mitigate the need for modeling expertise remains the “missing link” in self-service BI, but prior attempts at developing modeling languages for non-technical audiences have not been widely implemented. By introducing a structured methodology for model formulation specifically designed for practitioners, this paper fills the unmet need to bring model-building to a mainstream business audience. The paper also shows how to build a dimensional Model Management Warehouse that supports the proposed methodology, and demonstrates the viability of this approach by applying it to a problem faced by the Division of Fiscal and Actuarial Services of the US Department of Labor. The paper concludes by outlining several areas for future research.
Similar content being viewed by others
Notes
There are 53 UI jurisdictions. They include the 50 states plus the District of Columbia, the Virgin Islands, and Puerto Rico. State is used here to refer to a UI jurisdiction.
References
Blair, D. C. (2002). The data-document distinction revisited. The DATA BASE for Advances in Information Systems, 37(1), 77–96.
Box, G. E. P., & Draper, N. R. (1987). Empirical model building and response surfaces. New York: Wiley.
Brooke, A., Kendrick, D., & Meeraus, A. (1988). GAMS: a user's guide. Redwood City: Scientific Press.
Burnham, K. P., & Anderson, D. R. (2002). Model selection and multimodel inference: a practical information-theoretic approach (2nd ed.). New York: Springer.
Choobineh, J. (1991). SQLMP: a data sublanguage for representation and formulation of linear mathematical models. ORSA Journal on Computing, 3(4), 358–375.
Cox, D. R. (1995). Comment on “model uncertainty, data mining and statistical inference”. Journal of the Royal Statistical Society: Series A (Statistics in Society), 158(3), 455–456.
Cunningham, K., & Schrage, L. (2004). The LINGO Algebraic Modeling Language. In J. Kallrath (Ed.), Modeling languages in mathematical optimization (pp. 159-171). Kluwer Academic Publishers.
Davenport, T. H. (2013). Telling a Story with Data. Deloitte Review, 12.
Davenport, T. H., & Kim, J. (2013). Keeping up with the quants. Harvard Business Review Press.
Davenport, T. H., Harris, J. G., De Long, D. W., & Jacobson, A. L. (2001). Data to knowledge to results: building an analytic capability. California Management Review, 43(2), 116–138.
Deokar, A., & El-Gayar, O. F. (2011). Decision-enabled dynamic process management for networked enterprises. Information Systems Frontiers, 13(5), 655–688.
El-Gayar, O. F., & Deokar, A. (2013). A semantic service-oriented architecture for distributed model management systems. Decision Support Systems, 55(1), 374–384.
Finlay, P. N., & Mitchell, A. C. (1994). Perceptions of the benefits from the introduction of CASE: an empirical study. MIS Quarterly, 19(4), 353–370.
Fourer, R., Gay, D. M., & Kernighan, B. W. (1990). A modeling language for mathematical programming. Management Science, 36(5), 519–554.
Geoffrion, A. M. (1987). An introduction to structured modeling. Management Science, 33(5), 547–588.
Geoffrion, A. M. (1989). The formal aspects of structured modeling. Operations Research, 37(1), 30–51.
Guazzelli, A., Zeller, M., Lin, W-C., & Williams, G. (2009). PMML: an open standard for sharing models. The R Journal, 1(1), 60. http://journal.r-project.org
Guinan, P. J., Cooprider, J. G., & Sawyer, S. (1997). The effective use of automated application development tools. IBM Systems Journal, 36(1), 124–139.
Hampton, J. (2011). SEMMA and CRISP-DM: data mining methodologies. JessHampton.com http://jesshampton.com/2011/02/16/semma-and-crisp-dm-data-mining-methodologies. Accessed 21 July 2016.
HBR Analytic Services (2012). The evolution of decision making: how leading organizations are adopting a data-driven culture. Harvard Business Review. https://hbr.org/resources/pdfs/tools/17568_HBR_SAS%20Report_webview.pdf. Accessed 18 September 2015.
Henschen, D. (2014). IBM Watson analytics goes public. InformationWeek. http://www.informationweek.com/big-data/big-data-analytics/ibm-watson-analytics-goes-public/d/d-id/1317887. Accessed 14 February 2015.
Hevner, A. R., March, S. T., Park, J., & Ram, S. (2004). Design science in information systems research. MIS Quarterly, 28(1), 77–105.
Iivari, J. (1996). Why Are CASE tools not used? Communications of the ACM, 39(10), 94–103.
Kimball, R. (1997). A dimensional modeling manifesto. Database Magazine, 10(9), 59–78.
Kottemann, J. E., & Dolk, D. R. (1992). Model integration and modeling languages. Information Systems Research, 3(1), 1–16.
Kridel, D., & Dolk, D. (2013). Automated self-service modeling: predictive analytics as a service. Information Systems and E-Business Management, 11(1), 119–140.
Lending, D., & Chervany, H. L. (1998). CASE tools: understanding the reasons for non-use. Computer Personnel, 19(2), 13–26.
Lin, E., Schuff, D., & St. Louis, R. (2000). Subscript-free modeling languages: a tool for facilitating the formulation and use of models. European Journal of Operational Research, 123(3), 614–627.
Logi Analytics (2014) State of Self-Service BI Report. http://images.learn.logixml.com/Web/LogiAnalyticsInc/%7B7c21cd62-221c-44af-9ecd-a35265bc8e34%7D_LogiAnalytics-2014StateOfSelfService-Artwork-1028.pdf. Accessed 8 February 2015.
Lundell, B., & Lings, B. (2004). Changing perceptions of CASE technology. Journal of Systems and Software, 72(2), 271–280.
Madhusudan, T. (2007). A web services framework for distributed model management. Information Systems Frontiers, 9(1), 9–27.
Maslow, A. H. (1966). The psychology of science. Chicago: J. Dewey Society.
McMurtrey, M. E., Grover, V., Teng, J. T. C., & Lightner, N. J. (2002). Job satisfaction of information technology workers: the impact of career orientation and task automation in a CASE environment. Journal of Management Information Systems, 19(2), 273–302.
Murphy, F. H., Stohr, E. A., & Asthana, A. (1992). Representation schemes for linear programming models. Management Science, 38(7), 964–991.
Object Management Group (2003). Common Warehouse Metamodel (CWM) Specification. http://www.omg.org/spec/CWM/1.1/PDF/. Accessed 24 April 2015.
Pack, D. J. (1987). A practical overview of ARIMA models for time series forecasting. In S. G. Makridakis & S. C. Wheelwright (Eds.), The handbook of forecasting: a managers guide (pp. 196–218). New York: Wiley.
Pechter, R. (2011). PMML conformance progress report – five years later. In Proceedings of PMML’11 (pp. 6–15). New York: ACM Press.
Powell, S. R. (2015). Summary of state models. Unpublished working paper. New Brunswick, NJ: John J. Heldrich Center for Workforce Development.
Power, D., & Sharda, R. (2007). Model-driven decision support systems: concepts and research directions. Decision Support Systems, 43(3), 1044–1061.
Rohanizadeh, S., & Moghadam, M. (2009). A proposed data mining methodology and its application to industrial procedures. Journal of Industrial Engineering, 4, 37–50.
SAS Institute (1998). Data Mining and the Case for Sampling. SAS Institute Best Practices Paper, Carey, NC.
Senn, J. A., & Wynekoop, J. L. (1995). The other side of CASE implementation. Information Systems Management, 12(4), 7.
Sottara, D., Mello, P., Sartori, C., & Fry, E. (2011). Enhancing a production rule engine with predictive models using PMML. In Proceedings of PMML’11 (pp. 39–47). New York: ACM Press.
Wit, E., van den Heuvel, E., & Romeijn, J.-W. (2012). ‘All models are wrong…’: an introduction to model uncertainty. Statistica Neerlandica, 66(3), 217–236.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Schuff, D., Corral, K., St. Louis, R.D. et al. Enabling self-service BI: A methodology and a case study for a model management warehouse. Inf Syst Front 20, 275–288 (2018). https://doi.org/10.1007/s10796-016-9722-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10796-016-9722-2