Skip to main content
Log in

Enabling self-service BI: A methodology and a case study for a model management warehouse

  • Published:
Information Systems Frontiers Aims and scope Submit manuscript

Abstract

The promise of Self-Service Business Intelligence (BI) is its ability to give business users access to selection, analysis, and reporting tools without requiring intervention from IT. This is essential if BI is to maximize its contribution by radically transforming how people make decisions. However, while some progress has been made through tools such as SAS Enterprise Miner, IBM SPSS Modeler, and RapidMiner, analytical modeling remains firmly in the domain of IT departments and data scientists. The development of tools that mitigate the need for modeling expertise remains the “missing link” in self-service BI, but prior attempts at developing modeling languages for non-technical audiences have not been widely implemented. By introducing a structured methodology for model formulation specifically designed for practitioners, this paper fills the unmet need to bring model-building to a mainstream business audience. The paper also shows how to build a dimensional Model Management Warehouse that supports the proposed methodology, and demonstrates the viability of this approach by applying it to a problem faced by the Division of Fiscal and Actuarial Services of the US Department of Labor. The paper concludes by outlining several areas for future research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. There are 53 UI jurisdictions. They include the 50 states plus the District of Columbia, the Virgin Islands, and Puerto Rico. State is used here to refer to a UI jurisdiction.

References

  • Blair, D. C. (2002). The data-document distinction revisited. The DATA BASE for Advances in Information Systems, 37(1), 77–96.

    Article  Google Scholar 

  • Box, G. E. P., & Draper, N. R. (1987). Empirical model building and response surfaces. New York: Wiley.

    Google Scholar 

  • Brooke, A., Kendrick, D., & Meeraus, A. (1988). GAMS: a user's guide. Redwood City: Scientific Press.

    Google Scholar 

  • Burnham, K. P., & Anderson, D. R. (2002). Model selection and multimodel inference: a practical information-theoretic approach (2nd ed.). New York: Springer.

    Google Scholar 

  • Choobineh, J. (1991). SQLMP: a data sublanguage for representation and formulation of linear mathematical models. ORSA Journal on Computing, 3(4), 358–375.

    Article  Google Scholar 

  • Cox, D. R. (1995). Comment on “model uncertainty, data mining and statistical inference”. Journal of the Royal Statistical Society: Series A (Statistics in Society), 158(3), 455–456.

    Google Scholar 

  • Cunningham, K., & Schrage, L. (2004). The LINGO Algebraic Modeling Language. In J. Kallrath (Ed.), Modeling languages in mathematical optimization (pp. 159-171). Kluwer Academic Publishers.

  • Davenport, T. H. (2013). Telling a Story with Data. Deloitte Review, 12.

  • Davenport, T. H., & Kim, J. (2013). Keeping up with the quants. Harvard Business Review Press.

  • Davenport, T. H., Harris, J. G., De Long, D. W., & Jacobson, A. L. (2001). Data to knowledge to results: building an analytic capability. California Management Review, 43(2), 116–138.

    Article  Google Scholar 

  • Deokar, A., & El-Gayar, O. F. (2011). Decision-enabled dynamic process management for networked enterprises. Information Systems Frontiers, 13(5), 655–688.

    Article  Google Scholar 

  • El-Gayar, O. F., & Deokar, A. (2013). A semantic service-oriented architecture for distributed model management systems. Decision Support Systems, 55(1), 374–384.

    Article  Google Scholar 

  • Finlay, P. N., & Mitchell, A. C. (1994). Perceptions of the benefits from the introduction of CASE: an empirical study. MIS Quarterly, 19(4), 353–370.

    Article  Google Scholar 

  • Fourer, R., Gay, D. M., & Kernighan, B. W. (1990). A modeling language for mathematical programming. Management Science, 36(5), 519–554.

    Article  Google Scholar 

  • Geoffrion, A. M. (1987). An introduction to structured modeling. Management Science, 33(5), 547–588.

    Article  Google Scholar 

  • Geoffrion, A. M. (1989). The formal aspects of structured modeling. Operations Research, 37(1), 30–51.

    Article  Google Scholar 

  • Guazzelli, A., Zeller, M., Lin, W-C., & Williams, G. (2009). PMML: an open standard for sharing models. The R Journal, 1(1), 60. http://journal.r-project.org

  • Guinan, P. J., Cooprider, J. G., & Sawyer, S. (1997). The effective use of automated application development tools. IBM Systems Journal, 36(1), 124–139.

    Article  Google Scholar 

  • Hampton, J. (2011). SEMMA and CRISP-DM: data mining methodologies. JessHampton.com http://jesshampton.com/2011/02/16/semma-and-crisp-dm-data-mining-methodologies. Accessed 21 July 2016.

  • HBR Analytic Services (2012). The evolution of decision making: how leading organizations are adopting a data-driven culture. Harvard Business Review. https://hbr.org/resources/pdfs/tools/17568_HBR_SAS%20Report_webview.pdf. Accessed 18 September 2015.

  • Henschen, D. (2014). IBM Watson analytics goes public. InformationWeek. http://www.informationweek.com/big-data/big-data-analytics/ibm-watson-analytics-goes-public/d/d-id/1317887. Accessed 14 February 2015.

  • Hevner, A. R., March, S. T., Park, J., & Ram, S. (2004). Design science in information systems research. MIS Quarterly, 28(1), 77–105.

    Article  Google Scholar 

  • Iivari, J. (1996). Why Are CASE tools not used? Communications of the ACM, 39(10), 94–103.

    Article  Google Scholar 

  • Kimball, R. (1997). A dimensional modeling manifesto. Database Magazine, 10(9), 59–78.

    Google Scholar 

  • Kottemann, J. E., & Dolk, D. R. (1992). Model integration and modeling languages. Information Systems Research, 3(1), 1–16.

    Article  Google Scholar 

  • Kridel, D., & Dolk, D. (2013). Automated self-service modeling: predictive analytics as a service. Information Systems and E-Business Management, 11(1), 119–140.

    Article  Google Scholar 

  • Lending, D., & Chervany, H. L. (1998). CASE tools: understanding the reasons for non-use. Computer Personnel, 19(2), 13–26.

    Article  Google Scholar 

  • Lin, E., Schuff, D., & St. Louis, R. (2000). Subscript-free modeling languages: a tool for facilitating the formulation and use of models. European Journal of Operational Research, 123(3), 614–627.

    Article  Google Scholar 

  • Logi Analytics (2014) State of Self-Service BI Report. http://images.learn.logixml.com/Web/LogiAnalyticsInc/%7B7c21cd62-221c-44af-9ecd-a35265bc8e34%7D_LogiAnalytics-2014StateOfSelfService-Artwork-1028.pdf. Accessed 8 February 2015.

  • Lundell, B., & Lings, B. (2004). Changing perceptions of CASE technology. Journal of Systems and Software, 72(2), 271–280.

    Article  Google Scholar 

  • Madhusudan, T. (2007). A web services framework for distributed model management. Information Systems Frontiers, 9(1), 9–27.

    Article  Google Scholar 

  • Maslow, A. H. (1966). The psychology of science. Chicago: J. Dewey Society.

    Google Scholar 

  • McMurtrey, M. E., Grover, V., Teng, J. T. C., & Lightner, N. J. (2002). Job satisfaction of information technology workers: the impact of career orientation and task automation in a CASE environment. Journal of Management Information Systems, 19(2), 273–302.

    Article  Google Scholar 

  • Murphy, F. H., Stohr, E. A., & Asthana, A. (1992). Representation schemes for linear programming models. Management Science, 38(7), 964–991.

    Article  Google Scholar 

  • Object Management Group (2003). Common Warehouse Metamodel (CWM) Specification. http://www.omg.org/spec/CWM/1.1/PDF/. Accessed 24 April 2015.

  • Pack, D. J. (1987). A practical overview of ARIMA models for time series forecasting. In S. G. Makridakis & S. C. Wheelwright (Eds.), The handbook of forecasting: a managers guide (pp. 196–218). New York: Wiley.

    Google Scholar 

  • Pechter, R. (2011). PMML conformance progress report – five years later. In Proceedings of PMML’11 (pp. 6–15). New York: ACM Press.

    Google Scholar 

  • Powell, S. R. (2015). Summary of state models. Unpublished working paper. New Brunswick, NJ: John J. Heldrich Center for Workforce Development.

  • Power, D., & Sharda, R. (2007). Model-driven decision support systems: concepts and research directions. Decision Support Systems, 43(3), 1044–1061.

    Article  Google Scholar 

  • Rohanizadeh, S., & Moghadam, M. (2009). A proposed data mining methodology and its application to industrial procedures. Journal of Industrial Engineering, 4, 37–50.

    Google Scholar 

  • SAS Institute (1998). Data Mining and the Case for Sampling. SAS Institute Best Practices Paper, Carey, NC.

  • Senn, J. A., & Wynekoop, J. L. (1995). The other side of CASE implementation. Information Systems Management, 12(4), 7.

    Article  Google Scholar 

  • Sottara, D., Mello, P., Sartori, C., & Fry, E. (2011). Enhancing a production rule engine with predictive models using PMML. In Proceedings of PMML’11 (pp. 39–47). New York: ACM Press.

    Google Scholar 

  • Wit, E., van den Heuvel, E., & Romeijn, J.-W. (2012). ‘All models are wrong…’: an introduction to model uncertainty. Statistica Neerlandica, 66(3), 217–236.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Karen Corral.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Schuff, D., Corral, K., St. Louis, R.D. et al. Enabling self-service BI: A methodology and a case study for a model management warehouse. Inf Syst Front 20, 275–288 (2018). https://doi.org/10.1007/s10796-016-9722-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10796-016-9722-2

Keywords

Navigation