Skip to main content
Log in

An approach for incorporating quality-based cost–benefit analysis in data warehouse design

  • Published:
Information Systems Frontiers Aims and scope Submit manuscript

Abstract

Data are considered to be important organizational assets because of their assumed value, including their potential to improve the organizational decision-making processes. Such potential value, however, comes with various costs, including those of acquiring, storing, securing and maintaining the given assets at appropriate quality levels. Clearly, if these costs outweigh the value that results from using the data, it would be counterproductive to acquire, store, secure and maintain the data. Thus cost–benefit assessment is particularly important in data warehouse (DW) development; yet very few techniques are available for determining the value that the organization will derive from storing a particular data table and hence determining which data set should be loaded in the DW. This research seeks to address the issue of identifying the set of data with the potential for producing the greatest net value for the organization by offering a model that can be used to perform a cost–benefit analysis on the decision support views that the warehouse can support and by providing techniques for estimating the parameters necessary for this model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Ballou, D. P., & Tayi, G. K. (1999). Enhancing data quality in data warehouse environments. Communications of the ACM, 42(1), 73–78.

    Article  Google Scholar 

  • Barquin, R., & Edelstein, H. (1997). Planning and designing the data warehouse. Prentice Hall.

  • Brijs, T., Vanhoof, K., & Wets, G. (2000). Reducing redundancy in characteristic rule discovery by using integer programming techniques. Intelligent Data Analysis, 4, 229–240.

    Google Scholar 

  • Celko, J., & McDonald, J. (1995). Don’t warehouse dirty data. Datamation, 41(19), 42–53.

    Google Scholar 

  • Chengalur-Smith, I., Ballou, D., & Pazer, H. (1999). The impact of data quality information on decision making: An exploratory analysis. IEEE Transactions on Knowledge and Data Engineering, 11, 6.

    Article  Google Scholar 

  • Cowie, J., & Burstein, F. (2007). Quality of data model for supporting mobile decision making.. Decision Support Systems, 43, 1675–1683.

    Article  Google Scholar 

  • Davidson, I., Grover, A., Satyanarayana, A., & Tayi, G. K. (2004). A general approach to incorporate data quality matrices into data mining algorithms. Paper presented at the Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, Seattle, Washington, USA.

  • Ewen, E. F., Medsker, C. E., & Dusterhoft, L. E. (1998). Data warehousing in an integrated health system: building the business case. Paper presented at the 1st ACM international workshop on Data warehousing and OLAP, Washington, D.C.

  • Gardner, S. R. (1998). Building the data warehouse. Communications of the ACM, 41, 9.

    Article  Google Scholar 

  • Gupta, H., & Mumick, I. S. (2005). Selection of views to materialize in a data warehouse. IEEE Transactions on Knowledge and Data Engineering, 17(1), 24–43.

    Article  Google Scholar 

  • Hwang, H.-G., Ku, C.-Y., Yen, D. C., & Cheng, C.-C. (2004). Critical factors influencing the adoption of data warehouse technology: A study of the banking industry in Taiwan. Decision Support Systems, 37, 1–21.

    Article  Google Scholar 

  • Jarke, M., Jeusfeld, M., Quix, C., & Vassiliadis, P. (1999). Architecture and quality in data warehouses: An extended repository approach. Information Systems, 24(3), 229–253.

    Article  Google Scholar 

  • Kalfus, O., Ronen, B., & Spiegler, I. (2004). A selective data retention approach in massive databases. Omega, 32, 87–95.

    Article  Google Scholar 

  • Kwan, M., & Balasubramanian, P. (2003). KnowledgeScope: Managing knowledge in context. Decision Support Systems, 35, 467 LINDO Software. from www.lindo.com.

    Article  Google Scholar 

  • Martinsons, M., Davison, R., & Tse, D. (1999). The balanced scorecard: A foundation for the strategic management of information systems. Decision Support Systems, 25, 71–88.

    Article  Google Scholar 

  • Moody, D., & Walsh, P. (1999). Measuring the value of information: An asset valuation approach. Paper presented at the Seventh European Conference on Information Systems (ECIS), Frederiksberg, Denmark.

  • Nazareth, D. L., & Rothenberger, M. A. (2004). Assessing the cost-effectiveness of software reuse: A model for planned reuse. The Journal of Systems and Software, 73, 245–255.

    Article  Google Scholar 

  • Osei-Bryson, K.-M., & Ngwenyama, O. K. (2006). Managing risks in information systems outsourcing: An approach to analyzing outsourcing risks and structuring incentive contracts. European Journal of Operational Research, 174, 245–264.

    Article  Google Scholar 

  • Pipino, L. L., Lee, Y. W., & Wang, R. Y. (2002). Data quality assessment. Communications of the ACM, 45(4), 211–218.

    Article  Google Scholar 

  • Prakash, N., & Gosain, A. (2003). Requirements driven data warehouse development. Paper presented at the CAiSE Short Paper Proceedings.

  • Rahm, E., & Do, H. (2000). Data cleaning: Problems and current approaches. IEEE Data Engineering Bulletin, 23(4), 1–11.

    Google Scholar 

  • Saaty, T. (1980). The analytic hierarchy process: Planning, Priority Setting, Resource Allocation. McGraw-Hill.

  • Solomon, M. (2005). Ensuring a successful data warehouse initiative. Information Systems Management, 22(1), 26–36.

    Article  Google Scholar 

  • Tayi, G. K., & Ballou, D. (1998). Examining data quality. Communications of the ACM, 41, 2.

    Article  Google Scholar 

  • Vassiliadis, P. (2000). Gulliver in the land of data warehousing: practical experiences and observations of a researcher. Paper presented at the International Workshop on Design and Management of Data Warehouses (DMDW), Stockholm, Sweden.

  • Vassiliadis, P., Bouzeghoub, M., & Quix, C. (2000). Towards quality-oriented data warehouse usage and evolution. Informations Systems, 25(2), 89–115.

    Article  Google Scholar 

  • Wang, R. Y., & Strong, D. (1996). Beyond accuracy: What data quality means to data consumers. Journal of Management Information Systems, 12(4), 5–34.

    Google Scholar 

  • Watson, H., & Haley, B. (1998). Managerial considerations. Communications of the ACM, 41, 9.

    Article  Google Scholar 

  • Winter, R., & Strauch, B. (2003). A method for demand-driven information requirements analysis in data warehousing projects. Paper presented at the 36th Hawaii International Conference on System Sciences, Hawaii.

  • Wixom, B. H., & Watson, H. J. (2001). An empirical investigation of the factors affecting data warehousing success. MIS Quarterly.

  • Yang, J., Karlapalem, K., & Li, Q. (1997). Algorithms for materialized view design in data warehousing environment. Paper presented at the VLDB, Athens, Greece.

  • Zhang, H., Kishore, R., & Ramesh, R. (2007). Semantics of the MIBML conceptual modeling grammar: An ontological analysis using the Bunge Wang–Weber Framework. Journal of Database Management, 18(1), 1–19.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lila Rao.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rao, L., Osei-Bryson, KM. An approach for incorporating quality-based cost–benefit analysis in data warehouse design. Inf Syst Front 10, 361–373 (2008). https://doi.org/10.1007/s10796-008-9077-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10796-008-9077-4

Keywords

Navigation