Skip to main content

Comparative Study of Data Quality Dimensions for Data Warehouse Development: A Survey

  • Conference paper
Advanced Machine Learning Technologies and Applications (AMLTA 2012)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 322))

Abstract

Due to the increasing complexity of data warehouse (DW), continuous attention must be paid for evaluation of their quality throughout their design and development. DW quality depends on the quality of all requirements, conceptual, logical and physical models used for DW design. Therefore, identification of various data quality (DQ) dimensions in those phase of DW development are very much needed.

In this paper, we surveyed and evaluated the literature related to the DQ dimension in every phase of DW development and proposed an integrated approach for incorporating DQ in DW development in order to minimize risk of DW project failure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ballou, D.P., Tayi, G.K.: Enhancing Data Quality in Data Warehouse Environments. Communications of the ACM 42(1), 73–78 (1999)

    Article  Google Scholar 

  2. Cowie, J., Burstein, F.: Quality of data model for supporting mobile decision making. Decision Support Systems 43, 1675–1683 (2007)

    Article  Google Scholar 

  3. Dubielewicz, I., Hnatkowska, B., Huzar, Z., Tuzinkiewicz, L.: Feasibility Analysis of MDA-based Database Design. In: Proceedings of the International Conference on Dependability of Computer Systems (DEPCOS-RELCOMEX 2006). IEEE (2006) 0-7695-2565-2/06

    Google Scholar 

  4. ElGamal, N., Bastawissy, Galal-Edeen, G.: Towards a Data Warehouse Testing Framework. In: 2011 Ninth International Conference on ICT and Knowledge Engineering. IEEE (2011)

    Google Scholar 

  5. English, L.P.: Improving Data Warehouse and Business Information Quality (Methods for Reducing Costs and Increasing Profits). John Wiley and Sons, Inc., New York (1999)

    Google Scholar 

  6. Eppler, M.J.: Managing Information Quality: Increasing the Value of Information in Knowledge-Intensive Products and Processes, 2nd edn. Springer (2006)

    Google Scholar 

  7. Foshay, N., Mukherjee, A., Taylor, A.: Does Data Warehouse End-User Metadata Add Value? Communications of the ACM 50, 70–77 (2007)

    Article  Google Scholar 

  8. Giannoccaro, A., Shanks, G., Darke, P.: Stakeholder Perceptions of Data Quality in a Data Warehouse Environment. In: Proc. 10th Australian Conference on Information Systems (1999)

    Google Scholar 

  9. Gosain, A., Singh, J.: Towards Data warehouse Business Quality through Requirements Elicitation. IEEE 978-1-4244-2624-9/08 (2008)

    Google Scholar 

  10. Golfarelli, M.: From User Requirements to Conceptual Design in Data warehouse Design. IGI Global (2010), doi:10.4018/978-1-60566-756-0.ch001

    Google Scholar 

  11. Hadley, L.: Data Warehouse Quality Management (1998), http://www.users.qwest.net/~lauramh/resume/dwqual.htm

  12. Helfert, M., von Maur, E.: A Strategy for Managing Data Quality in Data Warehouse Systems. In: The Proceedings of the International Conference on Information Quality, Boston, MA (2001)

    Google Scholar 

  13. Jarke, M., Jeusfeld, M., Quix, C., Vassiliadis, P.: – Architecture and Quality in Data Warehouses: An Extended Repository Approach. Information Systems 24(3), 229–253 (1999)

    Article  Google Scholar 

  14. Jarke, M., Vassiliou, Y.: Data Warehouse Quality: A Review of the DWQ Project. In: Proceedings of the 2nd Conference on Information Quality, Cambridge, MA (1997)

    Google Scholar 

  15. Jeusfeld, M.A., Quix, C., Jarke, M.: Design and Analysis of Quality Information for Data Warehouses. In: Ling, T.-W., Ram, S., Li Lee, M. (eds.) ER 1998. LNCS, vol. 1507, pp. 349–362. Springer, Heidelberg (1998)

    Google Scholar 

  16. Kimball, R., Reeves, L., Thornthwaite, W., Ross, M., Thornwaite, W.: The Data Warehouse Lifecycle Toolkit: Expert Method for Designing, Developing, and Deploying Data Warehouses. John Wiley & Sons, Inc. (1998)

    Google Scholar 

  17. Kimball, R., Caserta, J.: The Datawarehouse ETL Toolkit: Practical techniques for Extracting, Cleansing, Conforming and Delivering Data. Wiley Publishing, Inc., IN (2004) ISBN: 0-764-56757-8

    Google Scholar 

  18. March, S., Hevner, A.: Integrated decision support systems: A data warehousing perspective. Decision Support Systems 43(3), 1031–1043 (2007)

    Article  Google Scholar 

  19. Marco, D.: Building and Managing the Meta Data Repository: A Full Lifecycle Guide. Willey and Sons, Inc., New York (2000)

    Google Scholar 

  20. Marotta, A., Ruggia, R.: Data Warehouse Design: A schema-transformation approach. In: SCCC 2002, Chile (2002)

    Google Scholar 

  21. Munawar, Salim, N., Ibrahim, R.: Toward Data Quality Integration into the Data Warehouse Development. In: Ninth IEEE International Conference on Dependable, Autonomic and Secure Computing. IEEE Computer Sociaty (2011a), 978-0-7695-4612-4/11, doi:10.1109/DASC.2011.194

    Google Scholar 

  22. Munawar, Salim, N., Ibrahim, R.: Toward Data Warehouse Quality through Integrated Requirements Analysis. In: ICACSIS 2011 (2011b) ISBN: 978-979-1421-11-9

    Google Scholar 

  23. Nemani, R.R., Konda, R.: A Framework for Data Quality in Data Warehousing. In: Yang, J., Ginige, A., Mayr, H.C., Kutsche, R.-D. (eds.) UNISCON 2009. LNBIP, vol. 20, pp. 292–297. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  24. Paim, F.R.S., Castro, J.B.: DWARF: An Approach for Requirements Definition and Management of Datawarehouse Systems. In: Proceedings of the 11th IEEE International Conference on Requirement Engineering, pp. 75–78 (2003)

    Google Scholar 

  25. Paim, F.R.S., Castro, J.B.: Enhancing Data Warehouse Design with the NFR Framework. In: 5th Workshop on Requirements Engineering (WER 2002), Valencia, Spain, November 11-12, pp. 40–57 (2002)

    Google Scholar 

  26. Prakash, N., Singh, Y., Gosain, A.: Informational Scenarios for Data Warehouse Requirements Elicitation. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 205–216. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  27. Prakash, N., Gosanin, A.: An Approach to Engineering the Requirements of data Warehouses. Springer, London (2007), doi:10.1007/s00766-007-0057-x

    Google Scholar 

  28. Pipino, L., Lee, Y., Wang, R.: Data Quality Assessment. Commun. ACM 45, 4 (2002)

    Google Scholar 

  29. Piprani, B., Ernst, D.: A Model for Data Quality Assessment. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM-WS 2008. LNCS, vol. 5333, pp. 750–759. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  30. Singh, R., Singh, K.: A Descriptive Classification of Causes of Data Quality Problems in Data Warehousing. IJCSI International Journal of Computer Science Issues 7(3(2)) (May 2010) ISSN : 1694-0784

    Google Scholar 

  31. Solodovnikova, D.: Metadata to support Data Warehouse Evolution. Information System Development, 627–635 (2010) 10.1007/b137171_65

    Google Scholar 

  32. Solomon, M.: Ensuring a successful data warehouse initiative. IS Management 22(1), 26–36 (2005)

    Google Scholar 

  33. Sumathi, S., Sivanandam, S.N.: Data Marts and Data Warehouse: Information Architecture for the Millennium. SCI, vol. 29, pp. 75–150. Springer, Heidelberg (2006)

    Google Scholar 

  34. Thomann, J., Wells, D.: Data warehouse quality management. In: The Data Warehousing Institute’s Fourth Annual Implementation Conference, Anaheim, CA, February 14-19 (1999)

    Google Scholar 

  35. Van, L.A.: Goal-oriented requirements engineering: a guided tour. Invited paper for RE 2001, Proceedings of 5th IEEE International Symposium on Requirements Engineering, Toronto, pp. 249–263 (August 2001)

    Google Scholar 

  36. Vetterli, T., Vaduva, A., Staudt, M.: Metadata Standards for Datawarehousing: Open Information Model vs Common Warehouse Metamodel (2000)

    Google Scholar 

  37. Celko, J., McDonald, J.: Don’t warehouse dirty data. Datamation 41(19), 42–53 (1995)

    Google Scholar 

  38. Giblett, P.B.: Data Quality: The Key to Managing the Successful Data Warehouse Project (2002), http://www.ontariocio.com/Data_Quality_key_to_successful_BI_project_20020501.pdf (retrieved 20 April, 2010)

  39. Rizzi, S., Abelló, A., Lechtenbörger, J., Trujillo, J.: Research in data warehouse modeling and design: Dead or alive? In: Proceedings of the 9th ACM Int. Workshop on Data Warehousing and OLAP (DOLAP 2006), pp. 3–10. ACM Press (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Munawar, Salim, N., Ibrahim, R. (2012). Comparative Study of Data Quality Dimensions for Data Warehouse Development: A Survey. In: Hassanien, A.E., Salem, AB.M., Ramadan, R., Kim, Th. (eds) Advanced Machine Learning Technologies and Applications. AMLTA 2012. Communications in Computer and Information Science, vol 322. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35326-0_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35326-0_46

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35325-3

  • Online ISBN: 978-3-642-35326-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics