Abstract
Due to the increasing complexity of data warehouse (DW), continuous attention must be paid for evaluation of their quality throughout their design and development. DW quality depends on the quality of all requirements, conceptual, logical and physical models used for DW design. Therefore, identification of various data quality (DQ) dimensions in those phase of DW development are very much needed.
In this paper, we surveyed and evaluated the literature related to the DQ dimension in every phase of DW development and proposed an integrated approach for incorporating DQ in DW development in order to minimize risk of DW project failure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Ballou, D.P., Tayi, G.K.: Enhancing Data Quality in Data Warehouse Environments. Communications of the ACM 42(1), 73–78 (1999)
Cowie, J., Burstein, F.: Quality of data model for supporting mobile decision making. Decision Support Systems 43, 1675–1683 (2007)
Dubielewicz, I., Hnatkowska, B., Huzar, Z., Tuzinkiewicz, L.: Feasibility Analysis of MDA-based Database Design. In: Proceedings of the International Conference on Dependability of Computer Systems (DEPCOS-RELCOMEX 2006). IEEE (2006) 0-7695-2565-2/06
ElGamal, N., Bastawissy, Galal-Edeen, G.: Towards a Data Warehouse Testing Framework. In: 2011 Ninth International Conference on ICT and Knowledge Engineering. IEEE (2011)
English, L.P.: Improving Data Warehouse and Business Information Quality (Methods for Reducing Costs and Increasing Profits). John Wiley and Sons, Inc., New York (1999)
Eppler, M.J.: Managing Information Quality: Increasing the Value of Information in Knowledge-Intensive Products and Processes, 2nd edn. Springer (2006)
Foshay, N., Mukherjee, A., Taylor, A.: Does Data Warehouse End-User Metadata Add Value? Communications of the ACM 50, 70–77 (2007)
Giannoccaro, A., Shanks, G., Darke, P.: Stakeholder Perceptions of Data Quality in a Data Warehouse Environment. In: Proc. 10th Australian Conference on Information Systems (1999)
Gosain, A., Singh, J.: Towards Data warehouse Business Quality through Requirements Elicitation. IEEE 978-1-4244-2624-9/08 (2008)
Golfarelli, M.: From User Requirements to Conceptual Design in Data warehouse Design. IGI Global (2010), doi:10.4018/978-1-60566-756-0.ch001
Hadley, L.: Data Warehouse Quality Management (1998), http://www.users.qwest.net/~lauramh/resume/dwqual.htm
Helfert, M., von Maur, E.: A Strategy for Managing Data Quality in Data Warehouse Systems. In: The Proceedings of the International Conference on Information Quality, Boston, MA (2001)
Jarke, M., Jeusfeld, M., Quix, C., Vassiliadis, P.: – Architecture and Quality in Data Warehouses: An Extended Repository Approach. Information Systems 24(3), 229–253 (1999)
Jarke, M., Vassiliou, Y.: Data Warehouse Quality: A Review of the DWQ Project. In: Proceedings of the 2nd Conference on Information Quality, Cambridge, MA (1997)
Jeusfeld, M.A., Quix, C., Jarke, M.: Design and Analysis of Quality Information for Data Warehouses. In: Ling, T.-W., Ram, S., Li Lee, M. (eds.) ER 1998. LNCS, vol. 1507, pp. 349–362. Springer, Heidelberg (1998)
Kimball, R., Reeves, L., Thornthwaite, W., Ross, M., Thornwaite, W.: The Data Warehouse Lifecycle Toolkit: Expert Method for Designing, Developing, and Deploying Data Warehouses. John Wiley & Sons, Inc. (1998)
Kimball, R., Caserta, J.: The Datawarehouse ETL Toolkit: Practical techniques for Extracting, Cleansing, Conforming and Delivering Data. Wiley Publishing, Inc., IN (2004) ISBN: 0-764-56757-8
March, S., Hevner, A.: Integrated decision support systems: A data warehousing perspective. Decision Support Systems 43(3), 1031–1043 (2007)
Marco, D.: Building and Managing the Meta Data Repository: A Full Lifecycle Guide. Willey and Sons, Inc., New York (2000)
Marotta, A., Ruggia, R.: Data Warehouse Design: A schema-transformation approach. In: SCCC 2002, Chile (2002)
Munawar, Salim, N., Ibrahim, R.: Toward Data Quality Integration into the Data Warehouse Development. In: Ninth IEEE International Conference on Dependable, Autonomic and Secure Computing. IEEE Computer Sociaty (2011a), 978-0-7695-4612-4/11, doi:10.1109/DASC.2011.194
Munawar, Salim, N., Ibrahim, R.: Toward Data Warehouse Quality through Integrated Requirements Analysis. In: ICACSIS 2011 (2011b) ISBN: 978-979-1421-11-9
Nemani, R.R., Konda, R.: A Framework for Data Quality in Data Warehousing. In: Yang, J., Ginige, A., Mayr, H.C., Kutsche, R.-D. (eds.) UNISCON 2009. LNBIP, vol. 20, pp. 292–297. Springer, Heidelberg (2009)
Paim, F.R.S., Castro, J.B.: DWARF: An Approach for Requirements Definition and Management of Datawarehouse Systems. In: Proceedings of the 11th IEEE International Conference on Requirement Engineering, pp. 75–78 (2003)
Paim, F.R.S., Castro, J.B.: Enhancing Data Warehouse Design with the NFR Framework. In: 5th Workshop on Requirements Engineering (WER 2002), Valencia, Spain, November 11-12, pp. 40–57 (2002)
Prakash, N., Singh, Y., Gosain, A.: Informational Scenarios for Data Warehouse Requirements Elicitation. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 205–216. Springer, Heidelberg (2004)
Prakash, N., Gosanin, A.: An Approach to Engineering the Requirements of data Warehouses. Springer, London (2007), doi:10.1007/s00766-007-0057-x
Pipino, L., Lee, Y., Wang, R.: Data Quality Assessment. Commun. ACM 45, 4 (2002)
Piprani, B., Ernst, D.: A Model for Data Quality Assessment. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM-WS 2008. LNCS, vol. 5333, pp. 750–759. Springer, Heidelberg (2008)
Singh, R., Singh, K.: A Descriptive Classification of Causes of Data Quality Problems in Data Warehousing. IJCSI International Journal of Computer Science Issues 7(3(2)) (May 2010) ISSN : 1694-0784
Solodovnikova, D.: Metadata to support Data Warehouse Evolution. Information System Development, 627–635 (2010) 10.1007/b137171_65
Solomon, M.: Ensuring a successful data warehouse initiative. IS Management 22(1), 26–36 (2005)
Sumathi, S., Sivanandam, S.N.: Data Marts and Data Warehouse: Information Architecture for the Millennium. SCI, vol. 29, pp. 75–150. Springer, Heidelberg (2006)
Thomann, J., Wells, D.: Data warehouse quality management. In: The Data Warehousing Institute’s Fourth Annual Implementation Conference, Anaheim, CA, February 14-19 (1999)
Van, L.A.: Goal-oriented requirements engineering: a guided tour. Invited paper for RE 2001, Proceedings of 5th IEEE International Symposium on Requirements Engineering, Toronto, pp. 249–263 (August 2001)
Vetterli, T., Vaduva, A., Staudt, M.: Metadata Standards for Datawarehousing: Open Information Model vs Common Warehouse Metamodel (2000)
Celko, J., McDonald, J.: Don’t warehouse dirty data. Datamation 41(19), 42–53 (1995)
Giblett, P.B.: Data Quality: The Key to Managing the Successful Data Warehouse Project (2002), http://www.ontariocio.com/Data_Quality_key_to_successful_BI_project_20020501.pdf (retrieved 20 April, 2010)
Rizzi, S., Abelló, A., Lechtenbörger, J., Trujillo, J.: Research in data warehouse modeling and design: Dead or alive? In: Proceedings of the 9th ACM Int. Workshop on Data Warehousing and OLAP (DOLAP 2006), pp. 3–10. ACM Press (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Munawar, Salim, N., Ibrahim, R. (2012). Comparative Study of Data Quality Dimensions for Data Warehouse Development: A Survey. In: Hassanien, A.E., Salem, AB.M., Ramadan, R., Kim, Th. (eds) Advanced Machine Learning Technologies and Applications. AMLTA 2012. Communications in Computer and Information Science, vol 322. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35326-0_46
Download citation
DOI: https://doi.org/10.1007/978-3-642-35326-0_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35325-3
Online ISBN: 978-3-642-35326-0
eBook Packages: Computer ScienceComputer Science (R0)