Abstract
Data warehousing is a new technology which provides a software infrastructure for decision support systems and OLAP applications. Data warehouse systems collect data from heterogeneous and distributed sources, transform and reconcile this data in order to aggregate it and customize it with respect to business and organizational criteria required by decision makers. High level aggregated data is organized by subjects and stored as a multidimensional structure into a data mart. Data quality is very important in database applications in general and very crucial in data warehousing in particular. Indeed, data warehouse systems provide aggregated data to decision makers whose actions and decisions should be very strategic to the enterprise. Providing dirty data, imprecise data or non coherent data may lead to the rejection of the decision support system or may result into non productive decisions. This chapter provides a general framework for data warehouse design based on quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baralis E., Paraboschi S., Teniente E. “Materialized View Selection in a Multidimensional Environment”, Proceedings of the VLDB’97, Athens, GR, 1997.
Basili, V. R., Caldiera, G., Rombach. H. D. The Goal Question Metric Approach.Encyclopedia of Software Engineering - 2 Volume Setpp 528–532, John Wiley & Sons, Inc. (1994). Available athttp://www.cs.umd.edu/users/basili/papers.html
Bouzeghoub M., Fabret F., Matulovic M., “Modeling the Data Warehouse Refreshment Process as a Workflow Application ”, Proceedings of the International Workshop on Design and Management of Data Warehouses (DMDW’99), Heidelberg, Germany, June 1999.
Bouzeghoub, M., Kedad, Z. “A Quality-Based Framework for Data Warehouse Design ”, Proceedings of the International Workshop on Design and Management of Data Warehouses(DMDW’2000), Stockholm, June 2000.
Bouzeghoub, M., Lenzerini, M. (guest eds) “Data extraction, cleaning and reconciliation, a special issue of Information Systems Journal, Elsevier Sce, to be published end 2001
Calvanese, D., De Giacomo, G., Lenzerini, M., Nardi, D., Rosati, R., “ Source Integration integration: conceptual modeling and reasoning support ”, Proceedings of the 6thInternat. Conf. On Cooperative Information Systems, (CoopIS’98), 1998.
Galhardas, H., Florescu, D., Shasha, D., Simon, E., Saita, C.A. “Declarative data cleaning: language, model and algorithms, proceed. Of the 27`h VLDB Conf., Rome, 2001.
Gupta A., Mumick I.S., “Maintenance of materialized views: Problems, Techniques, and applications.” IEEE Data Engineering Bulletin, Special Issue on Materialised Views and Data Warehousing, 18(2), june 1995
Hull R., Zhou G. “A Framework for Supporting Data Integration Using the Materialized and Virtual Approches”Procedding of theSIGMOD 96. 1996a
Hull, R., Zhou, G., Toward the study of performance trade-off between materialized and virtual integrated views, Proeed. Of the Workshop on Materialized Views, I.S. Mumick & A. Gupta Edits, Montreal, June 1996b.
Jarke M, Jeusfeld M.A., Quix C., Vassiliadis P., “Architecture and Quality in Data Warehouses”, Proceeding of the 10th International Conference on Advanced Information Systems Engineering (CAiSE’98), Pisa, Italy, June 1998.
Jarke, M., Lenzerini, M., Vassiliou, Y., Vassiliadis, P., (edits), “ Fundamentals of Data Warehouses ”, Springer, 1999
Jeusfeld, M. A., Quix C., Jarke M., “Design and Analysis of Quality Information for Data Warehouses”, Proceedings of the 17th Internat. Conf. on Conceptual Modeling (ER’98), Singapore, november 1998.
Kedad Z. “Techniques d’int¨¦gration dans les syst¨¨mes d’information multi-source”, PhD Thesis, Laboratoire PriSM, University of Versailles, January 1999.
Kedad Z., Bouzeghoub M. “Conception de syst¨¨mes d’information multi-source”, to appear in the Proceedings of INFORSID’99, Toulon, France, June 1999.
Levy A.Y., Rajaraman A., Ordille J.J. “Querying Heterogeneous Information Sources Using Source Description”, Proceedings of the VLDB’96.
Ligoudistianos S., Sellis T., Theodoratos D., Vassiliou Y. “Heuristic Algorithms For Designing The Data Warehouse with SPJ Views”, Proceedings of the DAWAK’99.
D. Theodoratos, M. Bouzeghoub. Data Currency Quality Factors in Data Warehouse Design. InProc. of the International Workshop on Design and Management of Data Warehouses (DMDW’99)Heidelberg, Germany (1999). Available at http://www.dbnet.ece.ntua.gr/--dwq/
Theodoratos D., Sellis T. “The Data Warehouse Configuration”, Proceedings of the VLDB’97, Athens, GR, 1997.
Theodoratos D., Sellis T. “Data Warehouse Schema and Instance Design”, Proceedings of ER’98, syngapour, Nov. 1998.
Theodoratos D., Ligoudistianos S., Sellis T. “Designing the Global Data Warehouse with SPJ Views”, Proceedings of CAISE’99.
Vassiliadis, P., Bouzeghoub, M., Quix, C., “Towards quality-oriented data warehouse usage and evolution”, Proceedings of the 11th Conference on Advanced Information Systems Engineering (CAiSE’99), Heidelberg, Germany, June 1999, also published in Information Systems Journal, vol. 25, No. 2, 2000.
R.Y. Wang, V.C. Storey, C.P. Firth. A Framework for Analysis of Data Quality Research.IEEE Transactions on Knowledge and Data Engineering7(4): 623–640 (1995).
Wang, R. Y., A product perspective on total data quality management, Com. Of the ACM,Vo141, N2, February 1998.
Yang J., Karlapalem K., Li S. “Algorithms for materialized view design in data warehousing Environment”, Proceedings of VLDB 97, Athens, GR, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer Science+Business Media New York
About this chapter
Cite this chapter
Bouzeghoub, M., Kedad, Z. (2002). Quality in Data Warehousing. In: Piattini, M.G., Calero, C., Genero, M. (eds) Information and Database Quality. Advances in Database Systems, vol 25. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0831-1_8
Download citation
DOI: https://doi.org/10.1007/978-1-4615-0831-1_8
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5260-0
Online ISBN: 978-1-4615-0831-1
eBook Packages: Springer Book Archive