Reference Hub5
On Handling the Evolution of External Data Sources in a Data Warehouse Architecture

On Handling the Evolution of External Data Sources in a Data Warehouse Architecture

Robert Wrembel
ISBN13: 9781609605377|ISBN10: 1609605373|EISBN13: 9781609605384
DOI: 10.4018/978-1-60960-537-7.ch006
Cite Chapter Cite Chapter

MLA

Wrembel, Robert. "On Handling the Evolution of External Data Sources in a Data Warehouse Architecture." Integrations of Data Warehousing, Data Mining and Database Technologies: Innovative Approaches, edited by David Taniar and Li Chen, IGI Global, 2011, pp. 106-147. https://doi.org/10.4018/978-1-60960-537-7.ch006

APA

Wrembel, R. (2011). On Handling the Evolution of External Data Sources in a Data Warehouse Architecture. In D. Taniar & L. Chen (Eds.), Integrations of Data Warehousing, Data Mining and Database Technologies: Innovative Approaches (pp. 106-147). IGI Global. https://doi.org/10.4018/978-1-60960-537-7.ch006

Chicago

Wrembel, Robert. "On Handling the Evolution of External Data Sources in a Data Warehouse Architecture." In Integrations of Data Warehousing, Data Mining and Database Technologies: Innovative Approaches, edited by David Taniar and Li Chen, 106-147. Hershey, PA: IGI Global, 2011. https://doi.org/10.4018/978-1-60960-537-7.ch006

Export Reference

Mendeley
Favorite

Abstract

A data warehouse architecture (DWA) has been developed for the purpose of integrating data from multiple heterogeneous, distributed, and autonomous external data sources (EDSs) as well as for providing means for advanced analysis of integrated data. The major components of this architecture include: an external data source (EDS) layer, and extraction-transformation-loading (ETL) layer, a data warehouse (DW) layer, and an on-line analytical processing (OLAP) layer. Methods of designing a DWA, research developments, and most of the commercially available DW technologies tacitly assumed that a DWA is static. In practice, however, a DWA requires changes among others as the result of the evolution of EDSs, changes of the real world represented in a DW, and new user requirements. Changes in the structures of EDSs impact the ETL, DW, and OLAP layers. Since such changes are frequent, developing a technology for handling them automatically or semi-automatically in a DWA is of high practical importance. This chapter discusses challenges in designing, building, and managing a DWA that supports the evolution of structures of EDSs, evolution of an ETL layer, and evolution of a DW. The challenges and their solutions presented here are based on an experience of building a prototype Evolving-ETL and a prototype Multiversion Data Warehouse (MVDW). In details, this chapter presents the following issues: the concept of the MVDW, an approach to querying the MVDW, an approach to handling the evolution of an ETL layer, a technique for sharing data between multiple DW versions, and two index structures for the MVDW.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.