Abstract
Data Warehousing is becoming an increasingly important technology for information integration and data analysis.Given the dynamic nature of modern distributed environments, both source data and schema changes are likely to occur autonomously and even concurrently in different sources.We have thus developed a comprehensive solution approach, called TxnWrap,that successfully maintains the warehouse views under any type of concurrent source updates.In this work, we now overcome TxnWrap’s restriction that the maintenance is processed one by one for each source update, since that limits the performance. To overcome this limitation, we exploit the transactional approach of TxnWrap to achieve parallel data warehouse maintenance. For this, we first identify the read/write conflicts among the different warehouse maintenance processes. We then propose a parallel maintenance scheduler (PMS)that generates legal schedules that resolve these conflicts.PMS has been implemented and incorporated into our TxnWrap system.The experimental results confirm that our parallel maintenance scheduler significantly improves the performance of data warehouse maintenance.
This work was supported in part by several grants from NSF, namely, the NSF NYI grant #IRI 97.96264, the NSF CISE Instrumentation grant #IRIS 97.29878, and the NSF grant #IIS 9988776.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
D. Agrawal, A.E. Abbadi, A. Singh, and T. Yurek. Efficient View Maintenance at Data Warehouses.In Proceedings of SIGMOD, pages 417–427,1997.
P.A. Bernstein, V. Hadzilacos, and N. Goodman.Concurrency Control and Recovery in Database System.Addison-Wesley Pub.,1987.
J. Chen and E.A. Rundensteiner. Txnwrap:A transactional approach to data warehouse maintenance.Technical ReportWPI-CS-TR-00-26,Worcester Polytechnic Institute, November2000.
J. Chen, X. Zhang, S. Chen, K. Andreas, and E.A. Rundensteiner. DyDa:Data Warehouse Maintenance under Fully Concurrent Environments.In Proceedings of SIGMOD Demo Session, page 619, Santa Barbara,CA,May 2001.
H. García-Molina, W.L., J.L. Wiener, and Y. Zhuge.Distributed and Parallel Computing Issues in Data Warehousing.In Symposium on Principles of Distributed Computing,page 7,1998.Abstract.
A.M. Lee, A. Nica, and E.A. Rundensteiner.The EVE Approach:View Synchronization in Dynamic Distributed Environments.IEEE Transactions on Knowledge and Data Engineering (TKDE),2001.
B. Liu. Optimization Strategies for Data Warehouse Maintenance in Distributed Environments.Master’s thesis,Worcester Polytechnic Institute,May2002.
K. Salem, K.S. Beyer, R. Cochrane, and B.G. Lindsay.How To Roll a Join: Asynchronous Incremental View Maintenance.In Proceedings of SIGMOD,pages 129–140,2000.
X. Zhang, E.A. Rundensteiner, and L. Ding.PVM:Parallel View Maintenance Under Concurrent Data Updates of Distributed Sources.In Data Warehousing and Knowledge Discovery, Proceedings,September, Munich, Germany 2001.
Y. Zhuge, H. García-Molina, J. Hammer, and J. Widom.View Maintenance in a Warehousing Environment. In Proceedings of SIGMOD, pages 316–327,May 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, B., Chen, S., Rundensteiner, E.A. (2002). A Transactional Approach to Parallel Data Warehouse Maintenance. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2002. Lecture Notes in Computer Science, vol 2454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46145-0_30
Download citation
DOI: https://doi.org/10.1007/3-540-46145-0_30
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44123-6
Online ISBN: 978-3-540-46145-6
eBook Packages: Springer Book Archive