Abstract
Data warehouses (DW) are built by gathering information from distributed information sources (ISs) and integrating it into one customized repository. In recent years, work has begun to address the problem of view maintenance of DWs under concurrent data updates of different ISs. The SWEEP solution is one solution that does not require the ISs to be quiescence, as required by previous strategies, by employing a local compensation strategy. SWEEP however processes all update messages in a sequential manner. To optimize upon this sequential processing, we now propose a parallel view maintenance algorithm, called PVM, that incorporates all benefits of previous maintenance approaches while offering improved performance due to parallelism. We have identified two issues critical for supporting parallel view maintenance: (1) detecting maintenance-concurrent data updates in a parallel mode, and (2) correcting the problem that the DW commit order may not correspond to the DW update processing order due to parallel maintenance handling. In this work, we provide solutions to both issues. We have implemented both SWEEP and PVM in our EVE data warehousing system, and our studies confirm the multi-fold performance improvement of PVM over SWEEP.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
D. Agrawal, A. El Abbadi, A. Singh, and T. Yurek. Efficient View Maintenance at Data Warehouses. In Proceedings of SIGMOD, pages 417–427, 1997.
E. Baralis, S. Ceri, and S. Paraboschi. Conservative TimeStamp Revised for Materialized View Maintenance in a Data Warehouse. In Workshop on Materialized Views, pages 1–9, 1996.
J. Chen, X. Zhang, S. Chen, A. Koeller, and E.A. Rundensteiner. DyDa: Data Warehouse Maintenance in Fully Concurrent Environments. In Proceedings of SIGMOD’01 Demo Session, May 2001.
H. García-Molina, W. Labio, J.L. Wiener, and Y. Zhuge. Distributed and Parallel Computing Issues in Data Warehousing. In Symposium on Principles of Distributed Computing, page 7, 1998. Abstract.
A. Gupta, H.V. Jagadish, and I.S. Mumick. Maintenance and Self-Maintenance of Outer-Join Views. In Next Generation Information Technologies and Systems, 1997.
A. Gupta and I. Mumick. Maintenance of Materialized Views: Problems, Techniques, and Applications. IEEE Data Engineering Bulletin, Special Issue on Materialized Views and Warehousing, 18(2):3–19, 1995.
A. Gupta, I.S. Mumick, and V.S. Subrahmanian. Maintaining Views Incrementally. In Proceedings of SIGMOD, pages 157–166, 1993.
M.K. Mohania, S. Konomi, and Y. Kambayashi. Incremental Maintenance of Materialized Views. In Database and Expert Systems Applications (DEXA), pages 551–560, 1997.
A. Nica and E.A. Rundensteiner. View Maintenance after View Synchronization. In International Database Engineering and Applications Symposium (IDEAS’99), pages 213–215, August, Montreal, Canada 1999.
E.A. Rundensteiner, A. Koeller, X. Zhang, A. Lee, A. Nica, A. VanWyk, and Y. Li. Evolvable View Environment. In Proceedings of SIGMOD’99 Demo Session, pages 553–555, May 1999.
X. Zhang, L. Ding, and E.A. Rundensteiner. PSWEEP: Parallel View Maintenance Under Concurrent Data Updates of Distributed Sources. Technical Report WPI-CS-TR-99-14, Worcester Polytechnic Institute, Computer Science Department, May 1999.
Y. Zhuge, H. García-Molina, J. Hammer, and J. Widom. View Maintenance in a Warehousing Environment. In Proceedings of SIGMOD, pages 316–327, May 1995.
Y. Zhuge, H. García-Molina, and J.L. Wiener. The Strobe Algorithms for Multi-Source Warehouse Consistency. In International Conference on Parallel and Distributed Information Systems, pages 146–157, December 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, X., Ding, L., Rundensteiner, E.A. (2001). PVM: Parallel View Maintenance under Concurrent Data Updates of Distributed Sources⋆. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2001. Lecture Notes in Computer Science, vol 2114. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44801-2_23
Download citation
DOI: https://doi.org/10.1007/3-540-44801-2_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42553-3
Online ISBN: 978-3-540-44801-3
eBook Packages: Springer Book Archive