Skip to main content

A Transactional Approach to Parallel Data Warehouse Maintenance

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2454))

Abstract

Data Warehousing is becoming an increasingly important technology for information integration and data analysis.Given the dynamic nature of modern distributed environments, both source data and schema changes are likely to occur autonomously and even concurrently in different sources.We have thus developed a comprehensive solution approach, called TxnWrap,that successfully maintains the warehouse views under any type of concurrent source updates.In this work, we now overcome TxnWrap’s restriction that the maintenance is processed one by one for each source update, since that limits the performance. To overcome this limitation, we exploit the transactional approach of TxnWrap to achieve parallel data warehouse maintenance. For this, we first identify the read/write conflicts among the different warehouse maintenance processes. We then propose a parallel maintenance scheduler (PMS)that generates legal schedules that resolve these conflicts.PMS has been implemented and incorporated into our TxnWrap system.The experimental results confirm that our parallel maintenance scheduler significantly improves the performance of data warehouse maintenance.

This work was supported in part by several grants from NSF, namely, the NSF NYI grant #IRI 97.96264, the NSF CISE Instrumentation grant #IRIS 97.29878, and the NSF grant #IIS 9988776.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. D. Agrawal, A.E. Abbadi, A. Singh, and T. Yurek. Efficient View Maintenance at Data Warehouses.In Proceedings of SIGMOD, pages 417–427,1997.

    Google Scholar 

  2. P.A. Bernstein, V. Hadzilacos, and N. Goodman.Concurrency Control and Recovery in Database System.Addison-Wesley Pub.,1987.

    Google Scholar 

  3. J. Chen and E.A. Rundensteiner. Txnwrap:A transactional approach to data warehouse maintenance.Technical ReportWPI-CS-TR-00-26,Worcester Polytechnic Institute, November2000.

    Google Scholar 

  4. J. Chen, X. Zhang, S. Chen, K. Andreas, and E.A. Rundensteiner. DyDa:Data Warehouse Maintenance under Fully Concurrent Environments.In Proceedings of SIGMOD Demo Session, page 619, Santa Barbara,CA,May 2001.

    Google Scholar 

  5. H. García-Molina, W.L., J.L. Wiener, and Y. Zhuge.Distributed and Parallel Computing Issues in Data Warehousing.In Symposium on Principles of Distributed Computing,page 7,1998.Abstract.

    Google Scholar 

  6. A.M. Lee, A. Nica, and E.A. Rundensteiner.The EVE Approach:View Synchronization in Dynamic Distributed Environments.IEEE Transactions on Knowledge and Data Engineering (TKDE),2001.

    Google Scholar 

  7. B. Liu. Optimization Strategies for Data Warehouse Maintenance in Distributed Environments.Master’s thesis,Worcester Polytechnic Institute,May2002.

    Google Scholar 

  8. K. Salem, K.S. Beyer, R. Cochrane, and B.G. Lindsay.How To Roll a Join: Asynchronous Incremental View Maintenance.In Proceedings of SIGMOD,pages 129–140,2000.

    Google Scholar 

  9. X. Zhang, E.A. Rundensteiner, and L. Ding.PVM:Parallel View Maintenance Under Concurrent Data Updates of Distributed Sources.In Data Warehousing and Knowledge Discovery, Proceedings,September, Munich, Germany 2001.

    Google Scholar 

  10. Y. Zhuge, H. García-Molina, J. Hammer, and J. Widom.View Maintenance in a Warehousing Environment. In Proceedings of SIGMOD, pages 316–327,May 1995.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liu, B., Chen, S., Rundensteiner, E.A. (2002). A Transactional Approach to Parallel Data Warehouse Maintenance. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2002. Lecture Notes in Computer Science, vol 2454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46145-0_30

Download citation

  • DOI: https://doi.org/10.1007/3-540-46145-0_30

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44123-6

  • Online ISBN: 978-3-540-46145-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics