skip to main content
article

Multiversion-based view maintenance over distributed data sources

Published:12 December 2004Publication History
Skip Abstract Section

Abstract

Materialized views can be maintained by submitting maintenance queries to the data sources. However, the query results may be erroneous due to concurrent source updates. State-of-the-art maintenance strategies typically apply compensations to resolve such conflicts and assume all source schemata remain stable over time. In a loosely coupled dynamic environment, the sources may autonomously change not only their data but also their schema or semantics. Consequently, either the maintenance or the compensation queries may be broken. Unlike compensation-based approaches found in the literature, we instead model the complete materialized view maintenance process as a view maintenance transaction (VM_Transaction). This way, the anomaly problem can be rephrased as the serializability of VM_Transactions. To achieve VM_Transaction serializability, we propose a multiversion concurrency control algorithm, called TxnWrap, which is shown to be the appropriate design for loosely coupled environments with autonomous data sources. TxnWrap is complementary to the maintenance algorithms proposed in the literature, since it removes concurrency issues from consideration allowing the designer to focus on the maintenance logic. We show several optimizations of TxnWrap, in particular, (1) space optimizations on versioned data materialization and (2) parallel maintenance scheduling. With these optimizations, TxnWrap even outperforms state-of-the-art view maintenance solutions in terms of refresh time. Further, several design choices of TxnWrap are studied each having its respective advantages for certain environmental settings. A correctness proof based on transaction theory for TxnWrap is also provided. Last, we have implemented TxnWrap. The experimental results confirm that TxnWrap achieves predictable performance under a varying rate of concurrency.

Skip Supplemental Material Section

Supplemental Material

References

  1. Agrawal, D., Abbadi, A. E., Singh, A., and Yurek, T. 1997. Efficient view maintenance at data warehouses. In Proceedings of SIGMOD. 417--427. Google ScholarGoogle Scholar
  2. Agrawal, D. and Sengupta, S. 1989. Modular synchronization in multiversion databases. In Proceedings of SIGMOD. 408--417. Google ScholarGoogle Scholar
  3. Bernstein, P. A., Hadzilacos, V., and Goodman, N. 1987. Concurrency Control and Recovery in Database System. Addison-Wesley, Reading, M.A. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chan, A. and Gray, R. 1985. Implementing distributed read-only transactions. IEEE Trans. on Softw. Eng. 11, 205--212.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Chen, J., Chen, S., and Rundensteiner, E. A. 2002. A transactional model for data warehouse maintenance. In Proceedings of the Conference on Conceptual Modeling. 247--262. Google ScholarGoogle Scholar
  6. Chen, J., Zhang, X., Chen, S., Andreas, K., and Rundensteiner, E. A. 2001. DyDa: Data warehouse maintenance under fully concurrent environments. In Proceedings of SIGMOD Demo Session. 619. Google ScholarGoogle Scholar
  7. Chen, S., Chen, J., Zhang, X., and Rundensteiner, E. A. 2004. Detection and correction of conflicting source updates for view maintenace. In Proceedings of ICDE. 436--448. Google ScholarGoogle Scholar
  8. Colby, L. S., Griffin, T., Libkin, L., Mumick, I. S., and Trickey, H. 1996. Algorithms for deferred view maintenance. In Proceedings of SIGMOD. 469--480. Google ScholarGoogle Scholar
  9. Gray, J. and Reuter, A. 1992. Transaction Processing: Concepts and Techniques. Morgan Kaufmann, San Francisco, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Gupta, A. and Mumick, I. 1995. Maintenance of materialized views: Problems, techniques, and applications. IEEE Data Eng. Bull. 18, 2, 3--19.Google ScholarGoogle Scholar
  11. Gupta, A., Mumick, I., and Ross, K. 1995. Adapting materialized views after redefinition. In Proceedings of SIGMOD. 211--222. Google ScholarGoogle Scholar
  12. Lee, A. M., Nica, A., and Rundensteiner, E. A. 2002. The EVE approach: View synchronization in dynamic distributed environments. IEEE Trans. Knowl. Data Eng. 14, 5, 931--954. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Liu, B., Chen, S., and Rundensteiner, E. A. 2002a. A transactional approach to parallel data warehouse maintenance. In Proceedings of DaWaK. 307--317. Google ScholarGoogle Scholar
  14. Liu, B., Chen, S., and Rundensteiner, E. A. 2002b. Batch data warehouse maintenance in dynamic environments. In Proceedings of CIKM. 68--75. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lomet, D. B. and Salzberg, B. 1989. Access methods for multiversion data. In Proceedings of SIGMOD. 315--324. Google ScholarGoogle Scholar
  16. Lu, J. J., Moerkotte, G., Schue, J., and Subrahmanian, V. S. 1995. Efficient maintenance of materialized mediated views. In Proceedings of SIGMOD. 340--351. Google ScholarGoogle Scholar
  17. Madhavan, J., Bernstein, P. A., and Rahm, E. 2001. Generic schema matching with cupid. In Proceedings of VLDB. 49--58. Google ScholarGoogle Scholar
  18. Marche, S. 1993. Measuring the stability of data models. European J. Inform. Syst. 2, 1, 37--47.Google ScholarGoogle ScholarCross RefCross Ref
  19. Miller, R. J., Haas, L. M., and Hernández, M. A. 2000. Schema mapping as query discovery. In Proceedings of VLDB. 77--88. Google ScholarGoogle Scholar
  20. Mohan, C., Pirahesh, H., and Lorie, R. 1992. Efficient and flexible methods for transient versioning of records to avoid locking by read-only transactions. In Proceedings of SIGMOD. 124--133. Google ScholarGoogle Scholar
  21. Nica, A., Lee, A. J., and Rundensteiner, E. A. 1998. The CVS algorithm for view synchronization in evolvable large-scale information systems. In Proceedings of EDBT. 359--373. Google ScholarGoogle Scholar
  22. Nica, A. and Rundensteiner, E. A. 1999. View maintenance after view synchronization. In Proceedings of the Conference on International Database Engineering and Applications. 213--215. Google ScholarGoogle Scholar
  23. Quass, D. Gupta, A. Mumick, I. S., and Widom, J. 1996. Making view self-maintainable for data warehousing. In Proceedings of the Conference on Parallel and Distributed Information Systems. 158--169. Google ScholarGoogle Scholar
  24. Quass, D. and Widom, J. 1997. On-line warehouse view maintenance. In Proceedings of SIGMOD. 393--400. Google ScholarGoogle Scholar
  25. Salem, K., Beyer, K. S., Cochrane, R., and Lindsay, B. G. 2000. How to roll a join: Asynchronous incremental view maintenance. In Proceedings of SIGMOD. 129--140. Google ScholarGoogle Scholar
  26. Sjoberg, D. 1993. Quantifying schema evolution. Inform. Softw. Tech. 35, 1, 35--54.Google ScholarGoogle ScholarCross RefCross Ref
  27. Varde, A. S. and Rundensteiner, E. A. 2002. MEDWRAP: Consistent view maintenance over distributed multi-relation sources. In Proceedings of DEXA. 341--350. Google ScholarGoogle Scholar
  28. Velegrakis, Y., Miller, R. J., and Popa, L. 2003. Mapping adaptation under evolving schemas. In Proceedings of VLDB. 584--595. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Widom, J. 1995. Research problems in data warehousing. In Proceedings of CIKM. 25--30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Zaniolo, C., Ceri, S., Faloursos, C., Snodgrass, R. T., Subrahmanian, V. S., and Zicari, R. 1997. Advanced Database Systems. Morgan Kaufmann, San Francisco, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Zhang, X., Ding, L., and Rundensteiner, E. A. 2004. Parallel multi-source view maintenance. VLDB J. 13, 1, 22--48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Zhang, X. and Rundensteiner, E. A. 2000. DyDa: Dynamic data warehouse maintenance in a fully concurrent environment. In Proceedings of DaWaK. 94--103. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Zhuge, Y., Gracía-Molina, H., Hammer, J., and Widom, J. 1995. View maintenance in a warehousing environment. In Proceedings of SIGMOD. 316--327. Google ScholarGoogle Scholar
  34. Zhuge, Y., Gracía-Molina, H., and Wiener, J. L. 1996. The strobe algorithms for multi-source warehouse consistency. In Proceedings of the Conference on Parallel and Distributed Information Systems. 146--157. Google ScholarGoogle Scholar

Index Terms

  1. Multiversion-based view maintenance over distributed data sources

            Recommendations

            Reviews

            Elizabeth A. Unger

            The integration of information from many distributed databases is a challenge of this decade. Success in this endeavor will determine the ability of many organizations to continue in the future. This paper offers one approach to meeting the various challenges of current distributed data organization architectures, based on a model for the complete materialized view maintenance process, by expressing it as a view maintenance problem, and reducing the transaction serialization space. The paper is well written, provides a solution model for the problem of maintaining distributed views of a conceptual database, accounts for differences in the schemas, and describes a physical implementation in terms of its performance against conventional view maintenance algorithms. The single processor algorithm implementation is supplemented by a parallel processor solution, and formal correctness proofs are provided. The major problems of data update and schema change anomalies are shown to be solvable, by reformulating the maintenance problem of data as a virtual transaction model. Reformulated transactions are mapped from the distributed anomalies to the problem of serializability of transactions. Serializability theory provides the formal system in which view maintenance, view adaptation, and view synchronization can be solved for the materialized view maintenance problem. In essence, the concurrency problems of multiple updates for a record value are no longer a component of the problem. This enables simplification of the problem, and also of the conceptualization of it. A pivotal idea for this is the introduction of a local identifier within each data source's scope for version management, and global identifiers for the materialized view. These two types of identifiers are related, in that the global identifier is composed from a vector of the local identifiers. The authors bring an interesting new perspective to the challenge of providing users with a consistent view of the data of an enterprise, by approaching the complex view maintenance problem through the reconceptualization of the transactions into a serialized order. This may or may not be the ultimate solution for the overall problem, but approaching it with this perspective is very thought provoking. This paper is recommended to those interested in integrating distributed databases by providing a view of them. Online Computing Reviews Service

            Access critical reviews of Computing literature here

            Become a reviewer for Computing Reviews.

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Transactions on Database Systems
              ACM Transactions on Database Systems  Volume 29, Issue 4
              December 2004
              250 pages
              ISSN:0362-5915
              EISSN:1557-4644
              DOI:10.1145/1042046
              Issue’s Table of Contents

              Copyright © 2004 ACM

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 12 December 2004
              Published in tods Volume 29, Issue 4

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • article

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader