Abstract
A warehouse is a data repository containing integrated information for efficient querying and analysis. Maintaining the consistency of warehouse data is challenging, especially if the data sources are autonomous and views of the data at the warehouse span multiple sources. Transactions containing multiple updates at one or more sources, e.g., batch updates, complicate the consistency problem. In this paper we identify and discuss three fundamental transaction processing scenarios for data warehousing. We define four levels of consistency for warehouse data and present a new family of algorithms, the Strobe family, that maintain consistency as the warehouse is updated, under the various warehousing scenarios. All of the algorithms are incremental and can handle a continuous and overlapping stream of updates from the sources. Our implementation shows that the algorithms are practical and realistic choices for a wide variety of update scenarios.
Similar content being viewed by others
References
R. Alonso, D. Barbara, and H. Garcia-Molina. Data caching issues in an information retrieval system. ACM Transaction on Database Systems, 15(3):359-384, September 1990.
E. Baralis, S. Ceri, and S Paraboschi. Conservative timestamp revised for materialized view maintenance in a data warehouse. In Proceedings of the Workshop on Materialized Views, Techniques and Applications, pages 1-9, Montreal, Canada, June 1996.
Y. Breitbart, H. Garcia-Molina, and A. Silberschatz. Overview of multidatabase transaction management. VLDB Journal, 1(2):181-239, October 1992.
P.A. Bernstein, V. Hadzilacos, and N. Goodman. Concurrency Control and Recovery in Database Systems. Addison-Wesley, Reading, Massachusetts, 1987.
J.A. Blakeley, P.-A. Larson, and F.W. Tompa. Efficiently updating materialized views. In Proceedings of ACM SIGMOD Conference, pages 61-71,Washington, D.C., June 1986.
M. Cochinwala and J. Bradley. A multidatabase system for tracking and retrieval of financial data. In VLDB Conference, pages 714-721, 1994.
L.S. Colby, T. Griffin, L. Libkin, I.S. Mumick, and H. Trickey. Algorithms for deferred view maintenance. In Proceedings of ACM SIGMOD Conference, pages 469-480, Montreal, Quebec, Canada, June 1996.
A. Courtney, W. Janssen, D. Severson, M. Spreitzer, and F. Wymore. Inter-language unification, release 1.5. Technical Report ISTL-CSA-94-01-01 (Xerox accession number P94-00058, Xerox PARC, May 1994.
S. Ceri and J. Widom. Deriving production rules for incremental view maintenance. In VLDB Conference, pages 577-589, Barcelona, Spain, September 1991.
Rob Goldring and Beth Hamel, January 1996. Personal correspondence about IBM's data warehouse customer needs.
A. Gupta and I.S. Mumick. Maintenance of materialized views: Problems, techniques, and applications. IEEE Data Engineering Bulletin, Special Issue on Materialized Views and Data Warehousing, 18(2):3-18, June 1995.
A. Gupta, I. Mumick, and V. Subrahmanian. Maintaining views incrementally. In Proceedings of ACM SIGMOD Conference, pages 157-166, Washington, D.C., May 1993.
H. Garcia-Molina and G. Wiederhold. Read-only transactions in a distributed database. ACM Transaction on Database Systems, 7(2):209-234, June 1982.
R. Gallersdorfer and M. Nicola. Improving performance in replicated databases through relaxed coherency. In VLDB Conference, pages 445-456, Zurich, Switzerland, September 1995.
A. Gupta and J. Widom. Local verification of global integrity constraints in distributed databases. In Proceedings of ACM SIGMOD Conference, pages 49-58,Washington, D.C., May 1993.
J.V. Harrison and S.W. Dietrich. Maintenance of materialized views in a deductive database: An update propagation approach. In Proceedings of the 1992 JICLSPWorkshop on Deductive Databases, pages 56-65, 1992.
R. Hull and G. Zhou. A framework for supporting data integration using the materialized and virtual approaches. In Proceedings of ACM SIGMOD Conference, pages 481-492, Montreal, Quebec, Canada, June 1996.
W.H. Inmon and C. Kelley. Rdb/VMS: Developing the Data Warehouse. QED Publishing Group, Boston, Massachusetts, 1993.
W. Labio and H. Garcia-Molina. Efficient snapshot differential algorithms in data warehousing. In VLDB Conference, pages 63-74, September 1996.
B. Lindsay, L.M. Haas, C. Mohan, H. Pirahesh, and P. Wilms. A snapshot differential refresh algorithm. In Proceedings of ACM SIGMOD Conference, Washington, D.C., May 1986.
X. Qian and G. Wiederhold. Incremental recomputation of active relational expressions. IEEE Transactions on Knowledge and Data Engineering, 3(3):337-341, September 1991.
A. Segev and W. Fang. Currency-based updates to distributed materialized views. In ICDE Conference, pages 512-520, Los Alamitos, February 1990.
O. Shmueli and A. Itai. Maintenance of views. In Proceedings of ACM SIGMOD Conference, pages 240-255, Boston, Massachusetts, May 1984.
A. Segev and J. Park. Updating distributed materialized views. IEEE Transactions on Knowledge and Data Engineering, 1(2):173-184, June 1989.
Sybase, Inc. Command Reference Manual, release 4.9 edition, 1992.
J.L. Wiener, H. Gupta, W.J. Labio, Y. Zhuge, H. Garcia-Molina, and J. Widom. A system prototype for warehouse view maintenance. In Proceedings of the Workshop on Materialized Views, Techniques and Applications, pages 26-33, Montreal, Canada, June 1996.
G. Wiederhold and X. Qian. Consistency control of replicated data in federated databases. In Proceedings of the IEEE Workshop on Management of Replicated Data, pages 130-132, Houston, Texas, November 1990.
Y. Zhuge, H. Garcia-Molina, J. Hammer, and J. Widom. View maintenance in a warehousing environment. In Proceedings of ACM SIGMOD Conference, pages 316-327, San Jose, California, May 1995.
Y. Zhuge, J. L. Wiener, and H. Garcia-Molina. Multiple view consistency for data warehousing. In ICDE Conference, Birmingham, UK, April 1997.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Zhuge, Y., Garcia-Molina, H. & Wiener, J.L. Consistency Algorithms for Multi-Source Warehouse View Maintenance. Distributed and Parallel Databases 6, 7–40 (1998). https://doi.org/10.1023/A:1008698814840
Issue Date:
DOI: https://doi.org/10.1023/A:1008698814840