Abstract.
Personalization, advertising, and the sheer volume of online data generate a staggering amount of dynamic Web content. In addition to Web caching, view materialization has been shown to accelerate the generation of dynamic Web content. View materialization is an attractive solution as it decouples the serving of access requests from the handling of updates. In the context of the Web, selecting which views to materialize must be decided online and needs to consider both performance and data freshness, which we refer to as the online view selection problem. In this paper, we define data freshness metrics, provide an adaptive algorithm for the online view selection problem that is based on user-specified data freshness requirements, and present experimental results. Furthermore, we examine alternative metrics for data freshness and extend our proposed algorithm to handle multiple users and alternative definitions of data freshness.
Similar content being viewed by others
References
Altinel M, Bornhovd C, Krishnamurthy S, Mohan C, Pirahesh H, Reinwald B (2003) Cache tables: paving the way for an adaptive database cache. In: Proceedings of the 29th conference on very large data bases (VLDB), Berlin, Germany, pp 718-729
Bornovd C, Altinel M, Krishnamurthy S, Mohan C, Pirahesh H, Reinwald B (2003) DBCache: Middle-tier database caching for highly scalable e-business architectures. In: Proceedings of ACM SIGMOD, San Diego, p 662
Breslau L, Cao P, Fan L, Phillips G, Shenker S (1999) Web caching and Zipf-like distributions: evidence and implications. In: Proceedings of INFOCOM 1999, New York, pp 126-134
Carney D, Cetintemel U, Cherniack M, Convey C, Lee S, Seidman G, Stonebraker M, Tatbul N, Zdonik S (2002) Monitoring streams: a new class of data management applications. In: Proceedings of the 28th international conference on very large data bases (VLDB), Hong Kong, pp 215-226
Challenger J, Iyengar A, Witting K, Ferstat C, Reed P (2000) A publishing system for efficiently creating dynamic Web content. In: Proceedings of INFOCOM 2000, Tel Aviv, Israel, pp 844-853
Cho J, Garcia-Molina H (2003) Effective page refresh policies for Web crawlers. ACM Trans Database Sys 28(4):390-426
Datta A, Dutta K, Thomas HM, VanderMeer DE, Ramamritham K, Fishman D (2001) A comparative study of alternative middle tier caching solutions to support dynamic Web content acceleration. In: Proceedings of the 27th international conference on very large data bases (VLDB), Rome, pp 667-670
Datta A, Dutta K, Thomas HM, VanderMeer DE, Suresha, Ramamritham K (2002) Proxy-based acceleration of dynamically generated content on the World Wide Web: an approach and implementation. In: Proceedings of ACM SIGMOD, Madison, WI, pp 97-108
Gupta A, Mumick IS (1999) Materialized views: techniques, implementations, and applications, MIT Press, Cambridge, MA
Gupta H (1997) Selection of views to materialize in a data warehouse. In: Proceedings of ICDT, Delphi, Greece, pp 98-112
Iyengar A, Challenger J (1997) Improving Web server performance by caching dynamic data. In: Proceedings of the USENIX symposium on Internet technologies and systems, Monterey, CA
Jacobson V (1988) Congestion avoidance and control. In: Proceedings of ACM SIGCOMM, Stanford, CA, pp 314-329
Jain R (1991) The art of computer systems performance analysis. Wiley, New York
Kotidis Y, Roussopoulos N (1999) DynaMat: A dynamic view management system for data warehouses. In: Proceedings of ACM SIGMOD, Philadelphia, pp 371-382
Larson P, Goldstein J, Zhou J (2003) Transparent mid-tier database caching in SQL Server. In: Proceedings of ACM SIGMOD, San Diego, p 661
Larson P, Goldstein J, Zhou J (2004) Transparent mid-tier database caching in SQL Server. In: Proceedings of the 20th international conference on data engineering, Boston, pp 177-189
Luo Q, Krishnamurthy S, Mohan C, Pirahesh H, Woo H, Lindsay BG, Naughton JF (2002) Middle-tier database caching for e-business. In: Proceedings of ACM SIGMOD, Madison, WI, p 662
Labrinidis A, Roussopoulos N (2000) WebView materialization. In: Proceedings of ACM SIGMOD, Dallas, TX, pp 367-378
Labrinidis A, Roussopoulos N (2001) Update propagation strategies for improving the quality of data on the Web. In: Proceedings of the 27th international conference on very large data bases (VLDB), Rome, pp 391-400
Ligoudistianos S, Sellis TK, Theodoratos D, Vassiliou Y (1999) Heuristic algorithms for designing a data warehouse with SPJ views. In: Proceedings of the 1st international conference on data warehousing and knowledge discovery (DaWaK), Florence, Italy, pp 96-105
Mistry H, Roy P, Sudarshan S, Ramamritham K (2001) Materialized view selection and maintenance using multi-query optimization. In: Proceedings of ACM SIGMOD, Santa Barbara, CA, pp 307-318
Olston C, Widom J (2002) Best-effort cache synchronization with source cooperation. In: Proceedings of ACM SIGMOD, Madison, WI, pp 73-84
Rohm U, Bohm K, Schek H-J, Schuldt H (2002) FAS - A freshness-sensitive coordination middleware for a cluster of OLAP components. In: Proceedings of the 28th international conference on very large data bases (VLDB), Hong Kong
Roussopoulos N (1982) View indexing in relational databases. ACM Trans Database Sys 7(2):258-290
Shah S, Dharmarajan S, Ramamritham K (2003) An efficient and resilient approach to filtering and disseminating streaming data. In: Proceedings of the 29th international conference on very large data bases (VLDB), Berlin, Germany, pp 57-68
Sellis T (1988) Intelligent caching and indexing techniques for relational database systems, Inf Sys 13(2):175-185
Shah S, Ramamritham K, Shenoy PJ (2002) Maintaining coherency of dynamic data in cooperating repositories. In: Proceedings of the 28th international conference on very large data bases (VLDB), Hong Kong, pp 526-537
Scheuermann P, Shim J, Vingralek R (1996) WATCHMAN: A data warehouse intelligent cache manager. In: Proceedings of the international conference on very large data bases, Bombay, India, pp 51-62
Theodoratos D, Ligoudistianos S, Sellis TK (2001) View selection for designing the global data warehouse. Data Knowl Eng 39(3):219-240
Yagoub K, Florescu D, Issarny V, Valduriez P (2000) Caching strategies for data-intensive Web sites. In: Proceedings of the 26th international conference on very large data bases (VLDB), Cairo, Egypt, pp 188-199
Yu H, Vahdat A (2000) Design and evaluation of a continuous consistency model for replicated services. In: Proceedings of the 4th symposium on operating systems design and implementation, pp 305-318
Author information
Authors and Affiliations
Corresponding author
Additional information
Received: 17 January 2004, Accepted: 23 March 2004, Published online: 19 August 2004
Edited by: S. Abiteboul
Rights and permissions
About this article
Cite this article
Labrinidis, A., Roussopoulos, N. Exploring the tradeoff between performance and data freshness in database-driven Web servers. VLDB 13, 240–255 (2004). https://doi.org/10.1007/s00778-004-0131-7
Issue Date:
DOI: https://doi.org/10.1007/s00778-004-0131-7