Skip to main content
Log in

Exploring the tradeoff between performance and data freshness in database-driven Web servers

  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract.

Personalization, advertising, and the sheer volume of online data generate a staggering amount of dynamic Web content. In addition to Web caching, view materialization has been shown to accelerate the generation of dynamic Web content. View materialization is an attractive solution as it decouples the serving of access requests from the handling of updates. In the context of the Web, selecting which views to materialize must be decided online and needs to consider both performance and data freshness, which we refer to as the online view selection problem. In this paper, we define data freshness metrics, provide an adaptive algorithm for the online view selection problem that is based on user-specified data freshness requirements, and present experimental results. Furthermore, we examine alternative metrics for data freshness and extend our proposed algorithm to handle multiple users and alternative definitions of data freshness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Altinel M, Bornhovd C, Krishnamurthy S, Mohan C, Pirahesh H, Reinwald B (2003) Cache tables: paving the way for an adaptive database cache. In: Proceedings of the 29th conference on very large data bases (VLDB), Berlin, Germany, pp 718-729

  2. Bornovd C, Altinel M, Krishnamurthy S, Mohan C, Pirahesh H, Reinwald B (2003) DBCache: Middle-tier database caching for highly scalable e-business architectures. In: Proceedings of ACM SIGMOD, San Diego, p 662

  3. Breslau L, Cao P, Fan L, Phillips G, Shenker S (1999) Web caching and Zipf-like distributions: evidence and implications. In: Proceedings of INFOCOM 1999, New York, pp 126-134

  4. Carney D, Cetintemel U, Cherniack M, Convey C, Lee S, Seidman G, Stonebraker M, Tatbul N, Zdonik S (2002) Monitoring streams: a new class of data management applications. In: Proceedings of the 28th international conference on very large data bases (VLDB), Hong Kong, pp 215-226

  5. Challenger J, Iyengar A, Witting K, Ferstat C, Reed P (2000) A publishing system for efficiently creating dynamic Web content. In: Proceedings of INFOCOM 2000, Tel Aviv, Israel, pp 844-853

  6. Cho J, Garcia-Molina H (2003) Effective page refresh policies for Web crawlers. ACM Trans Database Sys 28(4):390-426

    Article  Google Scholar 

  7. Datta A, Dutta K, Thomas HM, VanderMeer DE, Ramamritham K, Fishman D (2001) A comparative study of alternative middle tier caching solutions to support dynamic Web content acceleration. In: Proceedings of the 27th international conference on very large data bases (VLDB), Rome, pp 667-670

  8. Datta A, Dutta K, Thomas HM, VanderMeer DE, Suresha, Ramamritham K (2002) Proxy-based acceleration of dynamically generated content on the World Wide Web: an approach and implementation. In: Proceedings of ACM SIGMOD, Madison, WI, pp 97-108

  9. Gupta A, Mumick IS (1999) Materialized views: techniques, implementations, and applications, MIT Press, Cambridge, MA

    Google Scholar 

  10. Gupta H (1997) Selection of views to materialize in a data warehouse. In: Proceedings of ICDT, Delphi, Greece, pp 98-112

  11. Iyengar A, Challenger J (1997) Improving Web server performance by caching dynamic data. In: Proceedings of the USENIX symposium on Internet technologies and systems, Monterey, CA

  12. Jacobson V (1988) Congestion avoidance and control. In: Proceedings of ACM SIGCOMM, Stanford, CA, pp 314-329

  13. Jain R (1991) The art of computer systems performance analysis. Wiley, New York

  14. Kotidis Y, Roussopoulos N (1999) DynaMat: A dynamic view management system for data warehouses. In: Proceedings of ACM SIGMOD, Philadelphia, pp 371-382

  15. Larson P, Goldstein J, Zhou J (2003) Transparent mid-tier database caching in SQL Server. In: Proceedings of ACM SIGMOD, San Diego, p 661

  16. Larson P, Goldstein J, Zhou J (2004) Transparent mid-tier database caching in SQL Server. In: Proceedings of the 20th international conference on data engineering, Boston, pp 177-189

  17. Luo Q, Krishnamurthy S, Mohan C, Pirahesh H, Woo H, Lindsay BG, Naughton JF (2002) Middle-tier database caching for e-business. In: Proceedings of ACM SIGMOD, Madison, WI, p 662

  18. Labrinidis A, Roussopoulos N (2000) WebView materialization. In: Proceedings of ACM SIGMOD, Dallas, TX, pp 367-378

  19. Labrinidis A, Roussopoulos N (2001) Update propagation strategies for improving the quality of data on the Web. In: Proceedings of the 27th international conference on very large data bases (VLDB), Rome, pp 391-400

  20. Ligoudistianos S, Sellis TK, Theodoratos D, Vassiliou Y (1999) Heuristic algorithms for designing a data warehouse with SPJ views. In: Proceedings of the 1st international conference on data warehousing and knowledge discovery (DaWaK), Florence, Italy, pp 96-105

  21. Mistry H, Roy P, Sudarshan S, Ramamritham K (2001) Materialized view selection and maintenance using multi-query optimization. In: Proceedings of ACM SIGMOD, Santa Barbara, CA, pp 307-318

  22. Olston C, Widom J (2002) Best-effort cache synchronization with source cooperation. In: Proceedings of ACM SIGMOD, Madison, WI, pp 73-84

  23. Rohm U, Bohm K, Schek H-J, Schuldt H (2002) FAS - A freshness-sensitive coordination middleware for a cluster of OLAP components. In: Proceedings of the 28th international conference on very large data bases (VLDB), Hong Kong

  24. Roussopoulos N (1982) View indexing in relational databases. ACM Trans Database Sys 7(2):258-290

    Article  MATH  Google Scholar 

  25. Shah S, Dharmarajan S, Ramamritham K (2003) An efficient and resilient approach to filtering and disseminating streaming data. In: Proceedings of the 29th international conference on very large data bases (VLDB), Berlin, Germany, pp 57-68

  26. Sellis T (1988) Intelligent caching and indexing techniques for relational database systems, Inf Sys 13(2):175-185

    Google Scholar 

  27. Shah S, Ramamritham K, Shenoy PJ (2002) Maintaining coherency of dynamic data in cooperating repositories. In: Proceedings of the 28th international conference on very large data bases (VLDB), Hong Kong, pp 526-537

  28. Scheuermann P, Shim J, Vingralek R (1996) WATCHMAN: A data warehouse intelligent cache manager. In: Proceedings of the international conference on very large data bases, Bombay, India, pp 51-62

  29. Theodoratos D, Ligoudistianos S, Sellis TK (2001) View selection for designing the global data warehouse. Data Knowl Eng 39(3):219-240

    Article  MATH  Google Scholar 

  30. Yagoub K, Florescu D, Issarny V, Valduriez P (2000) Caching strategies for data-intensive Web sites. In: Proceedings of the 26th international conference on very large data bases (VLDB), Cairo, Egypt, pp 188-199

  31. Yu H, Vahdat A (2000) Design and evaluation of a continuous consistency model for replicated services. In: Proceedings of the 4th symposium on operating systems design and implementation, pp 305-318

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alexandros Labrinidis.

Additional information

Received: 17 January 2004, Accepted: 23 March 2004, Published online: 19 August 2004

Edited by: S. Abiteboul

Rights and permissions

Reprints and permissions

About this article

Cite this article

Labrinidis, A., Roussopoulos, N. Exploring the tradeoff between performance and data freshness in database-driven Web servers. VLDB 13, 240–255 (2004). https://doi.org/10.1007/s00778-004-0131-7

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-004-0131-7

Keywords

Navigation