Abstract
The issues of performance, response efficiency and data consistency are among the most important for data intensive Web sites. In order to deal with these issues we analyze and evaluate a hybrid run-time management policy that may be applied to data intensive Web sites. Our research relies on the performance evaluation of experimental client/server configurations. We propose a hybrid Web site run-time management policy that may apply to different Web site request patterns and data update frequencies. A run-time management policy is viewed as a Web page materialization policy that can adapt to different conditions at run-time. We define a concept that we have named the Compromise Factor (CF), to achieve the relationship between current server conditions and the materialization policy. The issue of Web and database data consistency is the driving force behind our approach. In some cases though, we prove that certain compromises to consistency can be beneficial to Web server performance and at the same time be unnoticeable to users. We first present a general a comparative cost model for the hybrid management policy and three other related and popular Web management policies. We then evaluate the performance of all the approaches. The results of our evaluation show that the concept of the CF may be beneficial to Web servers in terms of performance.
Similar content being viewed by others
References
M. Arlitt, R. Friedrich, and T. Jin, “Performance evaluation of Web proxy cache replacement policies,” in Proceedings of Performance Tools'98, Lecture Notes in Computer Science, Vol. 1469, 1998, pp. 193-206.
E. Baralis, S Paraboschi, and E. Teniente, “Materialized views selection in a multidimensional database,” in VLDB, 1997.
C. Bouras and A. Konidaris, “An algorithm for handling hybrid run-time management policies in data intensive Web sites,” work in progress.
L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, “On the implications of Zipf's law forWeb caching,” in Proceedings of IEEE INFOCOM'99, New York, March 1999.
J. Challenger, P. Dantzig, D. Dias, and N. Mills, “Engineering highly accessed Web sites for performance,” Web Engineering, Y. Deshpande and S. Murugesan, Eds., Springer-Verlag.
J. Challenger, A. Iyengar, and P. Dantzig, “A scalable system for consistently caching dynamic Web data,” in Proceedings of IEEE INFOCOM'99, New York, NY, March 1999.
J. Challenger, A. Iyengar, and K. Witting, “A publishing system for efficiently creating dynamic Web content,” in Proceedings of INFOCOM 2000, Tel Aviv, Israel, March 26-30, 2000.
D. Florescu, M. Fernandez, J. Kang, A. Levy, and D. Suciu, “Catching the boat with Strudel: Experience with a Web-site management system,” in Proceedings of ACM SIGMOD Conference on Management of Data, Seattle, WA, 1998.
D. Florescu, A. Levy, and A. Mendelzon, “Database techniques for the World-Wide Web: A survey,” SIG-MOD Record 27(3), 1998, 59-74.
D. Florescu, A. Levy, D. Saciu, and K. Yakoub, “Optimization of run-time management of data intensive Web sites,” in Proceedings of the 25th VLDB Conference, Edinburgh, Scotland, 1999.
H. Gupta, “Selection of views to materialize in a data warehouse” in ICDT, 1997.
H. Gupta and I. S. Mumick, “Selection of views to materialize under a maintenance cost constraint,” in ICDT, 1999.
M. A. Habib and M. Abrams, “Analysis of sources of latency in downloading Web pages,” in WebNet 2000, San Antonio, TX, October 30-November 4, 2000.
K. Yagoub, D. Florescu, V. Issarny, and P. Valduriez, “Caching strategies for data-intensive Web sites,” in Proceedings of the International Conference on Very Large Data Bases (VLDB 2000), Cairo, Egypt, September 10-14, 2000.
A. Iyengar and J. Challenger, “Improving Web server performance by caching dynamic data,” in Proceedings of the USENIX Symposium on Internet Technologies and Systems, Monterey, CA, December 1997.
A. Lambrinidis and N. Roussopoulos, “WebView Materialization,” in Proceedings of the ACM SIGMOD 2000, Dallas, TX, May 2000.
Y. Li and K. Lu, “Performance Issues of a Web Database,” in Proceedings of Eleventh International Workshop on Database and Expert Systems Applications, Greenwich, London, UK, 4-8 September 2000.
B. Liu, “Characterizing Web response time,” Master of Science in Computer Science Thesis, Virginia Polytechnic Institute and State University, 1998.
G. Mecca, P. Atzeni, A. Masci, P. Merialdo, and G. Sindoni, “The Araneus Web-based, management system,” in Exhibits Program of ACM SIGMOD'98, 1998.
C. Olston and J. Widom, “Offering a precision-performance tradeoff for aggregation queries over replicated data,” in Proceedings of the 26th International Conference on Very Large Data Bases, Cairo, Egypt, September 10-14, 2000.
T. Palpanas and B. Krishnamurthy, “Reducing retrieval latencies in the Web: the past, the present, and the future,” Technical Report CSRG-378, Graduate Department of Computer Science, University of Toronto.
B. Proll, H. Starck, W. Retschitzegger, and H. Sighart, “Ready for prime time pre-generation of Web pages in TIScover,” in WebDB'99, Philadelphia, PA, June 3-4, 1999.
E. A. Rundensteiner, A. Koeller, and X. Zhang, “Maintaining data warehouses over changing information sources,” Communications of the ACM 43(6), June 2000.
The ARANEUS project home page, http://www.dia.uniroma3.it/Araneus/
J. Wang, “A survey of Web caching schemes for the Internet,” ACM Computer Communication Review 29(5), October 1999, 36-46.
C. E. Wills and M. Mikhailov, “Studying the impact of more complete server information on Web caching,” 5th International Web Caching and Content Delivery Workshop, Lisbon, Portugal, May 22-24, 2000.
G. K. Zipf, Human Behavior and the Principle of Least Effort, Addison-Wesley, 1949.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Bouras, C., Konidaris, A. Performance Evaluation of a Hybrid Run-Time Management Policy for Data Intensive Web Sites. World Wide Web 6, 23–47 (2003). https://doi.org/10.1023/A:1022356415823
Issue Date:
DOI: https://doi.org/10.1023/A:1022356415823