Skip to main content

Abstract

Web traffic measurement and modeling have contributed to understanding the effect of Web traffic on Internet resources since the 1990s. In the past years, a number of new Web features have gained more and more importance, e.g. content delivery networks (CDNs), increased amount of advertisement, personalization, usage tracking, client scripting and Web 2.0 style “mashups”. This paper uses active Web measurements to assess the efficiency of client side caching for modern Web sites, investigating some Web features in detail. As expected, we see that more than 50 % of the average downstream traffic volume is saved when loading a page using client side caching. More unexpected results comprise the actual distribution of cache effectiveness, varying between extreme and no reduction of traffic, the cachability of “Web bugs” and the variance between sites in cachable image pixels and CDN based files.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alexa: top 1,000,000 sites updated daily, http://s3.amazonaws.com/alexa-static/top-1m.csv.zip (visited February 18, 2009)

  2. OIX Route Views, http://archive.routeviews.org/oix-route-views/2008.05/ (visited 28 May 2008)

  3. Akamai home page, http://www.akamai.com/ (visited September 26, 2008)

  4. Balamash, A., Krunz, M.: Performance Analysis of a Client-Side Caching/Prefetching System for Web Traffic. Computer Networks 51(13), 3673–3692 (2007)

    Article  MATH  Google Scholar 

  5. Barford, P., Crovella, M.: Measuring Web Performance in the Wide Area. ACM Performance Evaluation Review 27(2), 37–48 (1999)

    Article  Google Scholar 

  6. Bent, L., Rabinovich, M., Voelker, G.M., Xiao, Z.: Characterization of a Large Web Site Population with Implications for Content Delivery. In: Proc. WWW 2004, New York, NY, USA (May 2004)

    Google Scholar 

  7. Bolot, J.C.: End-to-end packet delay and loss behavior in the internet. In: SIGCOMM 1993: Conference proceedings on Communications architectures, protocols and applications, pp. 289–298. ACM, New York (1993)

    Chapter  Google Scholar 

  8. Charzinski, J.: Locality Analysis of Today’s Internet Web Services. In: Proc. 19th ITC Specialist Seminar, Berlin, Germany (October 2008)

    Google Scholar 

  9. Charzinski, J.: Traffic, structure and locality characteristics of the Web’s most popular services’ home pages. In: Proc. KiVS 2009, Kassel, Germany (March 2009)

    Google Scholar 

  10. Duarte, F., Mattos, B., Bestavros, A., Almeida, V., Almeida, J.: Traffic Characteristics and Communication Patterns in Blogosphere. In: Proc. international conf. on Weblogs and Social Media (2007)

    Google Scholar 

  11. Duska, B.M., Marwood, D., Feeley, M.J.: The Measured Access Characteristics of World-Wide-Web Client Proxy Caches. In: Proc. Usenix Symp. on Internet Techn. and Systems, Monterey, CA (December 1997)

    Google Scholar 

  12. Eden, A.N., Joh, B.W., Mudge, T.: Web latency reduction via client-side prefetching. In: Proc. IEEE Int. Symp. on Perf. Analysis of Systems and Softw., ISPASS 2000, Austin, TX, USA, pp. 193–200 (2000)

    Google Scholar 

  13. Greg Barish, K.O.: World Wide Web Caching: Trends and Techniques. IEEE Communications Magazine 38, 178–184 (2000), http://citeseer.ist.psu.edu/454956.html

    Article  Google Scholar 

  14. Jackson, C., Boneh, D., Bortz, A., Mitchell, J.C.: Protecting Browser State from Web Privacy Attacks. In: Proc. WWW 2006, Edinburgh, Scotland (May 2006)

    Google Scholar 

  15. Kiciman, E., Livshits, B.: AjaxScope: A Platform for Remotely Monitoring the Client-Side Behavior of Web 2.0 Applications. In: Proc. SOSP 2007, Stevenson, WA, USA (October 2007)

    Google Scholar 

  16. Krishnamurthy, B., Wills, C.E.: Analyzing factors that influence end-to-end Web performance. Computer Networks 33(1), 17–32 (2000)

    Article  Google Scholar 

  17. Lightfoot, C.: driftnet, http://www.ex-parrot.com/~chris/driftnet/ (visited January 27, 2009)

  18. Mahanti, A., Williamson, C., Eager, D.: Traffic Analysis of a Web Proxy Caching Hierarchy. IEEE Network, 16–23 (May/June 2000)

    Google Scholar 

  19. McCanne, S., Leres, C., Jacobson, V.: tcpdump. LBNL Network Research Group, ftp://ftp.ee.lbl.gov/tcpdump.tar.Z

  20. Rabinovich, M., Spatschek, O.: Web Caching and Replication. Addison-Wesley, Reading (2002)

    Google Scholar 

  21. Saroiu, S., Gummadi, K.P., Dunn, R.J., Gribble, S.D., Levy, H.M.: An Analysis of Internet Content Delivery Systems. In: Proc. Usenix OSDI 2002, Boston, MA, USA, December 2002, pp. 315–328 (2002)

    Google Scholar 

  22. Sherman, A., Lisiecki, P.A., Berkheimer, A., Wein, J.: ACMS: The Akamai configuration management system. In: Proc. USENIX NSDI 2005 (May 2005)

    Google Scholar 

  23. Williams, A., Arlitt, M., Williamson, C., Barker, K.: Web workload characterization: Ten years later. Springer, Heidelberg (2005)

    Google Scholar 

  24. Yee, R.: Pro Web 2.0 Mashups: Remixing Data and Web Services. Apress (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Charzinski, J. (2010). Traffic Properties, Client Side Cachability and CDN Usage of Popular Web Sites. In: Müller-Clostermann, B., Echtle, K., Rathgeb, E.P. (eds) Measurement, Modelling, and Evaluation of Computing Systems and Dependability and Fault Tolerance. MMB&DFT 2010. Lecture Notes in Computer Science, vol 5987. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12104-3_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12104-3_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12103-6

  • Online ISBN: 978-3-642-12104-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics