Skip to main content

Exploiting Web Document Structure to Improve Storage Management in Proxy Caches

  • Conference paper
  • First Online:
High Performance Computing — HiPC 2002 (HiPC 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2552))

Included in the following conference series:

  • 1078 Accesses

Abstract

Proxy caches are essential to improve the performance of World Wide Web and to enhance user perceived latency. In this paper, we propose a new Web object based policy to manage the storage system of a proxy cache. We propose two techniques to improve the storage system performance. The first technique is concerned with prefetching the related files belonging to a Web object, from the disk to main memory. This prefetching improves performance as most of the files can be provided from the main memory instead of proxy disk. The second technique stores the Web object members in contiguous disk blocks in order to reduce the disk access time. This in turn reduces the disk response time. We have used trace-driven simulations to study the performance improvements one can obtain with these two techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. Abhari. Web Object Based Policies for Managing Proxy Caches. PhD thesis, 2002 (expected). 93, 94, 96

    Google Scholar 

  2. A. Abhari, S.P. Dandamudi, and S. Majumdar. Characterization of Web objects in Popular Web Documents. In ISCA 13th International Conference on Parallel and Distributed Computing Systems, pages 616–624, Las Vegas, Nevada, U. S.A., Aug. 2000. 91

    Google Scholar 

  3. A. Abhari, S.P. Dandamudi, and S. Majumdar. Using Web Object to Improve the Performance of Proxy Caching. In Fourth International Workshop on Web Engineering at the World Wide Web WWW10 Conference, pages 82–92, Hong Kong, May 2001. Available from http://aeims.uws.edu.au/webe2001/webe-www10-proc.pdf. 90, 93

  4. A. Abhari, S.P. Dandamudi, and S. Majumdar. Structural Characterization of popular Web Documents. International Journal of Computers and Their Applications, 9(1):15–24, Mar. 2002. 90, 91

    Google Scholar 

  5. M. Abrams, C. Standridge, G. Abdulla, S. Williams, and E. Fox. Caching Proxies: Limitations and Potentials. In Forth World Wide Web Conference’ 95: The Web Revolution, Boston, MA, Dec. 1995. 89

    Google Scholar 

  6. A. Bestavros. Using Speculation to Reduce Server Load and Service Time on the WWW. In Fourth ACM Conference on Information and Knowledge Management, Baltimore, Maryland, Nov. 1995. 90

    Google Scholar 

  7. P. Bowman, D. Danzing, D. Hardy, U. Manber, M. Schwartz, and D. Wesseles. Harvest: A Scalable, Customizable Discovery and Access System. Technical Report CU-CS-732-95, University of Colorado, Department of Computer Science, Boulder, Colorado, 1995. 89

    Google Scholar 

  8. E. Cohen, B. Krishnamurthy, and J. Rexford. Efficient Algorithms for Predicting Requests to Web Servers. In IEEE INFOCOM, Geneva, Switzerland, 1999. 90

    Google Scholar 

  9. E. Gabber, L. Huang, E. Shriver, and C. Stein. Storage Management for Web Proxies. In 2001 USENIX Annual Technical Conference, Boston, MA, U. S.A., June 2001. 91, 92

    Google Scholar 

  10. G.R. Ganger. Generating Representative Synthetic Workloads: An Unsolved Problem. In Computer Measurement Group (CMG) Conference, pages 1263–1269, Dec. 1995. 95

    Google Scholar 

  11. J. Gwertzman. Autonomous Replication in Wide-Area Internetworks. B.A. Thesis, Center for Research in Computing Technology, Harvard University, Cambridge MA, Apr. 1995. 89, 90

    Google Scholar 

  12. J. Hine, C. Wills, A. Martel, and J. Sommers. Combining Client Knowledge and Resource Dependencies for Improving World Wide Web Performance. In INET’98 Conference, Geneva, Switzerland, 1998. 90

    Google Scholar 

  13. J.H. Howard, M. L. Kazar, S.G. Menees, D.A. Nichols, M. Satyanarayanan, R.N. Sidebotham, and M. J. West. Scale and Performance in a Distributed File System. ACM Transactions on Computer Systems, 6(1), Feb. 1998. 91

    Google Scholar 

  14. M. Kazar, B. Leverrett, O. Anderson, V. Apostolides, B. Bottos, S. Chutani, C. Everhart, W. Mason, S. Tu, and E. Zayas. Decorum File System architecture overview. In Summer 1990 USENIX Annual Technical Conference, Anaheim, California, U. S.A., June 1990. 91

    Google Scholar 

  15. T.M. Kroeger, D. D.E. Long, and J.C. Mogul. Exploring the bounds of web latency reduction from caching and prefetching. In Symposium on Internetworking Systems and Technologies (USENIX), pages 13–22, Atlanta, Georgia, U. S.A., Dec. 1997. 89

    Google Scholar 

  16. E.P. Markatos and C. E. Chronaki. A Top-10 Approach to Prefetching the Web. In ICS4, Geneva, Switzerland, Jan. 1996. 90

    Google Scholar 

  17. J. C. Mogul. Hinted Caching in the Web. In 1996 SIGOPS European Workshop, 1996. 90

    Google Scholar 

  18. A. Rousskov and V. Soloviev. A Performance Study of Squid Proxy on HTTP/1.0. World Wide Web Journal, Special Edition on WWW Characterization and Performance and Evaluation, 1999. 91, 97

    Google Scholar 

  19. B. L. Worthington, G.R. Ganger, Y.N. Patt, and J. Wilkes. On-Line Extraction of SCSI Disk Drive Parameters. Technical Report CSE-TR-323-96, University of Michigan, Ann Arbor, Dec. 1996. 95

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Abhari, A., Dandamudi, S.P., Majumdar, S. (2002). Exploiting Web Document Structure to Improve Storage Management in Proxy Caches. In: Sahni, S., Prasanna, V.K., Shukla, U. (eds) High Performance Computing — HiPC 2002. HiPC 2002. Lecture Notes in Computer Science, vol 2552. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36265-7_9

Download citation

  • DOI: https://doi.org/10.1007/3-540-36265-7_9

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00303-8

  • Online ISBN: 978-3-540-36265-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics