Skip to main content
Log in

Web Caching Replacement Algorithm Based on Web Usage Data

  • Published:
New Generation Computing Aims and scope Submit manuscript

Abstract

Web caching is one of the fundamental techniques for reducing bandwidth usage and download time while browsing the World Wide Web. In this research, we provide an improvement in web caching by combining the result of web usage mining with traditional web caching techniques. Web cache replacement policy is used to select which object should be removed from the cache when the cache is full and which new object should be put into the cache. There are several attributes used for selecting the object to be removed, such as the size of the object, the number of times the object was used, and the time when the object was added into the cache. However, the flaw in these previous approaches is that each object is treated separately without considering the relation among those objects. We have developed a system that can record users’ browsing behavior at the resources level. By using information gathered from this system, we can improve web cache replacement policy so that the number of cache hits will increase, resulting in a faster web browsing experience and less data bandwidth, especially at lower cache storage environments such as on smart phones.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Podlipnig, S., Bszrmenyi, L., “A Survey of Web Cache Replacement Strategies,” ACM Computing Surveys, 35, 4, pp. 374-398, December 2003.

  2. Mookerjee, V. S., Tan, Y., “Analysis of a Least Recently Used Cache Management Policy for Web Browsers,” Operations Research, Linthicum, 50, 2, 2002, pp. 345-357.

  3. Williams, S., Abrams, M., Standridge, C. R., Abdulla, G. and Fox, E. A., “Removal policies in network caches for World-Wide Web documents,” in Proc. of ACM SIGCOMM, ACM Press, pp. 293-305, 1996.

  4. Wong, K.Y., “Web Cache Replacement Policies: A Pragmatic Approach,” IEEE Network, pp. 28-34, 2006.

  5. Luotonen, A., Altis, K., “World-Wide Web proxies,” Computer Networks And ISDN System, 27, 2, pp. 147-154, 1994.

  6. Romano, S., ElAarag, H., “A quantitative study of recency and frequency based web cache replacement strategies,” in Proc. of the 11th communications and networking simulation symposium, ACM, New York, 2008.

  7. Tirdad, K., Pakzad, F., Abhari, A., “Cache replacement solutions by evolutionary computing technique,” in Proc. of the 2009 Spring Simulation Multiconference, Society for Computer Simulation International, 2009.

  8. Ali, W., Shamsuddin, S. M., “Intelligent Client-Side Web Caching Scheme Based on Least Recently Used Algorithm and Neuro-Fuzzy System,” in Proc. of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II, Springer-Verlag Berlin Heidelberg, 2009.

  9. Torkzabah, V., Rahmani, S., “SCRAME: Selection of Cache Replacement Algorithm based on Multi Expert,” in Proc. of the 11th International Conference on Information Integration and Web-based Applications & Services, ACM, New York, 2009.

  10. Geetha, K., Gounden, N. A., Monikandan, S., “SEMALRU: An Implementation of modified web cache replacement algorithm,” World Congress on Nature & Biologically Inspired Computing, 2009.

  11. Srivastava, J., Cooley, R., Deshpande, M., Tan, P., “Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data,” ACM SIGKDD Explorations Newsletter, 1, 2, January, 2000.

  12. Spiliopoulou M.: “Web usage mining for Web site evaluation.. ” Communications of the ACM, 43, 8 (2000)

    Article  Google Scholar 

  13. Zhou, B., Hui, S. C., Fong, A. C. M., “Discovering and Visualizing Temporal-Based Web Access Behavior,” in Proc. of the Web Inteligence 2005, IEEE Computer Society, Washington, 2005.

  14. http://ita.ee.lbl.gov/

  15. http://www.web-caching.com

  16. Lui, B., Web Data Mining Exploring Hyperlinks, Contents, and Usage Data Second Edition, Springer, July, 2011.

  17. Cooley, R., Mobasher, B., Srivastava, J., “Data preparation for mining world wide web browsing patterns,” Knowledge and Information systems, 1999.

  18. Khoury, R., Dawborn, T., Huang, W., “Visualising Web Browsing Data for User Behaviour Analysis,” in Proc. of the 23rd Australian Computer-Human Interaction Conference, ACM New York, NY, USA.

  19. Goel, S., Hofman, J. M., Sirer, M. I., “Who Does What on the Web A Large-Scale Study of Browsing Behavior,” in Proc. of ICWSM-12, International Conference on Weblogs and Social Media 2012, pp. 4-6.

  20. Grove R., “Mobile Browser Cache Limits: Android, iOS, and webOS,” http://www.yuiblog.com/blog/2010/06/28/mobile-browser-cache-limits, June 28th, 2010.

  21. Jarukasemratana, S., Tsuyoshi, M., “Visualizing Web Structure based on Browsing Sessions,” in Proc. of APCHI 2012, The 10th Asia Pacific Conference on Computer Human Interaction, August, 2012.

  22. Shi, L., Wei, L., Ye, H., Shi, Y., “Measurements of web caching and applications,” in Proc. of the Fifth International Conference on Machine Learning and Cybernetics, Dalian, 13-16 August 2006.

  23. Judson, A., https://addons.mozilla.org/en-us/firefox/addon/tamper-data/

  24. Cherkasova, L., “Improving Web Servers and Proxies Performance with GDSF Caching Policies,” Computer System Laboratory, Hewlett Packard, HPL-98-69 (R.1), November 1998.

  25. http://www.squid-cache.org/

  26. Cao, P., Wisconsin Web Cache Simulator, http://www.cs.wisc.edu/cao/, 1997.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sorn Jarukasemratana.

About this article

Cite this article

Jarukasemratana, S., Murata, T. Web Caching Replacement Algorithm Based on Web Usage Data. New Gener. Comput. 31, 311–329 (2013). https://doi.org/10.1007/s00354-013-0404-z

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00354-013-0404-z

Keywords

Navigation