Skip to main content
Log in

Web prefetching through efficient prediction by partial matching

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

In this work we propose a prediction by partial matching technique to anticipate and prefetch web pages and files accessed via browsers. The goal is to reduce the delays necessary to load the web pages and files visited by the users. Since the number of visited web pages can be high, tree-based and table-based implementations can be inefficient from the representation point of view. Therefore, we present an efficient way to implement the prediction by partial matching as simple searches in the observation sequence. Thus, we can use high number of states in long web page access histories and higher order Markov chains at low complexity. The time-evaluations show that the proposed PPM implementation is significantly more efficient than previous implementations. We have enhanced the predictor with a confidence mechanism, implemented as saturating counters, which classifies dynamically web pages as predictable or unpredictable. Predictions are generated selectively only from web pages classified as predictable, improving thus the accuracy. The experiments show that the prediction by partial matching of order 4 with a history of 500 web pages is the optimal.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10

Similar content being viewed by others

References

  1. Ban, Z., Gu, Z., Jin, Y.: An online PPM prediction model for web prefetching. The 9th ACM International Workshop on Web Information and Data Management, pp. 89–96. Lisboa (2007)

  2. Canali, C., Colajanni, M., Lancellotti, R.: Adaptive algorithms for efficient content management in social network services. 10th International Conference on Computer and Information Technology, pp. 68–75. (2010)

  3. Ciobanu, D., Dinuca, C.E.: Predicting the next page that will be visited by a web surfer using page rank algorithm. Int. J. Comput. Commun. 6(1), 60–67 (2012)

    Google Scholar 

  4. Cleary, J., Witten, I.: Data compression using adaptive coding and partial string matching. IEEE Trans. Commun. 32(4), 396–402 (1984)

    Article  Google Scholar 

  5. Cunha, C.A., Bestavros, A., Crovella, M.E.: Characteristics of WWW client traces. Technical report TR-95-010. Boston University, Department of Computer Science (1995)

  6. Deshpande, M., Karypis, G.: Selective Markov models for predicting web-page accesses. ACM Trans. Internet Technol. 4(2), 163–184 (2004)

    Article  Google Scholar 

  7. Domènech, J., Pont, A., Sahuquillo, J., Gil, J.A.: An experimental framework for testing web prefetching techniques. The 30th EUROMICRO Conference, pp. 214–221. (2004)

  8. Domènech, J., Sahuquillo, J., Pont, A., Gil, J.A.: How current web generation affects prediction algorithms performance. Proceedings of SoftCOM International Conference on Software, Telecommunications and Computer Networks. Split, Croatia (2005)

  9. Dubey, S., Mishra, N.: Web page prediction using hybrid model. Int J Comput Sci Eng 3(5), 2170–2176 (2011)

    Google Scholar 

  10. Gellert, A., Florea, A.: Investigating a New design pattern for efficient implementation of prediction algorithms. J. Digit. Inf. Manag. 11(5), 366–377 (2013)

    Google Scholar 

  11. Gellert, A., Florea, A.: Web page prediction enhanced with confidence mechanism. J Web Eng 13(5–6), 507–524 (2014)

    Google Scholar 

  12. Guo, Y.Z., Ramamohanarao, K., Park, L.A.F.: Web page prediction based on conditional random fields. The 18th European Conference on Artificial Intelligence, pp. 251–255. (2008)

  13. Hasan, M.A., Chaoji, V., Salem, S., Zaki, M.: Link prediction using supervised learning. Proceedings of SDM 06 Workshop on Link Analysis, Counterterrorism and Security. Bethesda (2006)

  14. Huang, Q., Yang, Q., Huang, J.Z., Ng, M.K.: Mining of Web-Page Visiting Patterns with Continuous-Time Markov Models, pp. 549–558. Springer-Verlag, Berlin Heidelberg (2004)

    Google Scholar 

  15. Huang, Z.: Link prediction based on graph topology: the predictive value of generalized clustering coefficient. Proceedings of the Workshop on Link Analysis: Dynamics and Static of Large Networks. Philadelphia (2006)

  16. Jin, X., Xu, H.: An approach to intelligent web pre-fetching based on hidden Markov model. Proceedings of the 42nd Conference on Decision and Control, vol. 3, pp. 2954–2958. Maui (2003)

  17. Kaushal, P.: Hybrid Markov model for better prediction of web page. IJSRP. 2(8), (2012)

  18. Khalil, F., Li, J., Wang, H.: Integrating recommendation models for improved web page prediction accuracy. Proceedings of the 31st Australasian Conference on Computer Science, vol. 74, pp. 91–100. (2008)

  19. Khalil, F., Li, J., Wang, H.: An integrated model for next page access prediction. IJKWI 1(1/2), 48–80 (2009)

    Article  Google Scholar 

  20. Khanchana, R., Punithavalli, M.: Web page prediction for web personalization: a review. GJCST 11(7), 39–44 (2011)

    Google Scholar 

  21. Lee, J., Shi, Y., Wang, F., Lee, H., Kim, H.K.: Advertisement Clicking Prediction by Using Multiple Criteria Mathematical Programming. WWWJ (2015). doi:10.1007/s11280-015-0353-1

    Google Scholar 

  22. Murata, T., Moriyasu, S.: Link prediction of social networks based on weighted proximity measures. IEEE/WIC/ACM International Conference on Web Intelligence, pp. 85–88. (2007)

  23. Palpanas, T., Mendelzon, A.: Web prefetching using partial match prediction. Proceedings of the 4th International Web Caching Workshop. San Diego (1999)

  24. Pitkow, J., Pirolli, P.: Mining longest repeating subsequences to predict World Wide Web surfing. The 2nd USENIX Symposium on Internet Technologies & Systems, vol. 2, pp. 11–14. Boulder (1999)

  25. Singhai, N., Nigam, R.K.: A novel technique to predict oftenly used web pages from usage patterns. IJETTCS 1(4), 49–55 (2012)

    Google Scholar 

  26. Su, Z., Yang, Q., Zhang, H. J.: A prediction system for multimedia pre-fetching in internet. Proceedings of the eighth ACM international conference on Multimedia, pp. 3–11. New York (2000)

  27. Temgire, S., Gupta, P.: Review on web prefetching techniques. IJTEEE 1(4), 100–105 (2013)

    Google Scholar 

  28. Wan, M., Jönsson, A., Wang, C., Li, L., Yang, Y.: Web user clustering and web prefetching using random indexing with weight functions. Knowl. Inf. Syst. 33(1), 89–115 (2012)

    Article  Google Scholar 

  29. Wu, Y.-H., Chen, A.L.P.: Prediction of web page accesses by proxy server log. WWWJ 5(1), 67–88 (2002). doi:10.1023/A:1015750423727

    Article  MATH  Google Scholar 

  30. Zheng, Z., Wei, W., Liu, C., Cao, W., Cao, L., Bhatia, M.: An Effective Contrast Sequential Pattern Mining Approach to Taxpayer Behavior Analysis. WWWJ (2015). doi:10.1007/s11280-015-0350-4

    Google Scholar 

  31. Zhu, J., Hong, J., Hughes, J.G.: Using Markov Chains for Link Prediction in Adaptive Web Sites, pp. 60–73. Springer, Berlin Heidelberg (2002)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arpad Gellert.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gellert, A., Florea, A. Web prefetching through efficient prediction by partial matching. World Wide Web 19, 921–932 (2016). https://doi.org/10.1007/s11280-015-0367-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-015-0367-8

Keywords

Navigation