Skip to main content

Scout: A Point of Presence Recommendation System Using Real User Monitoring Data

  • Conference paper
  • First Online:
Passive and Active Measurement (PAM 2016)

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 9631))

Included in the following conference series:

Abstract

This paper describes, Scout, a statistical modeling driven approach to automatically recommend new Point of Presence (PoP) centers for web sites. PoPs help reduce a website’s page download time dramatically. However, where to build the new PoP centers given the current assets of existing ones is a problem that has rarely been studied in a quantitative and principled way before; it was mainly done through empirical studies or through applying industry experience and intuitions. In this paper, we propose a novel approach that estimates the impact of the PoP centers by building a statistical model using the real user monitoring data collected by the web sites and recommend the next PoPs to build. We also consider the problem of recommending PoPs using other metrics such as user’s number of page views. We show empirically that our approach works well, by experiments that use real data collected from millions of user visits in a major social network site.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Another strategy is to use CDNs to deliver dynamic content, but it is less common due to security, privacy and cost concerns.

References

  1. Brutlag, J.: Speed matters for google web search. Google, June 2009

    Google Scholar 

  2. www.peeringdb.com

  3. Dabek, F., Cox, R., Kaashoek, F., Morris, R.: Vivaldi: a decentralized network coordinate system. In: ACM SIGCOMM Computer Communication Review, vol. 34, pp. 15–26. ACM (2004)

    Google Scholar 

  4. Dellaert, B.G., Kahn, B.E.: How tolerable is delay?: Consumers’ evaluations of internet web sites after waiting. J. Interact. Mark. 13(1), 41–54 (1999)

    Article  Google Scholar 

  5. Francis, P., Jamin, S., Jin, C., Jin, Y., Raz, D., Shavitt, Y., Zhang, L.: Idmaps: a global internet host distance estimation service. IEEE/ACM Trans. Network. 9(5), 525–540 (2001)

    Article  Google Scholar 

  6. Gummadi, K.P., Saroiu, S., Gribble, S.D.: King: estimating latency between arbitrary internet end hosts. In: Proceedings of the 2nd ACM SIGCOMM Workshop on Internet Measurment, pp. 5–18. ACM (2002)

    Google Scholar 

  7. Hastie, T., Tibshirani, R., Friedman, J., Hastie, T., Friedman, J., Tibshirani, R.: The elements of statistical learning, vol. 2. Springer, New York (2009)

    Google Scholar 

  8. Iyengar, A.K., Squillante, M.S., Zhang, L.: Analysis and characterization of large-scale web server access patterns and performance. World Wide Web 2(1–2), 85–100 (1999)

    Article  Google Scholar 

  9. Koenker, R.: Quantile Regression. Cambridge University Press, Cambridge (2005)

    Book  MATH  Google Scholar 

  10. Krishnan, R., Madhyastha, H.V., Srinivasan, S., Jain, S., Krishnamurthy, A., Anderson, T., Gao, J.: Moving beyond end-to-end path information to optimize cdn performance. In: Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement Conference, pp. 190–201. ACM (2009)

    Google Scholar 

  11. Madhyastha, H.V., Anderson, T., Krishnamurthy, A., Spring, N., Venkataramani, A.: A structural approach to latency prediction. In Proceedings of the 6th ACM SIGCOMM Conference on Internet Measurement, pp. 99–104. ACM (2006)

    Google Scholar 

  12. Maheshwari, R.: How LinkedIn used PoPs and RUM to make dynamic content download 25% faster. LinkedIn Engineering Blog (2014)

    Google Scholar 

  13. Ramsay, J., Barbesi, A., Preece, J.: A psychological investigation of long retrieval times on the world wide web. Interact. Comput. 10(1), 77–86 (1998)

    Article  Google Scholar 

  14. Sears, A., Jacko, J.A., Borella, M.S.: Internet delay effects: how users perceive quality, organization, and ease of use of information. In: CHI 1997 Extended Abstracts on Human Factors in Computing Systems, pp. 353–354. ACM (1997)

    Google Scholar 

  15. Squillante, M.S., Yao, D.D., Zhang, L.: Web traffic modeling and web server performance analysis. ACM SIGMETRICS Perform. Eval. Rev. 27(3), 24–27 (1999). IBM TJ Watson Research Center

    Article  Google Scholar 

Download references

Acknowledgments

We are grateful to Samir R. Das for his valuable feedback on an earlier draft of this paper. We would also like to thank the anonymous reviewers for their insightful comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yang Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Yang, Y., Zhang, L., Maheshwari, R., Kahn, Z.A., Agarwal, D., Dubey, S. (2016). Scout: A Point of Presence Recommendation System Using Real User Monitoring Data. In: Karagiannis, T., Dimitropoulos, X. (eds) Passive and Active Measurement. PAM 2016. Lecture Notes in Computer Science(), vol 9631. Springer, Cham. https://doi.org/10.1007/978-3-319-30505-9_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-30505-9_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-30504-2

  • Online ISBN: 978-3-319-30505-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics