skip to main content
10.1145/2488388.2488450acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Understanding latency variations of black box services

Published:13 May 2013Publication History

ABSTRACT

Data centers run many services that impact millions of users daily. In reality, the latency of each service varies from one request to another. Existing tools allow to monitor services for performance glitches or service disruptions, but typically they do not help understanding the variations in latency.

We propose a general framework for understanding performance of arbitrary black box services. We consider a stream of requests to a given service with their monitored attributes, as well as latencies of serving each request. We propose what we call the multi-dimensional f-measure, that helps for a given interval to identify the subset of monitored attributes that explains it. We design algorithms that use this measure not only for a fixed latency interval, but also to explain the entire range of latencies of the service by segmenting it into smaller intervals.

We perform a detailed experimental study with synthetic data, as well as real data from a large search engine. Our experiments show that our methods automatically identify significant latency intervals together with request attributes that explain them, and are robust.

References

  1. R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc 20th Int Conf Very Large Data Bases VLDB, volume 1215, pages 487--499. Citeseer, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. T. Ball and J. Larus. Efficient path profiling. In Proceedings of the 29th Annual International Symposium on Microarchitecture, pages 46--57, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. Barham, R. Isaacs, R. Mortier, and D. Narayanan. Magpie: Online modelling and performance-aware systems. In Proceedings of the 9th conference on Hot Topics in Operating Systems-Volume 9, pages 15--15. USENIX Association, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Destrero, S. Mosci, C. De Mol, A. Verri, and F. Odone. Feature selection for high-dimensional data. Computational management science, 6(1):25--40, 2009.Google ScholarGoogle Scholar
  5. P. Devijver and J. Kittler. Pattern recognition: A statistical approach. Prentice/Hall International, 1982.Google ScholarGoogle Scholar
  6. R. Fonseca, M. Freedman, and G. Porter. Experiences with tracing causality in networked services. In Proceedings of the 2010 internet network management conference on Research on enterprise networking, pages 10--10. USENIX Association, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Fonseca, G. Porter, R. Katz, S. Shenker, and I. Stoica. X-trace: A pervasive network tracing framework. In Proceedings of the 4th USENIX conference on Networked systems design & implementation, pages 20--20. USENIX Association, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. G. Forman. An extensive empirical study of feature selection metrics for text classification. The Journal of Machine Learning Research, 3:1289--1305, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. Guyon. Practical feature selection: from correlation to causality. Mining Massive Data Sets for Security: Advances in Data Mining, Search, Social Networks and Text Mining, and their Applications to Security, pages 27--43, 2008.Google ScholarGoogle Scholar
  10. R. Kohavi and G. John. Wrappers for feature subset selection. Artificial intelligence, 97(1):273--324, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Land and A. Doig. An automatic method of solving discrete programming problems. Econometrica: Journal of the Econometric Society, pages 497--520, 1960.Google ScholarGoogle ScholarCross RefCross Ref
  12. M. Mahoney and P. Drineas. Cur matrix decompositions for improved data analysis. Proceedings of the National Academy of Sciences, 106(3):697--702, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  13. G. Mann, M. Sandler, D. Kruschevskaja, S. Guha, and E. Even-dar. Modeling the parallel execution of black-box services. USENIX/HotCloud, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Marshak and H. Levy. Evaluating web user perceived latency using server side measurements. Computer Communications, 26:2003, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Miller. Subset selection in regression. Chapman & Hall/CRC, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  16. D. Musicant, V. Kumar, A. Ozgur, et al. Optimizing f-measure with support vector machines. In Proceedings of the Sixteenth International Florida Artificial Intelligence Research Society Conference, pages 356--360, 2003.Google ScholarGoogle Scholar
  17. P. Narendra and K. Fukunaga. A branch and bound algorithm for feature subset selection. Computers, IEEE Transactions on, 100(9):917--922, 1977. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. K. Ostrowski, G. Mann, and M. Sandler. Diagnosing latency in multi-tier black-box services. 2011.Google ScholarGoogle Scholar
  19. R. R. Sambasivan, A. X. Zheng, M. De Rosa, E. Krevat, S. Whitman, M. Stroucken, W. Wang, L. Xu, and G. R. Ganger. Diagnosing performance changes by comparing request flows. In Proceedings of the 8th USENIX conference on Networked systems design and implementation, NSDI'11, pages 4--4, Berkeley, CA, USA, 2011. USENIX Association. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Sigelman, L. Barroso, M. Burrows, P. Stephenson, M. Plakal, D. Beaver, S. Jaspan, and C. Shanbhag. Dapper, a large-scale distributed systems tracing infrastructure. Google Research, 2010.Google ScholarGoogle Scholar
  21. C. Van Rijsbergen. Information retrieval, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. N. Ye, K. Chai, W. Lee, and H. Chieu. Optimizing f-measures: A tale of two approaches. 2012.Google ScholarGoogle Scholar

Index Terms

  1. Understanding latency variations of black box services

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          WWW '13: Proceedings of the 22nd international conference on World Wide Web
          May 2013
          1628 pages
          ISBN:9781450320351
          DOI:10.1145/2488388

          Copyright © 2013 Copyright is held by the International World Wide Web Conference Committee (IW3C2).

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 13 May 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          WWW '13 Paper Acceptance Rate125of831submissions,15%Overall Acceptance Rate1,899of8,196submissions,23%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader