skip to main content
10.1145/3127479.3129255acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article
Public Access

Workload analysis and caching strategies for search advertising systems

Published:24 September 2017Publication History

ABSTRACT

Search advertising depends on accurate predictions of user behavior and interest, accomplished today using complex and computationally expensive machine learning algorithms that estimate the potential revenue gain of thousands of candidate advertisements per search query. The accuracy of this estimation is important for revenue, but the cost of these computations represents a substantial expense, e.g., 10% to 30% of the total gross revenue. Caching the results of previous computations is a potential path to reducing this expense, but traditional domain-agnostic and revenue-agnostic approaches to do so result in substantial revenue loss. This paper presents three domain-specific caching mechanisms that successfully optimize for both factors. Simulations on a trace from the Bing advertising system show that a traditional cache can reduce cost by up to 27.7% but has negative revenue impact as bad as -14.1%. On the other hand, the proposed mechanisms can reduce cost by up to 20.6% while capping revenue impact between -1.3% and 0%. Based on Microsoft's earnings release for FY16 Q4, the traditional cache would reduce the net profit of Bing Ads by $84.9 to $166.1 million in the quarter, while our proposed cache could increase the net profit by $11.1 to $71.5 million.

References

  1. 2016. Average CTR (Click-Through Rate): Learn How Your CTR Compares. http://www.wordstream.com/average-ctr. (2016).Google ScholarGoogle Scholar
  2. 2017. Microsoft Earnings Release FY16 Q4. https://www.microsoft.com/en-us/Investor/earnings/FY-2016-Q4/. (2017).Google ScholarGoogle Scholar
  3. Sadiye Alici, Ismail Sengor Altingovde, Rifat Ozcan, Berkant Barla Cambazoglu, and Özgür Ulusoy. 2011. Timestamp-based Result Cache Invalidation for Web Search Engines. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Sadiye Alici, Ismail Sengor Altingovde, Rifat Ozcan, B. Barla Cambazoglu, and Özgür Ulusoy. 2012. Adaptive Time-to-Live Strategies for Query Result Caching in Web Search Engines. In Proceedings of the 34th European Conference on Information Retrieval. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Ricardo Baeza-Yates, Aristides Gionis, Flavio P Junqueira, Vanessa Murdock, Vassilis Plachouras, and Fabrizio Silvestri. 2008. Design Trade-Offs for Search Engine Caching. ACM Trans. Web 2, 4 (Oct. 2008).Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Xiao Bai and Flavio P. Junqueira. 2012. Online Result Cache Invalidation for Real-time Web Search. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Edward Bortnikov, Ronny Lempel, and Kolman Vornovitsky. 2011. Caching for Realtime Search. In Proceedings of the 33rd European Conference on Information Retrieval. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. B Barla Cambazoglu and Ismail Sengor Altingovde. 2012. Impact of Regionalization on Performance of Web Search Engine Result Caches. In Proceedings of the 19th Symposium on String Processing and Information Retrieval. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Berkant Barla Cambazoglu, Flavio P. Junqueira, Vassilis Plachouras, Scott Banachowski, Baoqiu Cui, Swee Lim, and Bill Bridge. 2010. A Refreshing Perspective of Search Engine Caching. In Proceedings of the 19th International Conference on World Wide Web. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Pei Cao and Sandy Irani. 1997. Cost-aware WWW Proxy Caching Algorithms. In Proceedings of the USENIX Symposium on Internet Technologies and Systems.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Tiziano Fagni, Raffaele Perego, Fabrizio Silvestri, and Salvatore Orlando. 2006. Boosting the Performance of Web Search Engines: Caching and Prefetching Query Results by Exploiting Historical Usage Data. ACM Trans. Information Systems 24, 1 (Jan. 2006). Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Qingqing Gan and Torsten Suel. 2009. Improved Techniques for Result Caching in Web Search Engines. In Proceedings of the 18th International Conference on World Wide Web. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Thore Graepel, Joaquin Q. Candela, Thomas Borchert, and Ralf Herbrich. 2010. Web-Scale Bayesian Click-Through Rate Prediction for Sponsored Search Advertising in Microsoft's Bing Search Engine. In Proceedings of the 27th international conference on machine learning.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Xinran He, Junfeng Pan, Ou Jin, Tianbing Xu, Bo Liu, Tao Xu, Yanxin Shi, Antoine Atallah, Ralf Herbrich, Stuart Bowers, and Joaquin Quiñonero Candela. 2014. Practical Lessons from Predicting Clicks on Ads at Facebook. In Proceedings of the Eighth International Workshop on Data Mining for Online Advertising. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Conglong Li and Alan L. Cox. 2015. GD-Wheel: A Cost-aware Replacement Policy for Key-value Stores. In Proceedings of the Tenth European Conference on Computer Systems. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. H. Brendan McMahan, Gary Holt, D. Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, Sharat Chikkerur, Dan Liu, Martin Wattenberg, Arnar Mar Hrafnkelsson, Tom Boulos, and Jeremy Kubica. 2013. Ad Click Prediction: A View from the Trenches. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Rifat Ozcan, Ismail Sengor Altingovde, B. Barla Cambazoglu, and Özgür Ulusoy. 2013. Second Chance: A Hybrid Approach for Dynamic Result Caching and Prefetching in Search Engines. ACM Trans. Web 8, 1 (Dec. 2013). Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Rifat Ozcan, Ismail Sengor Altingovde, and Özgür Ulusoy. 2011. Cost-Aware Strategies for Query Result Caching in Web Search Engines. ACM Trans. Web 5, 2 (May 2011). Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Fethi Burak Sazoglu, B. Barla Cambazoglu, Rifat Ozcan, Ismail Sengor Altingovde, and Özgür Ulusoy. 2013. A Financial Cost Metric for Result Caching. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Workload analysis and caching strategies for search advertising systems

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SoCC '17: Proceedings of the 2017 Symposium on Cloud Computing
            September 2017
            672 pages
            ISBN:9781450350280
            DOI:10.1145/3127479

            Copyright © 2017 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 24 September 2017

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate169of722submissions,23%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader