skip to main content
10.1145/3580305.3599183acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
abstract
Free Access

Practical Design of Performant Recommender Systems using Large-scale Linear Programming-based Global Inference

Published:04 August 2023Publication History

ABSTRACT

Several key problems in web-scale recommender systems, such as optimal matching and allocation, can be formulated as large-scale linear programs (LPs) [4, 1]. These LPs take predictions from ML models such as probabilities of click, like, etc. as inputs and optimize recommendations made to users. In recent years, there has been an explosion in the research and development of large-scale recommender systems, but effective optimization of business objectives using the output of those systems remains a challenge. Although LPs can help optimize such business objectives, and algorithms for solving LPs have existed since the 1950s [5, 8], generic LP solvers cannot handle the scale of these problems. At LinkedIn, we have developed algorithms that can solve LPs of various forms with trillions of variables in a Spark-based library called "DuaLip" [7], a novel distributed solver that solves a perturbation of the LP problem at scale via gradient-based algorithms on the smooth dual of the perturbed LP. DuaLip has been deployed in production at LinkedIn and powers several very large-scale recommender systems. DuaLip is open-sourced and extensible in terms of features and algorithms.

In this first-of-its-kind tutorial, we will motivate the application of LPs to improve recommender systems, cover the theory of key LP algorithms [8, 6], and introduce DuaLip (https://github.com/linkedin/DuaLip), a highly performant Spark-based library that solves extreme-scale LPs for a large variety of recommender system problems. We will describe practical successes of large-scale LP in the industry [3, 2, 9] followed by a hands-on exercise to run DuaLip.

References

  1. Deepak Agarwal et al. 2015. Personalizing linkedin feed. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1651--1660.Google ScholarGoogle Scholar
  2. Eduardo M Azevedo and E Glen Weyl. 2016. Matching markets in the digital age. Science, 352, 6289, 1056--1057.Google ScholarGoogle Scholar
  3. Rupesh Gupta, Guangde Chen, and Shipeng Yu. 2019. Internal promotion optimization. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2358--2366.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Rupesh Gupta, Guanfeng Liang, and Rómer Rosales. 2017. Optimizing email volume for sitewide engagement. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 1947--1955.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Olvi L Mangasarian and RR Meyer. 1979. Nonlinear perturbation of linear programs. SIAM Journal on Control and Optimization, 17, 6, 745--752.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Brendan O'donoghue, Eric Chu, Neal Parikh, and Stephen Boyd. 2016. Conic op- timization via operator splitting and homogeneous self-dual embedding. Journal of Optimization Theory and Applications, 169, 1042--1068.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Rohan Ramanath, S Sathiya Keerthi, Yao Pan, Konstantin Salomatin, and Kinjal Basu. 2022. Efficient vertex-oriented polytopic projection for web-scale applications. In Proceedings of the AAAI Conference on Artificial Intelligence number 4. Vol. 36, 3821--3829.Google ScholarGoogle ScholarCross RefCross Ref
  8. Philip Wolfe. 1976. Finding the nearest point in a polytope. Mathematical Pro-gramming, 11, 128--149.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Huanyang Zheng and Jie Wu. 2017. Online to offline business: urban taxi dis- patching with passenger-driver matching stability. In 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS). IEEE, 816--825Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Practical Design of Performant Recommender Systems using Large-scale Linear Programming-based Global Inference

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
        August 2023
        5996 pages
        ISBN:9798400701030
        DOI:10.1145/3580305

        Copyright © 2023 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 4 August 2023

        Check for updates

        Qualifiers

        • abstract

        Acceptance Rates

        Overall Acceptance Rate1,133of8,635submissions,13%

        Upcoming Conference

        KDD '24
      • Article Metrics

        • Downloads (Last 12 months)236
        • Downloads (Last 6 weeks)24

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Access Granted

      The conference sponsors are committed to making content openly accessible in a timely manner.
      This article is provided by ACM and the conference, through the ACM OpenTOC service.