Practical Design of Performant Recommender Systems using Large-scale Linear Programming-based Global Inference

Authors:
Aman Gupta

LinkedIn, Sunnyvale, CA, USA

LinkedIn, Sunnyvale, CA, USA

0009-0007-6620-841X
View Profile

,
S. Sathiya Keerthi

LinkedIn, Sunnyvale, CA, USA

LinkedIn, Sunnyvale, CA, USA

0000-0002-6065-1732
View Profile

,
Ayan Acharya

LinkedIn, Sunnyvale, CA, USA

LinkedIn, Sunnyvale, CA, USA

0000-0003-3023-4337
View Profile

,
Miao Cheng

LinkedIn, Sunnyvale, CA, USA

LinkedIn, Sunnyvale, CA, USA

0009-0008-1505-4403
View Profile

,
Borja Ocejo Elizondo

LinkedIn, Sunnyvale, CA, USA

LinkedIn, Sunnyvale, CA, USA

0000-0001-6706-9940
View Profile

,
Rohan Ramanath

Chico AI & LinkedIn, Sunnyvale, CA, USA

Chico AI & LinkedIn, Sunnyvale, CA, USA

0009-0007-4493-8139
View Profile

,
Rahul Mazumder

LinkedIn, Sunnyvale, CA, USA

LinkedIn, Sunnyvale, CA, USA

0000-0003-4285-7400
View Profile

,
Kinjal Basu

Aliveo AI & LinkedIn, Sunnyvale, CA, USA

Aliveo AI & LinkedIn, Sunnyvale, CA, USA

0000-0002-4091-0119
View Profile

,
J. Kenneth Tay

LinkedIn, Sunnyvale, CA, USA

LinkedIn, Sunnyvale, CA, USA

0000-0003-3046-1820
View Profile

,
Rupesh Gupta

LinkedIn, Sunnyvale, CA, USA

LinkedIn, Sunnyvale, CA, USA

0009-0006-4395-1262
View Profile

KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data MiningAugust 2023Pages 5781–5782https://doi.org/10.1145/3580305.3599183

Published:04 August 2023Publication History

KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 5781–5782

ABSTRACT

Several key problems in web-scale recommender systems, such as optimal matching and allocation, can be formulated as large-scale linear programs (LPs) [4, 1]. These LPs take predictions from ML models such as probabilities of click, like, etc. as inputs and optimize recommendations made to users. In recent years, there has been an explosion in the research and development of large-scale recommender systems, but effective optimization of business objectives using the output of those systems remains a challenge. Although LPs can help optimize such business objectives, and algorithms for solving LPs have existed since the 1950s [5, 8], generic LP solvers cannot handle the scale of these problems. At LinkedIn, we have developed algorithms that can solve LPs of various forms with trillions of variables in a Spark-based library called "DuaLip" [7], a novel distributed solver that solves a perturbation of the LP problem at scale via gradient-based algorithms on the smooth dual of the perturbed LP. DuaLip has been deployed in production at LinkedIn and powers several very large-scale recommender systems. DuaLip is open-sourced and extensible in terms of features and algorithms.

In this first-of-its-kind tutorial, we will motivate the application of LPs to improve recommender systems, cover the theory of key LP algorithms [8, 6], and introduce DuaLip (https://github.com/linkedin/DuaLip), a highly performant Spark-based library that solves extreme-scale LPs for a large variety of recommender system problems. We will describe practical successes of large-scale LP in the industry [3, 2, 9] followed by a hands-on exercise to run DuaLip.

References

Deepak Agarwal et al. 2015. Personalizing linkedin feed. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1651--1660.Google Scholar
Eduardo M Azevedo and E Glen Weyl. 2016. Matching markets in the digital age. Science, 352, 6289, 1056--1057.Google Scholar
Rupesh Gupta, Guangde Chen, and Shipeng Yu. 2019. Internal promotion optimization. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2358--2366.Google ScholarDigital Library
Rupesh Gupta, Guanfeng Liang, and Rómer Rosales. 2017. Optimizing email volume for sitewide engagement. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 1947--1955.Google ScholarDigital Library
Olvi L Mangasarian and RR Meyer. 1979. Nonlinear perturbation of linear programs. SIAM Journal on Control and Optimization, 17, 6, 745--752.Google ScholarDigital Library
Brendan O'donoghue, Eric Chu, Neal Parikh, and Stephen Boyd. 2016. Conic op- timization via operator splitting and homogeneous self-dual embedding. Journal of Optimization Theory and Applications, 169, 1042--1068.Google ScholarDigital Library
Rohan Ramanath, S Sathiya Keerthi, Yao Pan, Konstantin Salomatin, and Kinjal Basu. 2022. Efficient vertex-oriented polytopic projection for web-scale applications. In Proceedings of the AAAI Conference on Artificial Intelligence number 4. Vol. 36, 3821--3829.Google ScholarCross Ref
Philip Wolfe. 1976. Finding the nearest point in a polytope. Mathematical Pro-gramming, 11, 128--149.Google ScholarDigital Library
Huanyang Zheng and Jie Wu. 2017. Online to offline business: urban taxi dis- patching with passenger-driver matching stability. In 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS). IEEE, 816--825Google ScholarCross Ref

Index Terms

Practical Design of Performant Recommender Systems using Large-scale Linear Programming-based Global Inference
1. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Recommender systems
2. Theory of computation
  1. Design and analysis of algorithms
    1. Mathematical optimization
      1. Continuous optimization
        Linear programming

Recommendations

Branching on hyperplane methods for mixed integer linear and convex programming using adjoint lattices

We present branching-on-hyperplane methods for solving mixed integer linear and mixed integer convex programs. In particular, we formulate the problem of finding a good branching hyperplane using a novel concept of adjoint lattice. We also reformulate ...
Read More
Very Large-Scale Linear Programming: A Case Study in Combining Interior Point and Simplex Methods

Experience with solving a 12.753.313 variable linear program is described. This problem is the linear programming relaxation of a set partitioning problem arising from an airline crew scheduling application. A scheme is described that requires ...
Read More
Transformation of a multi-choice linear programming problem

The aim of this paper is to transform a multi-choice linear programming problem to a standard mathematical programming problem where the right hand side goals of some constraints are 'multi-choice' in nature. For each of the constraint there may exist ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
August 2023
5996 pages
ISBN:9798400701030
DOI:10.1145/3580305
General Chairs:
Ambuj Singh
UC Santa Barbara, USA
,
Yizhou Sun
UC Los Angeles, USA
,
Program Chairs:
Leman Akoglu
Carnegie Mellon University, USA
,
Dimitrios Gunopulos
University of Athens, Greece
,
Xifeng Yan
UC Santa Barbara, USA
,
Ravi Kumar
Google, USA
,
Fatma Ozcan
Google, USA
,
Jieping Ye
Alibaba DAMO Academy
Copyright © 2023 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 August 2023
Check for updates
Author Tags
linear programming
recommender systems
Qualifiers
- abstract
Conference

Acceptance Rates
Overall Acceptance Rate1,133of8,635submissions,13%
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 236
  Total Downloads
- Downloads (Last 12 months)236
- Downloads (Last 6 weeks)24
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Practical Design of Performant Recommender Systems using Large-scale Linear Programming-based Global Inference

KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

ABSTRACT

References

Cited By

Index Terms

Recommendations

Branching on hyperplane methods for mixed integer linear and convex programming using adjoint lattices

Very Large-Scale Linear Programming: A Case Study in Combining Interior Point and Simplex Methods

Transformation of a multi-choice linear programming problem