research-article

Sketched Follow-The-Regularized-Leader for Online Factorization Machine

Authors:

Jian PeiAuthors Info & Claims

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 1900 - 1909

https://doi.org/10.1145/3219819.3220044

Published: 19 July 2018 Publication History

Abstract

Factorization Machine (FM) is a supervised machine learning model for feature engineering, which is widely used in many real-world applications. In this paper, we consider the case that the data samples arrive sequentially. The existing convex formulation for online FM has the strong theoretical guarantee and stable performance in practice, but the computational cost is typically expensive when the data is high-dimensional. To address this weakness, we devise a novel online learning algorithm called Sketched Follow-The-Regularizer-Leader (SFTRL). SFTRL presents the parameters of FM implicitly by maintaining low-rank matrices and updates the parameters via sketching. More specifically, we propose Generalized Frequent Directions to approximate indefinite symmetric matrices in a streaming way, making that the sum of historical gradients for FM could be estimated with tighter error bound efficiently. With mild assumptions, we prove that the regret bound of SFTRL is close to that of the standard FTRL. Experimental results show that SFTRL has better prediction quality than the state-of-the-art online FM algorithms in much lower time and space complexities.

References

[1]

Nir Ailon and Bernard Chazelle . 2006. Approximate nearest neighbors and the fast Johnson-Lindenstrauss transform STOC.

Digital Library

[2]

Nir Ailon and Edo Liberty . 2009. Fast dimension reduction using Rademacher series on dual BCH codes. Discrete & Computational Geometry Vol. 42, 4 (2009), 615.

Digital Library

[3]

Nir Ailon and Edo Liberty . 2013. An almost optimal unrestricted fast Johnson-Lindenstrauss transform. ACM Transactions on Algorithms (TALG) Vol. 9, 3 (2013), 21.

Digital Library

[4]

Peter L. Bartlett, Elad Hazan, and Alexander Rakhlin . 2007. Adaptive Online Gradient Descent. In NIPS.

Digital Library

[5]

Mathieu Blondel, Akinori Fujino, and Naonori Ueda . 2015. Convex Factorization Machines. In ECML/PKDD.

[6]

Mathieu Blondel, Akinori Fujino, Naonori Ueda, and Masakazu Ishihata . 2016. Higher-Order Factorization Machines. In NIPS.

Digital Library

[7]

Daniele Calandriello, Alessandro Lazaric, and Michal Valko . 2017. Second-Order Kernel Online Convex Optimization with Adaptive Sketching ICML.

[8]

Chen Cheng, Fen Xia, Tong Zhang, Irwin King, and Michael R. Lyu . 2014. Gradient boosting factorization machines. In RecSys.

Digital Library

[9]

Kenneth L Clarkson and David P Woodruff . 2013. Low rank approximation and regression in input sparsity time STOC.

Digital Library

[10]

Amey Desai, Mina Ghashami, and Jeff M Phillips . 2016. Improved practical matrix sketching with guarantees. IEEE Transactions on Knowledge and Data Engineering Vol. 28, 7 (2016), 1678--1690.

[11]

Petros Drineas, Ravi Kannan, and Michael W Mahoney . 2006 a. Fast Monte Carlo algorithms for matrices I: Approximating matrix multiplication. SIAM J. Comput. Vol. 36, 1 (2006), 132--157.

Digital Library

[12]

Petros Drineas, Ravi Kannan, and Michael W Mahoney . 2006 b. Fast Monte Carlo algorithms for matrices II: Computing a low-rank approximation to a matrix. SIAM Journal on computing Vol. 36, 1 (2006), 158--183.

Digital Library

[13]

Petros Drineas, Malik Magdon-Ismail, Michael W Mahoney, and David P Woodruff . 2012. Fast approximation of matrix coherence and statistical leverage. Journal of Machine Learning Research Vol. 13, Dec (2012), 3475--3506.

Digital Library

[14]

Petros Drineas, Michael W Mahoney, and S Muthukrishnan . 2008. Relative-error CUR matrix decompositions. SIAM J. Matrix Anal. Appl. Vol. 30, 2 (2008), 844--881.

Digital Library

[15]

John Duchi, Elad Hazan, and Yoram Singer . 2011. Adaptive subgradient methods for online learning and stochastic optimization. Journal of Machine Learning Research Vol. 12, Jul (2011), 2121--2159.

Digital Library

[16]

Alan Frieze, Ravi Kannan, and Santosh Vempala . 2004. Fast Monte-Carlo algorithms for finding low-rank approximations. Journal of the ACM (JACM) Vol. 51, 6 (2004), 1025--1041.

Digital Library

[17]

Mina Ghashami, Edo Liberty, and Jeff M Phillips . 2016 a. Efficient frequent directions algorithm for sparse matrices SIGKDD.

Digital Library

[18]

Mina Ghashami, Edo Liberty, Jeff M Phillips, and David P Woodruff . 2016 b. Frequent directions: Simple and deterministic matrix sketching. SIAM J. Comput. Vol. 45, 5 (2016), 1762--1792.

[19]

Ming Gu and Stanley C Eisenstat . 1993. A stable and fast algorithm for updating the singular value decomposition. Technical Report YALEU/DCS/RR-966 (1993).

[20]

Weiyu Guo, Shu Wu, Liang Wang, and Tieniu Tan . 2016. Personalized ranking with pairwise Factorization Machines. Neurocomputing Vol. 214 (2016), 191--200.

Digital Library

[21]

Elad Hazan et almbox. . 2016. Introduction to online convex optimization. Foundations and Trends® in Optimization Vol. 2, 3--4 (2016), 157--325.

Digital Library

[22]

William B Johnson and Joram Lindenstrauss . 1984. Extensions of Lipschitz mappings into a Hilbert space. Contemporary mathematics Vol. 26, 189--206 (1984), 1.

[23]

Yuchin Juan, Damien Lefortier, and Olivier Chapelle . 2017. Field-aware factorization machines in a real-world online advertising system WWW Companion.

Digital Library

[24]

Yuchin Juan, Yong Zhuang, Wei-Sheng Chin, and Chih-Jen Lin . 2016. Field-aware factorization machines for CTR prediction RecSys.

Digital Library

[25]

Daniel M Kane and Jelani Nelson . 2014. Sparser johnson-lindenstrauss transforms. Journal of the ACM (JACM) Vol. 61, 1 (2014), 4.

Digital Library

[26]

Takuya Kitazawa . 2016. Incremental Factorization Machines for Persistently Cold-starting Online Item Recommendation. arXiv preprint arXiv:1607.02858 (2016). deftempurl%https://arxiv.org/abs/1607.02858 tempurl

[27]

Edo Liberty . 2013. Simple and deterministic matrix sketching. In SIGKDD.

Digital Library

[28]

Xiao Lin, Wenpeng Zhang, Min Zhang, Wenwu Zhu, Jian Pei, Peilin Zhao, and Junzhou Huang . 2018. Online Compact Convexified Factorization Machine. In WWW.

Digital Library

[29]

Chun-Ta Lu, Lifang He, Weixiang Shao, Bokai Cao, and Philip S. Yu . 2017 a. Multilinear Factorization Machines for Multi-Task Multi-View Learning WSDM.

Digital Library

[30]

Chun-Ta Lu, Lifang He, Weixiang Shao, Bokai Cao, and Philip S Yu . 2017 b. Multilinear factorization machines for multi-task multi-view learning WSDM.

Digital Library

[31]

Haipeng Luo, Alekh Agarwal, Nicolò Cesa-Bianchi, and John Langford . 2016. Efficient second order online learning by sketching NIPS.

Digital Library

[32]

Luo Luo, Cheng Chen, Zhihua Zhang, Wu-Jun Li, and Tong Zhang . 2017. Robust Frequent Directions with Application in Online Learning. arXiv preprint arXiv:1705.05067 (2017). deftempurl%https://arxiv.org/abs/1705.05067 tempurl

[33]

H. Brendan McMahan . 2011. Follow-the-Regularized-Leader and Mirror Descent: Equivalence Theorems and L1 Regularization. In AISTATS.

[34]

Youssef Mroueh, Etienne Marcheret, and Vaibhava Goel . 2017. Co-Occurring Directions Sketching for Approximate Matrix Multiply AISTATS.

[35]

Jelani Nelson and Huy L Nguyên . 2013. OSNAP: Faster numerical linear algebra algorithms via sparser subspace embeddings FOCS.

Digital Library

[36]

Trung V Nguyen, Alexandros Karatzoglou, and Linas Baltrunas . 2014. Gaussian process factorization machines for context-aware recommendations SIGIR.

Digital Library

[37]

Dimitris Papailiopoulos, Anastasios Kyrillidis, and Christos Boutsidis . 2014. Provable deterministic leverage score sampling. In SIGKDD.

Digital Library

[38]

Steffen Rendle . 2010. Factorization machines. In ICDM.

Digital Library

[39]

Steffen Rendle, Zeno Gantner, Christoph Freudenthaler, and Lars Schmidt-Thieme . 2011. Fast context-aware recommendations with factorization machines SIGIR.

Digital Library

[40]

Tamas Sarlos . 2006. Improved approximation algorithms for large matrices via random projections FOCS.

Digital Library

[41]

Shai Shalev-Shwartz . 2011. Online learning and online convex optimization. Foundations and Trends in Machine Learning Vol. 4, 2 (2011), 107--194.

Digital Library

[42]

Lloyd N Trefethen and David Bau III . 1997. Numerical linear algebra. Vol. Vol. 50. Society for Industrial and Applied Mathematics.

[43]

David P. Woodruff . 2014. Sketching as a Tool for Numerical Linear Algebra. Foundations and Trends in Theoretical Computer Science Vol. 10, 1--2 (2014), 1--157.

Digital Library

[44]

Makoto Yamada, Wenzhao Lian, Amit Goyal, Jianhui Chen, Kishan Wimalawarne, Suleiman A Khan, Samuel Kaski, Hiroshi Mamitsuka, and Yi Chang . 2017. Convex factorization machine for toxicogenomics prediction SIGKDD.

Digital Library

[45]

Qiaomin Ye, Luo Luo, and Zhihua Zhang . 2016. Frequent Direction Algorithms for Approximate Matrix Multiplication with Applications in CCA. In IJCAI.

Digital Library

[46]

Wenpeng Zhang, Xiao Lin, Tong Zhang, Peilin Zhao, Wenwu Zhu, Min Zhang, and Jian Pei . 2018. Compact convexified factorization machine: formulation and online algorithms. arXiv preprint arXiv:1802.01379 (2018). deftempurl%https://arxiv.org/abs/1802.01379 tempurl

[47]

Martin Zinkevich . 2003. Online Convex Programming and Generalized Infinitesimal Gradient Ascent ICML.

Digital Library

Cited By

Chen Z(2024)Robust Sparse Online Learning through Adversarial Sparsity Constraints2024 9th IEEE International Conference on Smart Cloud (SmartCloud)10.1109/SmartCloud62736.2024.00015(42-47)Online publication date: 10-May-2024
https://doi.org/10.1109/SmartCloud62736.2024.00015
Chen Z(2024)Adaptive Sparse Online Learning through Asymmetric Truncated Gradient2024 IEEE 10th International Conference on Big Data Computing Service and Machine Learning Applications (BigDataService)10.1109/BigDataService62917.2024.00013(44-51)Online publication date: 15-Jul-2024
https://doi.org/10.1109/BigDataService62917.2024.00013
Hu ZPeng CHe CCai H(2020)IO-aware Factorization Machine for User Response Prediction2020 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN48605.2020.9207424(1-8)Online publication date: Jul-2020
https://doi.org/10.1109/IJCNN48605.2020.9207424

Index Terms

Sketched Follow-The-Regularized-Leader for Online Factorization Machine

Recommendations

Online Compact Convexified Factorization Machine
WWW '18: Proceedings of the 2018 World Wide Web Conference

Factorization Machine (FM) is a supervised learning approach with a powerful capability of feature engineering. It yields state-of-the-art performances in various batch learning tasks where all the training data is made available prior to the training. ...
Factorization Machines
ICDM '10: Proceedings of the 2010 IEEE International Conference on Data Mining

In this paper, we introduce Factorization Machines (FM) which are a new model class that combines the advantages of Support Vector Machines (SVM) with factorization models. Like SVMs, FMs are a general predictor working with any real valued feature ...
A survey of algorithms and analysis for adaptive online learning

We present tools for the analysis of Follow-The-Regularized-Leader (FTRL), Dual Averaging, and Mirror Descent algorithms when the regularizer (equivalently, proxfunction or learning rate schedule) is chosen adaptively based on the data. Adaptivity can ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

July 2018

2925 pages

ISBN:9781450355520

DOI:10.1145/3219819

General Chairs:
Yike Guo
Imperial College London
,
Faisal Farooq
IBM

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

the NSERC Discovery Grant program
the Canada Research Chair program
National Program on Key Basic Research Project
National Natural Science Foundation of China
National Natural Science Foundation of China Major Project
the NSERC Strategic Grant program

Conference

KDD '18

Sponsor:

KDD '18: The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 19 - 23, 2018

London, United Kingdom

Acceptance Rates

KDD '18 Paper Acceptance Rate 107 of 983 submissions, 11%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
1,039
Total Downloads

Downloads (Last 12 months)22
Downloads (Last 6 weeks)2

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Chen Z(2024)Robust Sparse Online Learning through Adversarial Sparsity Constraints2024 9th IEEE International Conference on Smart Cloud (SmartCloud)10.1109/SmartCloud62736.2024.00015(42-47)Online publication date: 10-May-2024
https://doi.org/10.1109/SmartCloud62736.2024.00015
Chen Z(2024)Adaptive Sparse Online Learning through Asymmetric Truncated Gradient2024 IEEE 10th International Conference on Big Data Computing Service and Machine Learning Applications (BigDataService)10.1109/BigDataService62917.2024.00013(44-51)Online publication date: 15-Jul-2024
https://doi.org/10.1109/BigDataService62917.2024.00013
Hu ZPeng CHe CCai H(2020)IO-aware Factorization Machine for User Response Prediction2020 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN48605.2020.9207424(1-8)Online publication date: Jul-2020
https://doi.org/10.1109/IJCNN48605.2020.9207424

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten