research-article

A Scalable, Adaptive and Sound Nonconvex Regularizer for Low-rank Matrix Learning

Authors:

James KwokAuthors Info & Claims

WWW '21: Proceedings of the Web Conference 2021

Pages 1798 - 1808

https://doi.org/10.1145/3442381.3450142

Published: 03 June 2021 Publication History

Abstract

Matrix learning is at the core of many machine learning problems. A number of real-world applications such as collaborative filtering and text mining can be formulated as a low-rank matrix completion problems, which recovers incomplete matrix using low-rank assumptions. To ensure that the matrix solution has a low rank, a recent trend is to use nonconvex regularizers that adaptively penalize singular values. They offer good recovery performance and have nice theoretical properties, but are computationally expensive due to repeated access to individual singular values. In this paper, based on the key insight that adaptive shrinkage on singular values improve empirical performance, we propose a new nonconvex low-rank regularizer called ”nuclear norm minus Frobenius norm” regularizer, which is scalable, adaptive and sound. We first show it provably holds the adaptive shrinkage property. Further, we discover its factored form which bypasses the computation of singular values and allows fast optimization by general optimization algorithms. Stable recovery and convergence are guaranteed. Extensive low-rank matrix completion experiments on a number of synthetic and real-world data sets show that the proposed method obtains state-of-the-art recovery performance while being the fastest in comparison to existing low-rank matrix learning methods. 1

References

[1]

Mohammad Taha Bahadori, Qi Rose Yu, and Yan Liu. 2014. Fast multivariate spatio-temporal analysis via low rank tensor learning. In Advances in Neural Information Processing Systems. 3491–3499.

[2]

Jérôme Bolte, Shoham Sabach, and Marc Teboulle. 2014. Proximal alternating linearized minimization for nonconvex and nonsmooth problems. Mathematical Programming 146, 1-2 (2014), 459–494.

Digital Library

[3]

Nicolas Boumal and P-A Absil. 2015. Low-rank matrix completion via preconditioned optimization on the Grassmann manifold. Linear Algebra Appl. 475(2015), 200–239.

[4]

Stephen Boyd, Stephen P Boyd, and Lieven Vandenberghe. 2004. Convex Optimization. Cambridge University Press.

Digital Library

[5]

Stephen Boyd, Neal Parikh, and Eric Chu. 2011. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning 3, 1 (2011), 1–122.

Digital Library

[6]

Jian-Feng Cai, Emmanuel J Candès, and Zuowei Shen. 2010. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization 20, 4 (2010), 1956–1982.

[7]

Emmanuel J Candès and Benjamin Recht. 2009. Exact matrix completion via convex optimization. Foundations of Computational Mathematics 9, 6 (2009), 717–772.

[8]

Emmanuel J Candes and Terence Tao. 2005. Decoding by linear programming. IEEE Transactions on Information Theory 51, 12 (2005), 4203–4215.

Digital Library

[9]

Emmanuel J Candes, Michael B Wakin, and Stephen P Boyd. 2008. Enhancing sparsity by reweighted ℓ1 minimization. Journal of Fourier Analysis and Applications 14, 5-6(2008), 877–905.

[10]

Jianhui Chen, Jiayu Zhou, and Jieping Ye. 2011. Integrating low-rank and group-sparse structures for robust multi-task learning. In International Conference on Knowledge Discovery and Data Mining. 42–50.

Digital Library

[11]

Jicong Fan, Lijun Ding, Yudong Chen, and Madeleine Udell. 2019. Factor group-sparse regularization for efficient low-rank matrix recovery. In Advances in Neural Information Processing Systems. 5104–5114.

[12]

Maryam Fazel. 2002. Matrix Rank Minimization with Applications. Ph.D. Dissertation. Stanford.

[13]

Rong Ge, Jason D Lee, and Tengyu Ma. 2016. Matrix completion has no spurious local minimum. In Advances in Neural Information Processing Systems. 2973–2981.

[14]

Shuhang Gu, Lei Zhang, Wangmeng Zuo, and Xiangchu Feng. 2014. Weighted nuclear norm minimization with application to image denoising. In Conference on Computer Vision and Pattern Recognition. 2862–2869.

Digital Library

[15]

Huan Gui, Jiawei Han, and Quanquan Gu. 2016. Towards faster rates and oracle property for low-rank matrix estimation. In International Conference on Machine Learning. 2300–2309.

[16]

Suriya Gunasekar, Blake Woodworth, Srinadh Bhojanapalli, Behnam Neyshabur, and Nathan Srebro. 2017. Implicit regularization in matrix factorization. In Advances in Neural Information Processing Systems. 6151–6159.

[17]

F Maxwell Harper and Joseph A Konstan. 2015. The movielens datasets: History and context. ACM Transactions on Interactive Intelligent Systems 5, 4 (2015), 1–19.

Digital Library

[18]

Trevor Hastie, Rahul Mazumder, Jason D Lee, and Reza Zadeh. 2015. Matrix completion and low-rank SVD via fast alternating least squares. Journal of Machine Learning Research 16, 1 (2015), 3367–3402.

Digital Library

[19]

J-B Hiriart-Urruty. 1985. Generalized differentiability, duality and optimization for problems dealing with differences of convex functions. In Proceedings of the Symposium on Convexity and Duality in Optimization. 37–70.

[20]

Yao Hu, Debing Zhang, Jieping Ye, Xuelong Li, and Xiaofei He. 2012. Fast and accurate matrix completion via truncated nuclear norm regularization. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 9(2012), 2117–2130.

Digital Library

[21]

Alan Jennings and John J McKeown. 1992. Matrix Computation. John Wiley & Sons.

[22]

Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30–37.

Digital Library

[23]

Dongsheng Li, Chao Chen, Qin Lv, Hansu Gu, Tun Lu, Li Shang, Ning Gu, and Stephen M Chu. 2018. AdaError: An adaptive learning rate method for matrix approximation-based collaborative filtering. In The World Wide Web Conference. 741–751.

Digital Library

[24]

Yifei Lou and Ming Yan. 2018. Fast L1-L2 minimization via a proximal operator. Journal of Scientific Computing 74, 2 (2018), 767–785.

Digital Library

[25]

Canyi Lu, Jinhui Tang, Shuicheng Yan, and Zhouchen Lin. 2015. Nonconvex nonsmooth low rank minimization via iteratively reweighted nuclear norm. IEEE Transactions on Image Processing 25, 2 (2015), 829–839.

Digital Library

[26]

Canyi Lu, Changbo Zhu, Chunyan Xu, Shuicheng Yan, and Zhouchen Lin. 2015. Generalized singular value thresholding. In AAAI Conference on Artificial Intelligence. 1805–1811.

[27]

Tian-Hui Ma, Yifei Lou, and Ting-Zhu Huang. 2017. Truncated ℓ1 − 2 models for sparse recovery and rank minimization. SIAM Journal on Imaging Sciences 10, 3 (2017), 1346–1380.

Digital Library

[28]

Rahul Mazumder, Diego Saldana, and Haolei Weng. 2020. Matrix completion with nonconvex regularization: Spectral operators and scalable algorithms. Statistics and Computing(2020), 1–26.

[29]

Andriy Mnih and Russ R Salakhutdinov. 2008. Probabilistic matrix factorization. In Advances in Neural Information Processing Systems. 1257–1264.

[30]

Feiping Nie, Heng Huang, and Chris Ding. 2012. Low-rank matrix recovery via efficient schatten p-norm minimization. In AAAI Conference on Artificial Intelligence.

[31]

Jorge Nocedal and Stephen Wright. 2006. Numerical Optimization. Springer Science & Business Media.

[32]

Neal Parikh and Stephen Boyd. 2014. Proximal algorithms. Foundations and Trends in Optimization 1, 3 (2014), 127–239.

Digital Library

[33]

Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. Glove: Global vectors for word representation. In Conference on Empirical Methods in Natural Language Processing. 1532–1543.

[34]

Nikhil Rao, Hsiang-Fu Yu, Pradeep Ravikumar, and Inderjit S Dhillon. 2015. Collaborative filtering with graph information: Consistency and scalable methods. In Advances in Neural Information Processing Systems. 2107–2115.

[35]

Ruslan Salakhutdinov and Andriy Mnih. 2008. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. In International Conference on Machine Learning. 880–887.

Digital Library

[36]

Fanhua Shang, Yuanyuan Liu, and James Cheng. 2016. Tractable and scalable Schatten quasi-norm approximations for rank minimization. In Artificial Intelligence and Statistics. 620–629.

[37]

Mohit Sharma and George Karypis. 2019. Adaptive matrix completion for the users and the items in tail. In The World Wide Web Conference. 3223–3229.

Digital Library

[38]

Tian Shi, Kyeongpil Kang, Jaegul Choo, and Chandan K Reddy. 2018. Short-text topic modeling via non-negative matrix factorization enriched with local word-context correlations. In The World Wide Web Conference. 1105–1114.

Digital Library

[39]

Nathan Srebro, Jason Rennie, and Tommi S Jaakkola. 2005. Maximum-margin matrix factorization. In Advances in Neural Information Processing Systems. 1329–1336.

[40]

Stephen Tu, Ross Boczar, Max Simchowitz, Mahdi Soltanolkotabi, and Ben Recht. 2016. Low-rank solutions of linear matrix equations via procrustes flow. In International Conference on Machine Learning. 964–973.

[41]

Bart Vandereycken. 2013. Low-rank matrix completion by Riemannian optimization. SIAM Journal on Optimization 23, 2 (2013), 1214–1236.

Digital Library

[42]

Lingxiao Wang, Xiao Zhang, and Quanquan Gu. 2017. A unified computational and statistical framework for nonconvex low-rank matrix estimation. In Artificial Intelligence and Statistics. 981–990.

[43]

Zaiwen Wen, Wotao Yin, and Yin Zhang. 2012. Solving a low-rank factorization model for matrix completion by a nonlinear successive over-relaxation algorithm. Mathematical Programming Computation 4, 4 (2012), 333–361.

[44]

Quanming Yao, James T Kwok, Taifeng Wang, and Tie-Yan Liu. 2019. Large-scale low-rank matrix learning with nonconvex regularizers. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 11(2019), 2628–2643.

[45]

Cun-Hui Zhang. 2010. Nearly unbiased variable selection under minimax concave penalty. The Annals of Statistics 38, 2 (2010), 894–942.

[46]

Tong Zhang. 2010. Analysis of multi-stage convex relaxation for sparse regularization. Journal of Machine Learning Research 11, Mar (2010), 1081–1107.

Cited By

Guo YChen GZeng TJin QNg M(2025)Quaternion Nuclear Norm Minus Frobenius Norm Minimization for color image reconstructionPattern Recognition10.1016/j.patcog.2024.110986158:COnline publication date: 1-Feb-2025
https://dl.acm.org/doi/10.1016/j.patcog.2024.110986
HUANG HSHAN YLI CWANG Z(2024)Nuclear Norm Minus Frobenius Norm Minimization with Rank Residual Constraint for Image DenoisingIEICE Transactions on Information and Systems10.1587/transinf.2023EDP7265E107.D:8(992-1006)Online publication date: 1-Aug-2024
https://doi.org/10.1587/transinf.2023EDP7265
Bo FMa XCen YHu S(2024)SAR Image Speckle Reduction Based on Nuclear Norm Minus Frobenius Norm RegularizationIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2024.350131462(1-15)Online publication date: 2024
https://doi.org/10.1109/TGRS.2024.3501314
Show More Cited By

A Scalable, Adaptive and Sound Nonconvex Regularizer for Low-rank Matrix Learning
1. Computing methodologies

Recommendations

Low-rank matrix completion using alternating minimization
STOC '13: Proceedings of the forty-fifth annual ACM symposium on Theory of Computing

Alternating minimization represents a widely applicable and empirically successful approach for finding low-rank matrices that best fit the given data. For example, for the problem of low-rank matrix completion, this method is believed to be one of the ...
Efficient learning with a family of nonconvex regularizers by redistributing nonconvexity

The use of convex regularizers allows for easy optimization, though they often produce biased estimation and inferior prediction performance. Recently, nonconvex regularizers have attracted a lot of attention and outperformed convex ones. However, the ...
A fast tri-factorization method for low-rank matrix recovery and completion

In recent years, matrix rank minimization problems have received a significant amount of attention in machine learning, data mining and computer vision communities. And these problems can be solved by a convex relaxation of the rank minimization problem ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

WWW '21: Proceedings of the Web Conference 2021

April 2021

4054 pages

ISBN:9781450383127

DOI:10.1145/3442381

Editors:
Jure Leskovec
Stanford
,
Marko Grobelnik
Jožef Stefan Institute
,
Marc Najork
Google
,
Jie Tang
Tsinghua University
,
Leila Zia
Wikimedia Foundation

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 June 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

WWW '21

Sponsor:

SIGWEB

WWW '21: The Web Conference 2021

April 19 - 23, 2021

Ljubljana, Slovenia

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

11
Total Citations
View Citations
261
Total Downloads

Downloads (Last 12 months)32
Downloads (Last 6 weeks)7

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Guo YChen GZeng TJin QNg M(2025)Quaternion Nuclear Norm Minus Frobenius Norm Minimization for color image reconstructionPattern Recognition10.1016/j.patcog.2024.110986158:COnline publication date: 1-Feb-2025
https://dl.acm.org/doi/10.1016/j.patcog.2024.110986
HUANG HSHAN YLI CWANG Z(2024)Nuclear Norm Minus Frobenius Norm Minimization with Rank Residual Constraint for Image DenoisingIEICE Transactions on Information and Systems10.1587/transinf.2023EDP7265E107.D:8(992-1006)Online publication date: 1-Aug-2024
https://doi.org/10.1587/transinf.2023EDP7265
Bo FMa XCen YHu S(2024)SAR Image Speckle Reduction Based on Nuclear Norm Minus Frobenius Norm RegularizationIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2024.350131462(1-15)Online publication date: 2024
https://doi.org/10.1109/TGRS.2024.3501314
Shan YHu DWang Z(2024)A Novel Truncated Norm Regularization Method for Multi-Channel Color Image DenoisingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.338230634:9(8427-8441)Online publication date: Sep-2024
https://doi.org/10.1109/TCSVT.2024.3382306
Zhang HWen BZha ZZhang BTang YYu GDu W(2024)Accelerated PALM for Nonconvex Low-Rank Matrix Recovery With Theoretical AnalysisIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.330681134:4(2304-2317)Online publication date: Apr-2024
https://doi.org/10.1109/TCSVT.2023.3306811
Miyata T(2024)Completion of Traffic Matrix by Tensor Nuclear Norm Minus Frobenius Norm Minimization and Time SlicingNOMS 2024-2024 IEEE Network Operations and Management Symposium10.1109/NOMS59830.2024.10575433(1-5)Online publication date: 6-May-2024
https://doi.org/10.1109/NOMS59830.2024.10575433
Dong DWu SWang YZhou JWang H(2024)ColdU: User Cold-start Recommendation with User-specific Modulation2024 IEEE Conference on Artificial Intelligence (CAI)10.1109/CAI59869.2024.00069(326-331)Online publication date: 25-Jun-2024
https://doi.org/10.1109/CAI59869.2024.00069
Wu WLv GDuan YLiang PZhang YXia Y(2024)Dual convolutional neural network with attention for image blind denoisingMultimedia Systems10.1007/s00530-024-01469-830:5Online publication date: 8-Sep-2024
https://doi.org/10.1007/s00530-024-01469-8
Zhang SLou JXiong LZhang XLiu JFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)Closed-form Machine Unlearning for Matrix FactorizationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614811(3278-3287)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3614811
Ahmad Hassan Yar GAhmed Pasha S(2022)Convex and Nonconvex Approaches for the Matrix Completion Problem2022 19th International Bhurban Conference on Applied Sciences and Technology (IBCAST)10.1109/IBCAST54850.2022.9990400(451-456)Online publication date: 16-Aug-2022
https://doi.org/10.1109/IBCAST54850.2022.9990400
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten