skip to main content
10.1145/3442381.3450142acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

A Scalable, Adaptive and Sound Nonconvex Regularizer for Low-rank Matrix Learning

Published: 03 June 2021 Publication History

Abstract

Matrix learning is at the core of many machine learning problems. A number of real-world applications such as collaborative filtering and text mining can be formulated as a low-rank matrix completion problems, which recovers incomplete matrix using low-rank assumptions. To ensure that the matrix solution has a low rank, a recent trend is to use nonconvex regularizers that adaptively penalize singular values. They offer good recovery performance and have nice theoretical properties, but are computationally expensive due to repeated access to individual singular values. In this paper, based on the key insight that adaptive shrinkage on singular values improve empirical performance, we propose a new nonconvex low-rank regularizer called ”nuclear norm minus Frobenius norm” regularizer, which is scalable, adaptive and sound. We first show it provably holds the adaptive shrinkage property. Further, we discover its factored form which bypasses the computation of singular values and allows fast optimization by general optimization algorithms. Stable recovery and convergence are guaranteed. Extensive low-rank matrix completion experiments on a number of synthetic and real-world data sets show that the proposed method obtains state-of-the-art recovery performance while being the fastest in comparison to existing low-rank matrix learning methods. 1

References

[1]
Mohammad Taha Bahadori, Qi Rose Yu, and Yan Liu. 2014. Fast multivariate spatio-temporal analysis via low rank tensor learning. In Advances in Neural Information Processing Systems. 3491–3499.
[2]
Jérôme Bolte, Shoham Sabach, and Marc Teboulle. 2014. Proximal alternating linearized minimization for nonconvex and nonsmooth problems. Mathematical Programming 146, 1-2 (2014), 459–494.
[3]
Nicolas Boumal and P-A Absil. 2015. Low-rank matrix completion via preconditioned optimization on the Grassmann manifold. Linear Algebra Appl. 475(2015), 200–239.
[4]
Stephen Boyd, Stephen P Boyd, and Lieven Vandenberghe. 2004. Convex Optimization. Cambridge University Press.
[5]
Stephen Boyd, Neal Parikh, and Eric Chu. 2011. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning 3, 1 (2011), 1–122.
[6]
Jian-Feng Cai, Emmanuel J Candès, and Zuowei Shen. 2010. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization 20, 4 (2010), 1956–1982.
[7]
Emmanuel J Candès and Benjamin Recht. 2009. Exact matrix completion via convex optimization. Foundations of Computational Mathematics 9, 6 (2009), 717–772.
[8]
Emmanuel J Candes and Terence Tao. 2005. Decoding by linear programming. IEEE Transactions on Information Theory 51, 12 (2005), 4203–4215.
[9]
Emmanuel J Candes, Michael B Wakin, and Stephen P Boyd. 2008. Enhancing sparsity by reweighted ℓ1 minimization. Journal of Fourier Analysis and Applications 14, 5-6(2008), 877–905.
[10]
Jianhui Chen, Jiayu Zhou, and Jieping Ye. 2011. Integrating low-rank and group-sparse structures for robust multi-task learning. In International Conference on Knowledge Discovery and Data Mining. 42–50.
[11]
Jicong Fan, Lijun Ding, Yudong Chen, and Madeleine Udell. 2019. Factor group-sparse regularization for efficient low-rank matrix recovery. In Advances in Neural Information Processing Systems. 5104–5114.
[12]
Maryam Fazel. 2002. Matrix Rank Minimization with Applications. Ph.D. Dissertation. Stanford.
[13]
Rong Ge, Jason D Lee, and Tengyu Ma. 2016. Matrix completion has no spurious local minimum. In Advances in Neural Information Processing Systems. 2973–2981.
[14]
Shuhang Gu, Lei Zhang, Wangmeng Zuo, and Xiangchu Feng. 2014. Weighted nuclear norm minimization with application to image denoising. In Conference on Computer Vision and Pattern Recognition. 2862–2869.
[15]
Huan Gui, Jiawei Han, and Quanquan Gu. 2016. Towards faster rates and oracle property for low-rank matrix estimation. In International Conference on Machine Learning. 2300–2309.
[16]
Suriya Gunasekar, Blake Woodworth, Srinadh Bhojanapalli, Behnam Neyshabur, and Nathan Srebro. 2017. Implicit regularization in matrix factorization. In Advances in Neural Information Processing Systems. 6151–6159.
[17]
F Maxwell Harper and Joseph A Konstan. 2015. The movielens datasets: History and context. ACM Transactions on Interactive Intelligent Systems 5, 4 (2015), 1–19.
[18]
Trevor Hastie, Rahul Mazumder, Jason D Lee, and Reza Zadeh. 2015. Matrix completion and low-rank SVD via fast alternating least squares. Journal of Machine Learning Research 16, 1 (2015), 3367–3402.
[19]
J-B Hiriart-Urruty. 1985. Generalized differentiability, duality and optimization for problems dealing with differences of convex functions. In Proceedings of the Symposium on Convexity and Duality in Optimization. 37–70.
[20]
Yao Hu, Debing Zhang, Jieping Ye, Xuelong Li, and Xiaofei He. 2012. Fast and accurate matrix completion via truncated nuclear norm regularization. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 9(2012), 2117–2130.
[21]
Alan Jennings and John J McKeown. 1992. Matrix Computation. John Wiley & Sons.
[22]
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30–37.
[23]
Dongsheng Li, Chao Chen, Qin Lv, Hansu Gu, Tun Lu, Li Shang, Ning Gu, and Stephen M Chu. 2018. AdaError: An adaptive learning rate method for matrix approximation-based collaborative filtering. In The World Wide Web Conference. 741–751.
[24]
Yifei Lou and Ming Yan. 2018. Fast L1-L2 minimization via a proximal operator. Journal of Scientific Computing 74, 2 (2018), 767–785.
[25]
Canyi Lu, Jinhui Tang, Shuicheng Yan, and Zhouchen Lin. 2015. Nonconvex nonsmooth low rank minimization via iteratively reweighted nuclear norm. IEEE Transactions on Image Processing 25, 2 (2015), 829–839.
[26]
Canyi Lu, Changbo Zhu, Chunyan Xu, Shuicheng Yan, and Zhouchen Lin. 2015. Generalized singular value thresholding. In AAAI Conference on Artificial Intelligence. 1805–1811.
[27]
Tian-Hui Ma, Yifei Lou, and Ting-Zhu Huang. 2017. Truncated ℓ1 − 2 models for sparse recovery and rank minimization. SIAM Journal on Imaging Sciences 10, 3 (2017), 1346–1380.
[28]
Rahul Mazumder, Diego Saldana, and Haolei Weng. 2020. Matrix completion with nonconvex regularization: Spectral operators and scalable algorithms. Statistics and Computing(2020), 1–26.
[29]
Andriy Mnih and Russ R Salakhutdinov. 2008. Probabilistic matrix factorization. In Advances in Neural Information Processing Systems. 1257–1264.
[30]
Feiping Nie, Heng Huang, and Chris Ding. 2012. Low-rank matrix recovery via efficient schatten p-norm minimization. In AAAI Conference on Artificial Intelligence.
[31]
Jorge Nocedal and Stephen Wright. 2006. Numerical Optimization. Springer Science & Business Media.
[32]
Neal Parikh and Stephen Boyd. 2014. Proximal algorithms. Foundations and Trends in Optimization 1, 3 (2014), 127–239.
[33]
Jeffrey Pennington, Richard Socher, and Christopher D Manning. 2014. Glove: Global vectors for word representation. In Conference on Empirical Methods in Natural Language Processing. 1532–1543.
[34]
Nikhil Rao, Hsiang-Fu Yu, Pradeep Ravikumar, and Inderjit S Dhillon. 2015. Collaborative filtering with graph information: Consistency and scalable methods. In Advances in Neural Information Processing Systems. 2107–2115.
[35]
Ruslan Salakhutdinov and Andriy Mnih. 2008. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. In International Conference on Machine Learning. 880–887.
[36]
Fanhua Shang, Yuanyuan Liu, and James Cheng. 2016. Tractable and scalable Schatten quasi-norm approximations for rank minimization. In Artificial Intelligence and Statistics. 620–629.
[37]
Mohit Sharma and George Karypis. 2019. Adaptive matrix completion for the users and the items in tail. In The World Wide Web Conference. 3223–3229.
[38]
Tian Shi, Kyeongpil Kang, Jaegul Choo, and Chandan K Reddy. 2018. Short-text topic modeling via non-negative matrix factorization enriched with local word-context correlations. In The World Wide Web Conference. 1105–1114.
[39]
Nathan Srebro, Jason Rennie, and Tommi S Jaakkola. 2005. Maximum-margin matrix factorization. In Advances in Neural Information Processing Systems. 1329–1336.
[40]
Stephen Tu, Ross Boczar, Max Simchowitz, Mahdi Soltanolkotabi, and Ben Recht. 2016. Low-rank solutions of linear matrix equations via procrustes flow. In International Conference on Machine Learning. 964–973.
[41]
Bart Vandereycken. 2013. Low-rank matrix completion by Riemannian optimization. SIAM Journal on Optimization 23, 2 (2013), 1214–1236.
[42]
Lingxiao Wang, Xiao Zhang, and Quanquan Gu. 2017. A unified computational and statistical framework for nonconvex low-rank matrix estimation. In Artificial Intelligence and Statistics. 981–990.
[43]
Zaiwen Wen, Wotao Yin, and Yin Zhang. 2012. Solving a low-rank factorization model for matrix completion by a nonlinear successive over-relaxation algorithm. Mathematical Programming Computation 4, 4 (2012), 333–361.
[44]
Quanming Yao, James T Kwok, Taifeng Wang, and Tie-Yan Liu. 2019. Large-scale low-rank matrix learning with nonconvex regularizers. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 11(2019), 2628–2643.
[45]
Cun-Hui Zhang. 2010. Nearly unbiased variable selection under minimax concave penalty. The Annals of Statistics 38, 2 (2010), 894–942.
[46]
Tong Zhang. 2010. Analysis of multi-stage convex relaxation for sparse regularization. Journal of Machine Learning Research 11, Mar (2010), 1081–1107.

Cited By

View all
  • (2025)Quaternion Nuclear Norm Minus Frobenius Norm Minimization for color image reconstructionPattern Recognition10.1016/j.patcog.2024.110986158:COnline publication date: 1-Feb-2025
  • (2024)Nuclear Norm Minus Frobenius Norm Minimization with Rank Residual Constraint for Image DenoisingIEICE Transactions on Information and Systems10.1587/transinf.2023EDP7265E107.D:8(992-1006)Online publication date: 1-Aug-2024
  • (2024)SAR Image Speckle Reduction Based on Nuclear Norm Minus Frobenius Norm RegularizationIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2024.350131462(1-15)Online publication date: 2024
  • Show More Cited By
  1. A Scalable, Adaptive and Sound Nonconvex Regularizer for Low-rank Matrix Learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '21: Proceedings of the Web Conference 2021
    April 2021
    4054 pages
    ISBN:9781450383127
    DOI:10.1145/3442381
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 June 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Collaborative Filtering
    2. Low-rank Matrix Learning
    3. Matrix Completion
    4. Nonconvex Regularization
    5. Recommender Systems

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    WWW '21
    Sponsor:
    WWW '21: The Web Conference 2021
    April 19 - 23, 2021
    Ljubljana, Slovenia

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)32
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Quaternion Nuclear Norm Minus Frobenius Norm Minimization for color image reconstructionPattern Recognition10.1016/j.patcog.2024.110986158:COnline publication date: 1-Feb-2025
    • (2024)Nuclear Norm Minus Frobenius Norm Minimization with Rank Residual Constraint for Image DenoisingIEICE Transactions on Information and Systems10.1587/transinf.2023EDP7265E107.D:8(992-1006)Online publication date: 1-Aug-2024
    • (2024)SAR Image Speckle Reduction Based on Nuclear Norm Minus Frobenius Norm RegularizationIEEE Transactions on Geoscience and Remote Sensing10.1109/TGRS.2024.350131462(1-15)Online publication date: 2024
    • (2024)A Novel Truncated Norm Regularization Method for Multi-Channel Color Image DenoisingIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.338230634:9(8427-8441)Online publication date: Sep-2024
    • (2024)Accelerated PALM for Nonconvex Low-Rank Matrix Recovery With Theoretical AnalysisIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.330681134:4(2304-2317)Online publication date: Apr-2024
    • (2024)Completion of Traffic Matrix by Tensor Nuclear Norm Minus Frobenius Norm Minimization and Time SlicingNOMS 2024-2024 IEEE Network Operations and Management Symposium10.1109/NOMS59830.2024.10575433(1-5)Online publication date: 6-May-2024
    • (2024)ColdU: User Cold-start Recommendation with User-specific Modulation2024 IEEE Conference on Artificial Intelligence (CAI)10.1109/CAI59869.2024.00069(326-331)Online publication date: 25-Jun-2024
    • (2024)Dual convolutional neural network with attention for image blind denoisingMultimedia Systems10.1007/s00530-024-01469-830:5Online publication date: 8-Sep-2024
    • (2023)Closed-form Machine Unlearning for Matrix FactorizationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614811(3278-3287)Online publication date: 21-Oct-2023
    • (2022)Convex and Nonconvex Approaches for the Matrix Completion Problem2022 19th International Bhurban Conference on Applied Sciences and Technology (IBCAST)10.1109/IBCAST54850.2022.9990400(451-456)Online publication date: 16-Aug-2022
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media