Proximal Quasi-Newton Method for Composite Optimization over the Stiefel Manifold

Wang, Qinsi; Yang, Wei Hong

doi:10.1007/s10915-023-02165-x

Proximal Quasi-Newton Method for Composite Optimization over the Stiefel Manifold

Published: 16 March 2023

Volume 95, article number 39, (2023)
Cite this article

Journal of Scientific Computing Aims and scope Submit manuscript

622 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

In this paper, we consider the composite optimization problems over the Stiefel manifold. A successful method to solve this class of problems is the proximal gradient method proposed by Chen et al. (SIAM J Optim 30:210–239, 2020. https://doi.org/10.1137/18M122457X). Motivated by the proximal Newton-type techniques in the Euclidean space, we present a Riemannian proximal quasi-Newton method, named ManPQN, to solve the composite optimization problems. The global convergence of the ManPQN method is proved and iteration complexity for obtaining an \(\epsilon \)-stationary point is analyzed. Under some mild conditions, we also establish the local linear convergence result of the ManPQN method. Numerical results are encouraging, which shows that the proximal quasi-Newton technique can be used to accelerate the proximal gradient method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Efficiency of higher-order algorithms for minimizing composite functions

Article 10 October 2023

Yassine Nabou & Ion Necoara

A new optimization approach to solving split equality problems in Hilbert spaces

Article 13 April 2024

Simeon Reich, Truong Minh Tuyen & Nguyen Song Ha

On convergence of iterative thresholding algorithms to approximate sparse solution for composite nonconvex optimization

Article Open access 06 March 2024

Yaohua Hu, Xinlin Hu & Xiaoqi Yang

Data Availability

Enquiries about data availability should be directed to the authors.

Notes

Our MATLAB code is available at https://github.com/QinsiWang2022/ManPQN.

References

Absil, P.-A., Hosseini, S.: A collection of nonsmooth Riemannian optimization problems. Int. Ser. Numer. Math. 170, 1–15 (2019). https://doi.org/10.1007/978-3-030-11370-4_1
Absil, P.-A., Mahony, R., Sepulchre, R.: Optimization Algorithms on Matrix Manifolds. Princeton University Press, Princeton (2008)
Book MATH Google Scholar
Beck, A., Teboulle, M.: A fast iterative shrinkage-threshold algorithm for linear inverse problems. SIAM J. Imaging Sci. 2, 183–202 (2009). https://doi.org/10.1137/080716542
Article MathSciNet MATH Google Scholar
Bento, G., Cruz Neto, J., Oliveira, P.: A new approach to the proximal point method: convergence on general Riemannian manifolds. J. Optim. Theory Appl. 168, 743–755 (2016). https://doi.org/10.1007/s10957-015-0861-2
Article MathSciNet MATH Google Scholar
Boumal, N., Absil, P.-A., Cartis, C.: Global rates of convergence for nonconvex optimization on manifolds. IMA J. Numer. Anal. 39(1), 1–33 (2019). https://doi.org/10.1093/imanum/drx080
Article MathSciNet MATH Google Scholar
Chen, S., Ma, S., So, A.M.-C., Zhang, T.: Proximal gradient method for nonsmooth optimization over the Stiefel manifold. SIAM J. Optim. 30(1), 210–239 (2020). https://doi.org/10.1137/18M122457X
Article MathSciNet MATH Google Scholar
Dai, Y.-H.: A nonmonotone conjugate gradient algorithm for unconstrained optimization. J. Syst. Sci. Complex. 15(2), 139–145 (2002)
MathSciNet MATH Google Scholar
Davis, T.A., Hu, Y.: The University of Florida sparse matrix collection. ACM Trans. Math. Softw. (TOMS) 38(1), 1–25 (2011)
MathSciNet MATH Google Scholar
Ferreira, O., Oliveira, P.: Subgradient algorithm on Riemannian manifolds. J. Optim. Theory Appl. 97, 93–104 (1998). https://doi.org/10.1023/A:1022675100677
Article MathSciNet MATH Google Scholar
Ferreira, O., Oliveira, P.: Proximal point algorithm on Riemannian manifolds. Optimization 51, 257–270 (2002). https://doi.org/10.1080/02331930290019413
Article MathSciNet MATH Google Scholar
Fukushima, M., Qi, L.: A globally and superlinearly convergent algorithm for nonsmooth convex minimization. SIAM J. Optim. 6(4), 1106–1120 (1996). https://doi.org/10.1137/S1052623494278839
Article MathSciNet MATH Google Scholar
Gao, B., Liu, X., Yuan, Y.-X.: Parallelizable algorithms for optimization problems with orthogonality constraints. SIAM J. Sci. Comput. 41(3), 1949–1983 (2019). https://doi.org/10.1137/18M1221679
Article MathSciNet MATH Google Scholar
Grippo, L., Lampariello, F., Lucidi, S.: A nonmonotone line search technique for Newton’s method. SIAM J. Numer. Anal. 23(4), 707–716 (1986). https://doi.org/10.1137/0723046
Article MathSciNet MATH Google Scholar
Grohs, P., Hosseini, S.: \(\varepsilon \)-subgradient algorithms for locally Lipschitz functions on Riemannian manifolds. Adv. Comput. Math. 42(2), 333–360 (2016). https://doi.org/10.1007/s10444-015-9426-z
Article MathSciNet MATH Google Scholar
Hosseini, S., Grohs, P.: Nonsmooth trust region algorithms for locally Lipschitz functions on Riemannian manifolds. IMA J. Numer. Anal. 36(3), 1167–1192 (2016). https://doi.org/10.1093/imanum/drv043
Article MathSciNet MATH Google Scholar
Hu, J., Milzarek, A., Wen, Z., Yuan, Y.-X.: Adaptive quadratically regularized Newton method for Riemannian optimization. SIAM J. Matrix Anal. Appl. 39(3), 1181–1207 (2018). https://doi.org/10.1137/17M1142478
Article MathSciNet MATH Google Scholar
Huang, W., Wei, K.: Riemannian proximal gradient methods. Math. Program. 194, 371–413 (2022). https://doi.org/10.1007/s10107-021-01632-3
Article MathSciNet MATH Google Scholar
Huang, W., Wei, K.: An extension of fast iterative shrinkage-thresholding to Riemannian optimization for sparse principal component analysis. Numer. Linear Algebra Appl. 29, e2409 (2022). https://doi.org/10.1002/nla.2409
Article MathSciNet MATH Google Scholar
Huang, Y., Liu, H.: On the rate of convergence of projected Barzilai–Borwein methods. Optim. Methods Softw. 30(4), 880–892 (2015). https://doi.org/10.1080/10556788.2015.1004064
Article MathSciNet MATH Google Scholar
Lee, J.D., Sun, Y., Saunders, M.A.: Proximal Newton-type methods for minimizing composite functions. SIAM J. Optim. 24(3), 1420–1443 (2014). https://doi.org/10.1137/130921428
Article MathSciNet MATH Google Scholar
Li, D., Wang, X., Huang, J.: Diagonal BFGS updates and applications to the limited memory BFGS method. Comput. Optim. Appl. 81, 829–856 (2022). https://doi.org/10.1007/s10589-022-00353-3
Article MathSciNet MATH Google Scholar
Liu, H., Wu, W., So, A.M.-C.: Quadratic optimization with orthogonality constraints: explicit Łojasiewicz exponent and linear convergence of retraction-based line-search and stochastic variance-reduced gradient methods. Math. Program. 178, 215–262 (2019). https://doi.org/10.1007/s10107-018-1285-1
Article MathSciNet MATH Google Scholar
Mordukhovich, B., Yuan, X., Zeng, S., Zhang, J.: A globally convergent proximal Newton-type method in nonsmooth convex optimization. Math. Program. 198, 899–936 (2023). https://doi.org/10.1007/s10107-022-01797-5
Article MathSciNet MATH Google Scholar
Moré, J.J., Sorensen, D.C.: Computing a trust region step. SIAM J. Sci. Stat. Comput. 4(3), 553–572 (1983). https://doi.org/10.1137/0904038
Article MathSciNet MATH Google Scholar
Nakayama, S., Narushima, Y., Yabe, H.: Inexact proximal memoryless quasi-Newton methods based on the Broyden family for minimizing composite functions. Comput. Optim. Appl. 79, 127–154 (2021). https://doi.org/10.1007/s10589-021-00264-9
Article MathSciNet MATH Google Scholar
Nesterov, Y.: Gradient methods for minimizing composite functions. Math. Program. Ser. B 140, 125–161 (2013). https://doi.org/10.1007/s10107-012-0629-5
Article MathSciNet MATH Google Scholar
Nesterov, Y.: Lectures on Convex Optimization, 2nd edn. Springer, Cham (2018)
Book MATH Google Scholar
Nocedal, J.: Updating quasi-Newton matrices with limited storage. Math. Comput. 35, 773–782 (1980). https://doi.org/10.2307/2006193
Article MathSciNet MATH Google Scholar
Oviedo, H.: Proximal point algorithm on the Stiefel manifold. Preprint in Optimization. http://www.optimization-online.org/DB_FILE/2021/05/ 8401.pdf (2021)
Ozoliņš, V., Lai, R., Caflisch, R., Osher, S.: Compressed modes for variational problems in mathematics and physics. Proc. Natl. Acad. Sci. USA 110(46), 18368–18373 (2013). https://doi.org/10.1073/pnas.1318679110
Article MathSciNet MATH Google Scholar
Park, Y., Dhar, S., Boyd, S., Shah, M.: Variable metric proximal gradient method with diagonal Barzilai–Borwein stepsize. In: ICASSP 2020—2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3597–3601 (2020). https://doi.org/10.1109/ICASSP40776.2020.9054193
Powell, M.J.: Algorithms for nonlinear constraints that use Lagrangian functions. Math. Program. 14(1), 224–248 (1978). https://doi.org/10.1007/BF01588967
Article MathSciNet MATH Google Scholar
Ring, W., Wirth, B.: Optimization methods on Riemannian manifolds and their application to shape space. SIAM J. Optim. 22(2), 596–627 (2012). https://doi.org/10.1137/11082885X
Article MathSciNet MATH Google Scholar
Sato, H.: Riemannian Newton-type methods for joint diagonalization on the Stiefel manifold with application to independent component analysis. Optimization 66(12), 2211–2231 (2017). https://doi.org/10.1080/02331934.2017.1359592
Sherman, J., Morrison, W.J.: Adjustment of an inverse matrix corresponding to a change in one element of a given matrix. Ann. Math. Stat. 21, 124–127 (1950)
Article MathSciNet MATH Google Scholar
Shor, N.Z.: Application of generalized gradient descent in block programming. Cybernetics 3(3), 43–45 (1967). https://doi.org/10.1007/BF01120005
Article MathSciNet Google Scholar
Wang, X., Ma, S., Goldfarb, D., Liu, W.: Stochastic quasi-Newton methods for nonconvex stochastic optimization. SIAM J. Optim. 27(2), 927–956 (2017). https://doi.org/10.1137/15M1053141
Article MathSciNet MATH Google Scholar
Wen, Z., Yin, W.: A feasible method for optimization with orthogonality constraints. Math. Program. 142, 397–434 (2013). https://doi.org/10.1007/s10107-012-0584-1
Article MathSciNet MATH Google Scholar
Wright, S., Nowak, R., Figueiredo, M.: Sparse reconstruction by separable approximation. IEEE Trans. Signal Process. 57(7), 2479–2493 (2009). https://doi.org/10.1109/TSP.2009.2016892
Article MathSciNet MATH Google Scholar
Xiao, X., Li, Y., Wen, Z., Zhang, L.W.: A regularized semi-smooth Newton method with projection steps for composite convex programs. J. Sci. Comput. 76, 364–389 (2018). https://doi.org/10.1007/s10915-017-0624-3
Article MathSciNet MATH Google Scholar
Yang, W.H., Zhang, L., Song, R.: Optimality conditions for the nonlinear programming problems on Riemannian manifolds. Pac. J. Optim. 10, 415–434 (2013)
MathSciNet MATH Google Scholar
Zhang, C., Chen, X., Ma, S.: A Riemannian smoothing steepest descent method for non-Lipschitz optimization on submanifolds (2021). ArXiv arXiv:2104.04199

Download references

Acknowledgements

The authors are grateful to the associate editor and the two anonymous referees for their valuable comments and suggestions.

Funding

The work of Wei Hong Yang was supported by the National Natural Science Foundation of China Grant 11971118.

Author information

Authors and Affiliations

School of Mathematical Sciences, Fudan University, 220 Handan Street, Shanghai, 200433, China
Qinsi Wang & Wei Hong Yang

Authors

Qinsi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Hong Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Hong Yang.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Wang, Q., Yang, W.H. Proximal Quasi-Newton Method for Composite Optimization over the Stiefel Manifold. J Sci Comput 95, 39 (2023). https://doi.org/10.1007/s10915-023-02165-x

Download citation

Received: 24 May 2022
Revised: 12 November 2022
Accepted: 12 February 2023
Published: 16 March 2023
DOI: https://doi.org/10.1007/s10915-023-02165-x

Keywords

Mathematics Subject Classification

90C30

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Proximal Quasi-Newton Method for Composite Optimization over the Stiefel Manifold

Abstract

Access this article

Similar content being viewed by others

Efficiency of higher-order algorithms for minimizing composite functions

A new optimization approach to solving split equality problems in Hilbert spaces

On convergence of iterative thresholding algorithms to approximate sparse solution for composite nonconvex optimization

Data Availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Proximal Quasi-Newton Method for Composite Optimization over the Stiefel Manifold

Abstract

Access this article

Similar content being viewed by others

Efficiency of higher-order algorithms for minimizing composite functions

A new optimization approach to solving split equality problems in Hilbert spaces

On convergence of iterative thresholding algorithms to approximate sparse solution for composite nonconvex optimization

Data Availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation