An extrapolated iteratively reweighted $$\ell _1$$ method with complexity analysis

Wang, Hao; Zeng, Hao; Wang, Jiashan

doi:10.1007/s10589-022-00416-5

An extrapolated iteratively reweighted $\ell _1$ method with complexity analysis

Published: 05 October 2022

Volume 83, pages 967–997, (2022)
Cite this article

Computational Optimization and Applications Aims and scope Submit manuscript

393 Accesses
1 Altmetric
Explore all metrics

Abstract

The iteratively reweighted $\ell _1$ algorithm is a widely used method for solving various regularization problems, which generally minimize a differentiable loss function combined with a convex/nonconvex regularizer to induce sparsity in the solution. However, the convergence and the complexity of iteratively reweighted $\ell _1$ algorithms is generally difficult to analyze, especially for non-Lipschitz differentiable regularizers such as $\ell _p$ norm regularization with $0<p<1$. In this paper, we propose, analyze and test a reweighted $\ell _1$ algorithm combined with the extrapolation technique under the assumption of Kurdyka-Łojasiewicz (KL) property on the proximal function of the perturbed objective. Our method does not require the Lipschitz differentiability on the regularizers nor the smoothing parameters in the weights bounded away from 0. We show the proposed algorithm converges uniquely to a stationary point of the regularization problem and has local linear convergence for KL exponent at most 1/2 and local sublinear convergence for KL exponent greater than 1/2. We also provide results on calculating the KL exponents and discuss the cases when the KL exponent is at most 1/2. Numerical experiments show the efficiency of our proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Convergence rate analysis of proximal iteratively reweighted $$\ell _1$$ methods for $$\ell _p$$ regularization problems

Article 26 July 2022

Relating $$\ell _p$$ regularization and reweighted $$\ell _1$$ regularization

Article 29 January 2021

Iteratively reweighted $$\ell _1$$ algorithms with extrapolation

Article 25 February 2019

Data Availability

The data that support the findings of this study are available from the corresponding author upon request.

References

Attouch, Hedy, Bolte, Jérôme.: On the convergence of the proximal algorithm for nonsmooth functions involving analytic features. Math. Program. 116(1), 5–16 (2009)
Article MathSciNet MATH Google Scholar
Attouch, Hédy., Bolte, Jérôme., Redont, Patrick, Soubeyran, Antoine: Proximal alternating minimization and projection methods for nonconvex problems: an approach based on the kurdyka-łojasiewicz inequality. Math. Oper. Res. 35(2), 438–457 (2010)
Article MathSciNet MATH Google Scholar
Attouch, H., Bolte, J., Svaiter, B.F.: Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized gauss-seidel methods. Math. Program. 137(1–2), 91–129 (2013)
Article MathSciNet MATH Google Scholar
Auslender, Alfred, Teboulle, Marc: Interior gradient and proximal methods for convex and conic optimization. SIAM J. Optim. 16(3), 697–725 (2006)
Article MathSciNet MATH Google Scholar
Bauschke,H.H., Dao, M.N., Moursi, W.M.: On fej$\backslash $’er monotone sequences and nonexpansive mappings. arXiv preprint arXiv:1507.05585, 2015
Beck, Amir, Teboulle, Marc: A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imag. Sci. 2(1), 183–202 (2009)
Article MathSciNet MATH Google Scholar
Becker, S.R., Candès, E.J., Grant, M.C.: Templates for convex cone problems with applications to sparse signal recovery. Math. Program. comput. 3(3), 165 (2011)
Article MathSciNet MATH Google Scholar
Bolte, J., Daniilidis, A., Lewis, A.: The łojasiewicz inequality for nonsmooth subanalytic functions with applications to subgradient dynamical systems. SIAM J. Optim. 17(4), 1205–1223 (2007)
Article MATH Google Scholar
Bolte, Jérôme., Sabach, Shoham, Teboulle, Marc: Proximal alternating linearized minimization for nonconvex and nonsmooth problems. Math. Program. 146(1), 459–494 (2014)
Article MathSciNet MATH Google Scholar
Candes, E.J., Wakin, M.B., Boyd, S.P.: Enhancing sparsity by reweighted $\ell _1$ minimization. J. Fourier Anal. Appl. 14(5–6), 877–905 (2008)
Article MathSciNet MATH Google Scholar
Chartrand, R., Yin, W.: Iteratively reweighted algorithms for compressive sensing. In 2008 IEEE International conference on acoustics, speech and signal processing, pp. 3869–3872. IEEE, 2008
Chen, Xiaojun, Zhou, Weijun: Convergence of reweighted $\ell _1$ minimization algorithms and unique solution of truncated lp minimization. The Hong Kong Polytechnic University, Department of Applied Mathematics (2010)
Google Scholar
Fan, Jianqing, Li, Runze: Variable selection via nonconcave penalized likelihood and its oracle properties. J. Am. Stat. Assoc. 96(456), 1348–1360 (2001)
Article MathSciNet MATH Google Scholar
Figueiredo, M.A.T., Nowak, R.D., Wright, S.J.: Gradient projection for sparse reconstruction: application to compressed sensing and other inverse problems. IEEE J. Selected Topics Signal Process. 1(4), 586–597 (2007)
Article Google Scholar
Frankel, Pierre, Garrigos, Guillaume, Peypouquet, Juan: Splitting methods with variable metric for kurdyka-łojasiewicz functions and general convergence rates. J. Optim. Theory Appl. 165(3), 874–900 (2015)
Article MathSciNet MATH Google Scholar
Ge, Dongdong, Jiang, Xiaoye, Ye, Yinyu: A note on the complexity of $\ell _p$ minimization. Math. Program. 129(2), 285–299 (2011)
Article MathSciNet MATH Google Scholar
Yaohua, Hu., Li, Chong, Meng, Kaiwen, Yang, Xiaoqi: Linear convergence of inexact descent method and inexact proximal gradient algorithms for lower-order regularization problems. J. Global Optim. 79(4), 853–883 (2021)
Article MathSciNet MATH Google Scholar
Jaggi, M.: Sparse convex optimization methods for machine learning. PhD Thesis, ETH Zurich, 2011
Lai, Ming-Jun., Wang, Jingyue: An unconstrained $ \ell _q $ minimization with $0<q\le 1$ for sparse solution of underdetermined linear systems. SIAM J. Optim. 21(1), 82–101 (2011)
Article MathSciNet Google Scholar
Lan, G., Lu, Z., Monteiro, R.D.C.: Primal-dual first-order methods with $o(1/\epsilon )$ iteration-complexity for cone programming. Math. Program. 126(1), 1–29 (2011)
Article MathSciNet MATH Google Scholar
Guoyin Li and Ting Kei Pong: Douglas-rachford splitting for nonconvex optimization with application to nonconvex feasibility problems. Math. Program. 159(1), 371–401 (2016)
MathSciNet MATH Google Scholar
Guoyin Li and Ting Kei Pong: Calculus of the exponent of kurdyka-łojasiewicz inequality and its applications to linear convergence of first-order methods. Found. Comput. Math. 18(5), 1199–1232 (2018)
Article MathSciNet MATH Google Scholar
Lobo, M.S., Fazel, M., Boyd, S.: Portfolio optimization with linear and fixed transaction costs. Annal Operat. Res. 152(1), 341–365 (2007)
Article MathSciNet MATH Google Scholar
Canyi Lu, Yunchao Wei, Zhouchen Lin, Shuicheng Yan.: Proximal iteratively reweighted algorithm with multiple splitting for nonconvex sparsity optimization. In Twenty-Eighth AAAI conference on artificial intelligence, 2014
Zhaosong, Lu.: Iterative reweighted minimization methods for $\ell _p$ regularized unconstrained nonlinear programming. Math. Program. 147(1–2), 277–307 (2014)
MathSciNet MATH Google Scholar
Weixin Luo, Wen Liu, and Shenghua Gao.: A revisit of sparse coding based anomaly detection in stacked rnn framework. In Proceedings of the IEEE international conference on computer vision, pp. 341–349, 2017
Luo, Zhi-Quan., Pang, Jong-Shi., Ralph, Daniel: Mathematical programs with equilibrium constraints. Cambridge University Press, Cambridge (1996)
Book MATH Google Scholar
Lustig, M., Donoho, D., Pauly, J.M.: Sparse mri: the application of compressed sensing for rapid mr imaging. Magnetic Resonance Med.: Official J. Int. Soc. Magnetic Resonance Med. 58(6), 1182–1195 (2007)
Article Google Scholar
Mairal, Julien, Elad, Michael, Sapiro, Guillermo: Sparse representation for color image restoration. IEEE Trans. Image Process. 17(1), 53–69 (2007)
Article MathSciNet Google Scholar
Mairal, J., Bach, F., Ponce, J., Sapiro, G.: Online learning for matrix factorization and sparse coding. J. Mach. Learn. Res. 11(Jan), 19–60 (2010)
MathSciNet MATH Google Scholar
Nesterov, Yu.: Gradient methods for minimizing composite functions. Math. Program. 140(1), 125–161 (2013)
Article MathSciNet MATH Google Scholar
Nesterov, Yurii: Introductory lectures on convex programming volume i: basic course. Lecture Notes 3(4), 5 (1998)
Google Scholar
Nesterov, Yurii: Primal-dual subgradient methods for convex problems. Math. Program. 120(1), 221–259 (2009)
Article MathSciNet MATH Google Scholar
Nesterov, Y.E.: A method for solving the convex programming problem with convergence rate o (1/k$\hat{}$ 2). In Dokl. Akad. Nauk Sssr 269, 543–547 (1983)
MathSciNet Google Scholar
Tseng, Paul: Approximation accuracy, gradient methods, and error bound for structured convex optimization. Math. Program. 125(2), 263–295 (2010)
Article MathSciNet MATH Google Scholar
Wang, Feng: Study on the Kurdyka-Łojasiewicz exponents of $\ell _p$ regularization functions (in Chinese). PhD thesis Southwest Jiaotong University, Chengdu (2021)
Google Scholar
Wang, H.,Zeng, H., Wang, J.: Relating $\ell _p$ regularization and reweighted $\ell _1$ regularization. arXiv preprint arXiv:1912.00723, 2019
Wen, B., Chen, X., Pong, T.K.: A proximal difference-of-convex algorithm with extrapolation. Comput. Optim. Appl. 69(2), 297–324 (2018)
Article MathSciNet MATH Google Scholar
Yangyang, Xu., Yin, Wotao: A block coordinate descent method for regularized multiconvex optimization with applications to nonnegative tensor factorization and completion. SIAM J. Imag. Sci. 6(3), 1758–1789 (2013)
Article MathSciNet MATH Google Scholar
Peiran Yu and Ting Kei Pong: Iteratively reweighted $\ell _1$ algorithms with extrapolation. Comput. Optim. Appl. 73(2), 353–386 (2019)
Article MathSciNet MATH Google Scholar
Yu, P., Li, G., Pong, T.K.: Kurdyka-łojasiewicz exponent via inf-projection. Found. Comput. Math. 22, 1–47 (2021)
MATH Google Scholar
Zeng, Jinshan, Lin, Shaobo, Zongben, Xu.: Sparse regularization: convergence of iterative jumping thresholding algorithm. IEEE Trans. Signal Process. 64(19), 5106–5118 (2016)
Article MathSciNet MATH Google Scholar
Roman Zeyde, Michael Elad, Matan Protter.: On single image scale-up using sparse-representations. In International conference on curves and surfaces, pp. 711–730. Springer, 2010
Zhang, Cun-Hui., et al.: Nearly unbiased variable selection under minimax concave penalty. Ann. Stat. 38(2), 894–942 (2010)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

Hao Wang was supported by the Natural Science Foundation of Shanghai under Grant 21ZR1442800 and the National Natural Science Foundation of China under Grant 12001367.The authors would like to thank Professor Ting Kei Pong and Professor Kaiwen Meng for their advice on calculating the KL exponent.

Author information

Authors and Affiliations

School of Information Science and Technology, ShanghaiTech University, Shanghai, China
Hao Wang & Hao Zeng
Department of Mathematics, University of Washington, Seattle, USA
Jiashan Wang

Authors

Hao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hao Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Jiashan Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hao Wang.

Ethics declarations

Conflict of interest

All authors disclosed no relevant relationships.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

The translation of the proof of Theorem 5.

Proof

It follows from $\nabla h(x^*)=0$ and the mean-value theorem that

$$\begin{aligned} h(x) - h(x^*) = (x-x^*)^T\nabla ^2h(x^*+t_x(x-x^*))(x-x^*),\end{aligned}$$

with $t_x \in (0,1)$. Since h is twice continuously differentiable on $\mathbb {B}(x^*, \varepsilon _0)$, there exists $L>0$ such that

$$\begin{aligned} h(x) - h(x^*) \le L\Vert x-x^*\Vert _2^2\end{aligned}$$

for any $x\in \mathbb {B}(x^*,\varepsilon _0)$. On the other hand,

$$ \begin{aligned} \nabla h(x) = & \nabla h(x) - \nabla h(x^{*} ) \\ = & \int_{0}^{1} {\nabla ^{2} } h(x + t(x - x^{*} ))(x - x^{*} )dt \\ = & \left[ {\begin{array}{*{20}c} {[\nabla ^{2} h(x^{*} + t_{{1x}} (x - x^{*} ))]_{1} } \\ \ldots \\ {\nabla ^{2} h(x^{*} + t_{{nx}} (x - x^{*} ))]_{n} } \\ \end{array} } \right](x - x^{*} ) \\ : = & A_{x} (x - x^{*} ), \\ \end{aligned} $$

where $t_{ix}\in (0,1), i=1,2, \ldots , n$.

Since $\nabla ^2 h(x^*)$ is nonsingular, there exists $0<\varepsilon _1 < \varepsilon _0$ such that for any $x\in \mathbb {B}(x^*, \varepsilon _1)$, $A_x$ is nonsingular. Therefore, $A_x^TA_x$ is positive definite on $\mathbb {B}(x^*, \varepsilon _1)$. Hence, there exists $0<\varepsilon < \varepsilon _1$ and $\sigma > 0$ satisfying $\sigma = \min _{x\in \mathbb {B}(x^*, \varepsilon )} \sigma _{\min }(A_x^TA_x)$. It then follows that

$$\begin{aligned} \Vert \nabla h(x)\Vert ^2 = (x-x^*)^TA_x^TA_x(x-x^*) \ge \sigma \Vert x-x^*\Vert ^2_2 \ge \frac{\sigma }{L}(h(x)-h(x^*)). \end{aligned}$$

This implies that there exists $\theta = \tfrac{1}{2} \in (0,1]$, $\varepsilon > 0$ and $c= \big ( \tfrac{L}{\sigma } \big )^{1/2}$ for any $x\in B(x^*, \varepsilon )$,

$$\begin{aligned} (h(x)-h(x^*))^{1-\theta }_+ \le c\Vert \nabla h(x)\Vert ,\end{aligned}$$

or, equivalently, h satisfies the KL inequality with $\theta = \tfrac{1}{2}$. $\square $

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Wang, H., Zeng, H. & Wang, J. An extrapolated iteratively reweighted $\ell _1$ method with complexity analysis. Comput Optim Appl 83, 967–997 (2022). https://doi.org/10.1007/s10589-022-00416-5

Download citation

Received: 12 July 2020
Accepted: 19 September 2022
Published: 05 October 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s10589-022-00416-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An extrapolated iteratively reweighted \(\ell _1\) method with complexity analysis

Abstract

Access this article

Similar content being viewed by others

Convergence rate analysis of proximal iteratively reweighted $$\ell _1$$ methods for $$\ell _p$$ regularization problems

Relating $$\ell _p$$ regularization and reweighted $$\ell _1$$ regularization

Iteratively reweighted $$\ell _1$$ algorithms with extrapolation

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Proof

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An extrapolated iteratively reweighted \(\ell _1\) method with complexity analysis

Abstract

Access this article

Similar content being viewed by others

Convergence rate analysis of proximal iteratively reweighted $$\ell _1$$ methods for $$\ell _p$$ regularization problems

Relating $$\ell _p$$ regularization and reweighted $$\ell _1$$ regularization

Iteratively reweighted $$\ell _1$$ algorithms with extrapolation

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Appendix

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation