An efficient augmented Lagrangian method with applications to total variation minimization

Li, Chengbo; Yin, Wotao; Jiang, Hong; Zhang, Yin

doi:10.1007/s10589-013-9576-1

An efficient augmented Lagrangian method with applications to total variation minimization

Published: 03 July 2013

Volume 56, pages 507–530, (2013)
Cite this article

Computational Optimization and Applications Aims and scope Submit manuscript

Chengbo Li¹,
Wotao Yin¹,
Hong Jiang² &
…
Yin Zhang¹

4949 Accesses
445 Citations
Explore all metrics

Abstract

Based on the classic augmented Lagrangian multiplier method, we propose, analyze and test an algorithm for solving a class of equality-constrained non-smooth optimization problems (chiefly but not necessarily convex programs) with a particular structure. The algorithm effectively combines an alternating direction technique with a nonmonotone line search to minimize the augmented Lagrangian function at each iteration. We establish convergence for this algorithm, and apply it to solving problems in image reconstruction with total variation regularization. We present numerical results showing that the resulting solver, called TVAL3, is competitive with, and often outperforms, other state-of-the-art solvers in the field.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Newton-Type Methods: A Broader View

Article 28 May 2014

Total variation image reconstruction algorithm based on non-convex function

Article 12 March 2024

Total Variation in Imaging

References

Barzilai, J., Borwein, J.M.: Two-point step size gradient methods. IMA J. Numer. Anal. 8, 141–148 (1988)
Article MathSciNet MATH Google Scholar
Becker, S., Bobin, J., Candès, E.: NESTA: a fast and accurate first-order method for sparse recovery. SIAM J. Imaging Sci. 4, 1–39 (2011)
Article MathSciNet MATH Google Scholar
Bioucas-Dias, J., Figueiredo, M.: A new TwIST: two-step iterative thresholding algorithm for image restoration. IEEE Trans. Image Process. 16(12), 2992–3004 (2007)
Article MathSciNet Google Scholar
Bioucas-Dias, J., Figueiredo, M.: Two-step algorithms for linear inverse problems with non-quadratic regularization. In: IEEE International Conference on Image Processing—ICIP 2007, San Antonio, TX, USA, September 2007
Google Scholar
Boykov, Y., Veksler, O., Zabih, R.: Fast approximate energy minimization via graph cuts. IEEE Trans. Pattern Anal. Mach. Intell. 23(11), 1222–1239 (2001)
Article Google Scholar
Candès, E., Romberg, J., Tao, T.: Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inf. Theory 52(2), 489–509 (2006)
Article MATH Google Scholar
Candès, E., Tao, T.: Near optimal signal recovery from random projections: universal encoding strategies. IEEE Trans. Inf. Theory 52(12), 5406–5425 (2006)
Article Google Scholar
Chambolle, A.: An algorithm for total variation minimization and applications. J. Math. Imaging Vis. 20, 89–97 (2004)
Article MathSciNet Google Scholar
Chan, T., Wong, C.K.: Total variation blind deconvolution. IEEE Trans. Image Process. 7(3), 370–375 (1998)
Article Google Scholar
Chang, T., He, L., Fang, T.: MR image reconstruction from sparse radial samples using Bregman iteration. In: ISMRM (2006)
Google Scholar
Donoho, D.: Compressed sensing. IEEE Trans. Inf. Theory 52(4), 1289–1306 (2006)
Article MathSciNet Google Scholar
Donoho, D.: Neighborly polytopes and sparse solution of underdetermined linear equations. IEEE Trans. Inf. Theory (2006)
Duarte, M.F., Sarvotham, S., Baron, D., Wakin, M.B., Baraniuk, R.G.: Distributed compressed sensing of jointly sparse signals. In: 39th Asilomar Conference on Signals, Systems and Computers, pp. 1537–1541 (2005)
Google Scholar
Fortin, M., Glowinski, R.: Méthodes de Lagrangien Augmenté. Application à la Résolution Numérique de Problèmes aux Limites. Dunod-Bordas, Paris (1982) (in French)
Google Scholar
Gabay, D., Mercier, B.: A dual algorithm for the solution of nonlinear variational problems via finite element approximations. Comput. Appl. Math. 2, 17–40 (1976)
Article MATH Google Scholar
Glowinski, R.: Numerical Methods for Nonlinear Variational Problems. Springer, Berlin (1984)
Book MATH Google Scholar
Glowinski, R., Marrocco, A.: Sur l’approximation par éléments finis d’ordre un et la résolution par pénalisation-dualité d’une classe de problèmes de Dirichlet nonlinéaires. C. R. Math. Acad. Sci. Paris 278A, 1649–1652 (1974) (in French)
MathSciNet Google Scholar
Goldstein, T., Osher, S.: The split Bregman method for L1 regularized problems. SIAM J. Imaging Sci. 2(2), 323–343 (2009)
Article MathSciNet MATH Google Scholar
Grippo, L., Lampariello, F., Lucidi, S.: A nonmonotone line search technique for Newton’s method. SIAM J. Numer. Anal. 23, 707–716 (1986)
Article MathSciNet MATH Google Scholar
Hager, W.W., Phan, D.T., Zhang, H.: Gradient-based methods for sparse recovery. SIAM J. Imaging Sci. 4, 146–165 (2011)
Article MathSciNet Google Scholar
Hestenes, M.R.: Multiplier and gradient methods. J. Optim. Theory Appl. 4, 303–320 (1969)
Article MathSciNet MATH Google Scholar
Jiang, H., Li, C., Haimi-Cohen, R., Wilford, P., Zhang, Y.: Scalable video coding using compressive sensing. Bell Labs Tech. J. 16, 149–169 (2012)
Article Google Scholar
Laska, J., Kirolos, S., Duarte, M., Ragheb, T., Baraniuk, R., Massoud, Y.: Theory and implementation of an analog-to-information converter using random demodulation. In: Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS), New Orleans, Louisiana (2007)
Google Scholar
Li, C., Jiang, H., Wilford, P., Zhang, Y.: Video coding using compressive sensing for wireless communications. In: IEEE Wireless Communications and Networking Conference (WCNC), pp. 2077–2082 (2011). doi:10.1109/WCNC.2011.5779474
Google Scholar
Li, C., Sun, T., Kelly, K., Zhang, Y.: A compressive sensing and unmixing scheme for hyperspectral data processing. IEEE Trans. Image Process. 21, 1200–1210 (2012)
Article MathSciNet Google Scholar
Li, C., Zhang, Y., Yin, W.: http://www.caam.rice.edu/~optimization/L1/TVAL3/
Lions, P.L., Mercier, B.: Splitting algorithms for the sum of two nonlinear operators. SIAM J. Numer. Anal. 16, 964–979 (1979)
Article MathSciNet MATH Google Scholar
Natarajan, B.K.: Sparse approximate solutions to linear systems. SIAM J. Comput. 24, 227–234 (1995)
Article MathSciNet MATH Google Scholar
Nesterov, Yu.: Smooth minimization of non-smooth functions. Math. Program., Ser. A 103, 127–152 (2005)
Article MathSciNet MATH Google Scholar
Osher, S., Burger, M., Goldfarb, D., Xu, J., Yin, W.: An iterated regularization method for total variation based image restoration. Multiscale Model. Simul. 4, 460–489 (2005)
Article MathSciNet MATH Google Scholar
Powell, M.J.D.: A method for nonlinear constraints in minimization problems. In: Fletcher, R. (ed.) Optimization, pp. 283–298. Academic Press, London (1969)
Google Scholar
Rockafellar, R.T.: The multiplier method of Hestenes and Powell applied to convex programming. J. Optim. Theory Appl. 12(6), 555–562 (1973)
Article MathSciNet MATH Google Scholar
Rudin, L., Osher, S., Fatemi, E.: Nonlinear total variation based noise removal algorithms. Physica D 259–268 (1992)
Sun, T., Woods, G.L., Duarte, M.F., Kelly, K.F., Li, C., Zhang, Y.: OBIC measurements without lasers or raster-scanning based on compressive sensing. In: Proceedings of the 35th International Symposium for Testing and Failure Analysis (2009)
Google Scholar
Takhar, D., Laska, J.N., Wakin, M.B., Duarte, M.F., Baron, D., Sarvotham, S., Kelly, K.F., Baraniuk, R.G.: A new compressive imaging camera architecture using optical-domain compression. In: Computational Imaging IV, vol. 6065, pp. 43–52 (2006)
Chapter Google Scholar
Wang, Y., Yang, J., Yin, W., Zhang, Y.: A new alternating minimization algorithm for total variation image reconstruction. SIAM J. Imaging Sci. 1(4), 248–272 (2008)
Article MathSciNet MATH Google Scholar
Yang, J., Yin, W., Zhang, Y.: A fast alternating direction method for TVL1-L2 signal reconstruction from partial Fourier data. Technical Report, TR08-27, CAAM, Rice University (2008)
Yin, W., Osher, S., Goldfarb, D., Darbon, J.: Bregman iterative algorithms for ℓ ₁-minimization with applications to compressed sensing. SIAM J. Imaging Sci. 1, 143–168 (2008)
Article MathSciNet MATH Google Scholar
Yin, W., Morgan, S., Yang, J., Zhang, Y.: Practical compressive sensing with Toeplitz and circulant matrices. In: Proceedings of Visual Communications and Image Processing (VCIP) (2010)
Google Scholar
Zhang, H., Hager, W.W.: A nonmonotone line search technique and its application to unconstrained optimization. SIAM J. Optim. 14, 1043–1056 (2004)
Article MathSciNet MATH Google Scholar
Xiao, Y., Song, H.: An inexact alternating directions algorithm for constrained total variation regularized compressive sensing problems. J. Math. Imaging Vis. 44, 114–127 (2012)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The work of the first author was supported in part by NSF Grant DMS-0811188. The work of the second author was supported in part by NSF grants DMS-07-48839 and ECCS-1028790, as well as ONR Grant N00014-08-1-1101. The work of the fourth author was supported in part by NSF Grant DMS-0811188, ONR Grant N00014-08-1-1101, and NSF Grant DMS-1115950. The first and the fourth authors also appreciate a gift fund from Bell Labs, Alcatel-Lucent to Rice University that partially supported their travels to international conferences. Last but not least, we thank the two anonymous referees for their constructive criticism and their helpful suggestions.

Author information

Authors and Affiliations

Department of Computational and Applied Mathematics, Rice University, 6100 Main, Houston, TX, 77005, USA
Chengbo Li, Wotao Yin & Yin Zhang
Bell Laboratories, Alcatel-Lucent, 700 Mountain Avenue, Murray Hill, NJ, 07974, USA
Hong Jiang

Authors

Chengbo Li
View author publications
You can also search for this author in PubMed Google Scholar
Wotao Yin
View author publications
You can also search for this author in PubMed Google Scholar
Hong Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yin Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chengbo Li.

Appendix: Proof of Theorem 1

For notational simplicity, let us define

$$ \phi_k(\cdot) = \phi(\cdot, y_k) \quad \mbox{and}\quad\nabla\phi_k(\cdot) = \partial_1 \phi( \cdot, y_k). $$

(22)

The proof of the theorem relies on two lemmas. The two lemmas are modifications of their counterparts in [40]. Since our objective may contain a non-differentiable part, the key modification is to connect this non-differentiable part to the differentiable part by means of alternating minimization. Otherwise, the line of proofs follows closely that given in [40].

The first lemma presents some basic properties and established that the algorithm is well-defined.

Lemma 1

If ∇ϕ _k(x _k)^T d _k≤0 holds for each k, then for the sequences generated by Algorithm-NADA, we have ϕ _k(x _k)≤ϕ _k−1(x _k)≤C _k for each k and {C _k} is monotonically non-increasing. Moreover, if ∇ϕ _k(x _k)^T d _k<0, a step length α _k>0 always exists so that the nonmonotone Armijo condition (11) holds.

Proof

Define real-valued function

$$D_k(t) = \frac{tC_{k-1} + \phi_{k-1}(x_k)}{t+1} \quad\mbox{for}\ t \ge0, $$

then

$$D_k'(t) = \frac{C_{k-1} - \phi_{k-1}(x_k)}{(t+1)^2} \quad\mbox{for}\ t \ge 0. $$

Due to the nonmonotone Armijo condition (11) and ∇ϕ _k(x _k)^T d _k≤0, we have

$$C_{k-1} - \phi_{k-1}(x_k)\geq-\delta \alpha_{k-1} \nabla\phi_{k-1}(x_{k-1})^T d_{k-1} \geq0. $$

Therefore, $D_{k}'(t) \ge0$ holds for any t≥0, and then D _k is non-decreasing.

Since

$$D_k(0) = \phi_{k-1}(x_k) \quad\mbox{and}\quad D_k(\eta_{k-1} Q_{k-1}) = C_k, $$

we have

$$\phi_{k-1}(x_k) \le C_k, \quad\forall k. $$

As is defined in Algorithm-NADA,

$$y_k = \operatorname {argmin}_y \phi(x_k,y). $$

Therefore,

$$\phi(x_k, y_k) \leq\phi(x_{k}, y_{k-1}). $$

Hence, ϕ _k(x _k)≤ϕ _k−1(x _k)≤C _k holds for any k.

Furthermore,

$$C_{k+1} = \frac{\eta_k Q_k C_k + \phi_k(x_{k+1})}{Q_{k+1}} \le\frac {\eta_k Q_k C_k + C_{k+1}}{Q_{k+1}} , $$

i.e.,

$$(\eta_k Q_k +1)C_{k+1} \le\eta_k Q_k C_k + C_{k+1}, $$

that is,

$$C_{k+1}\le C_k. $$

Thus, {C _k} is monotonically non-increasing.

If C _k is replaced by ϕ _k(x _k) in (11), the nonmonotone Armijo condition becomes the standard Armijo condition. It is well-known that α _k>0 exists for the standard Armijo condition while ∇ϕ _k(x _k)^T d _k<0 and ϕ is bounded below. Since ϕ _k(x _k)≤C _k, it follows α _k>0 exists as well for the nonmonotone Armijo condition:

$$\begin{aligned} \phi_k(x_k+\alpha_k d_k) \le C_k + \delta\alpha_k \nabla \phi_k(x_k)^T d_k. \end{aligned}$$

Now we defining the quantity A _k by

$$\begin{aligned} A_k = \frac{1}{k+1} \sum _{i=0}^k \phi_k(x_k). \end{aligned}$$

(23)

By induction, it is easy to show that C _k is bounded above by A _k. Together with the facts that C _k is also bounded below by ϕ _k(x _k) and α _k>0 always exists, it is clear that Algorithm-NADA is well-defined. □

In the next lemma, a lower bound for the step length generated by Algorithm-NADA will be given.

Lemma 2

Assume that ∇ϕ _k(x _k)^T d _k≤0 for all k and that Lipschitz condition (19) holds with constant L. Then

$$\begin{aligned} \alpha_k \geq\min\biggl\{ {\alpha_{\max} \over\rho}, {2(1-\delta)\over L\rho} {|\nabla\phi_k(x_k)^T d_k|\over\|d_k\| ^2} \biggr\}. \end{aligned}$$

(24)

The proof is omitted here since the proof of Lemma 2.1 in [40] is directly applicable.

With the aid of the lower bound (24), we now are ready to prove Theorem 1. We need to establish the two relationships given in (20).

Proof

First, by definition in Algorithm-NADA,

$$y_k = \operatorname {argmin}_y \phi(x_k,y). $$

Hence, it always holds true that

$$0 \in\partial_2 \phi(x_k,y_k). $$

Now it suffices to show that the limit holds true in (20). Consider the nonmonotone Armijo condition:

$$\begin{aligned} \phi_k(x_k+\alpha_k d_k) \le C_k + \delta\alpha_k \nabla \phi_k(x_k)^T d_k. \end{aligned}$$

(25)

If ρα _k<α _max, in view of the lower bound (24) on α _k in Lemma 2 and the direction assumption (18),

$$\begin{aligned} \phi_k(x_k+\alpha_k d_k) \le& C_k - \delta\frac{2(1-\delta)}{ L\rho} \frac{|\nabla\phi_k(x_k)^T d_k|^2}{\| d_k \| ^2} \\ \le& C_k - \frac{2 \delta(1-\delta)}{L\rho} \frac{c_1^2 \|\nabla\phi _k(x_k)\|^4}{c_2^2 \| \nabla\phi_k(x_k) \| ^2} \\ =& C_k - \biggl[ \frac{2\delta(1-\delta)c_1^2}{L\rho c_2^2} \biggr] \big\| \nabla \phi_k(x_k) \big\| ^2. \end{aligned}$$

On the other hand, if ρα _k≥α _max, the lower bound (24), together with the direction assumption (18), gives

$$\begin{aligned} \phi_k(x_k+\alpha_k d_k) \le& C_k + \delta\alpha_k \nabla\phi_k(x_k)^T d_k \\ \le& C_k - \delta\alpha_k c_1 \big\| \nabla \phi_k(x_k) \big\| ^2 \\ \le& C_k - \frac{\delta\alpha_{\max} c_1}{\rho} \big\| \nabla\phi_k(x_k) \big\| ^2. \end{aligned}$$

Introducing a constant

$$\tilde{\tau} = \min\biggl\{ \frac{2\delta(1-\delta)c_1^2}{L \rho c_2^2}, \frac{\delta\alpha_{\max} c_1}{\rho} \biggr\}, $$

we can combine the above inequalities into

$$\begin{aligned} \phi_k(x_k+ \alpha_k d_k) \le C_k - \tilde{\tau} \big\| \nabla \phi_k(x_k) \big\| ^2. \end{aligned}$$

(26)

Next we show by induction that for all k

$$ \frac{1}{Q_k} \ge1-\eta_{\max}, $$

(27)

which obviously holds for k=0 given that Q ₀=1. Assume that (27) holds for k=j. Then

$$\begin{aligned} Q_{j+1} = \eta_j Q_j +1 \le \frac{\eta_j}{1-\eta_{\max}} +1 \le\frac{\eta_{\max}}{1-\eta_{\max}} +1 = \frac{1}{1-\eta_{\max}}, \end{aligned}$$

implying that (27) also holds for k=j+1. Hence, (27) holds for all k.

It follows from (26) and (27) that

$$\begin{aligned} C_k - C_{k+1} =& C_k - {\eta_k Q_k C_k + \phi_k(x_{k+1})\over Q_{k+1}} \\ =& {C_k(\eta_k Q_k + 1) - (\eta_k Q_k C_k + \phi_k(x_{k+1})) \over Q_{k+1} } \\ =& {C_k - \phi_k(x_{k+1}) \over Q_{k+1} } \\ \ge& \frac{\tilde{\tau} \|\nabla\phi_k(x_k)\|^2}{Q_{k+1}} \\ \ge& \tilde{\tau} (1-\eta_{\max}) \big\|\nabla\phi_k(x_k) \big\| ^2. \end{aligned}$$

(28)

Since ϕ is bounded below by assumption, {C _k} is also bounded below. In addition, by Lemma 1, {C _k} is monotonically non-increasing, hence convergent. Therefore, the left-hand side of (28) tends to zero, so does the right-hand side; i.e., ∥∇ϕ _k(x _k)∥→0. Finally, by definition (22),

$$\lim_{k\rightarrow0} \partial_1 \phi(x_k, y_k) = 0, $$

which completes the proof. □

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, C., Yin, W., Jiang, H. et al. An efficient augmented Lagrangian method with applications to total variation minimization. Comput Optim Appl 56, 507–530 (2013). https://doi.org/10.1007/s10589-013-9576-1

Download citation

Received: 19 August 2012
Published: 03 July 2013
Issue Date: December 2013
DOI: https://doi.org/10.1007/s10589-013-9576-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An efficient augmented Lagrangian method with applications to total variation minimization

Abstract

Access this article

Similar content being viewed by others

Newton-Type Methods: A Broader View

Total variation image reconstruction algorithm based on non-convex function

Total Variation in Imaging

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix: Proof of Theorem 1

Lemma 1

Proof

Lemma 2

Proof

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An efficient augmented Lagrangian method with applications to total variation minimization

Abstract

Access this article

Similar content being viewed by others

Newton-Type Methods: A Broader View

Total variation image reconstruction algorithm based on non-convex function

Total Variation in Imaging

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix: Proof of Theorem 1

Appendix: Proof of Theorem 1

Lemma 1

Proof

Lemma 2

Proof

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation