Sparsity aware consistent and high precision variable selection

Rezaii, T. Yousefi; Tinati, M. A.; Beheshti, S.

doi:10.1007/s11760-012-0401-6

Sparsity aware consistent and high precision variable selection

Original Paper
Published: 09 December 2012

Volume 8, pages 1613–1624, (2014)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

T. Yousefi Rezaii^1,2,
M. A. Tinati¹ &
S. Beheshti²

267 Accesses
2 Citations
Explore all metrics

Abstract

Variable selection is fundamental while dealing with sparse signals that contain only a few number of nonzero elements. This is the case in many signal processing areas extending from high-dimensional statistical modeling to sparse signal estimation. This paper explores a new and efficient approach to model a system with underlying sparse parameters. The idea is to get the noisy observations and estimate the minimum number of underlying parameters with acceptable estimation accuracy. The main challenge is due to the non-convex optimization problem to be solved. The reconstruction stage deals with some suitable objective function in order to estimate the original sparse signal by performing variable selection procedure. This paper introduces a suitable objective function in order to simultaneously recover the true support of the underlying sparse signal while still achieving an acceptable estimation error. It is shown that the proposed method performs the best variable selection compared to the other algorithms, while approaching the lowest least mean squared error in almost all the cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A comparative study of multi-objective optimization algorithms for sparse signal reconstruction

Article 06 October 2021

Robust Estimation of Sparse Signal with Unknown Sparsity Cluster Value

Robust compressive sensing of sparse signals: a review

Article Open access 19 October 2016

Notes

According to the discussion in [24], three necessary properties for penalty function of the least squares criterion which result in oracle properties are as follows: (1) unbiasedness, (2) sparsity and (3) continuity. It is shown that Lasso achieves the last two properties, but this comes at the price of shifting the resulting estimator by a constant parameter, thus losing the unbiasedness property.
The idea of locally linear approximation has been successfully used in [29] in order to maximize the penalized likelihood function.

References

Donoho, D.: Compressed sensing. IEEE Trans. Inf. Theory 52(4), 1289–1306 (2006)
Article MATH MathSciNet Google Scholar
Candes, E., Romberg, J., Tao, T.: Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inf. Theory 52(2), 489–509 (2006)
Article MATH MathSciNet Google Scholar
Chen, S.S., Donoho, D.L., Saunders, M.A.: Atomic decomposition by basis pursuit. SIAM J. Sci. Comput. 20(1), 33–61 (1998)
Article MathSciNet Google Scholar
Tropp, J.: Greed is good: algorithmic results for sparse approximation. IEEE Trans. Inf. Theory 50(10), 2231–2242 (2004)
Article MATH MathSciNet Google Scholar
Li, Y., Osher, S.: Coordinate descent optimization for $L_{1}$ minimization with application to compressed sensing: greedy algorithm. Inverse Probl Imaging 3(3), 487–503 (2009)
Article MATH MathSciNet Google Scholar
Cetin, M., Malioutov, D.M., and Willsky, A.S.: A variational technique for source localization based on a sparse signal reconstruction perspective. In: Proc. ICASSP, pp. 2965–2968 (2002)
Candes, E.J., Wakin, M., Boyd, S.: Enhancing sparsity by reweighted L1 minimization. J Fourier Anal Appl 14(5), 877–905 (2008)
Article MATH MathSciNet Google Scholar
Chartrand, R.: Exact reconstruction of sparse signals via non-convex minimization. IEEE Signal Process. Lett. 14, 707–710 (2007)
Article Google Scholar
Chartrand, R., Yin, W.: Iteratively reweighted algorithms for compressive sensing. In: Proc. ICASSP, Las Vegas, pp. 3869–3872 (2008)
Miosso, C.J., Borries, R.V., Argaez, M., Velazquez, L., Quintero, C., Potes, C.M.: Compressive sensing reconstruction with prior information by iteratively reweighted least-squares. IEEE Trans. Signal Process. 57(6), 2424–2431 (2009)
Google Scholar
Benesty, J., Gay, S.L.: An improved PNLMS algorithm. In: Proc. IEEE ICASSP, pp. 1881–1884 (2002)
Duttweiler, D.L.: Proportionate normalized least-mean-squares adaptation in echo cancellers. IEEE Trans. Speech Audio Process. 8, 508–518 (2000)
Article Google Scholar
Gaensler, T., Gay, S.L., Sondhi, M.M., Benesty, J.: Double-talk robust fast converging algorithms for network echo cancellation., IEEE Trans. Speech Audio Process. 8, 656–663 (2000)
Google Scholar
Hoshuyama, O., Gubran, R.A., Sugiyama, A.: A generalized proportionate variable step-size algorithm for fast changing acoustic environments. In: Proc. IEEE ICASSP, pp. IV-161–IV-164 (2004)
Benesty, J., Paleologu, C., Ciochina, S.: Proportionate adaptive filters from a basis pursuit perspective. IEEE Signal Process. Lett. 17(12), 985–988 (2010)
Article Google Scholar
Chen, Y., Gu, Y., Hero, A.O.: Sparse LMS for system identification. In: Proc. IEEE ICASSP, pp. 3125–3128 (2009)
Gu, Y., Jin, J., Mei, S.: $L_{0}$ norm constraint LMS algorithm for sparse system identification. IEEE Signal Process. Lett. 16(9), 774–777 (2009)
Article Google Scholar
Jin, J., Gu, Y., Mei, S.: A stochastic gradient approach on compressive sensing reconstruction based on adaptive filtering framework. IEEE J. Sel. Top. Signal Process. 4(2), 409–420 (2010)
Article Google Scholar
Eksioglu, E.M.: RLS adaptive filtering with sparsity regularization. In: 10th Int. Conf. on Inf. Sci., Signal Process., and their Applications, pp. 550–553 (2010)
Tibshirani, R.: Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. Ser. B 58(1), 267–288 (1996)
Google Scholar
Efron, B., Hastie, T., Johnstone, I., Tibshirani, R.: Least angle regression. Ann. Stat. 32, 407–499 (2004)
Article MATH MathSciNet Google Scholar
Angelosante, D., Giannakis, G.B.: RLS-weighted Lasso for adaptive estimation of sparse signals. In: Proc. IEEE ICASSP, pp. 3245–3248 (2009)
Angelosante, D., Bazerque, J.A., Giannakis, G.B.: Online adaptive estimation of sparse signals: where RLS meets the $L_{1}$ norm. IEEE Trans. Signal Process. 58(7), 3436–3447 (2010)
Article MathSciNet Google Scholar
Fan, J., Li, R.: Variable selection via non-concave penalized likelihood and its oracle properties. J. Am. Stat. Assoc. 96(456), 1348–1360 (2001)
Article MATH MathSciNet Google Scholar
Yuan, M., Lin, Y.: On the Nonnegative Garotte Estimator, Technical Report. School of Industrial and Systems Engineering, Georgia Inst. of Tech, Georgia (2005)
Google Scholar
Zou, H.: The adaptive Lasso and its oracle properties. J. Am. Stat. Assoc. 101(476), 1418–1429 (2006)
Article MATH Google Scholar
Zhao, P., Yu, B.: On model selection consistency of Lasso. J. Mach. Learn. Res. 7, 2541–2563 (2006)
MATH MathSciNet Google Scholar
Knight, K., Fu, W.: Asymptotics for Lasso-type estimators. Ann. Stat. 28, 1356–1378 (2000)
Article MATH MathSciNet Google Scholar
Zou, B.H., Li, R.: One-step sparse estimation in non-concave penalized likelihood models. Ann. Stat. 36(4), 1509–1533 (2008)
Article MATH MathSciNet Google Scholar
Donoho, D., Johnstone, I.: Ideal spatial adaptation by wavelet shrinkage. Biometrika 81, 425–455 (1994)
Article MATH MathSciNet Google Scholar
Tinati, M.A., Yousefi Rezaii, T.: Adaptive sparsity-aware parameter vector reconstruction with application to compressed sensing. In: Proc. IEEE HPCS, pp. 350–356 (2011)
Friedman, J., Hastie, T., Hofling, H., Tibshirani, R.: Pathwise coordinate optimization. Ann. Appl. Stat. 1(2), 302–332 (2007)
Google Scholar
Ward, R.: Compressed sensing with cross validation. IEEE Trans. Inf. Theory 55(12), 5773–5782 (2009)
Article Google Scholar
Boufounos, P., Duarte, M.F., Baraniuk, R.G.: Sparse signal reconstruction from noisy compressive measurements using cross validation. In: Proc. IEEE Workshop on Statistical, Signal Processing, pp. 299–303 (2007)

Download references

Author information

Authors and Affiliations

Faculty of Electrical and Computer Engineering, University of Tabriz, 29 Bahman Ave., 51666-15813 , Tabriz, Iran
T. Yousefi Rezaii & M. A. Tinati
Department of Electrical and Computer Engineering, Ryerson University, Toronto, ON, M5B 2K3, Canada
T. Yousefi Rezaii & S. Beheshti

Authors

T. Yousefi Rezaii
View author publications
You can also search for this author inPubMed Google Scholar
M. A. Tinati
View author publications
You can also search for this author inPubMed Google Scholar
S. Beheshti
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to T. Yousefi Rezaii.

Appendices

Appendix A

In (7), writing the first two terms of the Taylor series expansion of $\mathcal P _{\tau ,\gamma } \left( {\left| {\theta _{j} } \right|} \right)$ about $\theta _j^o $, as,

$$\begin{aligned} \mathcal P _{\tau ,\gamma } \left( {\left| {\theta _{j} } \right|} \right)\approx \mathcal P _{\tau ,\gamma } \left( {\left| {\theta _{j}^o } \right|} \right)+\mathcal{P }^{\prime }_{\tau ,\gamma } \left( {\left| {\theta _{j}^o } \right|} \right)\left( {\left| {\theta _j } \right|-\left| {\theta _{j}^o } \right|} \right).\nonumber \\ \end{aligned}$$

(36)

Substituting the approximation (36) into (7), the approximated objective function can be considered as,

$$\begin{aligned} \mathcal{J }\left( {\varvec{\uptheta }} \right)&\!\approx \! \frac{1}{2n}\left\Vert {\mathbf{y}\!-\!\mathbf{X}{\varvec{\uptheta }}} \right\Vert_2^2\nonumber \\&\quad \!+\!\sum _{j\!=\!1}^d {\left[ \mathcal{P _{\tau ,\gamma } \left( {\left| {\theta _{j}^o } \right|} \right) +\mathcal{P }^{\prime }_{\tau ,\gamma } \left( {\left| {\theta _j^o } \right|} \right)\left( {\left| {\theta _j } \right|\,-\,\left| {\theta _{j}^o } \right|} \right)} \right]}.\nonumber \\ \end{aligned}$$

(37)

Finally, excluding the constant terms from (37), the estimator ${\hat{{\varvec{\uptheta }}}}$ is obtained as,

$$\begin{aligned} {\hat{{\varvec{\uptheta }}}}=\mathop {\arg \min }\limits _{\varvec{\uptheta }} \;\;\frac{1}{2n}\left\Vert {\mathbf{y}-\mathbf{X}{\varvec{\uptheta }}} \right\Vert_2^2 +\sum _{j=1}^d {\mathcal{P }^{\prime }_{\tau ,\gamma } \left( {\left| {\theta _{j}^o } \right|} \right)\left| {\theta _j } \right|}. \end{aligned}$$

Appendix B

The term $Z^{(n)}(\mathbf{u})-Z^{(n)}(\mathbf{0})$ could be written as follows using (16),

$$\begin{aligned} Z^{(n)}(\mathbf{u})\!-\!Z^{(n)}(\mathbf{0})\, =\, \frac{1}{2}\left\Vert {\mathbf{y}\!-\!\mathbf{X}\!\left( {{\varvec{\uptheta }}^{*}\!+\!\frac{\mathbf{u}}{\sqrt{n}}} \right)}\!\right\Vert_2^2 +\frac{1}{2}\left\Vert {\mathbf{y}\!-\!\mathbf{X}{\varvec{\uptheta }}^{*}} \right\Vert_2^2 \\ +\,n\sum _{j=1}^d {\mathcal{P }^{\prime }_{\tau ,\gamma } \left( {\left| {\theta _j^o } \right|} \right)\!\!\left({\left| {\theta _j^*\!+\!\frac{u_j }{\sqrt{n}}} \right|\!-\!\left| {\theta _j^*} \right|} \right)}. \end{aligned}$$

For the sake of notation simplicity, the rightmost term of the above equation is suppressed in the following, since it will remain unchanged. So, we have the following:

$$\begin{aligned}&\frac{1}{2}\left\Vert {\mathbf{y}-\mathbf{X}\left( {{\varvec{\uptheta }}^{*}+\frac{\mathbf{u}}{\sqrt{n}}} \right)} \right\Vert_2^2 +\frac{1}{2}\left\Vert {\mathbf{y}-\mathbf{X}\varvec{\uptheta } ^{*}} \right\Vert_2^2\\&\quad =\frac{1}{2}\left[ {\mathbf{y}-\mathbf{X}\varvec{\uptheta } -\frac{1}{\sqrt{n}}\mathbf{Xu}} \right]^{T}\left[ {\mathbf{y}-\mathbf{X}\varvec{\uptheta }-\frac{1}{\sqrt{n}}\mathbf{Xu}} \right]\\&\qquad -\frac{1}{2}\left[ {\mathbf{y}-\mathbf{X}\varvec{\uptheta } } \right]^{T}\left[ {\mathbf{y}-\mathbf{X}\varvec{\uptheta }} \right]. \end{aligned}$$

After some manipulations, we will have the following:

$$\begin{aligned}&\frac{1}{2}\left\Vert {\mathbf{y}-\mathbf{X}\left( {{\varvec{\uptheta }}^{*}+\frac{\mathbf{u}}{\sqrt{n}}} \right)} \right\Vert_2^2 +\frac{1}{2}\left\Vert {\mathbf{y}-\mathbf{X}\varvec{\uptheta } ^{*}} \right\Vert_2^2\nonumber \\&\quad =\frac{1}{\sqrt{n}}\mathbf{u}^{T}\mathbf{X}^{T}\mathbf{X}\varvec{\uptheta } ^{*}-\frac{1}{\sqrt{n}}\mathbf{u}^{T}\mathbf{X}^{T}\mathbf{y}+\frac{1}{2n}\mathbf{u}^{T}\mathbf{X}^{T}\mathbf{Xu} \nonumber \\&\quad =\frac{-1}{\sqrt{n}}\mathbf{u}^{T}\mathbf{X}^{T}\left( {\mathbf{y}-\mathbf{X}\varvec{\uptheta } ^{*}} \right)+\frac{1}{2}\mathbf{u}^{T}\left( {\frac{1}{n}\mathbf{X}^{T}\mathbf{X}} \right)\mathbf{u}. \end{aligned}$$

(38)

Substituting $\mathbf{v}=\mathbf{y}-\mathbf{X}\varvec{\uptheta }$ in ($\text{ B}_{1})$, we have

$$\begin{aligned}&\frac{1}{2}\left\Vert {\mathbf{y}-\mathbf{X}\left( {{\varvec{\uptheta }}^{*}+\frac{\mathbf{u}}{\sqrt{n}}} \right)} \right\Vert_2^2 +\frac{1}{2}\left\Vert {\mathbf{y}-\mathbf{X}\varvec{\uptheta }^{*}} \right\Vert_2^2 =\frac{-1}{\sqrt{n}}\mathbf{u}^{T}\mathbf{X}^{T}\mathbf{v}\nonumber \\&\quad +\frac{1}{2}\mathbf{u}^{T}\left( {\frac{1}{n}\mathbf{X}^{T}\mathbf{X}} \right)\mathbf{u}. \end{aligned}$$

(39)

Knowing that $\lim _{n\rightarrow \infty } \frac{1}{n}\mathbf{X}^{T}\mathbf{X}=\mathbf{C}$, the second term in the right hand of ($\text{ B}_{2})$ tends to $\frac{1}{2}\mathbf{u}^{T}\mathbf{Cu}$ as $n$ goes to infinity. By using the Slutsky’s theorem and central limit theorem, it is easy to show that the first term in the right hand of ($\text{ B}_{2})$ tends to a zero-mean normal distribution with covariance matrix of $\sigma ^{2}\mathbf{C}$.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rezaii, T.Y., Tinati, M.A. & Beheshti, S. Sparsity aware consistent and high precision variable selection. SIViP 8, 1613–1624 (2014). https://doi.org/10.1007/s11760-012-0401-6

Download citation

Received: 19 April 2012
Revised: 08 November 2012
Accepted: 11 November 2012
Published: 09 December 2012
Issue Date: November 2014
DOI: https://doi.org/10.1007/s11760-012-0401-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sparsity aware consistent and high precision variable selection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A comparative study of multi-objective optimization algorithms for sparse signal reconstruction

Robust Estimation of Sparse Signal with Unknown Sparsity Cluster Value

Robust compressive sensing of sparse signals: a review

Notes

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix A

Appendix B

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now