Efficient Bi-level Variable Selection and Application to Estimation of Multiple Covariance Matrices

Phan, Duy Nhat; Le Thi, Hoai An; Pham, Dinh Tao

doi:10.1007/978-3-319-57454-7_24

Duy Nhat Phan¹⁹,
Hoai An Le Thi¹⁹ &
Dinh Tao Pham²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10234))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

3866 Accesses

Abstract

Variable selection plays an important role in analyzing high dimensional data. When the data possesses certain group structures in which individual variables are also meaningful scientifically, we are naturally interested in selecting important groups as well as important variables. We introduce a new regularization by combining the $\ell _{p,0}$-norm and $\ell _0$-norm for bi-level variable selection. Using an appropriate DC (Difference of Convex functions) approximation, the resulting problem can be solved by DC Algorithm. As an application, we implement the proposed algorithm for estimating multiple covariance matrices sharing some common structures such as the locations or weights of non-zero elements. The experimental results on both simulated and real datasets demonstrate the efficiency of our algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

$$\ell _1$$ Regularized Robust and Sparse Linear Modeling Using Discrete Optimization

Correction for Optimisation Bias in Structured Sparse High-Dimensional Variable Selection

Penalized Partial Least Square applied to structured data

Article Open access 04 March 2019

Notes

1.
https://archive.ics.uci.edu/ml/datasets.

References

Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, P.: Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundat. Trends Mach. Learn. 3(1), 1–122 (2011)
Article MATH Google Scholar
Breheny, P.: The group exponential lasso for bi-level variable selection. Biometrics 71(3), 731–740 (2015)
Article MathSciNet MATH Google Scholar
Breheny, P., Huang, J.: Penalized methods for bi-level variable selection. Stat. Interface 2, 369–380 (2009)
Article MathSciNet MATH Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: A note on the group lasso and sparse group lasso. arXiv:1001.0736v1, pp. 1–8 (2010)
Huang, J., Ma, S., Xie, H., Zhang, C.H.: A group bridge approach for variable selection. Biometrika 96(2), 339–355 (2009)
Article MathSciNet MATH Google Scholar
Le Thi, H.A., Pham Dinh, T.: The DC (difference of convex functions) programming and DCA revisited with DC models of real world nonconvex optimization problems. Ann. Oper. Res. 133, 23–46 (2005)
Article MathSciNet MATH Google Scholar
Le Thi, H.A., Pham Dinh, T., Le Hoai, M., Vo Xuan, T.: DC approximation approaches for sparse optimization. Eur. J. Oper. Res. 244, 26–44 (2015)
Article MathSciNet MATH Google Scholar
Peleg, D., Meir, R.: A bilinear formulation for vector sparsity optimization. Sig. Process. 88(2), 375–389 (2008)
Article MATH Google Scholar
Pham Dinh, T., Le Thi, H.A.: Convex analysis approach to D.C. programming: theory, algorithms and applications. Acta Math. Vietnamica 22(1), 289–355 (1997)
MathSciNet MATH Google Scholar
Wu, T.T., Lange, K.: Coordinate descent algorithms for lasso penalized regression. Ann. Appl. Stat. 2(1), 224–244 (2008)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Theoretical and Applied Computer Science EA 3097, University of Lorraine, Ile du Saulcy, 57045, Metz, France
Duy Nhat Phan & Hoai An Le Thi
Laboratory of Mathematics, INSA–Rouen, University of Normandie, 76801, Saint-Etienne-du-Rouvray Cedex, France
Dinh Tao Pham

Authors

Duy Nhat Phan
View author publications
You can also search for this author in PubMed Google Scholar
Hoai An Le Thi
View author publications
You can also search for this author in PubMed Google Scholar
Dinh Tao Pham
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Duy Nhat Phan .

Editor information

Editors and Affiliations

Kangwon National University, Chuncheon, Korea (Republic of)
Jinho Kim
Seoul National University, Seoul, Korea (Republic of)
Kyuseok Shim
University of Technology Sydney, Sydney, New South Wales, Australia
Longbing Cao
KAIST, Daejeon, Korea (Republic of)
Jae-Gil Lee
University of New South Wales, Sydney, New South Wales, Australia
Xuemin Lin
Kangwon National University, Chuncheon, Korea (Republic of)
Yang-Sae Moon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Phan, D.N., Le Thi, H.A., Pham, D.T. (2017). Efficient Bi-level Variable Selection and Application to Estimation of Multiple Covariance Matrices. In: Kim, J., Shim, K., Cao, L., Lee, JG., Lin, X., Moon, YS. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2017. Lecture Notes in Computer Science(), vol 10234. Springer, Cham. https://doi.org/10.1007/978-3-319-57454-7_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-57454-7_24
Published: 23 April 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57453-0
Online ISBN: 978-3-319-57454-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics