Skip to main content

Efficient Bi-level Variable Selection and Application to Estimation of Multiple Covariance Matrices

  • Conference paper
  • First Online:
  • 3741 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10234))

Abstract

Variable selection plays an important role in analyzing high dimensional data. When the data possesses certain group structures in which individual variables are also meaningful scientifically, we are naturally interested in selecting important groups as well as important variables. We introduce a new regularization by combining the \(\ell _{p,0}\)-norm and \(\ell _0\)-norm for bi-level variable selection. Using an appropriate DC (Difference of Convex functions) approximation, the resulting problem can be solved by DC Algorithm. As an application, we implement the proposed algorithm for estimating multiple covariance matrices sharing some common structures such as the locations or weights of non-zero elements. The experimental results on both simulated and real datasets demonstrate the efficiency of our algorithm.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://archive.ics.uci.edu/ml/datasets.

References

  1. Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, P.: Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundat. Trends Mach. Learn. 3(1), 1–122 (2011)

    Article  MATH  Google Scholar 

  2. Breheny, P.: The group exponential lasso for bi-level variable selection. Biometrics 71(3), 731–740 (2015)

    Article  MathSciNet  MATH  Google Scholar 

  3. Breheny, P., Huang, J.: Penalized methods for bi-level variable selection. Stat. Interface 2, 369–380 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  4. Friedman, J., Hastie, T., Tibshirani, R.: A note on the group lasso and sparse group lasso. arXiv:1001.0736v1, pp. 1–8 (2010)

  5. Huang, J., Ma, S., Xie, H., Zhang, C.H.: A group bridge approach for variable selection. Biometrika 96(2), 339–355 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  6. Le Thi, H.A., Pham Dinh, T.: The DC (difference of convex functions) programming and DCA revisited with DC models of real world nonconvex optimization problems. Ann. Oper. Res. 133, 23–46 (2005)

    Article  MathSciNet  MATH  Google Scholar 

  7. Le Thi, H.A., Pham Dinh, T., Le Hoai, M., Vo Xuan, T.: DC approximation approaches for sparse optimization. Eur. J. Oper. Res. 244, 26–44 (2015)

    Article  MathSciNet  MATH  Google Scholar 

  8. Peleg, D., Meir, R.: A bilinear formulation for vector sparsity optimization. Sig. Process. 88(2), 375–389 (2008)

    Article  MATH  Google Scholar 

  9. Pham Dinh, T., Le Thi, H.A.: Convex analysis approach to D.C. programming: theory, algorithms and applications. Acta Math. Vietnamica 22(1), 289–355 (1997)

    MathSciNet  MATH  Google Scholar 

  10. Wu, T.T., Lange, K.: Coordinate descent algorithms for lasso penalized regression. Ann. Appl. Stat. 2(1), 224–244 (2008)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Duy Nhat Phan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Phan, D.N., Le Thi, H.A., Pham, D.T. (2017). Efficient Bi-level Variable Selection and Application to Estimation of Multiple Covariance Matrices. In: Kim, J., Shim, K., Cao, L., Lee, JG., Lin, X., Moon, YS. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2017. Lecture Notes in Computer Science(), vol 10234. Springer, Cham. https://doi.org/10.1007/978-3-319-57454-7_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-57454-7_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-57453-0

  • Online ISBN: 978-3-319-57454-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics