research-article

Sparse Logistic Regression with the Hybrid L1/2+1 Regularization

Author:

Yuhuan YangAuthors Info & Claims

ICMAI '21: Proceedings of the 2021 6th International Conference on Mathematics and Artificial Intelligence

Pages 8 - 13

https://doi.org/10.1145/3460569.3460578

Published: 31 August 2021 Publication History

Abstract

In this paper, L1/2+1 regularized logistic regression model and corresponding algorithm are proposed. The L1/2 regular term has unbiased, sparsity and Oracle properties. The L1 regular term guarantees the convex function characteristics in theory. The regular term of the model is composed of the linear combination of L1/2 norm and L1 norm, which can effectively improve the over fitting problem and the generalization ability of the model. In this algorithm, the idea of coordinate descent method is adopted, and the solution of parameters is transformed into a series of extremum problems of one variable function, thus the analytical expression of parameter estimation is given. Experiments on simulated data and real data show that, in some cases, the model and algorithm proposed in this paper are better than the traditional logistic regression and several classical regularized logistic regression in variable selection and prediction ability, and are suitable for small sample data sets with low correlation between variables.

References

[1]

Thompson E, Bowling B, and Markle R. 2018. Predicting Student Success in a Major's Introductory Biology Course via Logistic Regression Analysis of Scientific Reasoning Ability and Mathematics Scores. J. Research in Science Education. 48 (February 2018): 151-163. https://doi.org/10.1007/s11165-016-9563-5

[2]

David MC, Van Der Pols JC, Williams GM, 2013. Risk of attrition in a longitudinal study of skin cancer: logistic and survival models can give different results. J. Journal of clinical epidemiology. 66 (August 2013): 888-895. https://doi.org/10.1016/j.jclinepi.2013.03.008

[3]

Gajović V, Kerkez M, and Kočović J. 2018. Modeling and simulation of logistic processes: risk assessment with a fuzzy logic technique. J. Simulation (San Diego, Calif.). 94 (June 2018): 507-518. https://doi.org/10.1177/0037549717738351

Digital Library

[4]

Hoerl A, Kennard R. 1970. Ridge Regression: Biased Estimation for Nonorthogonal Problems. J. Technometrics. 12 (1): 55-67. https://doi.org/10.1080/00401706.1970.10488634

[5]

Tibshirani R. 1996. Regression Shrinkage and Selection Via the Lasso. J. Journal of the Royal Statistical Society: Series B. 58 (January 1996): 229-243. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x

[6]

Fan J, Li R. 2011. Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties. J. Journal of the American Statistical Association. 96 (December 2011): 1348-1360. https://doi.org/10.1198/016214501753382273

[7]

Zou H, Hastie T. 2005. Regularization and Variable Selection via the Elastic Net. J. Journal of the Royal Statistical Society: Series B (Statistical Methodology). 67 (April 2005): 301-320. https://doi.org/10.1111/j.1467-9868.2005.00503.x

[8]

Kim SH. 2003. An Investigation of Bayes Estimation Procedures for the Two-Parameter Logistic Model. J. Springer Japan. 389-396. https://doi.org/10.1007/978-4-431-66996-8_44

[9]

Friedman J, Hastie T, and Tibshirani R. 2010. Regularization Paths for Generalized Linear Models via Coordinate Descent. J. Journal of Statistical Software. 33 (February 2010), 1-22. https://doi.org/10.18637/jss.v033.i01

[10]

Yuan Guoxun, Ho C, and Lin C. 2011. An Improved GLMNET for L1-Regularized Logistic Regression. J. ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. San Diego, USA. (August 2011), 33-41. https://doi.org/10.1145/2020408.2020421

Digital Library

[11]

Balamurugan P. 2013. Large-Scale Elastic Net Regularized Linear Classification SVMs and Logistic Regression. J. ACM Sigkdd International Conference on Knowledge Discovery & Data Mining. Dallas, USA. (December 2013). 10.1109/ICDM.2013.126

[12]

Bach F. 2013. Adaptivity of Averaged Stochastic Gradient Descent to Local Strong Convexity for Logistic Regression. J. Mach. Learn. Res. 15 (March): 595-627. 10.1142/S0218194014500065

[13]

Koh K, Kim S, and Boyd S. 2007. An Interior-Point Method for Large-Scale L1-Regularized Logistic Regression. J. Journal of Machine Learning Research. 8 (July 2007): 1519-1555. 10.1109/JSTSP.2007.910971

[14]

Park M, Hastie T. 2007. L1‐Regularization Path Algorithm for Generalized Linear Models. J. Journal of the Royal Statal Society Series B. 69 (August 2007): 659-677. https://doi.org/10.1111/j.1467-9868.2007.00607.x

[15]

Zongben X, Hailiang Guo, Yao Wang, 2012. The Representative of L1/2 Regularization among Lq (1<q≤1) Regularizations: An Experimental Study Based on Phase Diagram. J. Acta Automatica Sinica. 38 (July 2012): 1225-1228 (in Chinese). https://doi.org/10.1016/S1874-1029(11)60293-0

[16]

Zongben X, Xiangyu C, Fengmin X, 2012. L1/2 Regularization: A Thresholding Representation Theory and a Fast Solver. IEEE Transactions on Neural Networks and Learning Systems. United States, 23 (July 2012): 1013-1027. 10.1109/TNNLS.2012.2197412

[17]

Xing Fuchong. 2003. Investigation on Solutions of Cubic Equations with One Unknown. J. Journal of The Central University for Nationalities: Natural Science Edition. 12 (3): 207-218 (in Chinese). 10.3969/j.issn.1005-8036.2003.03.003

Cited By

Bouihi BBousselham AAoula EEnnibras FDeraoui A(2024)Prediction of Higher Education Student Dropout based on Regularized Regression ModelsEngineering, Technology & Applied Science Research10.48084/etasr.864414:6(17811-17815)Online publication date: 2-Dec-2024
https://doi.org/10.48084/etasr.8644

Index Terms

Sparse Logistic Regression with the Hybrid L1/2+1 Regularization
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
    2. Machine learning approaches
      1. Factorization methods
        Canonical correlation analysis
2. Mathematics of computing
  1. Probability and statistics
    1. Statistical paradigms
      1. Regression analysis

Index terms have been assigned to the content through auto-classification.

Recommendations

Gene expression data classification with robust sparse logistic regression using fused regularisation

Microarray technology has become popular and is extensively used for gene classification. It is essential to identify a proper set of gene expressions that help to classify cancer data. However, microarray data comprises large number of genes with small ...
Sparse Recovery via Partial Regularization: Models, Theory, and Algorithms

In the context of sparse recovery, it is known that most of the existing regularizers such as ý ₁ suffer from some bias incurred by some leading entries in magnitude of the associated vector. To neutralize this bias, we propose a class of models with ...
A novel l1/2 sparse regression method for hyperspectral unmixing

Hyperspectral unmixing HU is a popular tool in remotely sensed hyperspectral data interpretation, and it is used to estimate the number of reference spectra end-members, their spectral signatures, and their fractional abundances. However, it can also be ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICMAI '21: Proceedings of the 2021 6th International Conference on Mathematics and Artificial Intelligence

March 2021

142 pages

ISBN:9781450389464

DOI:10.1145/3460569

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 August 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICMAI 2021

ICMAI 2021: 2021 6th International Conference on Mathematics and Artificial Intelligence

March 19 - 21, 2021

Chengdu, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
91
Total Downloads

Downloads (Last 12 months)17
Downloads (Last 6 weeks)0

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bouihi BBousselham AAoula EEnnibras FDeraoui A(2024)Prediction of Higher Education Student Dropout based on Regularized Regression ModelsEngineering, Technology & Applied Science Research10.48084/etasr.864414:6(17811-17815)Online publication date: 2-Dec-2024
https://doi.org/10.48084/etasr.8644

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten