research-article

Why LASSO Seems to Simultaneously Decrease Bias and Variance in Machine Learning

Authors:
Jochen Merker

Leipzig University of Applied Sciences, Germany

Leipzig University of Applied Sciences, Germany
View Profile

,
Gregor Schuldt

Leipzig University of Applied Sciences, Germany

Leipzig University of Applied Sciences, Germany
View Profile

ICoMS '21: Proceedings of the 2021 4th International Conference on Mathematics and StatisticsJune 2021Pages 86–89https://doi.org/10.1145/3475827.3475839

Published:27 October 2021Publication History

ICoMS '21: Proceedings of the 2021 4th International Conference on Mathematics and Statistics

Pages 86–89

ABSTRACT

We show that on an enhancement of the capacity of the function space used in regression, LASSO simultaneously decreases bias and variance of statistical models obtained in machine learning from training data, if the balance between minimization of the mean-squared error and the L1-regularization term is optimal. Further, if minimization of the mean-squared error is dominant, this seems to explain the occurrence of a double descent in the modern interpolation regime of machine learning. Our main method is a decomposition of mean squared error plus complexity into bias, variance and an unavoidable irreducible error inherent to the problem.

References

T. Hastie, R. Tibshirani, M. Wainwright, Statistical learning with sparsity: the lasso and generalizations, CRC press, 2015.Google ScholarCross Ref
V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, 1995.Google ScholarCross Ref
S. Geman, E. Bienenstock, R. Doursat, Neural computation 4 (1992), 1-58.Google Scholar
S. Spigler, M. Geiger, S. d'Ascoli, L. Sagun, G. Biroli, M. Wyart, Journal of Physics A: Mathematical and Theoretical 52 (2019), 474001.Google ScholarCross Ref
M. Belkin, D. Hsu, S. Ma, S. Mandal, Proceedings of the National Academy of Sciences 116 (2019), 15849-15854.Google ScholarCross Ref
B. Ghojogh, M. Crowley, arXiv:1905.12787 (2019).Google Scholar
B. Adlam, J. Pennington, arXiv:2011.03321 (2020).Google Scholar
J. Merker, G. Schuldt, Proceedings of ICoMS 2020, ACM (2020), DOI: 10.1145/3409915.3409920Google ScholarDigital Library
J. Merker, Journal of Advances in Applied Mathematics 2 (2017), 109-114.Google ScholarCross Ref

Why LASSO Seems to Simultaneously Decrease Bias and Variance in Machine Learning
1. Computing methodologies

Recommendations

Generalized LASSO with under-determined regularization matrices

This paper studies the intrinsic connection between a generalized LASSO and a basic LASSO formulation. The former is the extended version of the latter by introducing a regularization matrix to the coefficients. We show that when the regularization ...
Read More
Comparative study of computational algorithms for the Lasso with high-dimensional, highly correlated data

Variable selection is important in high-dimensional data analysis. The Lasso regression is useful since it possesses sparsity, soft-decision rule, and computational efficiency. However, since the Lasso penalized likelihood contains a nondifferentiable ...
Read More
A multi-stage framework for Dantzig selector and LASSO

We consider the following sparse signal recovery (or feature selection) problem: given a design matrix X ∈ R^n×m (m ≫ n) and a noisy observation vector y ∈ Rⁿ satisfying y = Xβ* + ε where ε is the noise vector following a Gaussian distribution N(0,σ²I), ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICoMS '21: Proceedings of the 2021 4th International Conference on Mathematics and Statistics
June 2021
102 pages
ISBN:9781450389907
DOI:10.1145/3475827

Copyright © 2021 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 27 October 2021
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
LASSO
bias-variance
double descent
mean-squared error
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 63
  Total Downloads
- Downloads (Last 12 months)32
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Why LASSO Seems to Simultaneously Decrease Bias and Variance in Machine Learning

ICoMS '21: Proceedings of the 2021 4th International Conference on Mathematics and Statistics

ABSTRACT

References

Cited By

Recommendations

Generalized LASSO with under-determined regularization matrices

Comparative study of computational algorithms for the Lasso with high-dimensional, highly correlated data

A multi-stage framework for Dantzig selector and LASSO

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Why LASSO Seems to Simultaneously Decrease Bias and Variance in Machine Learning

ICoMS '21: Proceedings of the 2021 4th International Conference on Mathematics and Statistics

ABSTRACT

References

Cited By

Recommendations

Generalized LASSO with under-determined regularization matrices

Comparative study of computational algorithms for the Lasso with high-dimensional, highly correlated data

A multi-stage framework for Dantzig selector and LASSO

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media