skip to main content
10.1145/3475827.3475839acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicomsConference Proceedingsconference-collections
research-article

Why LASSO Seems to Simultaneously Decrease Bias and Variance in Machine Learning

Published:27 October 2021Publication History

ABSTRACT

We show that on an enhancement of the capacity of the function space used in regression, LASSO simultaneously decreases bias and variance of statistical models obtained in machine learning from training data, if the balance between minimization of the mean-squared error and the L1-regularization term is optimal. Further, if minimization of the mean-squared error is dominant, this seems to explain the occurrence of a double descent in the modern interpolation regime of machine learning. Our main method is a decomposition of mean squared error plus complexity into bias, variance and an unavoidable irreducible error inherent to the problem.

References

  1. T. Hastie, R. Tibshirani, M. Wainwright, Statistical learning with sparsity: the lasso and generalizations, CRC press, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  2. V. N. Vapnik, The Nature of Statistical Learning Theory, Springer, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  3. S. Geman, E. Bienenstock, R. Doursat, Neural computation 4 (1992), 1-58.Google ScholarGoogle Scholar
  4. S. Spigler, M. Geiger, S. d'Ascoli, L. Sagun, G. Biroli, M. Wyart, Journal of Physics A: Mathematical and Theoretical 52 (2019), 474001.Google ScholarGoogle ScholarCross RefCross Ref
  5. M. Belkin, D. Hsu, S. Ma, S. Mandal, Proceedings of the National Academy of Sciences 116 (2019), 15849-15854.Google ScholarGoogle ScholarCross RefCross Ref
  6. B. Ghojogh, M. Crowley, arXiv:1905.12787 (2019).Google ScholarGoogle Scholar
  7. B. Adlam, J. Pennington, arXiv:2011.03321 (2020).Google ScholarGoogle Scholar
  8. J. Merker, G. Schuldt, Proceedings of ICoMS 2020, ACM (2020), DOI: 10.1145/3409915.3409920Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Merker, Journal of Advances in Applied Mathematics 2 (2017), 109-114.Google ScholarGoogle ScholarCross RefCross Ref
  1. Why LASSO Seems to Simultaneously Decrease Bias and Variance in Machine Learning

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICoMS '21: Proceedings of the 2021 4th International Conference on Mathematics and Statistics
      June 2021
      102 pages
      ISBN:9781450389907
      DOI:10.1145/3475827

      Copyright © 2021 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 October 2021

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format