skip to main content
10.1145/3645279.3645310acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbdmipConference Proceedingsconference-collections
research-article

A Correlation-Driven Adaptive Lasso for Robust Logistic Regression Model Using Trimming Step

Published:06 May 2024Publication History

ABSTRACT

The presence of contamination can influence the performance of parameter estimation in the binary logistic regression. Additionally, the emergence of collinearity among independent variables also gives rise to the issue of multicollinearity. In this work, we propose a novel correlation-driven adaptive lasso algorithm designed to enhance the robustness of logistic regression by incorporating a trimming step. The efficacy of this approach stems from the synergistic utilization of correlation-driven trimming techniques, which collectively serve to mitigate the impact of contaminated observations. The algorithm is designed to select information highly correlated features adaptively and to detect outilers simultaneously by maximizing a trimmed likelihood function. The proposed method has been evaluated and compared with other exisitng methods through a simulation study. Finally, an application to a real data set is given.

References

  1. S. Le Cessie and J. C. Van Houwelingen. 1992. Ridge Estimators in Logistic Regression. Journal of the Royal Statistical Society. Series C (Applied Statistics) 41, 1 (1992), 191–201.Google ScholarGoogle ScholarCross RefCross Ref
  2. Araveeporn Autcha and Yuwadee Klomwises. 2020. The estimated parameter of logistic regression model by Markov Chain Monte Carlo method with multicollinearity. Statistical Journal of the IAOS 36, 4 (2020), 1253-1259.Google ScholarGoogle ScholarCross RefCross Ref
  3. Robert Tibshirani. 1996. Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society. Series B. 58, 1 (1996), 267-288.Google ScholarGoogle Scholar
  4. Hui Zou and Trevor Hastie. 2005. Regularization and Variable Selection Via the Elastic Net. Journal of the Royal Statistical Society Series B: Statistical Methodology 67, 2 (2005), 301–320Google ScholarGoogle ScholarCross RefCross Ref
  5. Netti Herawati. 2020. Selecting the Method to Overcome Partial and Full Multicollinearity in Binary Logistic Model. International journal of statistics and applications 10, 3 (2020), 55-59.Google ScholarGoogle Scholar
  6. Autcha Araveeporn and Choojai Kuharatanachai. 2019. Comparing Penalized Regression Analysis of Logistic Regression Model with Multicollinearity. In Proceedings of the 2019 2nd International Conference on Mathematics and Statistics (ICoMS'19), 52-27.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Robert Tibshirani. 1996. Regression Shrinkage and Selection via the Lasso. Journal of the royal statistical society series biomethodological 58, 267-288.Google ScholarGoogle ScholarCross RefCross Ref
  8. Hui Zou. 2006. The Adaptive Lasso and Its Oracle Properties. Journal of the American Statistical Association 101, 1418 - 1429.Google ScholarGoogle ScholarCross RefCross Ref
  9. Miguel Patrcio, Jos Pereira, Joana Crisstomo, Paulo Matafome, Raquel Seia, and Francisco Caramelo. 2018. Breast Cancer Coimbra. UCI Machine Learning Repository.Google ScholarGoogle Scholar
  10. Kriegel, Hans Peter , Matthias Schubert , and Arthur Zimek . 2008. Angle-based outlier detection in high-dimensional data. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, Las Vegas, Nevada, USA, 444-452.Google ScholarGoogle Scholar
  11. Abir Smiti. 2020. A critical overview of outlier detection methods. Computer Science Review 38, 100306.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yasin Asar. 2017. Some new methods to solve multicollinearity in logistic regression. Communications in Statistics - Simulation and Computation 46, 4 (2017), 2576-2586.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A Correlation-Driven Adaptive Lasso for Robust Logistic Regression Model Using Trimming Step

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      BDMIP '23: Proceedings of the 2023 International Conference on Big Data Mining and Information Processing
      November 2023
      223 pages
      ISBN:9798400709166
      DOI:10.1145/3645279

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 6 May 2024

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)2

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format