Abstract
The problem of selecting a parsimonious subset of variables from a large number of predictors in a regression model is a topic of high importance. When the data contains vertical outliers and/or leverage points, outlier detection and variable selection are inseparable problems. Therefore a robust method that can simultaneously detect outliers and select variables is needed. An outlier detection and robust variable selection method is introduced that combines robust least angle regression with least trimmed squares regression on jack-knife subsets. In a second stage the detected outliers are removed and standard least angle regression is applied on the cleaned data to robustly sequence the predictor variables in order of importance. The performance of this method is evaluated by simulations that contain vertical outliers and high leverage points. The results of the simulation study show the good performance of this method in both outlier detection and robust variable selection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Akaike, H.: Statistical predictor identification. Annals of the Institute of Statistical Mathematics 22(1), 203–217 (1970)
Mallows, C.L.: Some comments on c p . Technometrics 15(4), 661–675 (1973)
Schwarz, G.: Estimating the dimension of a model. The Annals of Statistics 6(2), 461–464 (1978)
Ronchetti, E.: Robust model selection in regression. Statistics & Probability Letters 3(1), 21–23 (1985)
Ronchetti, E., Staudte, R.G.: A robust version of mallows’ c p . Journal of the American Statistical Association 89(426), 550–559 (1994)
Maronna, R.A., Martin, R.D., Yohai, V.J.: Robust Statistics: Theory and Methods. J. Wiley & Sons (2006)
Müller, S., Welsh, A.: Outlier robust model selection in linear regression. Journal of the American Statistical Association 100(472), 1297–1310 (2005)
Salibian-Barrera, M., Van Aelst, S.: Robust model selection using fast and robust bootstrap. Computational Statistics & Data Analysis 52(12), 5121–5135 (2008)
Atkinson, A.C., Riani, M.: Forward search added-variable t-tests and the effect of masked outliers on model selection. Biometrika 89(4), 939–946 (2002)
Cantoni, E., Ronchetti, E.: Robust inference for generalized linear models. Journal of the American Statistical Association 96(455), 1022–1030 (2001)
Weisberg, S.: Applied Linear Regression. J. Wiley & Sons, New York (2005)
Efron, B., Hastie, T., Johnstone, I., Tibshirani, R.: Least angle regression. The Annals of Statistics 32(2), 407–499 (2004)
Khan, J.A., Van Aelst, S., Zamar, R.H.: Robust linear model selection based on least angle regression. Journal of the American Statistical Association 102(480), 1289–1299 (2007)
Efron, B.: The jackknife, the bootstrap and other resampling plans, vol. 38. SIAM NSF-CBMS (1982)
Huber, P.J., Ronchetti, E.M.: Robust Statistics. Wiley, New York (2009)
Pison, G., Van Aelst, S., Willems, G.: Small sample corrections for lts and mcd. Metrika 55(1-2), 111–123 (2002)
Hubert, M., Rousseeuw, P.J., Van Aelst, S.: High-breakdown robust multivariate methods. Statistical Science, 92–119 (2008)
Frank, L.E., Friedman, J.H.: A statistical view of some chemometrics regression tools. Technometrics 35(2), 109–135 (1993)
Core Team, R.: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna (2012)
Hastie, T., Efron, B.: lars: Least Angle Regression, Lasso and Forward Stagewise, R package version 1.2 (2013)
Alfons, A.: robustHD: Robust methods for high-dimensional data, R package version 0.4.0 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Shahriari, S., Faria, S., Gonçalves, A.M., Van Aelst, S. (2014). Outlier Detection and Robust Variable Selection for Least Angle Regression. In: Murgante, B., et al. Computational Science and Its Applications – ICCSA 2014. ICCSA 2014. Lecture Notes in Computer Science, vol 8581. Springer, Cham. https://doi.org/10.1007/978-3-319-09150-1_37
Download citation
DOI: https://doi.org/10.1007/978-3-319-09150-1_37
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09149-5
Online ISBN: 978-3-319-09150-1
eBook Packages: Computer ScienceComputer Science (R0)