In the statistical analysis of data one is often confronted with observations that “appear to be inconsistent with the remainder of that set of data” (Barnett and Lewis 1994). Although such observations (the outliers) have been the subject of numerous investigations, there is no general accepted formal definition of outlyingness. Nevertheless, the outliers describe abnormal data behavior, i.e., data that are deviating from the natural data variability (see, e.g., Peña and Prieto 2001, Filzmoser 2004, and Filzmoser et al. 2008 for a discussion).
Sometimes outliers can grossly distort the statistical analysis, while at other times their influence may not be as noticeable. Statisticians have accordingly developed numerous algorithms for the detection and treatment of outliers, but most of these methods were developed for univariate data sets. They are based on the estimation of location and scale, or on quantiles of the data. Since in a univariate sample outliers may be identified as an...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References and Further Reading
Barnett V, Lewis T (1994) Outliers in statistical data, 3rd edn. Wiley, Chichester
Becker C, Gather U (2001) The largest nonidentifiable outlier: a comparison of multivariate simultaneous outlier identification rules. Comput Stat Data Anal 36:119–127
Donoho D (1982) Breakdown properties of multivariate location estimators. Ph.D. thesis, Harvard University
Filzmoser P (2004) A multivariate outlier detection method. In: Aivazian S, Filzmoser P, Kharin Yu (eds) Proceedings of the seventh international conference on computer data analysis and modeling, vol 1. Belarusian State University, Minsk, pp 18–22
Filzmoser P, Maronna R, Werner M (2008) Outlier identification in high dimensions. Comput Stat Data Anal 52:1694–1711
Gnanadesikan R, Kettenring JR (1972) Robust estimates, residuals, and outlier detection with multiresponse data. Biometrics 28:81–124
Hardin J, Rocke D (2005) The distribution of robust distances. J Comput Graph Stat 14:928–946
Kruskal JB (1969) Toward a practical method which helps uncover the structure of a set of multivariate observations by finding the linear transformation which optimizes a new “index of condensation”. In: Milton RC, Nelder JA (eds) Statistical computation. Academic, New York, pp 427–440
Maronna RA, Yohai VJ (1995) The behavior of the Stahel-Donoho robust multivariate estimator. J Am Stat Assoc 90:330–341
Maronna RA, Martin RD, Yohai V (2006) Robust statistics: theory and methods. Wiley, New York
Peña D, Prieto FJ (2001) Multivariate outlier detection and robust covariance matrix estimation. Technometrics 43:286–310
Rousseeuw PJ (1984) Least median of squares regression. J Am Stat Assoc 79:871–880
Rousseeuw PJ, Van Driessen K (1999) A fast algorithm for the minimum covariance determinant estimator. Technometrics 41:212–223
Rousseeuw PJ, Debruyne M, Engelen S, Hubert M (2006) Robustness and outlier detection in chemometrics. Cr Rev Anal Chem 36:221–242
Stahel WA (1981) Robust estimation: infinitesimal optimality and covariance matrix estimators. Ph.D. thesis in German, Swiss Federal Institute of Technology, Zurich, Switzerland
Wilks SS (1963) Multivariate statistical outliers. Sankhya 25:407–426
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this entry
Cite this entry
Rodrigues, I.M., Boente, G. (2011). Multivariate Outliers. In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_396
Download citation
DOI: https://doi.org/10.1007/978-3-642-04898-2_396
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04897-5
Online ISBN: 978-3-642-04898-2
eBook Packages: Mathematics and StatisticsReference Module Computer Science and Engineering