Abstract
Analysis of categorical data by means of Correspondence Analysis (CA) has recently become popular. The behavior of CA in the presence of outliers in the table is not sufficiently explored in the literature, especially in the case of multidimensional contingency tables. In our research we apply correspondence analysis to three-way contingency tables with outliers, generated by deviations from the independence model. Outliers in our work are chosen in such a way that they break the independence in the table, but still they are not large enough to be easily spotted without statistical analysis. We study the change in the correspondence analysis row and column coordinates caused by the outliers and perform numerical analysis of the outlier coordinates.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The printed version of the paper contains only black-white pictures. Coloured versions of this pictres are available from the authors upon request.
References
Agresti, A. (2002). Categorical data analysis. Hoboken: Wiley.
Andersen, E. B. (1994). The statistical analysis of categorical data. Berlin: Springer.
Barnett, V., & Lewis, T. (1984). Outliers in statistical data (Wiley Series in Probability and Mathematical Statistics. Applied Probability and Statistics 2nd ed.). Chichester: Wiley.
Blasius, J. (2001). Korrespondenzanalyse. München: Oldenbourg Verlag.
Blasius, J., & Greenacre, M. (2006). Multiple correspondence analysis and related methods. London: Chapman and Hall.
Greenacre, M. J. (1984). Theory and applications of correspondence analysis. London: Academic.
Kroonenberg, P. M. (2007). Applied multiway data analysis. Hoboken: Wiley.
Kuhnt S. (2004). Outlier identification procedures for contingency tables using maximum likelihood and L 1 estimates. Scandinavian Journal of Statistics,31, 431–442.
Nenadić, O. & Greenacre, M. (2007). Correspondence analysis in R, with two- and three-dimensional graphics: The ca package. Journal of Statistical Software,20(3), 1–13.
Shane, K. V., & Simonoff, J. S. (2001). A robust approach to categorical data analysis. ComputGraphStat,10, 135–157.
Acknowledgements
We appreciate valuable comments of Mikhail Langovoy as well as of anonymous reviewers of our article.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Langovaya, A., Kuhnt, S., Chouikha, H. (2013). Correspondence Analysis in the Case of Outliers. In: Giusti, A., Ritter, G., Vichi, M. (eds) Classification and Data Mining. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28894-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-28894-4_8
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28893-7
Online ISBN: 978-3-642-28894-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)