Skip to main content

Correspondence Analysis in the Case of Outliers

  • Conference paper
  • First Online:
Classification and Data Mining

Abstract

Analysis of categorical data by means of Correspondence Analysis (CA) has recently become popular. The behavior of CA in the presence of outliers in the table is not sufficiently explored in the literature, especially in the case of multidimensional contingency tables. In our research we apply correspondence analysis to three-way contingency tables with outliers, generated by deviations from the independence model. Outliers in our work are chosen in such a way that they break the independence in the table, but still they are not large enough to be easily spotted without statistical analysis. We study the change in the correspondence analysis row and column coordinates caused by the outliers and perform numerical analysis of the outlier coordinates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The printed version of the paper contains only black-white pictures. Coloured versions of this pictres are available from the authors upon request.

References

  • Agresti, A. (2002). Categorical data analysis. Hoboken: Wiley.

    Google Scholar 

  • Andersen, E. B. (1994). The statistical analysis of categorical data. Berlin: Springer.

    Google Scholar 

  • Barnett, V., & Lewis, T. (1984). Outliers in statistical data (Wiley Series in Probability and Mathematical Statistics. Applied Probability and Statistics 2nd ed.). Chichester: Wiley.

    Google Scholar 

  • Blasius, J. (2001). Korrespondenzanalyse. München: Oldenbourg Verlag.

    Google Scholar 

  • Blasius, J., & Greenacre, M. (2006). Multiple correspondence analysis and related methods. London: Chapman and Hall.

    Google Scholar 

  • Greenacre, M. J. (1984). Theory and applications of correspondence analysis. London: Academic.

    Google Scholar 

  • Kroonenberg, P. M. (2007). Applied multiway data analysis. Hoboken: Wiley.

    Google Scholar 

  • Kuhnt S. (2004). Outlier identification procedures for contingency tables using maximum likelihood and L 1 estimates. Scandinavian Journal of Statistics,31, 431–442.

    Google Scholar 

  • Nenadić, O. & Greenacre, M. (2007). Correspondence analysis in R, with two- and three-dimensional graphics: The ca package. Journal of Statistical Software,20(3), 1–13.

    Google Scholar 

  • Shane, K. V., & Simonoff, J. S. (2001). A robust approach to categorical data analysis. ComputGraphStat,10, 135–157.

    Google Scholar 

Download references

Acknowledgements

We appreciate valuable comments of Mikhail Langovoy as well as of anonymous reviewers of our article.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anna Langovaya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Langovaya, A., Kuhnt, S., Chouikha, H. (2013). Correspondence Analysis in the Case of Outliers. In: Giusti, A., Ritter, G., Vichi, M. (eds) Classification and Data Mining. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28894-4_8

Download citation

Publish with us

Policies and ethics