Skip to main content

DDα-Classification of Asymmetric and Fat-Tailed Data

  • Conference paper
  • First Online:

Abstract

The DDα-procedure is a fast nonparametric method for supervised classification of d-dimensional objects into q ≥ 2 classes. It is based on q-dimensional depth plots and the α-procedure, which is an efficient algorithm for discrimination in the depth space [0, 1]q. Specifically, we use two depth functions that are well computable in high dimensions, the zonoid depth and the random Tukey depth, and compare their performance for different simulated data sets, in particular asymmetric elliptically and t-distributed data.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  • Cuesta-Albertos, J. A., & Nieto-Reyes, A. (2008). The random Tukey depth. Computational Statistics and Data Analysis, 52, 4979–4988.

    Article  MathSciNet  MATH  Google Scholar 

  • Dutta, S., & Ghosh, A. K. (2012). On robust classification using projection depth. Annals of the Institute of Statistical Mathematics, 64, 657–676.

    Article  MathSciNet  MATH  Google Scholar 

  • Dutta, S., & Ghosh, A. K. (2012). On classification based on L p depth with an adaptive choice of p. Technical Report Number R5/2011, Statistics and Mathematics Unit, Indian Statistical Institute, Kolkata, India.

    Google Scholar 

  • Dyckerhoff, R., Koshevoy, G., & Mosler, K. (1996). Zonoid data depth: Theory and computation. In A. Prat (Ed.), COMPSTAT 1996 - Proceedings in computational statistics (pp. 235–240). Heidelberg: Physica.

    Google Scholar 

  • Frahm, G. (2004). Generalized elliptical distributions: Theory and applications. Doctoral thesis. University of Cologne.

    Google Scholar 

  • Ghosh, A. K., & Chaudhuri, P. (2005). On maximum depth and related classifiers. Scandinavian Journal of Statistics, 32, 327–350.

    Article  MathSciNet  MATH  Google Scholar 

  • Hoberg, R., & Mosler, K. (2006). Data analysis and classification with the zonoid depth. In R. Liu, R. Serfling, & D. Souvaine (Eds.), Data depth: Robust multivariate analysis, computational geometry and applications (pp. 49–59). Providence: American Mathematical Society.

    Google Scholar 

  • Koshevoy, G., & Mosler, K. (1997). Zonoid trimming for multivariate distributions. Annals of Statistics, 25, 1998–2017.

    Article  MathSciNet  MATH  Google Scholar 

  • Lange, T., Mosler, K., & Mozharovskyi, P. (2012). Fast nonparametric classification based on data depth. Statistical papers (to appear).

    Google Scholar 

  • Lange, T., & Mozharovskyi, P. (2012). The Alpha-Procedure - a nonparametric invariant method for automatic classification of d-dimensional objects. In 36th Annual Conference of the German Classification Society, Hildesheim.

    Google Scholar 

  • Li, J., Cuesta-Albertos, J. A., & Liu, R. Y. (2012). DD-classifier: Nonparametric classification procedure based on DD-plot. Journal of the Americal Statistical Association, 107, 737–753.

    Article  MathSciNet  MATH  Google Scholar 

  • Liu, R. (1990). On a notion of data depth based on random simplices. Annals of Statistics, 18, 405–414.

    Article  MathSciNet  MATH  Google Scholar 

  • Liu, X., & Zuo, Y. (2012). Computing halfspace depth and regression depth. Mimeo.

    Google Scholar 

  • Mahalanobis, P. (1936). On the generalized distance in statistics. Proceedings of the National Institute of Science of India, 2, 49–55.

    MATH  Google Scholar 

  • Mosler, K. (2002). Multivariate dispersion, central regions and depth: The lift zonoid approach. New York: Springer.

    Book  Google Scholar 

  • Paindaveine, D., & Van Bever, G. (2013). Nonparametrically consistent depth-based classifiers. Bernoulli (to appear).

    Google Scholar 

  • Rousseeuw, P. J., & Ruts, I. (1996). Bivariate location depth. Journal of the Royal Statistical Society. Series C (Applied Statistics), 45, 516–526.

    MATH  Google Scholar 

  • Rousseeuw, P. J., & Struyf, A. (1998). Computing location depth and regression depth in higher dimensions. Statistics and Computing, 8, 193–203.

    Article  Google Scholar 

  • Tukey, J. W. (1975). Mathematics and the picturing of data. In Proceedings of the International Congress of Mathematicians (pp. 523–531), Vancouver.

    Google Scholar 

  • Vasil’ev, V. I. (1991). The reduction principle in pattern recognition learning (PRL) problem. Pattern Recognition and Image Analysis, 1, 23–32.

    Google Scholar 

  • Vasil’ev, V. I. (2003). The reduction principle in problems of revealing regularities I. Cybernetics and Systems Analysis, 39, 686–694.

    Article  MathSciNet  MATH  Google Scholar 

  • Vasil’ev, V. I., & Lange, T. (1998). The duality principle in learning for pattern recognition (in Russian). Kibernetika i Vytschislit’elnaya Technika, 121, 7–16.

    Google Scholar 

  • Zuo, Y., & Serfling, R. (2000). General notions of statistical depth function. Annals of Statistics, 28, 462–482.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tatjana Lange .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Lange, T., Mosler, K., Mozharovskyi, P. (2014). DDα-Classification of Asymmetric and Fat-Tailed Data. In: Spiliopoulou, M., Schmidt-Thieme, L., Janning, R. (eds) Data Analysis, Machine Learning and Knowledge Discovery. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-01595-8_8

Download citation

Publish with us

Policies and ethics