Skip to main content
Log in

Using sliced mean variance–covariance inverse regression for classification and dimension reduction

  • Original Paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

The sliced mean variance–covariance inverse regression (SMVCIR) algorithm takes grouped multivariate data as input and transforms it to a new coordinate system where the group mean, variance, and covariance differences are more apparent. Other popular algorithms used for performing graphical group discrimination are sliced average variance estimation (SAVE, targetting the same differences but using a different arrangement for variances) and sliced inverse regression (SIR, which targets mean differences). We provide an improved SMVCIR algorithm and create a dimensionality test for the SMVCIR coordinate system. Simulations corroborating our theoretical results and comparing SMVCIR with the other methods are presented. We also provide examples demonstrating the use of SMVCIR and the other methods, in visualization and group discrimination by k-nearest neighbors. The advantages and differences of SMVCIR from SAVE and SIR are shown clearly in these examples and simulation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19

Similar content being viewed by others

References

  • Asuncion A, Newman D (2007) UCI machine learning repository. http://www.ics.uci.edu/~mlearn/MLRepository.html

  • Bentler PM, Xie J (2000) Corrections to test statistics in principal hessian directions. Stat Probab Lett 47:381–389

    Article  MATH  Google Scholar 

  • Bura E, Cook RD (2001) Extending sliced inverse regression: the weighted chi-squared test. J Am Stat Assoc 96:996–1003

    Article  MATH  MathSciNet  Google Scholar 

  • Bura E, Yang J (2011) Dimension estimation in sufficient dimension reduction: a unifying approach. J Multivar Anal 102:130–142

    Article  MATH  MathSciNet  Google Scholar 

  • Clopper CJ, Pearson ES (1935) The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika 26:404–413

    Article  Google Scholar 

  • Cook RD (1998) Regression graphics: ideas for studying regressions through graphics. Wiley, New York

    Book  MATH  Google Scholar 

  • Cook RD (2000) Save: a method for dimension reduction and graphics in regression. Commun Stat Theory Methods 29:2109–2121

    Article  MATH  Google Scholar 

  • Cook RD, Critchley F (2000) Detecting regression outliers and mixtures graphically. J Am Stat Assoc 95:781–794

    Article  MATH  MathSciNet  Google Scholar 

  • Cook RD, Forzani LM, Tomassi DR (2011) Ldr: a package for likelihood-based sufficient dimension reduction. J Stat Softw 39:1–20, http://www.jstatsoft.org/v39/i03

    Google Scholar 

  • Eaton ML, Tyler DE (1994) Asymptotic distributions of singular values with applications to canonical correlations and correspondence analysis. J Multivar Anal 50:238–264

    Article  MATH  MathSciNet  Google Scholar 

  • Gannoun A, Saracco J (2003) An asymptotic theory for sir\(_{\alpha } \) method. Stat Sinica 13:297–310

    Google Scholar 

  • Hastie T, Tibshirani RJ, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction, 2nd edn. Springer, New York

    Book  Google Scholar 

  • Hodges JL, Fix E (1951) Discriminatory analysis: nonparametric discrimination, consistency properties. Tech. Rep. 4, Project No. 21–49-004, Randolph Field, Texas: Brooks Air Force Base, USAF School of Aviation Medicine

  • Izenman AJ (2008) Modern multivariate statistical techniques: regression, classification, and manifold learning. Springer, New York

    Book  Google Scholar 

  • Li KC (1991) Sliced inverse regression for dimension reduction. J Am Stat Assoc 86:316–327

    Article  MATH  Google Scholar 

  • Lindsey CD (2010) Sliced mean variance-covariance inverse regression dimensionality test. PhD thesis, Texas A &M University, available by request to the author

  • Loftsgaarden DO, Quesenberry CP (1965) A nonparametric estimate of a multivariate density function. Ann Math Stat 36:1049–1051

    Article  MATH  MathSciNet  Google Scholar 

  • Lütkepohl H (2007) New introduction to multiple time series analysis, 2nd edn. Springer, New York

    Google Scholar 

  • Muirhead RJ (1982) Aspects of multivariate statistical analysis. Wiley, New York

    Google Scholar 

  • R Development Core Team (2012) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org, ISBN 3-900051-07-0

  • Rencher AC (2002) Methods of multivariate analysis, 2nd edn. Wiley, New York

    Book  MATH  Google Scholar 

  • Serfling RJ (1980) Approximation theorems of mathematical statistics. Wiley, New York

    Book  MATH  Google Scholar 

  • Shao Y, Cook RD, Weisberg S (2007) Marginal tests with sliced average variance estimation. Biometrika 94:285–296

    Article  MATH  MathSciNet  Google Scholar 

  • Sheather SJ, McKean JW, Crimin K (2008) Sliced mean variance-covariance inverse regression. Comput Stat Data Anal 52:1908–1927

    Article  MATH  MathSciNet  Google Scholar 

  • StataCorp (2011) Release 12. College Station, Texas, http://www.stata.com

  • Tyler DE (1981) Asymptotic inference for eigenvectors. Ann Stat 9:725–736

    Article  MATH  MathSciNet  Google Scholar 

  • Weisberg S (2009) dr: Methods for dimension reduction for regression. http://CRAN.R-project.org/package=dr, R Package Version 3.0.4

  • Yin X (2005) Tests of dimension with sliced average variance estimation. Research Report 13, Department of Statistics, University of Georgia

  • Zhu M, Hastie TJ (2003) Feature extraction for nonparametric discriminant analysis. J Comput Graph Stat 12:101–120

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Charles D. Lindsey.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lindsey, C.D., Sheather, S.J. & McKean, J.W. Using sliced mean variance–covariance inverse regression for classification and dimension reduction. Comput Stat 29, 769–798 (2014). https://doi.org/10.1007/s00180-013-0460-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-013-0460-3

Keywords

Navigation