Abstract
The sliced mean variance–covariance inverse regression (SMVCIR) algorithm takes grouped multivariate data as input and transforms it to a new coordinate system where the group mean, variance, and covariance differences are more apparent. Other popular algorithms used for performing graphical group discrimination are sliced average variance estimation (SAVE, targetting the same differences but using a different arrangement for variances) and sliced inverse regression (SIR, which targets mean differences). We provide an improved SMVCIR algorithm and create a dimensionality test for the SMVCIR coordinate system. Simulations corroborating our theoretical results and comparing SMVCIR with the other methods are presented. We also provide examples demonstrating the use of SMVCIR and the other methods, in visualization and group discrimination by k-nearest neighbors. The advantages and differences of SMVCIR from SAVE and SIR are shown clearly in these examples and simulation.



















Similar content being viewed by others
References
Asuncion A, Newman D (2007) UCI machine learning repository. http://www.ics.uci.edu/~mlearn/MLRepository.html
Bentler PM, Xie J (2000) Corrections to test statistics in principal hessian directions. Stat Probab Lett 47:381–389
Bura E, Cook RD (2001) Extending sliced inverse regression: the weighted chi-squared test. J Am Stat Assoc 96:996–1003
Bura E, Yang J (2011) Dimension estimation in sufficient dimension reduction: a unifying approach. J Multivar Anal 102:130–142
Clopper CJ, Pearson ES (1935) The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika 26:404–413
Cook RD (1998) Regression graphics: ideas for studying regressions through graphics. Wiley, New York
Cook RD (2000) Save: a method for dimension reduction and graphics in regression. Commun Stat Theory Methods 29:2109–2121
Cook RD, Critchley F (2000) Detecting regression outliers and mixtures graphically. J Am Stat Assoc 95:781–794
Cook RD, Forzani LM, Tomassi DR (2011) Ldr: a package for likelihood-based sufficient dimension reduction. J Stat Softw 39:1–20, http://www.jstatsoft.org/v39/i03
Eaton ML, Tyler DE (1994) Asymptotic distributions of singular values with applications to canonical correlations and correspondence analysis. J Multivar Anal 50:238–264
Gannoun A, Saracco J (2003) An asymptotic theory for sir\(_{\alpha } \) method. Stat Sinica 13:297–310
Hastie T, Tibshirani RJ, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction, 2nd edn. Springer, New York
Hodges JL, Fix E (1951) Discriminatory analysis: nonparametric discrimination, consistency properties. Tech. Rep. 4, Project No. 21–49-004, Randolph Field, Texas: Brooks Air Force Base, USAF School of Aviation Medicine
Izenman AJ (2008) Modern multivariate statistical techniques: regression, classification, and manifold learning. Springer, New York
Li KC (1991) Sliced inverse regression for dimension reduction. J Am Stat Assoc 86:316–327
Lindsey CD (2010) Sliced mean variance-covariance inverse regression dimensionality test. PhD thesis, Texas A &M University, available by request to the author
Loftsgaarden DO, Quesenberry CP (1965) A nonparametric estimate of a multivariate density function. Ann Math Stat 36:1049–1051
Lütkepohl H (2007) New introduction to multiple time series analysis, 2nd edn. Springer, New York
Muirhead RJ (1982) Aspects of multivariate statistical analysis. Wiley, New York
R Development Core Team (2012) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org, ISBN 3-900051-07-0
Rencher AC (2002) Methods of multivariate analysis, 2nd edn. Wiley, New York
Serfling RJ (1980) Approximation theorems of mathematical statistics. Wiley, New York
Shao Y, Cook RD, Weisberg S (2007) Marginal tests with sliced average variance estimation. Biometrika 94:285–296
Sheather SJ, McKean JW, Crimin K (2008) Sliced mean variance-covariance inverse regression. Comput Stat Data Anal 52:1908–1927
StataCorp (2011) Release 12. College Station, Texas, http://www.stata.com
Tyler DE (1981) Asymptotic inference for eigenvectors. Ann Stat 9:725–736
Weisberg S (2009) dr: Methods for dimension reduction for regression. http://CRAN.R-project.org/package=dr, R Package Version 3.0.4
Yin X (2005) Tests of dimension with sliced average variance estimation. Research Report 13, Department of Statistics, University of Georgia
Zhu M, Hastie TJ (2003) Feature extraction for nonparametric discriminant analysis. J Comput Graph Stat 12:101–120
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lindsey, C.D., Sheather, S.J. & McKean, J.W. Using sliced mean variance–covariance inverse regression for classification and dimension reduction. Comput Stat 29, 769–798 (2014). https://doi.org/10.1007/s00180-013-0460-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-013-0460-3