Abstract
We address the problem of describing several categorical variables with a prediction purpose. We focus on methods in the multiblock modelling framework, each block being formed of the indicator matrix associated with each qualitative variable.We propose a method, called categorical multiblock Redundancy Analysis, based on a well-identified global optimization criterion which leads to an eigensolution. In comparison with usual procedures, such as logistic regression, the method is well-adapted to the case of a large number of redundant explanatory variables. Practical uses of the proposed method are illustrated using an empirical example in the field of epidemiology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BARKER, M. and RAYENS, W. (2003): Partial least squares for discrimination. Journal of Chemometrics 17, 166-173.
BOUGEARD, S., HANAFI, M., LUPO, C. and QANNARI, E.M. (2008): From multiblock Partial Least Squares to multiblock redundancy analysis. A continuum approach. In: International Conference on Computational Statistics. Porto, 607–616
HANAFI, M. (2007): PLS path modelling: computation of latent variables with the estimation mode B. Computational Statistics and Data Analysis 22, 275-292.
JAKOBOWICZ, E. and DERQUENNE, C. (2007): A modified PLS path modeling algorithm handling reflective categorical variables and a new model building strategy. Computational Statistics and Data Analysis 51, 3666-3678.
KRÄMER, N. (2007): Analysis of high-dimensional data with Partial Least Squares and boosting. PhD thesis, Berlin.
LOHMÖLLER, J.B. (1989): Latent variable path modelling with partial least square. Physica-Verlag, Heidelberg.
SAPORTA, G. and NIANG, N. (2006): Correspondence analysis and classification (Chap. 16). Multiple correspondence analysis and related method. Chapman & Hall, 372-392.
SJÖSTRÖM, M., WOLD, H. and SÖDERSTRÖM, B. (1986): PLS discriminant plots. In: E.S. Gelsema & L.N. Kanal, Pattern Recognition in Practice II, Elsevier, Amsterdam, 461–470.
SKRONDAL, A. and RABE-HESKETH, S. (2005): Structural equation modeling: categorical variables. In: Everitt, B.S. & Howell, D., Encyclopedia of statistics in behavioral science. Wiley.
STONE, M. (1974): Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society 36, 111-147.
TENENHAUS, M. and HANAFI, M. (2007): A bridge between PLS path modeling and multi-block data analysis. In: Handbook on Partial Least Square (PLS): concepts, methods and applications.
WOLD, S. (1984): Three PLS algorithms according to SW. In: Symposium MULDAST, Umea, 26–30.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bougeard, S., Qannari, E.M., Chauvin, C. (2010). Multiblock Method for Categorical Variables. Application to the Study of Antibiotic Resistance. In: Lechevallier, Y., Saporta, G. (eds) Proceedings of COMPSTAT'2010. Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-2604-3_36
Download citation
DOI: https://doi.org/10.1007/978-3-7908-2604-3_36
Published:
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-2603-6
Online ISBN: 978-3-7908-2604-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)