Abstract
A CLUstering model for SKew-symmetric data including EXTernal information (CLUSKEXT) is proposed, which relies on the decomposition of a skew-symmetric matrix into within and between cluster effects which are further decomposed into regression and residual effects when possible external information on the objects is available. In order to fit the imbalances between objects, the model jointly searches for a partition of objects and appropriate weights which are in turn linearly linked to the external variables. The proposal is fitted in a least-squares framework and a decomposition of the fit is derived. An appropriate Alternating Least-Squares algorithm is provided to fit the model to illustrative real and artificial data.
Similar content being viewed by others
References
Bell DR, Lattin JM (1998) Shopping behavior and consumer preference for store price format: Why ‘large basket’ shoppers prefer EDLP. Mark Sci 17:66–88
Borg I, Groenen PJF (1995) Asymmetries in multidimensional scaling. In: Faulbaum F (ed) Softstat ’95. Gustav Fischer, Stuttgart, pp 31–35
Borg I, Groenen P (2005) Modern multidimensional scaling. Theory and applications, 2nd edn. Springer, Berlin
Bove, G. (2006) Approaches to asymmetric multidimensional scaling with external information. In: Zani S, Cerioli A, Riani M, Vichi M (eds) Data analysis, classification and forward search. Series: Data analysis, studies in classification data analysis and knowledge organization. Springer, Heidelberg, pp 69–76
Constantine AG, Gower JC (1978) Graphic representations of asymmetric matrices. Appl Stat 27:297–304
Ekman GA (1963) A direct method for multidimensional ratio scaling. Psychometrika 28:3–41
Escoufier Y, Grorud A (1980) Analyse factorielle des matrices carrées non-symétriques. In: Diday E et al (eds) Data analysis and informatics. North Holland, Amsterdam, pp 263–276
Gower JC (1977) The analysis of asymmetry and orthogonality. In: Barra JR, Brodeau F, Romier G, Van Cutsem B (eds) Recent Developments in Statistics. North Holland, Amsterdam, pp 109–123
Kiers HAL, Vicari D, Vichi D (2005) Simultaneous classification and multidimensional scaling with external information. Psychometrika 70:433–470
Liu S, Trenkler G (2008) Hadamard, Khatri-Rao, Kronecker and other matrix products. Int J Inf Syst Sci 4:160–177
Martín-Merino M, Munoz A (2005) Visualizing asymmetric proximities with SOM and MDS models. Neurocomputing 63:171–192
Munoz A, Martín-Merino M (2002) New asymmetric iterative scaling models for the generation of textual word maps, JADT 2002: 6es Journées internationales d’Analyse statistique des Données Textuelles, Saint Malo, France, pp 593–603
Okada A (1990) A generalization of asymmetric multidimensional scaling. In: Schader M, Gaul W (eds) Knowledge, data and computer-assisted decisions. Springer, Berlin, pp 127–138
Okada A, Imaizumi T (1987) Geometric models for asymmetric similarity data. Behaviormetrika 21:81–96
Olszewski D (2012) K-means clustering of asymmetric data. In: Corchado, E. et al. (eds.), Hybrid Artificial Intelligent Systems 2012, Part I, Lecture Notes in Computer Science, vol 7208. Springer, Berlin Heidelberg, pp 243–254
Olszewski D, Ster B (2014) Asymmetric clustering using the alpha-beta divergence. Pattern Recognit 47(5):2031–2041
Rocci R, Bove G (2002) Rotation techniques in asymmetric multidimensional scaling. J Comput Graph Stat 11:405–419
Saito T, Yadohisa H (2005) Data analysis of asymmetric structures. Advanced Approaches in Computational Statistics, Marcel Dekker, New York
Vicari D (2014) Classification of asymmetric proximity data. J Classif 31(3):386–420
Zielman B, Heiser WJ (1996) Models for asymmetric proximities. Br J Math Stat Psychol 49:127–146
Acknowledgments
The author wishes to thank the associate editors and the anonymous referees for their constructive comments and suggestions which greatly improved the quality of the paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Vicari, D. CLUSKEXT: CLUstering model for SKew-symmetric data including EXTernal information. Adv Data Anal Classif 12, 43–64 (2018). https://doi.org/10.1007/s11634-015-0203-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11634-015-0203-0