Abstract
Generalization of the covariance concept is discussed for mixed categorical and numerical data. Gini’s definition of variance for categorical data gives us a starting point to address this issue. The value difference in the original definition is changed to a vector in value space, giving a new definition of covariance for categorical and numerical data. It leads to reasonable correlation coefficients when applied to typical contingency tables.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Takeuchi, K. et al. (eds.): Encyclopedia of Statistics (in Japanese). Toyo Keizai Shinpou Tokyo, (1990)
Okada, T.: Rule Induction in Cascade Model based on Sum of Squares Decomposition. In: Zytkow, J.M., Rauch, J. (eds.): Principles of Data Mining and Knowledge Discovery (PKDD’99). LNAI, Vol. 1704, Springer-Verlag (1999) 468–475
Gini, C.W.: Variability and Mutability, contribution to the study of statistical distributions and relations. Studi Economico-Giuridici della R. Universita de Cagliari (1912). Reviewed in: Light, R.J., Margolin, B.H.: An Analysis of Variance for Categorical Data. J. American Statistical Association, 66 (1971) 534–544
Okada, T.: Sum of Squares Decomposition for Categorical Data. Kwansei Gakuin Studies in Computer Science, 14 (1999) 1–6
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Okada, T. (2000). A Note on Covariances for Categorical Data. In: Leung, K.S., Chan, LW., Meng, H. (eds) Intelligent Data Engineering and Automated Learning — IDEAL 2000. Data Mining, Financial Engineering, and Intelligent Agents. IDEAL 2000. Lecture Notes in Computer Science, vol 1983. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44491-2_23
Download citation
DOI: https://doi.org/10.1007/3-540-44491-2_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41450-6
Online ISBN: 978-3-540-44491-6
eBook Packages: Springer Book Archive