Abstract
Functional data analysis, as proposed by Ramsay (Psychometrika 47:379–396, 1982), has recently attracted many researchers. The most popular approach taken in recent studies of functional data has been the extension of statistical methods for the analysis of usual data to that of functional data (e.g., Ramsay and Silverman in Functional data Analysis Springer, Berlin Heidelberg New York, 1997, Applied functional data analysis: methods and case studies. Springer, Berlin Heidelberg New York, 2002; Mizuta in Proceedings of the tenth Japan and Korea Joint Conference of Statistics, pp 77–82, 2000; Shimokawa et al. in Japan J Appl Stat 29:27–39, 2000). In addition, several methods for clustering functional data have been proposed (Abraham et al. in Scand J Stat 30:581–595, 2003; Gareth and Catherine in J Am Stat Assoc 98:397–408, 2003; Tarpey and kinateder in J Classif 20:93–114, 2003; Rossi et al. in Proceedings of European Symposium on Artificial Neural Networks pp 305–312, 2004). Furthermore, Tokushige et al. (J Jpn Soc Comput Stat 15:319–326, 2002) defined several dissimilarities between functions for the case of functional data. In this paper, we extend existing crisp and fuzzy k-means clustering algorithms to the analysis of multivariate functional data. In particular, we consider the dissimilarity between functions as a function. Furthermore, cluster centers and memberships, which are defined as functions, are determined at the minimum of a certain target function by using a calculus-of-variations approach.
Similar content being viewed by others
References
Abraham C, Cornillon PA, Matzner-Lober E, Molinari N (2003) Unsupervised curve clustering using B-splines. Scand J Statist 30:581–595
D’Urso P (2004) Fuzzy C-means clustering models for multivariate time-varying data: different approaches. Int J Uncertain Fuzziness Knowl Based Syst 12(3):287–326
Gareth MJ, Catherine AS (2003) Clustering for sparsely sampled functional data. J Am Statist Assoc 98:397–408
MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley symposium on mathematical statistics and probability, vol. 1, pp. 281–297, University of California Press, Berkeley
Mizuta M (2000) Functional multidimensional scaling. In: Proceedings of the Tenth Japan and Korea joint conference of statistics, pp. 77–82
Ramsay JO (1982) When the data are functions. Psychometrika 47:379–396
Ramsay JO, Silverman BW (1997) Functional data analysis. Springer, Berlin Heidelberg New York
Ramsay JO, Silverman BW, (2002) Applied functional data analysis: methods and case studies. Springer, Berlin Heidelberg New York
Rossi F, Conan-Guez B, El Golli A (2004) Clustering functional data with the SOM algorithm. In: Proceedings of European symposium on artificial neural networks, 2004, pp. 305–312
Ruspini EH (1969) Hierarchical grouping to optimize an objective function. J Am Statist Assoc 58:236–244
Shimokawa M, Mizuta M, Sato Y (2000) An expansion of functional regression analysis (in Japanese). Jpn J Appl Statist 29:27–39
Simonoff JS (1998) Smoothing methods in statistics. Springer, Berlin Heidelberg New York
Tarpey T, Kinateder KKJ (2003) Clustering functional data. J Classif 20:93–114
Tokushige S, Inada K, Yadohisa H (2002) Dissimilarity and related methods for functional data. J Jpn Soc Comput Statist 15:319–326
Tuddenham RD, Snyder MM (1954) Physical growth of California boys and girls from birth to eighteen years. University of California Press, Berkeley, Los Angeles
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tokushige, S., Yadohisa, H. & Inada, K. Crisp and fuzzy k-means clustering algorithms for multivariate functional data. Computational Statistics 22, 1–16 (2007). https://doi.org/10.1007/s00180-006-0013-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-006-0013-0