Abstract
Traditional procedures for clustering time series are based mostly on crisp hierarchical or partitioning methods. Given that the dynamics of a time series may change over time, a time series might display patterns that may enable it to belong to one cluster over one period while over another period, its pattern may be more consistent with those in another cluster. The traditional clustering procedures are unable to identify the changing patterns over time. However, clustering based on fuzzy logic will be able to detect the switching patterns from one time period to another thus enabling some time series to simultaneously belong to more than one cluster. In particular, this paper proposes a fuzzy approach to the clustering of time series based on their variances through wavelet decomposition. We will show that this approach will distinguish between time series with different patterns in variability as well identifying time series with switching patterns in variability.
Similar content being viewed by others
References
ALAGON, J. (1989), “Spectral Discriminant for Two Groups of Time Series”, Journal of Time Series Analysis, 10, 3, 203–214.
ANDERBERG, M.R. (1973), Cluster Analysis for Applications, New York: Academic Press.
BASALTO, N., BELLOTTI, R., DE CARLO, F., FACCHI, P., PANTALEO, E., and PASCAZIO, S. (2007), “Hausdorff Clustering of Financial Time Series”, Physica A, 379, 635–644.
BELLMAN, R. (1961), Adaptive Control Processes: A Guided Tour, Princeton, NJ: Princeton University Press.
BEYEN, K., GOLDSTEIN, J., RAMAKRISHNAN, R., and SHAFT, U. (1999), “When is the Nearest Neighbor Meaningful?”, Proceeding of the 7th International Conference on Database Theory, 217–235.
BEZDEK, J.C. (1981), Pattern Recognition with Fuzzy Objective Function Algorithms, New York: Plenum Press.
BEZDEK, J.C., KELLER, J., KRISNAPURAM, R., and PAL, N.R. (1999), Fuzzy Models and Algorithms for Pattern Recognition and Image Processing, The Handbook of Fuzzy Sets, Volume 4, Dordrecht: Kluwer.
BOHTE, Z., CEPAR, D., and KOŠMELJ, K. (1980), Clustering of Time Series, COMPSTAT’80, Heidelberg: Physica-Verlag, pp. 587–593.
CAIADO, J., CRATO, N., and PEÑA, D. (2006), “A Periodogram-Based Metric for Time Series Classification”, Computational Statistics & Data Analysis, 50, 2668–2684.
CARRIERI, F., ERRUNZA, V., and HOGAN, K. (2002), Characterizing World Market Integration Through Time, Working Paper, McGill University.
CHAN K.P., and FU A.C, (1999), Efficient Time Series Matching by Wavelets. in ICDE, 1999, Proceedings of the 15th International Conference on Data Engineering, Sydney, Australia.
COPPI, R., and D’URSO, P. (2002) “The Dual Dynamic Factor Analysis Model”, in Classification, Automation, and New Media, eds. W. Gaul and G. Ritter, Heidelberg: Springer-Verlag, pp. 47–58.
COPPI, R., D’URSO, P., and GIORDANI, P. (2006), “Fuzzy C-Medoids Clustering Models for Time-Varying Data”, in Modern Information Processing: From Theory to Applications, eds. B. Bouchon-Meunier, G. Coletti, and R.R. Yager, Elsevier, pp. 195–206.
CORAZZIARI, I. (1999), “Dynamic Factor Analysis”, in Classification and Data Analysis. Theory and Application, eds. M. Vichi and O. Opitz, Heidelberg: Springer-Verlag, pp. 171–178.
D’URSO, P. (2004), Fuzzy C-means Clustering Models for Multivariate Time-Varying Data: Different Approaches, International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 12 (3), 287–326.
D’URSO, P. (2005), “Fuzzy Clustering for Data Time Array with Inlier and Outlier Time Trajectories, IEEE Transactions on Fuzzy Systems, 13, 5, pp. 583–604.
D’URSO, P., and DE GIOVANNI, L. (2008), “Temporal Self-organizing Maps for Telecommunications Market Segmentation”, Neurocomputing, 71, 2880–2892.
DOSE, C., and CINCOTTI, S. (2005), “Clustering of Financial Time Series with Application to Index and Enhanced Index Tracking Portfolio”, Physica A, 355, 145–151.
EVERITT, B.S., LANDAU, S., and LEESE, M. (2001), Cluster Analysis (4th ed.), London: Arnold Press.
FAMA, E.F. (1981), “Stock Returns, Real Activity, Inflation, and Money”, American Economic Review 71, 545–565.
FERSON, W., and HARVEY, C.R. (1993), “The Risk and Predictability of International Equity Returns”, Review of Financial Studies, 6, 527–544.
FIFIELD, G.M., POWER, D.M., and SINCLAIR, C.D. (2002), “Macroeconomic Factors and Share Returns: An Analysis Using Emerging Market Data, International Journal of Finance & Economics, 7, 1, 51–62.
GE, D., SRINIVASAN, N., and KRISHNAN, S.M. (2002), “Cardiac Arrhythmia Classification Using Autoregressive Modeling”, Biomedical Engineering. OnLine (http://www.biomedical-engineering-online.com).
GORDON, A.D. (1999), Classification, Boca Raton FL: CRC Press.
HARVEY, C.R. (1991), “The World Price of COVARIANCE RISK”, Journal of Finance, 46, 111–157.
HARVEY, C.R. (1995), “Predictable Risk and Returns in Emerging Markets”, Review of Financial Studies, Fall, 773–816.
HARVEY, C.R. (2001), “Asset Pricing in Emerging Markets”, in International Encyclopedia of the Social and Behavioral Sciences, ed. O. Ashenfelter, Amsterdam: Elsevier Science, pp. 840–845.
HEISER, W.J., and GROENEN, P.J.F. (1997), “Cluster Differences Scaling With a Within- Clusters Loss Component and a Fuzzy Successive Approximation Strategy to Avoid Local Minima, Psychometrika, 62, 63–83.
HWANG, H., DESARBO, W.S., and TAKANE, Y. (2007), “Fuzzy Clusterwise Generalized Structured Component Analysis”, Psychometrika, 72, 181–198.
KAKIZAWA, Y., SHUMWAY, H., and TANIGUCHI, M. (1998), “Discriminant and Clustering for Multivariate Time Series”, Journal of the American Statistical Association, 93 (441), 328–340.
KEOGH, E.J., and KASETTY, S. (2002), “On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration”, 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, July 23-26.
KOSĚC, D. (2000), Parametric Estimation of Continuous Non Stationary Spectrum and Its Dynamics in Surface EMG Studies, International Journal of Medical Informatics, 58–59, 59–69.
MCBRATNEY, A.B., and MOORE, A.W. (1985), “Application of Fuzzy Sets to Climatic Classification”, Agricultural and Forest Meteorology, 35, 165–185.
MAHARAJ, E.A., and ALONSO, A.M. (2007), “Discrimination of Locally Stationary Time Series Using Wavelets, Computational Statistics & Data Analysis, 52, 879–895.
MAC QUEEN, J.B. (1967), “Some Methods for Classification and Analysis of Multivariate Observations”, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 2, 281–297.
PERCIVAL, D.B., and WALDEN, A.T. (2000), Wavelet Methods for Time Series Analysis, Cambridge: Cambridge University Press.
PERCIVAL, D.B. (1995), “On the Estimation of the Wavelet Variance, Biometrika, 82(3), 619–631.
PRIESTLEY, M.B. (1966), “Design Relations for Non-Stationary Processes”, Journal of the Royal Statistical Society B, 28, 228–240.
ROMESBURG, H.C. (1984), Cluster Analysis for Researchers, Belmont, CA: Lifetime Learning Publications.
WEDEL, M., and KAMAKURA, W.A. (1998), Market Segmentation: Conceptual and Methodological Foundations, Boston: Kluwer Academic.
WANG, W., and ZHANG, Y. (2007), “On Fuzzy Cluster Validity Indices, Fuzzy Sets and Systems, 158 (19), 2095–2117.
ZHANG, H., HO T.B., ZHANG, Y., and LIN, M. (2005), Unsupervised Feature Extraction for Time Series Clustering Using Orthogonal Wavelet Transform, Informatica, 30, 305–319.
Author information
Authors and Affiliations
Corresponding author
Additional information
We thank the editor and three referees for their comments and suggestions that helped improve the presentation of this paper.
Rights and permissions
About this article
Cite this article
Ann Maharaj, E., D’Urso, P. & Galagedera, D.U.A. Wavelet-based Fuzzy Clustering of Time Series. J Classif 27, 231–275 (2010). https://doi.org/10.1007/s00357-010-9058-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00357-010-9058-4