Skip to main content
Log in

Wavelet-based Fuzzy Clustering of Time Series

  • Published:
Journal of Classification Aims and scope Submit manuscript

Abstract

Traditional procedures for clustering time series are based mostly on crisp hierarchical or partitioning methods. Given that the dynamics of a time series may change over time, a time series might display patterns that may enable it to belong to one cluster over one period while over another period, its pattern may be more consistent with those in another cluster. The traditional clustering procedures are unable to identify the changing patterns over time. However, clustering based on fuzzy logic will be able to detect the switching patterns from one time period to another thus enabling some time series to simultaneously belong to more than one cluster. In particular, this paper proposes a fuzzy approach to the clustering of time series based on their variances through wavelet decomposition. We will show that this approach will distinguish between time series with different patterns in variability as well identifying time series with switching patterns in variability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • ALAGON, J. (1989), “Spectral Discriminant for Two Groups of Time Series”, Journal of Time Series Analysis, 10, 3, 203–214.

    Article  MathSciNet  Google Scholar 

  • ANDERBERG, M.R. (1973), Cluster Analysis for Applications, New York: Academic Press.

    MATH  Google Scholar 

  • BASALTO, N., BELLOTTI, R., DE CARLO, F., FACCHI, P., PANTALEO, E., and PASCAZIO, S. (2007), “Hausdorff Clustering of Financial Time Series”, Physica A, 379, 635–644.

    Article  Google Scholar 

  • BELLMAN, R. (1961), Adaptive Control Processes: A Guided Tour, Princeton, NJ: Princeton University Press.

    MATH  Google Scholar 

  • BEYEN, K., GOLDSTEIN, J., RAMAKRISHNAN, R., and SHAFT, U. (1999), “When is the Nearest Neighbor Meaningful?”, Proceeding of the 7th International Conference on Database Theory, 217–235.

  • BEZDEK, J.C. (1981), Pattern Recognition with Fuzzy Objective Function Algorithms, New York: Plenum Press.

    MATH  Google Scholar 

  • BEZDEK, J.C., KELLER, J., KRISNAPURAM, R., and PAL, N.R. (1999), Fuzzy Models and Algorithms for Pattern Recognition and Image Processing, The Handbook of Fuzzy Sets, Volume 4, Dordrecht: Kluwer.

  • BOHTE, Z., CEPAR, D., and KOŠMELJ, K. (1980), Clustering of Time Series, COMPSTAT’80, Heidelberg: Physica-Verlag, pp. 587–593.

    Google Scholar 

  • CAIADO, J., CRATO, N., and PEÑA, D. (2006), “A Periodogram-Based Metric for Time Series Classification”, Computational Statistics & Data Analysis, 50, 2668–2684.

    Article  MATH  MathSciNet  Google Scholar 

  • CARRIERI, F., ERRUNZA, V., and HOGAN, K. (2002), Characterizing World Market Integration Through Time, Working Paper, McGill University.

  • CHAN K.P., and FU A.C, (1999), Efficient Time Series Matching by Wavelets. in ICDE, 1999, Proceedings of the 15th International Conference on Data Engineering, Sydney, Australia.

  • COPPI, R., and D’URSO, P. (2002) “The Dual Dynamic Factor Analysis Model”, in Classification, Automation, and New Media, eds. W. Gaul and G. Ritter, Heidelberg: Springer-Verlag, pp. 47–58.

  • COPPI, R., D’URSO, P., and GIORDANI, P. (2006), “Fuzzy C-Medoids Clustering Models for Time-Varying Data”, in Modern Information Processing: From Theory to Applications, eds. B. Bouchon-Meunier, G. Coletti, and R.R. Yager, Elsevier, pp. 195–206.

  • CORAZZIARI, I. (1999), “Dynamic Factor Analysis”, in Classification and Data Analysis. Theory and Application, eds. M. Vichi and O. Opitz, Heidelberg: Springer-Verlag, pp. 171–178.

  • D’URSO, P. (2004), Fuzzy C-means Clustering Models for Multivariate Time-Varying Data: Different Approaches, International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 12 (3), 287–326.

    Article  MATH  MathSciNet  Google Scholar 

  • D’URSO, P. (2005), “Fuzzy Clustering for Data Time Array with Inlier and Outlier Time Trajectories, IEEE Transactions on Fuzzy Systems, 13, 5, pp. 583–604.

    Article  Google Scholar 

  • D’URSO, P., and DE GIOVANNI, L. (2008), “Temporal Self-organizing Maps for Telecommunications Market Segmentation”, Neurocomputing, 71, 2880–2892.

    Article  Google Scholar 

  • DOSE, C., and CINCOTTI, S. (2005), “Clustering of Financial Time Series with Application to Index and Enhanced Index Tracking Portfolio”, Physica A, 355, 145–151.

    Article  MathSciNet  Google Scholar 

  • EVERITT, B.S., LANDAU, S., and LEESE, M. (2001), Cluster Analysis (4th ed.), London: Arnold Press.

    Google Scholar 

  • FAMA, E.F. (1981), “Stock Returns, Real Activity, Inflation, and Money”, American Economic Review 71, 545–565.

    Google Scholar 

  • FERSON, W., and HARVEY, C.R. (1993), “The Risk and Predictability of International Equity Returns”, Review of Financial Studies, 6, 527–544.

    Google Scholar 

  • FIFIELD, G.M., POWER, D.M., and SINCLAIR, C.D. (2002), “Macroeconomic Factors and Share Returns: An Analysis Using Emerging Market Data, International Journal of Finance & Economics, 7, 1, 5162.

    Article  Google Scholar 

  • GE, D., SRINIVASAN, N., and KRISHNAN, S.M. (2002), “Cardiac Arrhythmia Classification Using Autoregressive Modeling”, Biomedical Engineering. OnLine (http://www.biomedical-engineering-online.com).

  • GORDON, A.D. (1999), Classification, Boca Raton FL: CRC Press.

    MATH  Google Scholar 

  • HARVEY, C.R. (1991), “The World Price of COVARIANCE RISK”, Journal of Finance, 46, 111–157.

    Article  Google Scholar 

  • HARVEY, C.R. (1995), “Predictable Risk and Returns in Emerging Markets”, Review of Financial Studies, Fall, 773–816.

    Google Scholar 

  • HARVEY, C.R. (2001), “Asset Pricing in Emerging Markets”, in International Encyclopedia of the Social and Behavioral Sciences, ed. O. Ashenfelter, Amsterdam: Elsevier Science, pp. 840–845.

  • HEISER, W.J., and GROENEN, P.J.F. (1997), “Cluster Differences Scaling With a Within- Clusters Loss Component and a Fuzzy Successive Approximation Strategy to Avoid Local Minima, Psychometrika, 62, 63–83.

    Article  MATH  MathSciNet  Google Scholar 

  • HWANG, H., DESARBO, W.S., and TAKANE, Y. (2007), “Fuzzy Clusterwise Generalized Structured Component Analysis”, Psychometrika, 72, 181–198.

    Article  MATH  MathSciNet  Google Scholar 

  • KAKIZAWA, Y., SHUMWAY, H., and TANIGUCHI, M. (1998), “Discriminant and Clustering for Multivariate Time Series”, Journal of the American Statistical Association, 93 (441), 328–340.

    Article  MATH  MathSciNet  Google Scholar 

  • KEOGH, E.J., and KASETTY, S. (2002), “On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration”, 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, July 23-26.

  • KOSĚC, D. (2000), Parametric Estimation of Continuous Non Stationary Spectrum and Its Dynamics in Surface EMG Studies, International Journal of Medical Informatics, 58–59, 59–69.

    Google Scholar 

  • MCBRATNEY, A.B., and MOORE, A.W. (1985), “Application of Fuzzy Sets to Climatic Classification”, Agricultural and Forest Meteorology, 35, 165–185.

    Article  Google Scholar 

  • MAHARAJ, E.A., and ALONSO, A.M. (2007), “Discrimination of Locally Stationary Time Series Using Wavelets, Computational Statistics & Data Analysis, 52, 879–895.

    Article  MATH  MathSciNet  Google Scholar 

  • MAC QUEEN, J.B. (1967), “Some Methods for Classification and Analysis of Multivariate Observations”, Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 2, 281–297.

    Google Scholar 

  • PERCIVAL, D.B., and WALDEN, A.T. (2000), Wavelet Methods for Time Series Analysis, Cambridge: Cambridge University Press.

    MATH  Google Scholar 

  • PERCIVAL, D.B. (1995), “On the Estimation of the Wavelet Variance, Biometrika, 82(3), 619–631.

    Article  MATH  MathSciNet  Google Scholar 

  • PRIESTLEY, M.B. (1966), “Design Relations for Non-Stationary Processes”, Journal of the Royal Statistical Society B, 28, 228–240.

    MATH  MathSciNet  Google Scholar 

  • ROMESBURG, H.C. (1984), Cluster Analysis for Researchers, Belmont, CA: Lifetime Learning Publications.

    Google Scholar 

  • WEDEL, M., and KAMAKURA, W.A. (1998), Market Segmentation: Conceptual and Methodological Foundations, Boston: Kluwer Academic.

    Google Scholar 

  • WANG, W., and ZHANG, Y. (2007), “On Fuzzy Cluster Validity Indices, Fuzzy Sets and Systems, 158 (19), 2095–2117.

    Article  MATH  MathSciNet  Google Scholar 

  • ZHANG, H., HO T.B., ZHANG, Y., and LIN, M. (2005), Unsupervised Feature Extraction for Time Series Clustering Using Orthogonal Wavelet Transform, Informatica, 30, 305–319.

    MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elizabeth Ann Maharaj.

Additional information

We thank the editor and three referees for their comments and suggestions that helped improve the presentation of this paper.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ann Maharaj, E., D’Urso, P. & Galagedera, D.U.A. Wavelet-based Fuzzy Clustering of Time Series. J Classif 27, 231–275 (2010). https://doi.org/10.1007/s00357-010-9058-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00357-010-9058-4

Keywords

Navigation