Skip to main content
Log in

Metaheuristics on time series clustering problem: theoretical and empirical evaluation

  • Research Paper
  • Published:
Evolutionary Intelligence Aims and scope Submit manuscript

Abstract

Considering the literature and the importance of using of the metaheuristic techniques in time series data mining tasks, especially time series clustering (TsC), it seems lack of a comparative study of such techniques in terms of efficiency for TsC problem. Hence, we try to offer the possibility of theoretical and empirical evaluation of metaheuristic techniques in TsC problem. In fact, we follow two main goals by performing this theoretical and empirical evaluation. These goals include: at the first, we would like to prove the effective role of metaheuristic techniques to enhance the efficiency of TsC algorithms, eliminate some challenges of such algorithms (e.g., sensitivity to initialization of some primary parameters), and indicates the popularity of their use in TsC problems during the last years because of their characteristics. Second, we would like to offer the possibility of a comparison between some metaheuristic techniques to analyze the percentage of their effectiveness for enhancing the accuracy of clustering results in TsC problems. The comparative analysis of results of the empirical evaluation for ten standard time series data sets collected from the UCR time series data sets repository show three keys conclusions: (1) generally, metaheuristic techniques have provided reasonable results and significant improvement in terms of efficiency in TsC problem in all experiments due to the main characteristic of metaheuristics that is finding an approximate solution more quickly. But, it is concluded that fuzzy metaheuristic techniques based on population solution class (e.g., FATPSO and FPSO) to provide better results versus single solution class in TsC algorithms. (2) It is deducted that consuming maximum computational time presented by fuzzy metaheuristic techniques in comparison to other optimization techniques in the performed experiments. (3) The paper has proposed that a hybrid of fuzzy metaheuristics (e.g., FATPSO) and the base TsC algorithms can be considered as a more adequate choice to present better results in terms of accuracy in different application areas.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Gogna A, Tayal A (2013) Metaheuristics: review and application. J Exp Theor Artif Intell 25(4):503–526

    Article  Google Scholar 

  2. Tsai CW, Rodrigues J (2014) Metaheuristic scheduling for cloud: a survey. IEEE Syst J 8(1):279–9

    Article  Google Scholar 

  3. Talbi El-Ghazali (2009) Metaheuristics: from design to implementation, 1st edn. Wiley, Chichester

    Book  Google Scholar 

  4. Abualigah L (2020) Multi-verse optimizer algorithm: a comprehensive survey of its results, variants, and applications. Neural Comput Appl 32:12381–12401. https://doi.org/10.1007/s00521-020-04839-1

    Article  Google Scholar 

  5. Abualigah L, Diabat A, Geem ZW (2020) A comprehensive survey of the harmony search algorithm in clustering applications. Appl Sci 10(11):3827

    Article  Google Scholar 

  6. Abualigah L, Shehab M, Alshinwan M et al (2020) Ant lion optimizer: a comprehensive survey of its variants and applications. Arch Computat Methods Eng. https://doi.org/10.1007/s11831-020-09420-6

    Article  Google Scholar 

  7. Fu TC (2011) A review on time series data mining. Eng Appl Artif Intell 24(1):164–181

    Article  Google Scholar 

  8. Izakian Z, Mesgari M (2015) Fuzzy clustering of time series data: a particle swarm optimization approach. J AI Data Min 3(1):39–46

    Google Scholar 

  9. Pérez-Ortiz M, Durán-Rosal AM, Gutiérrez PA, Sánchez-Monedero J, Nikolaou A, Fernández-Navarro F, Hervás-Martínez C (2019) On the use of evolutionary time series analysis for segmenting paleoclimate data. Neurocomputing 326:3–14

    Article  Google Scholar 

  10. Mehrmolaei S, Keyvanpour MR (2017) Fuzzy-clustering time series: population-based an enhanced technique. In: Proceeding of IEEE conference on artificial intelligence and robotics (IRANOPEN), pp 84–90

  11. Deb S, Tian Z, Fong S, Wong R, Millham R, Wong KK (2018) Elephant search algorithm applied to data clustering. Soft Comput 22(18):6035–6046

    Article  Google Scholar 

  12. Chou CH, Hsieh SC, Qiu CJ (2017) Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction. Appl Soft Comput 56:298–316

    Article  Google Scholar 

  13. Islam MZ, Estivill-Castro V, Rahman MA, Bossomaier T (2018) Combining K-Means and a genetic algorithm through a novel arrangement of genetic operators for high quality clustering. Expert Syst Appl 91:402–417

    Article  Google Scholar 

  14. Abubakar AM, Man-Yahya S (2019) CAT swarm optimization based clustering algorithm for fuzzy time series forecasting. Arid Zone J Eng Technol Environ 15(1):40–54

    Google Scholar 

  15. Alonso AM, Peña D (2019) Clustering time series by linear dependency. Stat Comput 29(4):655–676

    Article  MathSciNet  Google Scholar 

  16. Silva Filho TM, Pimentel BA, Souza RM, Oliveira AL (2015) Hybrid methods for fuzzy clustering based on fuzzy c-means and improved particle swarm optimization. Expert Syst Appl 42(17–18):6315–6328

    Article  Google Scholar 

  17. Navajas-Guerrero A, Manjarres D, Portillo E, Landa-Torres I (2019, May) A novel heuristic approach for the simultaneous selection of the optimal clustering method and its internal parameters for time series data. In: International workshop on soft computing models in industrial and environmental applications, pp 179–189. Springer, Cham

  18. Ramadas M, Abraham A (2019) Metaheuristics and data clustering. In: Metaheuristics for data clustering and image segmentation, pp 7–55. Springer, Cham

  19. Menéndez HD, Otero FE, Camacho D (2016) Medoid-based clustering using ant colony optimization. Swarm Intell 10(2):123–145

    Article  Google Scholar 

  20. Oduntan OI, Thulasiraman P (2018, November) Hybrid metaheuristic algorithm for clustering. In: 2018 IEEE symposium series on computational intelligence (SSCI), pp 1–9. IEEE

  21. Kushwaha N, Pant M (2020) Fuzzy particle swarm page rank clustering algorithm. In: Soft computing: theories and applications, pp 895–904. Springer, Singapore

  22. Panapakidis IP, Christoforidis GC (2019) Exploring the potential of metaheuristics-based clustering in load profiling applications. In: 2019 1st international conference on energy transition in the mediterranean area (SyNERGY MED), pp 1–6. IEEE

  23. Anderson OD (1977) The Box-Jenkins approach to time series analysis. RAIRO Oper Res 11(1):3–29

    Article  MathSciNet  Google Scholar 

  24. Mehrmolaei S, Keyvanpour MR (2018) TsP-SA: usage of time series techniques on healthcare data. Int J Electron Healthc 10(3):190–230

    Article  Google Scholar 

  25. Keogh E (2008) Indexing and mining time series data. Encyclopedia of GIS, pp 493–497

  26. Aghabozorgi S, Shirkhorshidi AS, Wah TY (2015) Time-series clustering: a decade review. Inform Syst 53:16–38

    Article  Google Scholar 

  27. Guijo-Rubio D, Durán-Rosal AM, Gutiérrez PA, Troncoso A, Hervás-Martínez C (2020) Time-series clustering based on the characterization of segment typologies. In: IEEE transactions on cybernetics

  28. Li H (2019) Multivariate time series clustering based on common principal component analysis. Neurocomputing 349:239–247

    Article  Google Scholar 

  29. Sheikholeslami G, Chatterjee S, Zhang A (1998) Wavecluster: a multi-resolution clustering approach for very large spatial databases. VLDB 98:428–439

    Google Scholar 

  30. Corduas M, Piccolo D (2008) Time series clustering and classification by the autoregressive metric. Comput Stat Data Anal 52(4):1860–1872

    Article  MathSciNet  Google Scholar 

  31. Jahanbakhsh A, Seyedzadegan M (2016) An improved method of fuzzy c-means clustering by using feature selection and weighting. Int J Comput Sci Netw Secur 16(10):64–69

    Google Scholar 

  32. Wang X, Wang Y, Wang L (2004) Improving fuzzy c-means clustering based on feature-weight learning”. Pattern Recogn Lett 25(10):1123–1132

    Article  Google Scholar 

  33. Lee CH, Su, YY, Lin YC, Lee SJ (2017) Time series forecasting based on weighted clustering. In: 2017 2nd IEEE international conference on computational intelligence and applications (ICCIA), pp 421–425. IEEE

  34. Hathaway RJ, Bezdek JC (2003) Visual cluster validity for prototype generator clustering models. Pattern Recogn Lett 24(9–10):1563–1569

    Article  Google Scholar 

  35. Izakian H, Pedrycz W, Jamal I (2015) Fuzzy clustering of time series data using dynamic time warping distance. Eng Appl Artif Intell 39:235–244

    Article  Google Scholar 

  36. Halkidi M, Batistakis Y, Vazirgiannis M (2003) Clustering validity checking methods: part ii

  37. Kotsakos D, Trajcevski G, Gunopulos D, Aggarwal CC (2013) time-series data clustering, books. google.com. Chapter 15:357–380

    Google Scholar 

  38. Mori U, Mendiburu A, Lozano JA (2015) Similarity measure selection for clustering time series databases. IEEE Trans Knowl Data Eng 28(1):181–195

    Article  Google Scholar 

  39. Tan PN (2018) Introduction to data mining. Pearson Education India, Chennai

    Google Scholar 

  40. Moosavi SR, Qajar J, Riazi M (2018) A comparison of methods for denoising of well test pressure data. J Petrol Explor Prod Technol 8(4):1519–1534

    Article  Google Scholar 

  41. Ali M, Jones MW, Xie X, Williams M (2019) TimeCluster: dimension reduction applied to temporal data for visual analytics. Vis Comput 35(6–8):1013–1026

    Article  Google Scholar 

  42. Van Der Maaten L, Postma E, Van den Herik J (2009) Dimensionality reduction: a comparative. J Mach Learn Res 10(66–71):13

    Google Scholar 

  43. Yu P, Li Y, Lin H, Wu N (2016) Removal of random noise in seismic data by time-varying window-length time-frequency peak filtering. Acta Geophys 64(5):1703–1714

    Article  Google Scholar 

  44. Kane A, Shiri N (2017) Multivariate time series representation and similarity search using pca. Ind Conf Data Min. Springer, Cham, pp 122–136

    Google Scholar 

  45. Luchian H, Breaban ME, Bautu A (2015) On meta-heuristics in optimization and data analysis. Application to geosciences. In: Artificial intelligent approaches in petroleum geosciences, pp 53–100. Springer, Cham

  46. Asih AMS, Sopha BM, Kriptaniadewa G (2017) Comparison study of metaheuristics: empirical application of delivery problems. Int J Eng Bus Manag 9:1–12. https://doi.org/10.1177/1847979017743603

    Article  Google Scholar 

  47. Tsai CW, Chiang MC, Ksentini A, Chen M (2016) Metaheuristic algorithms for healthcare: open issues and challenges. Comput Electr Eng 53:421–434

    Article  Google Scholar 

  48. Razavi SH, Ebadati EOM, Asadi S, Kaur H (2015) An efficient grouping genetic algorithm for data clustering and big data analysis. Comput Intell Big Data Anal. Springer, Cham, pp 119–142

    Chapter  Google Scholar 

  49. Mehrmolaei S, Keyvanpour MR (2019) an enhanced hybrid model for event prediction in healthcare time series. Int J Knowl Based Intell Eng Syst 23:131–147

    Google Scholar 

  50. Niknam T, Amiri B (2010) An efficient hybrid approach based on PSO, ACO and k-means for cluster analysis. Appl Soft Comput 10(1):183–197

    Article  Google Scholar 

  51. Khosravanian R, Mansouri V, Wood DA, Alipour MR (2018) A comparative study of several metaheuristic algorithms for optimizing complex 3-D well-path designs. J Petrol Explor Prod Technol 8(4):1487–1503

    Article  Google Scholar 

  52. Ilango SS, Vimal S, Kaliappan M, Subbulakshmi P (2018) Optimization using artificial bee colony based clustering approach for big data. Cluster Comput, pp 1–9

  53. Antosiewicz M, Koloch G, Kamiński B (2013) Choice of best possible metaheuristic algorithm for the travelling salesman problem with limited computational time: quality, uncertainty and speed. J Theor Appl Comput Sci 7(1):46–55

    Google Scholar 

  54. Liu H, Abraham A (2005) Fuzzy adaptive turbulent particle swarm optimization. In: Fifth international conference on hybrid intelligent systems (HIS’05), p 6. IEEE

  55. Livada I, Synnefa A, Haddad S, Paolini R, Garshasbi S, Ulpiani G, Fiorito F, Vassilakopoulou K, Osmond P, Santamouris M (2019) Time series analysis of ambient air-temperature during the period 1970–2016 over Sydney, Australia. Sci Total Environ 648:1627–1638

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammad Reza Keyvanpour.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mehrmolaei, S., Keyvanpour, M.R. & Savargiv, M. Metaheuristics on time series clustering problem: theoretical and empirical evaluation. Evol. Intel. 15, 329–348 (2022). https://doi.org/10.1007/s12065-020-00511-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12065-020-00511-8

Keywords

Navigation