Skip to main content
Log in

A time series classification method combining graph embedding and the bag-of-patterns algorithm

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Time series data mining techniques have attracted extensive attention from researchers worldwide. Of these techniques, time series classification is an important part of time series mining. Among the many time series classification algorithms, methods based on the bag-of-patterns algorithm have attracted much attention from researchers because of their high accuracy and execution efficiency. However, when using these methods, only the frequency of different patterns is considered. Features such as the position of patterns in a sequence are not mined. Therefore, the aim of this paper is to determine how to solve the problem that the positional relationships among patterns are ignored when using the bag-of-patterns algorithm. To solve this issue, we introduce the graph embedding technique, and an attempt is made to capture the positional relationships among the patterns of time series from the graph perspective. To verify the performance of the method, we perform extensive experiments with the UCR time series archive, and the experimental results demonstrate that our proposed method generally improves the classification ability of models based on the bag-of-patterns algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Algorithm 1
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Data availability

The datasets used in our study are available in the UCR repository, https://www.cs.ucr.edu/ eamonn/time_series_data_ 2018/.

References

  1. Middlehurst M, Large J, Flynn M, Lines J, Bostrom A, Bagnall A (2021) Hive-cote 2.0: a new meta ensemble for time series classification. Mach Learn 110(11–12):3211–3243

    Article  MathSciNet  MATH  Google Scholar 

  2. Hadiyoso S, Aulia S, Rizal A (2019) One-lead electrocardiogram for biometric authentication using time series analysis and support vector machine. Int J Adv Comput Sci Appl 10(2):276–283

    Google Scholar 

  3. Al-Hadeethi H, Abdulla S, Diykh M, Deo RC, Green JH (2020) Adaptive boost ls-svm classification approach for time-series signal classification in epileptic seizure diagnosis applications. Expert Syst Appl 161:113676

    Article  Google Scholar 

  4. Nihale S, Sharma S, Parashar L, Singh U (2020) Network traffic prediction using long short-term memory. In 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), pp 338–343. IEEE

  5. Tareen ADK, Asim KM, Kearfott KJ, Rafique M, Nadeem MSA, Iqbal T, Rahman SU (2019) Automated anomalous behaviour detection in soil radon gas prior to earthquakes using computational intelligence techniques. J Environ Radioact 203:48–54

    Article  Google Scholar 

  6. Xu L, Wang J, Cui J, Hu M, Zhang K, Teng W (2018) Dynamic expression recognition based on dynamic time warping and active appearance model. J Electron Inf Technol 40(2):338–345

    Google Scholar 

  7. Shi M, Wang Z (2020) An interpretable gait recognition method based on time series features. Sci Sin Inf 50(3):438–460

    Article  Google Scholar 

  8. Luo X, Liu T, Liu J, Guo X, Wang G (2012) Design and implementation of a distributed fall detection system based on wireless sensor networks. EURASIP J Wirel Commun Netw 2012:1–13

    Article  Google Scholar 

  9. Shougang R, Jingxu Z, Xingjian G (2021) Overview of feature extraction algorithms for time series. J Chin Comput Syst 42(2):271–278

    Google Scholar 

  10. Lin J, Khade R, Li Y (2012) Rotation-invariant similarity in time series using bag-of-patterns representation. J Intell Inf Syst 39(2):287–315

    Article  Google Scholar 

  11. Cheng Z, Yang Y, Wang W, Wenjie Hu, Zhuang Y, Song G (2020) Time2graph: Revisiting time series modeling with dynamic shapelets. Proc AAAI Conf Artif Intell 34:3617–3624

    Google Scholar 

  12. Jeong Y-S, Jeong MK, Omitaomu OA (2011) Weighted dynamic time warping for time series classification. Pattern Recog 44(9):2231–2240

    Article  Google Scholar 

  13. Marteau P-F (2008) Time warp edit distance with stiffness adjustment for time series matching. IEEE Trans Pattern Anal Mach Intell 31(2):306–318

    Article  Google Scholar 

  14. Stefan A, Athitsos V, Das G (2012) The move-split-merge metric for time series. IEEE Trans Knowl Data Eng 25(6):1425–1438

    Article  Google Scholar 

  15. Lines J, Bagnall A (2015) Time series classification with ensembles of elastic distance measures. Data Min Knowl Disc 29:565–592

    Article  MathSciNet  MATH  Google Scholar 

  16. Lucas B, Shifaz A, Pelletier C, O’Neill L, Zaidi N, Goethals B, Petitjean F, Webb GI (2019) Proximity forest: an effective and scalable distance-based classifier for time series. Data Min Knowl Discov 33(3):607–635

    Article  Google Scholar 

  17. Shifaz A, Pelletier C, Petitjean F, Webb GI (2020) TS-CHIEF: a scalable and accurate forest algorithm for time series classification. Data Min Knowl Discov 34(3):742–775

    Article  MathSciNet  MATH  Google Scholar 

  18. Yeh C-CM, Zhu Y, Ulanova L, Begum N, Ding Y, Dau HA, Zimmerman Z, Silva DF, Mueen A, Keogh E (2018) Time series joins, motifs, discords and shapelets: a unifying view that exploits the matrix profile. Data Min Knowl Discov 32:83–123

    Article  MathSciNet  MATH  Google Scholar 

  19. Zhu Y, Gharghabi S, Silva DF, Dau HA, Yeh C-CM, Senobari NS, Almaslukh A, Kamgar K, Zimmerman Z, Funning G et al (2020) The Swiss army knife of time series data mining: ten useful things you can do with the matrix profile and ten lines of code. Data Min Knowl Discov 34:949–979

    Article  Google Scholar 

  20. Ye L, Keogh E (2011) Time series shapelets: a novel technique that allows accurate, interpretable and fast classification. Data Min Knowl Disc 22(1):149–182

    Article  MathSciNet  MATH  Google Scholar 

  21. Rakthanmanon T, Keogh E (2013) Fast-shapelets: a fast algorithm for discovering robust time series shapelets. In Proceedings of 11th SIAM international conference on data mining, pp 668–676

    Google Scholar 

  22. Hills J, Lines J, Baranauskas E, Mapp J, Bagnall A (2014) Classification of time series by shapelet transformation. Data Min Knowl Disc 28(4):851–881

    Article  MathSciNet  MATH  Google Scholar 

  23. Grabocka J, Schilling N, Wistuba M, Schmidt-Thieme L (2014) Learning time-series shapelets. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 392–401

  24. Li G, Choi B, Jianliang X, Bhowmick SS, Chun K-P, Wong GL-H (2020) Efficient shapelet discovery for time series classification. IEEE Trans Knowl Data Eng 34(3):1149–1163

    Article  Google Scholar 

  25. Lin J, Keogh E, Wei Li, Lonardi S (2007) Experiencing sax: a novel symbolic representation of time series. Data Min Knowl Discov 15(2):107–144

    Article  MathSciNet  Google Scholar 

  26. Schäfer P, Högqvist M (2012) Sfa: a symbolic fourier approximation and index for similarity search in high dimensional datasets. In Proceedings of the 15th international conference on extending database technology, pp 516–527

  27. Senin P, Malinchik S (2013) Sax-vsm: Interpretable time series classification using sax and vector space model. In 2013 IEEE 13th international conference on data mining, pp 1175–1180. IEEE

  28. Schäfer P (2015) The boss is concerned with time series classification in the presence of noise. Data Min Knowl Disc 29(6):1505–1530

    Article  MathSciNet  MATH  Google Scholar 

  29. Schäfer P, Leser U (2017) Fast and accurate time series classification with weasel. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp 637–646

  30. Nguyen TL, Ifrim G (2023) Fast time series classification with random symbolic subsequences. In Advanced Analytics and Learning on Temporal Data: 7th ECML PKDD Workshop, AALTD 2022, Grenoble, France, September 19–23, 2022, Revised Selected Papers, pp 50–65. Springer

  31. Middlehurst M, Large J, Cawley G, Bagnall A (2021) The temporal dictionary ensemble (TDE) classifier for time series classification. In Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2020, Ghent, Belgium, September 14–18, 2020, Proceedings, Part I, pp 660–676. Springer

  32. Large J, Bagnall A, Malinowski S, Tavenard R (2019) On time series classification with dictionary-based classifiers. Intell Data Anal 23(5):1073–1089

    Article  Google Scholar 

  33. ZhuoYa J (2021) Research on Key Techniques of Discriminative Patterns Discovery and Classification Methods of Time Series. PhD thesis, Beijing Jiaotong University

  34. Wang Z, Yan W, Oates T (2017) Time series classification from scratch with deep neural networks: A strong baseline. In 2017 International joint conference on neural networks (IJCNN), pp 1578–1585. IEEE

  35. Cui Z, Chen W, Chen Y (2016) Multi-scale convolutional neural networks for time series classification. arXiv preprint arXiv:1603.06995

  36. Karim F, Majumdar S, Darabi H, Chen S (2017) Lstm fully convolutional networks for time series classification. IEEE Access 6:1662–1669

    Article  Google Scholar 

  37. Ismail Fawaz H, Lucas B, Forestier G, Pelletier C, Schmidt DF, Weber J, Webb GI, Idoumghar L, Muller P-A, Petitjean F (2020) Inceptiontime: Finding alexnet for time series classification. Data Min Knowl Disc 34(6):1936–1962

    Article  MathSciNet  Google Scholar 

  38. Conde MV, Shubham K, Agnihotri P, Movva ND, Bessenyei S (2021) Weakly-Supervised Classification and Detection of Bird Sounds in the Wild. A BirdCLEF 2021 Solution. In Conference and Labs of the Evaluation Forum

  39. Zhao Y, Cheng J, Zhang P, Peng X (2020) ECG classification using deep CNN improved by wavelet transform. Comput Mater Continua 9:1615–1628

    Article  Google Scholar 

  40. Wang J, Wang Z, Li J, Wu J (2018) Multilevel wavelet decomposition network for interpretable time series analysis. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2437–2446

  41. Schäfer P (2016) Scalable time series classification. Data Min Knowl Disc 30(5):1273–1298

    Article  MathSciNet  MATH  Google Scholar 

  42. Wei Z, ZhiHai W, JiDong Y, ShiLei H (2020) Time series discriminative feature dictionary construction algorithm. J Softw 31(10):3216–3237

    MATH  Google Scholar 

  43. Cai H, Zheng VW, Chang KC-C (2018) A comprehensive survey of graph embedding: Problems, techniques, and applications. IEEE Trans Knowl Data Eng 30(9):1616–1637

    Article  Google Scholar 

  44. Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 701–710

  45. Dau HA, Bagnall A, Kamgar K, Yeh C-CM, Zhu Y, Gharghabi S, Ratanamahatana CA, Keogh E (2019) The ucr time series archive. IEEE/CAA J Autom Sin 6(6):1293–1305

    Article  Google Scholar 

  46. Bagnall A, Lines J, Bostrom A, Large J, Keogh E (2017) The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min Knowl Disc 31(3):606–660

    Article  MathSciNet  Google Scholar 

Download references

Funding

This work is partially supported by the National Natural Science Foundation of China Grant Number 61972424, in part by JSPS KAKENHI Grant Numbers JP19K20250,JP20H04174,JP22K11989, Leading Initiative for Excellent Young Researchers (LEADER), MEXT, Japan, and JST, PRESTO Grant Number JPMJPR21P3, Japan. The authors thank all the anonymous reviewers for their valuable comments.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Huan Huang or Rui Hou.

Ethics declarations

Competing interest

There are no potential conflicts of interset to declare.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ma, X., Yu, M., Huang, H. et al. A time series classification method combining graph embedding and the bag-of-patterns algorithm. Appl Intell 53, 26297–26312 (2023). https://doi.org/10.1007/s10489-023-04859-z

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-023-04859-z

Keywords

Navigation