Skip to main content
Log in

A two-phase filtering of discriminative shapelets learning for time series classification

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Compared to the full-length methods for time series classification, shapelet-based methods acquire better interpretation, higher efficiency and precision since shapelets are discriminative features that well represent a time series. However, because of the large number of shapelets candidates, determining how to filter out shapelets with higher discriminability remains a challenge. In this paper, we propose a two-phase shapelets learning filtering framework for time series classification. Time series is first split into groups using the extreme key points, and local linear discriminant analysis with sparse group lasso regularizer is proposed to find projection vector. Then, a two-phase filtering framework is established to measure the sparsity of groups in order to quickly find the key group, where l2-norm is introduced in phase-1 and group sparsity degree is defined in phase-2 to filter sparse groups. Following that, only a few groups are used to extract shapelets and classify them, reducing the number of shapelets significantly. Finally, the group with the highest classification accuracy, i.e., the key group, is determined accurately. Extensive experiments on 28 time series datasets show that, when compared to other state-of-the-art shapelet-based classification methods, our proposed method achieves significant improvement and a competitive time cost.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Algorithm 1
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Ahmed T, Singh D (2020) Probability density functions based classification of modis ndvi time series data and monitoring of vegetation growth cycle. Adv Space Res 66(4):873– 886

    Article  Google Scholar 

  2. Al-Hadeethi H, Abdulla S, Diykh M, Deo R C, Green JH (2020) Adaptive boost ls-svm classification approach for time-series signal classification in epileptic seizure diagnosis applications. Expert Syst Appl 161:113676

    Article  Google Scholar 

  3. Belkin M, Niyogi P (2003) Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput 15(6):1373–1396

    Article  MATH  Google Scholar 

  4. Chang KW, Deka B, Hwu WMW, Roth D (2012) Efficient pattern-based time series classification on gpu. In: 2012 IEEE 12th International conference on data mining. IEEE, pp 131– 140

  5. Chen J, Dai G, Zhang N (2020) An application of sparse-group lasso regularization to equity portfolio optimization and sector selection. Ann Oper Res 284(1):243–262

    Article  MathSciNet  MATH  Google Scholar 

  6. Dau HA, Bagnall A, Kamgar K, Yeh CCM, Zhu Y, Gharghabi S, Ratanamahatana CA, Keogh E (2019) The ucr time series archive. IEEE/CAA J Automatica Sinica 6(6):1293–1305

    Article  Google Scholar 

  7. Ding H, Trajcevski G, Scheuermann P, Wang X, Keogh E (2008) Querying and mining of time series data: experimental comparison of representations and distance measures. Proc VLDB Endowment 1 (2):1542–1552

    Article  Google Scholar 

  8. Fisher RA (1936) The use of multiple measurements in taxonomic problems. Annals Eugenics 7 (2):179–188

    Article  Google Scholar 

  9. Friedman JH (1989) Regularized discriminant analysis. J Amer Statal Association 84(405):165–175

    Article  MathSciNet  Google Scholar 

  10. Gordon D, Hendler D, Rokach L (2012) Fast randomized model generation for shapelet-based time series classification. arXiv:12095038

  11. Gordon D, Hendler D, Rokach L (2015) Fast and space-efficient shapelets-based time-series classification. Intell Data Anal 19(5):953–981

    Article  Google Scholar 

  12. Grabocka J, Schilling N, Wistuba M, Schmidt-Thieme L (2014) Learning time-series shapelets. In: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 392–401

  13. Grabocka J, Wistuba M, Schmidt-Thieme L (2015) Scalable discovery of time-series shapelets. arXiv:150303238

  14. He X, Niyogi P (2004) Locality preserving projections. In: Advances in neural information processing systems, pp 153– 160

  15. Hills J, Lines J, Baranauskas E, Mapp J, Bagnall A (2014) Classification of time series by shapelet transformation. Data Min Knowl Disc 28(4):851–881

    Article  MathSciNet  MATH  Google Scholar 

  16. Hou L, Kwok JT, Zurada JM (2016) Efficient learning of timeseries shapelets. In: Proceedings of the thirtieth AAAI conference on artificial intelligence, pp 1209–1215

  17. Huang C, Wu X, Zhang X, Lin S, Chawla NV (2019) Deep prototypical networks for imbalanced time series classification under data scarcity. In: Proceedings of the 28th ACM international conference on information and knowledge management, pp 2141–2144

  18. Karampatziakis N, Mineiro P (2014) Discriminative features via generalized eigenvectors. In: International conference on machine learning, pp 494–502

  19. Kourou K, Rigas G, Papaloukas C, Mitsis M, Fotiadis DI (2020) Cancer classification from time series microarray data through regulatory dynamic bayesian networks. Comput Biol Med 103577:116

    Google Scholar 

  20. Li L, Chang D, Han L, Zhang X, Zaia J, Wan XF (2020) Multi-task learning sparse group lasso: a method for quantifying antigenicity of influenza a (h1n1) virus using mutations and variations in glycosylation of hemagglutinin. BMC Bioinformatics 21:1–22

    Article  Google Scholar 

  21. Liang Z, Wang H (2021) Efficient class-specific shapelets learning for interpretable time series classification. Inf Sci 570:428– 450

    Article  MathSciNet  Google Scholar 

  22. Lin J, Keogh E, Wei L, Lonardi S (2007) Experiencing sax: a novel symbolic representation of time series. Data Mining Knowl Discover 15(2):107–144

    Article  MathSciNet  Google Scholar 

  23. Lines J, Davis LM, Hills J, Bagnall A (2012) A shapelet transform for time series classification. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 289–297

  24. Mueen A, Keogh E, Young N (2011) Logical-shapelets: an expressive primitive for time series classification. In: Proceedings of the 17th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1154–1162

  25. Park H, Jung JY (2020) Sax-arm: deviant event pattern discovery from multivariate time series using symbolic aggregate approximation and association rule mining. Expert Syst Appl 141:112950

    Article  Google Scholar 

  26. Puk KM, Wang S, Rosenberger J, Gandy KC, Harris HN, Peng YB, Nordberg A, Lehmann P, Tommerdahl J, Chiao JC (2019) Emotion recognition and analysis using admm-based sparse group lasso. IEEE Trans Affect Comput 13(1):199–210

    Article  Google Scholar 

  27. Qian B, Xiao Y, Zheng Z, Zhou M, Zhuang W, Li S, Ma Q (2020) Dynamic multi-scale convolutional neural network for time series classification. IEEE Access 8:109732–109746

    Article  Google Scholar 

  28. Rahulamathavan Y, Phan RCW, Chambers JA, Parish DJ (2012) Facial expression recognition in the encrypted domain based on local fisher discriminant analysis. IEEE Trans Affect Comput 4(1):83–92

    Article  Google Scholar 

  29. Rakthanmanon T, Keogh E (2013) Fast shapelets: a scalable algorithm for discovering time series shapelets. In: Proceedings of the 2013 SIAM international conference on data mining, SIAM, pp 668-676

  30. Sart D, Mueen A, Najjar W, Keogh E, Niennattrakul V (2010) Accelerating dynamic time warping subsequence search with gpus and fpgas. In: 2010 IEEE international conference on data mining. IEEE, pp 1001–1006

  31. Shi Z, Hu J (2017) A kernel approach to implementation of local linear discriminant analysis for face recognition. IEEJ Trans Electr Electron Eng 12(1):62–70

    Article  Google Scholar 

  32. Simon N, Friedman J, Hastie T, Tibshirani R (2013) A sparse-group lasso. J Comput Graph Stat 22(2):231–245

    Article  MathSciNet  Google Scholar 

  33. Sugiyama M (2006) Local fisher discriminant analysis for supervised dimensionality reduction. In: Proceedings of the 23rd international conference on machine learning, pp 905– 912

  34. Tibshirani R (1996) Regression shrinkage and selection via the lasso. J Royal Stat Society Series B (Methodological) 58(1):267–288

    MathSciNet  MATH  Google Scholar 

  35. Tibshirani R, Saunders M, Rosset S, Zhu J, Knight K (2005) Sparsity and smoothness via the fused lasso. J Royal Stat Society Series B (Stat Methodology) 67(1):91–108

    Article  MathSciNet  MATH  Google Scholar 

  36. Ujan S, Navidi N, Landry Jr R et al (2020) Hierarchical classification method for radio frequency interference recognition and characterization in satcom. Appl Sci 10(13):4608

    Article  Google Scholar 

  37. Wang H, Wu J, Zhang P, Chen Y (2018) Learning shapelet patterns from network-based time series. IEEE Trans Industr Inform 15(7):3864–3876

    Article  Google Scholar 

  38. Wang Y, Emonet R, Fromont E, Malinowski S, Menager E, Mosser L, Tavenard R (2019) Learning interpretable shapelets for time series classification through adversarial regularization. arXiv:190600917

  39. Wang Z, Ruan Q, An G (2015) Face recognition using double sparse local fisher discriminant analysis. Math Probl Eng:2015

  40. Wistuba M, Grabocka J, Schmidt-Thieme L (2015) Ultra-fast shapelets for time series classification. arXiv:150305018

  41. Yahyaoui H, Al-Daihani R (2019) A novel trend based sax reduction technique for time series. Expert Syst Appl 130:113–123

    Article  Google Scholar 

  42. Ye L, Keogh E (2009) Time series shapelets: a new primitive for data mining. In: Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining, pp 947–956

  43. Yeung JFA, Zk W, Chan KY, Lau HY, Yiu KFC (2020) Jump detection in financial time series using machine learning algorithms. Soft Comput 24(3):1789–1801

    Article  Google Scholar 

  44. Yu B, Li S, Qiu W, Wang M, Du J, Zhang Y, Chen X (2018) Prediction of subcellular location of apoptosis proteins by incorporating psepssm and dcca coefficient based on lfda dimensionality reduction. BMC genomics 19(1):1–17

    Article  Google Scholar 

  45. Yuan M, Lin Y (2006) Model selection and estimation in regression with grouped variables. J Royal Stat Society Series B (Stat Methodol) 68(1):49–67

    Article  MathSciNet  MATH  Google Scholar 

  46. Zaatour R, Bouzidi S, Zagrouba E (2019) Class-adapted local fisher discriminant analysis to reduce highly-dimensioned data on commodity hardware: application to hyperspectral images. Multimed Tools Appl 78(12):17113–17134

    Article  Google Scholar 

  47. Zakaria J, Mueen A, Keogh E, Young N (2016) Accelerating the discovery of unsupervised-shapelets. Data Mining Knowl Discover 30(1):243–281

    Article  MathSciNet  MATH  Google Scholar 

  48. Zhang Z, Zhang H, Wen Y, Zhang Y, Yuan X (2018) Discriminative extraction of features from time series. Neurocomputing 275:2317–2328

    Article  Google Scholar 

  49. Zhuo L, Cheng B, Zhang J (2014) A comparative study of dimensionality reduction methods for large-scale image retrieval. Neurocomputing 141:202–210

    Article  Google Scholar 

Download references

Acknowledgements

We thank the reviewers for their valuable comments. This work is supported by the Fundamental Research Funds for the Central Universities, China under Grant 2020ZY216 and 2021III030JC.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuan Wan.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, C., Wan, Y., Zhang, W. et al. A two-phase filtering of discriminative shapelets learning for time series classification. Appl Intell 53, 13815–13833 (2023). https://doi.org/10.1007/s10489-022-04043-9

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-04043-9

Keywords

Navigation