Skip to main content
Log in

Learning-based shapelets discovery by feature selection for time series classification

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Shapelet-based methods have attracted widespread attention over the past decade in the time series classification for their benefits of high classification accuracy and good interpretability. The primary challenge of shapelet-based methods is to find discriminative shapelets that best distinguish different classes. Although the existing shapelet-based methods have achieved encouraging results, the number of obtained shapelets is still so large that the shapelets are not discriminative enough for the classification, and some of these shapelets are of less interpretability to reveal the most important patterns of different classes. In this paper, we propose a novel learning-based shapelets discovery method by feature selection (LSDF) for time series classification, of which the significance is transforming the shapelets discovery task into an optimization problem and efficiently learning interpretable shapelets. The value at each time point is regarded as a feature and Marginal Fisher Analysis (MFA) is combined with fused lasso to obtain the discriminative features (shapelets). The experimental results on real-world datasets from University of California, Riverside (UCR) repository show that LSDF outperforms the state-of-the-art shapelet-based and non-shapelet-based methods in classification accuracy and running time. Finally, the interpretability of shapelets learned by our methods is shown from different perspectives.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Abanda A, Mori U, Lozano J A (2019) A review on distance based time series classification. Data Min Knowl Disc 33(2):378–412

    Article  MathSciNet  Google Scholar 

  2. Bagnall A, Lines J, Hills J, Bostrom A (2015) Time-series classification with cote: the collective of transformation-based ensembles. IEEE Trans Knowl Data Eng 27(9):2522–2535

    Article  Google Scholar 

  3. Batista G E, Wang X, Keogh E J (2011) A complexity-invariant distance measure for time series. In: Proceedings of the 2011 SIAM international conference on data mining. SIAM, pp 699–710

  4. Baydogan M G, Runger G (2016) Time series representation and similarity based on local autopatterns. Data Min Knowl Disc 30(2):476–509

    Article  MathSciNet  Google Scholar 

  5. Baydogan M G, Runger G, Tuv E (2013) A bag-of-features framework to classify time series. IEEE Trans Pattern Anal Mach Intell 35(11):2796–2802

    Article  Google Scholar 

  6. Boyd S, Parikh N, Chu E (2011) Distributed optimization and statistical learning via the alternating direction method of multipliers. Now Publishers Inc.

  7. Dau H A, Bagnall A, Kamgar K, Yeh C C M, Zhu Y, Gharghabi S, Ratanamahatana C A, Keogh E (2019) The ucr time series archive. IEEE/CAA J Autom Sin 6(6):1293–1305

    Article  Google Scholar 

  8. Deng H, Runger G, Tuv E, Vladimir M (2013) A time series forest for classification and feature extraction. Inf Sci 239:142–153

    Article  MathSciNet  Google Scholar 

  9. Destrero A, De Mol C, Odone F, Verri A (2008) A sparsity-enforcing method for learning face features. IEEE Trans Image Process 18(1):188–201

    Article  MathSciNet  Google Scholar 

  10. Destrero A, De Mol C, Odone F, Verri A (2009) A regularized framework for feature selection in face detection and authentication. Int J Comput Vis 83(2):164–177

    Article  Google Scholar 

  11. Franceschi JY, Dieuleveut A, Jaggi M (2019) Unsupervised scalable representation learning for multivariate time series. arXiv:190110738

  12. Górecki T, ŁUczak M (2013) Using derivatives in time series classification. Data Min Knowl Discov 2:310–331

    Article  MathSciNet  Google Scholar 

  13. Górecki T, ŁUczak M (2014) Non-isometric transforms in time series classification using dtw. Knowl-Based Syst 61:98–108

    Article  Google Scholar 

  14. Grabocka J, Schilling N, Wistuba M, Schmidt-Thieme L (2014) Learning time-series shapelets. In: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 392–401

  15. Helbing D, Brockmann D, et al CT (2015) Saving human lives: What complexity science and information systems can contribute. J Stat Phys 158:735–781. https://doi.org/10.1007/s10955-014-1024-9

  16. Hills J, Lines J, Baranauskas E, Mapp J, Bagnall A (2014) Classification of time series by shapelet transformation. Data Min Knowl Discov 28(4):851–881

    Article  MathSciNet  Google Scholar 

  17. Hou L, Kwok J, Zurada J (2016) Efficient learning of timeseries shapelets. Proc AAAI Conf Artif Intell 30(1). https://ojs.aaai.org/index.php/AAAI/article/view/10178

  18. Hussain AJ, Al-Jumeily D, Al-Askar H, Radi N (2016) Regularized dynamic self-organized neural network inspired by the immune algorithm for financial time series prediction. Neurocomputing 188:23–30. https://doi.org/10.1016/j.neucom.2015.01.109

  19. Li G, Choi B K K, Xu J, Bhowmick S S, Chun KP, Wong GL (2020) Efficient shapelet discovery for time series classification. IEEE Transactions on Knowledge and Data Engineering

  20. Liang Z, Wang H (2021) Efficient class-specific shapelets learning for interpretable time series classification. Inf Sci 570:428–450

    Article  MathSciNet  Google Scholar 

  21. Lines J, Bagnall A (2015) Time series classification with ensembles of elastic distance measures. Data Mining and Knowledge Discovery

  22. Ma Q, Zhuang W, Cottrell G (2019) Triple-shapelet networks for time series classification. In: 2019 IEEE International Conference on Data Mining (ICDM). IEEE, pp 1246–1251

  23. Ma Q, Zhuang W, Li S, Huang D, Cottrell G (2020) Adversarial dynamic shapelet networks. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol 34, pp 5069–5076

  24. Martinez A M, Kak A C (2001) Pca versus lda. IEEE Trans Anal Mach Intell 23(2):228–233

    Article  Google Scholar 

  25. Mueen A, Keogh E, Young N (2011) Logical-shapelets: an expressive primitive for time series classification. In: Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 1154–1162

  26. Rakthanmanon T, Keogh E (2013) Fast shapelets: a scalable algorithm for discovering time series shapelets. In: proceedings of the 2013. SIAM International Conference on Data Mining. SIAM, pp 668–676

  27. Rodríguez-Bermúdez G, García-Laencina PJ, Roca-González J, Roca-Dorda J (2013) Efficient feature selection and linear discrimination of eeg signals. Neurocomputing 115:161–165

  28. Schäfer P (2015) The boss is concerned with time series classification in the presence of noise. Data Min Knowl Disc 29(6):1505–1530

    Article  MathSciNet  Google Scholar 

  29. Shah M, Grabocka J, Schilling N, Wistuba M, Schmidt-Thieme L (2016) Learning dtw-shapelets for time-series classification. In: Proceedings of the 3rd IKDD Conference on Data Science, vol 2016, pp 1–8

  30. Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B (Methodol) 58(1):267–288

    MathSciNet  MATH  Google Scholar 

  31. Tibshirani R, Saunders M, Rosset S, Zhu J, Knight K (2005) Sparsity and smoothness via the fused lasso. J R Stat Soc Ser B (Stat Methodol) 67(1):91–108

    Article  MathSciNet  Google Scholar 

  32. Wang Z, Yan W, Oates T (2017) Time Series classification from scratch with deep neural networks: A strong baseline. In: 2017 International joint conference on neural networks (IJCNN). IEEE, pp 1578–1585

  33. Yan S, Xu D, Zhang B, Zhang H J, Yang Q, Lin S (2006) Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Trans Pattern Anal Mach Intell 29(1):40–51

    Article  Google Scholar 

  34. Ye L, Keogh E (2009) Time series shapelets: a new primitive for data mining. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 947–956

  35. Zhang Z, Zhang H, Wen Y, Zhang Y, Yuan X (2018) Discriminative extraction of features from time series. Neurocomputing 275:2317–2328

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the Fundamental Research Funds for the Central Universities, China under Grant 2021III030JC.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuan Wan.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, J., Wan, Y., Wang, X. et al. Learning-based shapelets discovery by feature selection for time series classification. Appl Intell 52, 9460–9475 (2022). https://doi.org/10.1007/s10489-021-03009-7

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-021-03009-7

Keywords

Navigation