Abstract
The uniform sampling strategy is widely used in time series segmentation, but unable to handle time warping problem or preserve the latent patterns in time series. To solve these shortcomings, a brand new data-driven segmentation method is proposed, which could segment time series into subsequences with different lengths adaptively. Then a time series classification method under the bag-of-word framework is proposed. Two kinds of mutually complementary features, i.e., interval feature and normal cloud model feature, are extracted from subsequences. And then time series are encoded into Fisher Vectors. Finally, a linear support vector machine is used as the classifier. Experiments on 43 UCR datasets show that the newly proposed method has promising classification accuracies comparing with state of the art baselines. Moreover, due to the data-driven segmentation and timesaving local feature extraction, the method has low time complexity, which is also demonstrated in the experiments.
Similar content being viewed by others
References
Berndt DJ, Clifford J (1994) Using dynamic time warping to find patterns in time series. In: Working notes of the knowledge discovery in databases workshop
Shen J et al (2016) A novel similarity measure model for multivariate time series based on LMNN and DTW. Neural Process Lett 2016:1–13
Weinberger KQ, Saul LK (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 2009(10):207–244
Vlachos M, Kollios G, Gunopulos D (2002) Discovering similar multidimensional trajectories. In: International conference on data engineering
Batista GE et al (2014) CID: an efficient complexity-invariant distance for time series. Data Min Knowl Discov 2014(28):634–669
Lines J, Bagnall A (2015) Time series classification with ensembles of elastic distance measures. Data Min Knowl Discov 2015(29):565–592
Wang X et al (2013) Experimental comparison of representation methods and distance measures for time series data. Data Min Knowl Discov 2013(26):275–309
Gudmundsson S, Runarsson T.P, Sigurdsson S (2008) Support vector machines and dynamic time warping for time series. In: IEEE international joint conference on neural networks
Kate RJ (2016) Using dynamic time warping distances as features for improved time series classification. Data Min Knowl Discov 2016(30):283–312
Ye L, Keogh E (2011) Time series shapelets: a novel technique that allows accurate, interpretable and fast classification. Data Min Knowl Discov 2011(22):149–182
Hills J et al (2014) Classification of time series by shapelet transformation. Data Min Knowl Discov 2014(28):851–881
He X, Shao C, Xiong Y (2016) A non-parametric symbolic approximate representation for long time series. Pattern Anal Appl 2016(19):111–127
Lei Y, He Z, Zi Y (2010) Application of an intelligent classification method to mechanical fault diagnosis. Radiat Effects Defects Solids 2010(36):9941–9948
Lin J, Khade R, Li Y (2012) Rotation-invariant similarity in time series using bag-of-patterns representation. J Intell Inf Syst 2012(39):287–315
Lin J et al (2007) Experiencing SAX: a novel symbolic representation of time series. Data Min Knowl Discov 2007(15):107–144
Baydogan MG, Runger G, Tuv E (2013) A bag-of-features framework to classify time series. IEEE Trans Pattern Anal Mach Intell 2013(35):2796–2802
Grabocka J, Schmidt-Thieme L (2014) Invariant time-series factorization. Data Min Knowl Discov 2014(28):1455–1479
Zhao J, Itti L (2016) Classifying time series using local descriptors with hybrid sampling. IEEE Trans Knowl Data Eng 2016(28):623–637
Li D (1997) Knowledge representation in KDD based on linguistic atoms. J Comput Sci Technol 1997(12):481–496
Keogh E.J et al (2001) An online algorithm for segmenting time series. In: IEEE international conference on data mining
Li H, Guo C (2011) Piecewise cloud approximation for time series mining. Knowl Based Syst 2011(24):492–500
Deng W, Wang G, Xu J (2016) Piecewise two-dimensional normal cloud representation for time-series data mining. Inf Sci 2016(374):32–50
Liu BX, Hai-Lin LI, Yang LB (2009) Cloud decision analysis method. Control Decis 2009(24):957–960
Jaakkola TS, Haussler D (1998) Exploiting generative models in discriminative classifiers. Adv Neural Inf Process Syst 1998(11):487–493
Perronnin F, Nchez J, Mensink T (2010) Improving the fisher kernel for large-scale image classification. In: European conference on computer vision
Perronnin F, Dance C (2007) Fisher Kernels on visual vocabularies for image categorization. In: IEEE conference on computer vision and pattern recognition
Chen Y, Keogh E, Hu B, Begum N, Bagnall A, Mueen A, Batista G (2015) The UCR time series classification archive. www.cs.ucr.edu/~eamonn/time_series_data/
Rakthanmanon T, Keogh E (2013) Fast shapelets: a scalable algorithm for discovering time series shapelets. In: Proceedings of the 13th SIAM international conference on data mining
Acknowledgements
This study was supported by the National Natural Science Foundation of China (U1664264, U1509203).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Huang, W., Yue, B., Chi, Q. et al. Integrating Data-Driven Segmentation, Local Feature Extraction and Fisher Kernel Encoding to Improve Time Series Classification. Neural Process Lett 49, 43–66 (2019). https://doi.org/10.1007/s11063-018-9798-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-018-9798-4