Abstract
Many representation schemes for time series have been proposed and most of them require predefined parameters. In case of classification, the accuracy is considerably influenced by these predefined parameters. Also, the users usually have difficulty in determining the parameters. The aim of this paper is to develop a representation method for time series that can automatically select the parameters for the classification task. To this end, we exploit the multi-scale property of wavelet decomposition that allows us to automatically extract features and achieve high classification accuracy. Two main contributions of this work are: (1) selecting features of a representation that helps to prevent time series shifts, and (2) choosing appropriate features, namely, features in an appropriate wavelet decomposition scale according to the concentration of wavelet coefficients within this scale.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: Proceedings of the 4th Conference on Foundations of Data Organization and Algorithms, October 1993, pp. 69–84 (1993)
Alcock, R.J., Manolopoulos, Y.: Time-series similarity queries employing a feature-based approach. In: Proceedings of the 7th Hellenic Conference on Informatics, August 1999, pp. 1–9 (1999)
Burrus, C.S., Gopinath, R.A., Guo, H.: Introduction to Wavelets and Wavelet Transforms, A Primer. Prentice Hall, Englewood Cliffs (1997)
Chan, F.K., Fu, A.W.: Harr wavelets for efficient similarity search of time-series: with and without time warping. IEEE Trans. on Knowledge and Data Engineering 15(3), 686–705 (2003)
Chan, K.P., Fu, A.W.: Efficient time series matching by wavelets. In: Proceedings of the 15th Internation Conference on Data Engineering, March 1999, pp. 126–133 (1999)
Coifman, R.R., Wickerhauser, M.V.: Entropy-based algorithms for best basis selection. IEEE Trans. on Information Theory 38(2), 713–718 (1992)
Donoho, D.L.: De-noising by soft-thresholding. IEEE Trans. on Information Theory 41(3), 613–627 (1995)
Donoho, D.L., Johnson, I.M.: Ideal spatial adaptation via wavelet shrinkage. Biometrika 81, 425–455 (1994)
Geurts, P.: Pattern extraction for time series classification. In: Proceedings of the Principles of Data Mining and Knowledge Discovery, 5th European Conference, September 2001, pp. 115–127 (2001)
Hettich, S., Bay, S.D.: The uci kdd archive (1999), http://kdd.ics.uci.edu
Ho, T.B., Nguyen, T.D., Kawasaki, S., Le, S.Q., Nguyen, D.D., Yokoi, H., Takabayashi, K.: Mining hepatitis data with temporal abstraction. In: Proceedings of the 9th ACM International Conference on Knowledge Discovery and Data Mining, August 2003, pp. 369–377 (2003)
Kadous, M.W.: Learning comprehensible descriptions of multivariate time series. In: Proceedings of the 6th International Conference on Machine Learning, September 1999, pp. 454–463 (1999)
Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Dimensionality reduction of fast similarity search in large time series databases. Journal of Knowledge and Information System 3, 263–286 (2000)
Keogh, E., Kasetty, S.: On the need for time series data mining benchmarks: A survey and empirical demonstration. Data Mining and Knowledge Discovery 7(4), 349–371 (2003)
Keogh, E., Pazzani, M.: An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback. In: Proceedings of the 4th International Conference of Knowledge Discovery and Data Mining, August 1998, pp. 239–241 (1998)
Lin, J., Keogh, E., Lonardi, S., Chiu, B.: A symbolic representation of time series, with implications for streaming algorithms. In: Proceedings of the 8th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, June 2003, pp. 2–11 (2003)
Yi, B.K., Faloustos, C.: Fast time sequence indexing for arbitrary lp norms. In: Proceedings of the 26th International Conference on Very Large Databases, September 2000, pp. 385–394 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, H., Ho, T.B., Lin, M.S. (2004). A Non-parametric Wavelet Feature Extractor for Time Series Classification. In: Dai, H., Srikant, R., Zhang, C. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2004. Lecture Notes in Computer Science(), vol 3056. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24775-3_71
Download citation
DOI: https://doi.org/10.1007/978-3-540-24775-3_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22064-0
Online ISBN: 978-3-540-24775-3
eBook Packages: Springer Book Archive