A Non-parametric Wavelet Feature Extractor for Time Series Classification

Zhang, Hui; Ho, Tu Bao; Lin, Mao Song

doi:10.1007/978-3-540-24775-3_71

Hui Zhang¹⁹,
Tu Bao Ho¹⁹ &
Mao Song Lin²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3056))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

3044 Accesses
16 Citations

Abstract

Many representation schemes for time series have been proposed and most of them require predefined parameters. In case of classification, the accuracy is considerably influenced by these predefined parameters. Also, the users usually have difficulty in determining the parameters. The aim of this paper is to develop a representation method for time series that can automatically select the parameters for the classification task. To this end, we exploit the multi-scale property of wavelet decomposition that allows us to automatically extract features and achieve high classification accuracy. Two main contributions of this work are: (1) selecting features of a representation that helps to prevent time series shifts, and (2) choosing appropriate features, namely, features in an appropriate wavelet decomposition scale according to the concentration of wavelet coefficients within this scale.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: Proceedings of the 4th Conference on Foundations of Data Organization and Algorithms, October 1993, pp. 69–84 (1993)
Google Scholar
Alcock, R.J., Manolopoulos, Y.: Time-series similarity queries employing a feature-based approach. In: Proceedings of the 7th Hellenic Conference on Informatics, August 1999, pp. 1–9 (1999)
Google Scholar
Burrus, C.S., Gopinath, R.A., Guo, H.: Introduction to Wavelets and Wavelet Transforms, A Primer. Prentice Hall, Englewood Cliffs (1997)
Google Scholar
Chan, F.K., Fu, A.W.: Harr wavelets for efficient similarity search of time-series: with and without time warping. IEEE Trans. on Knowledge and Data Engineering 15(3), 686–705 (2003)
Article Google Scholar
Chan, K.P., Fu, A.W.: Efficient time series matching by wavelets. In: Proceedings of the 15th Internation Conference on Data Engineering, March 1999, pp. 126–133 (1999)
Google Scholar
Coifman, R.R., Wickerhauser, M.V.: Entropy-based algorithms for best basis selection. IEEE Trans. on Information Theory 38(2), 713–718 (1992)
Article MATH Google Scholar
Donoho, D.L.: De-noising by soft-thresholding. IEEE Trans. on Information Theory 41(3), 613–627 (1995)
Article MATH MathSciNet Google Scholar
Donoho, D.L., Johnson, I.M.: Ideal spatial adaptation via wavelet shrinkage. Biometrika 81, 425–455 (1994)
Article MATH MathSciNet Google Scholar
Geurts, P.: Pattern extraction for time series classification. In: Proceedings of the Principles of Data Mining and Knowledge Discovery, 5th European Conference, September 2001, pp. 115–127 (2001)
Google Scholar
Hettich, S., Bay, S.D.: The uci kdd archive (1999), http://kdd.ics.uci.edu
Ho, T.B., Nguyen, T.D., Kawasaki, S., Le, S.Q., Nguyen, D.D., Yokoi, H., Takabayashi, K.: Mining hepatitis data with temporal abstraction. In: Proceedings of the 9th ACM International Conference on Knowledge Discovery and Data Mining, August 2003, pp. 369–377 (2003)
Google Scholar
Kadous, M.W.: Learning comprehensible descriptions of multivariate time series. In: Proceedings of the 6th International Conference on Machine Learning, September 1999, pp. 454–463 (1999)
Google Scholar
Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Dimensionality reduction of fast similarity search in large time series databases. Journal of Knowledge and Information System 3, 263–286 (2000)
Article Google Scholar
Keogh, E., Kasetty, S.: On the need for time series data mining benchmarks: A survey and empirical demonstration. Data Mining and Knowledge Discovery 7(4), 349–371 (2003)
Article MathSciNet Google Scholar
Keogh, E., Pazzani, M.: An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback. In: Proceedings of the 4th International Conference of Knowledge Discovery and Data Mining, August 1998, pp. 239–241 (1998)
Google Scholar
Lin, J., Keogh, E., Lonardi, S., Chiu, B.: A symbolic representation of time series, with implications for streaming algorithms. In: Proceedings of the 8th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, June 2003, pp. 2–11 (2003)
Google Scholar
Yi, B.K., Faloustos, C.: Fast time sequence indexing for arbitrary lp norms. In: Proceedings of the 26th International Conference on Very Large Databases, September 2000, pp. 385–394 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Japan Advanced Institute of Science and Technology, Tatsunokuchi, Ishikawa, 923-1292, Japan
Hui Zhang & Tu Bao Ho
Southwest University of Science and Technology, Mianyang, Sichuan, 621002, China
Mao Song Lin

Authors

Hui Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tu Bao Ho
View author publications
You can also search for this author in PubMed Google Scholar
Mao Song Lin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering and Information Technology, Deakin University, VIC 3125, Australia
Honghua Dai
University of Illinois at Urbana-Champaign, 61801, Urbana, IL, USA
Ramakrishnan Srikant
Faculty of Engineering and Information Technology, Centre for Quantum Computation and Intelligent Systems, and Australian ACS National Committee for Artificial Intelligence, University of Technology, Sydney, Australia
Chengqi Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, H., Ho, T.B., Lin, M.S. (2004). A Non-parametric Wavelet Feature Extractor for Time Series Classification. In: Dai, H., Srikant, R., Zhang, C. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2004. Lecture Notes in Computer Science(), vol 3056. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24775-3_71

Download citation

DOI: https://doi.org/10.1007/978-3-540-24775-3_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22064-0
Online ISBN: 978-3-540-24775-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics