Skip to main content

Abstract

The popularity of time-series databases in many applications has created an increasing demand for performing data-mining tasks (classification, clustering, outlier detection, etc.) on time-series data. Currently, however, no single system or library exists that specializes on providing efficient implementations of data-mining techniques for time-series data, supports the necessary concepts of representations, similarity measures and preprocessing tasks, and is at the same time freely available. For these reasons we have designed a multipurpose, multifunctional, extendable system FAP – Framework for Analysis and Prediction, which supports the aforementioned concepts and techniques for mining time-series data. This paper describes the architecture of FAP and the current version of its Java implementation which focuses on time-series similarity measures and nearest-neighbor classification. The correctness of the implementation is verified through a battery of experiments which involve diverse time-series data sets from the UCR repository.

This work was supported by project Abstract Methods and Applications in Computer Science (no. 144017A), of the Serbian Ministry of Science and Environmental Protection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast Subsequence Matching in Time-Series Databases. In: SIGMOD Conference, pp. 419–429 (1994)

    Google Scholar 

  2. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2005)

    Google Scholar 

  3. Keogh, E.J.: A Decade of Progress in Indexing and Mining Large Time Series Databases. In: VLDB, p. 1268 (2006)

    Google Scholar 

  4. Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.: Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures. In: VLDB 2008, Auckland, New Zealand, pp. 1542–1552 (2008)

    Google Scholar 

  5. pong Chan, K., Fu, A.W.-C.: Efficient Time Series Matching by Wavelets. In: ICDE, pp. 126–133 (1999)

    Google Scholar 

  6. Keogh, E.J., Chakrabarti, K., Pazzani, M.J., Mehrotra, S.: Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases. Knowl. Inf. Syst. 3(3), 263–286 (2001)

    Article  MATH  Google Scholar 

  7. Keogh, E.J., Chakrabarti, K., Mehrotra, S., Pazzani, M.J.: Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases. In: SIGMOD Conference, pp. 151–162 (2001)

    Google Scholar 

  8. Lin, J., Keogh, E.J., Wei, L., Lonardi, S.: Experiencing SAX: a novel symbolic representation of time series. Data Min. Knowl. Discov. 15(2), 107–144 (2007)

    Article  MathSciNet  Google Scholar 

  9. Chen, Q., Chen, L., Lian, X., Liu, Y., Yu, J.X.: Indexable PLA for Efficient Similarity Search. In: VLDB, pp. 435–446 (2007)

    Google Scholar 

  10. Kurbalija, V., Ivanović, M., Budimac, Z.: Case-Based Curve Behaviour Prediction. Software: Practice and Experience 39(1), 81–103 (2009)

    Article  Google Scholar 

  11. Keogh, E.J., Ratanamahatana, C.A.: Exact indexing of dynamic time warping. Knowl. Inf. Syst. 7(3), 358–386 (2005)

    Article  Google Scholar 

  12. Vlachos, M., Gunopulos, D., Kollios, G.: Discovering similar multidimensional trajectories. In: ICDE, pp. 673–684 (2002)

    Google Scholar 

  13. Chen, L., Ng, R.T.: On the marriage of lp-norms and edit distance. In: VLDB, pp. 792–803 (2004)

    Google Scholar 

  14. Chen, L., Özsu, M.T., Oria, V.: Robust and fast similarity search for moving object trajectories. In: SIGMOD Conference, pp. 491–502 (2005)

    Google Scholar 

  15. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11(1), 10–18 (2009)

    Article  Google Scholar 

  16. http://rapid-i.com/ (January 2010)

  17. http://www.sas.com (January 2010)

  18. Baiocchi, G., Distaso, W.: GRETL: Econometric software for the GNU generation. Journal of Applied Econometrics 18(1), 105–110

    Google Scholar 

  19. Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing 26(1), 43–49 (1978)

    Article  MATH  Google Scholar 

  20. Xi, X., Keogh, E.J., Shelton, C.R., Wei, L., Ratanamahatana, C.A.: Fast time series classification using numerosity reduction. In: ICML, pp. 1033–1040 (2006)

    Google Scholar 

  21. Keogh, E., Xi, X., Wei, L., Ratanamahatana, C.: The UCR Time Series dataset (2006), http://www.cs.ucr.edu/~eamonn/time_series_data/

  22. Keogh, E., Pazzani, M.: Relevance Feedback Retrieval of Time Series Data. In: The Twenty-Second Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, pp. 183–190 (1999)

    Google Scholar 

  23. Hochheiser, H., Shneiderman, B.: Interactive Exploration of Time-Series Data. In: Proc. of the 4th Int’l Conference on Discovery Science, Washington D.C., pp. 441–446 (2001)

    Google Scholar 

  24. van Wijk, J.J., van Selow, E.R.: Cluster and Calendar based Visualization of Time Series Data. In: Proceedings of IEEE Symposium on Information Visualization, pp. 4–9 (1999)

    Google Scholar 

  25. Weber, M., Alexa, M., Müller, W.: Visualizing Time-Series on Spirals. In: Proceedings of the IEEE Symposium on Information Visualization, pp. 7–14 (2001)

    Google Scholar 

  26. Lin, J., Keogh, E., Lonardi, S., Lankford, J.P., Nystrom, D.M.: Visually Mining and Monitoring Massive Time Series. In: Proc. of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Seattle, WA, pp. 460–469 (2004)

    Google Scholar 

  27. Findley, D.F., et al.: New Capabilities and Methods of the X-12-ARIMA Seasonal-Adjustment Program. Journal of Business & Economic Statistics, American Statistical Association 16(2), 127–152 (1998)

    Google Scholar 

  28. Gómez, V., Maravall, A.: Guide for using the program TRAMO and SEATS. In: Working Paper 9805, Research Department, Banco de España (1998)

    Google Scholar 

  29. Ghil, M., Allen, R.M., Dettinger, M.D., Ide, K., Kondrashov, D., Mann, M.E., Robertson, A., Saunders, A., Tian, Y., Varadi, F., Yiou, P.: Advanced spectral methods for climatic time series. Rev. Geophys. 40(1), 3.1–3.41 (2002)

    Article  Google Scholar 

  30. Lütkepohl, H., Krätzig, M.: Applied Time Series Econometrics. Cambridge University Press, Cambridge (2004)

    Book  Google Scholar 

  31. Morse, M.D., Patel, J.M.: An efficient and accurate method for evaluating time series similarity. In: SIGMOD Conference, pp. 569–580 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kurbalija, V., Radovanović, M., Geler, Z., Ivanović, M. (2010). A Framework for Time-Series Analysis. In: Dicheva, D., Dochev, D. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 2010. Lecture Notes in Computer Science(), vol 6304. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15431-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15431-7_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15430-0

  • Online ISBN: 978-3-642-15431-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics