Abstract
The popularity of time-series databases in many applications has created an increasing demand for performing data-mining tasks (classification, clustering, outlier detection, etc.) on time-series data. Currently, however, no single system or library exists that specializes on providing efficient implementations of data-mining techniques for time-series data, supports the necessary concepts of representations, similarity measures and preprocessing tasks, and is at the same time freely available. For these reasons we have designed a multipurpose, multifunctional, extendable system FAP – Framework for Analysis and Prediction, which supports the aforementioned concepts and techniques for mining time-series data. This paper describes the architecture of FAP and the current version of its Java implementation which focuses on time-series similarity measures and nearest-neighbor classification. The correctness of the implementation is verified through a battery of experiments which involve diverse time-series data sets from the UCR repository.
This work was supported by project Abstract Methods and Applications in Computer Science (no. 144017A), of the Serbian Ministry of Science and Environmental Protection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast Subsequence Matching in Time-Series Databases. In: SIGMOD Conference, pp. 419–429 (1994)
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2005)
Keogh, E.J.: A Decade of Progress in Indexing and Mining Large Time Series Databases. In: VLDB, p. 1268 (2006)
Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.: Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures. In: VLDB 2008, Auckland, New Zealand, pp. 1542–1552 (2008)
pong Chan, K., Fu, A.W.-C.: Efficient Time Series Matching by Wavelets. In: ICDE, pp. 126–133 (1999)
Keogh, E.J., Chakrabarti, K., Pazzani, M.J., Mehrotra, S.: Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases. Knowl. Inf. Syst. 3(3), 263–286 (2001)
Keogh, E.J., Chakrabarti, K., Mehrotra, S., Pazzani, M.J.: Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases. In: SIGMOD Conference, pp. 151–162 (2001)
Lin, J., Keogh, E.J., Wei, L., Lonardi, S.: Experiencing SAX: a novel symbolic representation of time series. Data Min. Knowl. Discov. 15(2), 107–144 (2007)
Chen, Q., Chen, L., Lian, X., Liu, Y., Yu, J.X.: Indexable PLA for Efficient Similarity Search. In: VLDB, pp. 435–446 (2007)
Kurbalija, V., Ivanović, M., Budimac, Z.: Case-Based Curve Behaviour Prediction. Software: Practice and Experience 39(1), 81–103 (2009)
Keogh, E.J., Ratanamahatana, C.A.: Exact indexing of dynamic time warping. Knowl. Inf. Syst. 7(3), 358–386 (2005)
Vlachos, M., Gunopulos, D., Kollios, G.: Discovering similar multidimensional trajectories. In: ICDE, pp. 673–684 (2002)
Chen, L., Ng, R.T.: On the marriage of lp-norms and edit distance. In: VLDB, pp. 792–803 (2004)
Chen, L., Özsu, M.T., Oria, V.: Robust and fast similarity search for moving object trajectories. In: SIGMOD Conference, pp. 491–502 (2005)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11(1), 10–18 (2009)
http://rapid-i.com/ (January 2010)
http://www.sas.com (January 2010)
Baiocchi, G., Distaso, W.: GRETL: Econometric software for the GNU generation. Journal of Applied Econometrics 18(1), 105–110
Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing 26(1), 43–49 (1978)
Xi, X., Keogh, E.J., Shelton, C.R., Wei, L., Ratanamahatana, C.A.: Fast time series classification using numerosity reduction. In: ICML, pp. 1033–1040 (2006)
Keogh, E., Xi, X., Wei, L., Ratanamahatana, C.: The UCR Time Series dataset (2006), http://www.cs.ucr.edu/~eamonn/time_series_data/
Keogh, E., Pazzani, M.: Relevance Feedback Retrieval of Time Series Data. In: The Twenty-Second Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, pp. 183–190 (1999)
Hochheiser, H., Shneiderman, B.: Interactive Exploration of Time-Series Data. In: Proc. of the 4th Int’l Conference on Discovery Science, Washington D.C., pp. 441–446 (2001)
van Wijk, J.J., van Selow, E.R.: Cluster and Calendar based Visualization of Time Series Data. In: Proceedings of IEEE Symposium on Information Visualization, pp. 4–9 (1999)
Weber, M., Alexa, M., Müller, W.: Visualizing Time-Series on Spirals. In: Proceedings of the IEEE Symposium on Information Visualization, pp. 7–14 (2001)
Lin, J., Keogh, E., Lonardi, S., Lankford, J.P., Nystrom, D.M.: Visually Mining and Monitoring Massive Time Series. In: Proc. of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Seattle, WA, pp. 460–469 (2004)
Findley, D.F., et al.: New Capabilities and Methods of the X-12-ARIMA Seasonal-Adjustment Program. Journal of Business & Economic Statistics, American Statistical Association 16(2), 127–152 (1998)
Gómez, V., Maravall, A.: Guide for using the program TRAMO and SEATS. In: Working Paper 9805, Research Department, Banco de España (1998)
Ghil, M., Allen, R.M., Dettinger, M.D., Ide, K., Kondrashov, D., Mann, M.E., Robertson, A., Saunders, A., Tian, Y., Varadi, F., Yiou, P.: Advanced spectral methods for climatic time series. Rev. Geophys. 40(1), 3.1–3.41 (2002)
Lütkepohl, H., Krätzig, M.: Applied Time Series Econometrics. Cambridge University Press, Cambridge (2004)
Morse, M.D., Patel, J.M.: An efficient and accurate method for evaluating time series similarity. In: SIGMOD Conference, pp. 569–580 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kurbalija, V., Radovanović, M., Geler, Z., Ivanović, M. (2010). A Framework for Time-Series Analysis. In: Dicheva, D., Dochev, D. (eds) Artificial Intelligence: Methodology, Systems, and Applications. AIMSA 2010. Lecture Notes in Computer Science(), vol 6304. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15431-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-15431-7_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15430-0
Online ISBN: 978-3-642-15431-7
eBook Packages: Computer ScienceComputer Science (R0)