Skip to main content
Log in

A time series retrieval tool for sub-series matching

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

The problem of retrieving time series similar to a specified query pattern has been recently addressed within the case based reasoning (CBR) literature. Providing a flexible and efficient way of dealing with such an issue is of paramount importance in many domains (e.g., medical), where the evolution of specific parameters is collected in the form of time series. In the past, we have developed a framework for retrieving time series, applying temporal abstractions. With respect to more classical (mathematical) approaches, our framework provides significant advantages. In particular, multi-level abstraction mechanisms and proper indexing techniques allow for flexible query issuing, and for efficient and interactive query answering. In this paper, we present an extension to such a framework, which aims to support sub-series matching as well. Indeed, sub-series retrieval may be crucial when the whole time series evolution is not of interest, while critical patterns to be searched for are only “local”. Moreover, sometimes the relative order of patterns, but not their precise location in time, may be known. Finally, an interactive search, at different abstraction levels, may be required by the decision maker. Our extended framework (which is currently being applied in haemodialysis, but is domain independent) deals with all these issues.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. The result of abstracting (one or more) dummy symbol(s) is obviously domain dependent, and has to be defined by experts while setting up the system.

  2. Our indexing approach may resemble association rule mining with frequent pattern (FP) trees. Association rule mining is a well-known method for discovering interesting relations between variables in large databases. The FP-growth algorithm [39] has been proposed in this context to mine frequent itemsets, that will be the input for rule mining. FP-growth is an efficient and scalable method for mining the complete set of frequent patterns, using an extended prefix-tree structure for storing information named FP-tree. Despite the use of the tree structure, it is, however, clear that our approach is only loosely related to FP trees, since our goal (i.e., time series retrieval) is very different.

References

  1. Aamodt A, Plaza E (1994) Case-based reasoning: foundational issues, methodological variations and systems approaches. AI Commun 7:39–59

    Google Scholar 

  2. Agrawal R, Faloutsos C, Swami AN (1993) Efficient similarity search in sequence databases. In: Lomet D (ed) Foundations of Data Organization and Algorithms, 4th International Conference, FODO’93. Springer, Berlin, pp 69–84

    Chapter  Google Scholar 

  3. Bas E, Uslu VR, Yolcu U, Egrioglu E (2014) A modified genetic algorithm for forecasting fuzzy time series. Appl Intell 41:453–463

    Article  Google Scholar 

  4. Bellazzi R, Larizza C, Magni P, Montani S, Stefanelli M (2000) Intelligent analysis of clinical time series: an application in the diabetes mellitus domain. Artif Intell Med 20:37–57

    Article  Google Scholar 

  5. Bellazzi R, Larizza C, Riva A (1998) Temporal abstractions for interpreting diabetic patients monitoring data. Intell Data Anal 2:97–122

    Article  Google Scholar 

  6. Berry G, Sethi R (1986) From regular expressions to deterministic automata. Theor Comput Sci 48(3):117–126

    Article  MATH  MathSciNet  Google Scholar 

  7. Branting LK, Hastings JD (1994) An empirical evaluation of model-based case matching and adaptation. In: Proceedings of the Workshop on Case-Based Reasoning, AAAI 1994

  8. Chan KP, Fu AWC (1999) Efficient time series matching by wavelets. In: Proc. ICDE 99. IEEE Computer Society Press, Washington, pp 126–133

  9. Daw CS, Finney CE, Tracy ER (2003) A review of symbolic analysis of experimental data. Rev Sci Instrum 74:915–930

    Article  Google Scholar 

  10. Dechter R, Meiri I, Pearl J (1991) Temporal constraint networks. In: Brachman RJ, Levesque HJ, Reiter R (eds) Knowledge Representation. MIT Press, London, pp 61–95

    Google Scholar 

  11. Feng L, Yin Z, Jin B, Zhao X (2011) Time series subsequence retrieval based on the properties of multi-index. AISS: Adv Inf Sci Serv Sci 3(9):300–308

    Article  Google Scholar 

  12. Fuad MMM, Marteau PF (2010) Fast retrieval of time series using a multi-resolution filter with multiple reduced spaces. In: Cao L, Feng Y, Zhong J (eds) Advanced Data Mining and Applications - 6th International Conference, ADMA 2010, Chongqing, China, November 19–21, 2010, Proceedings, Part I, volume 6440 of Lecture Notes in Computer Science. Springer, pp 137–148

  13. Fuch B, Mille A, Chiron B (1995) Operator decision aiding by adaptation of supervision strategies. In: Case-Based Reasoning Research and Development, LNAI. Springer, Berlin, pp 23–32

  14. Funk P, Xiong N (2008) Extracting knowledge from sensor signals for case-based reasoning with longitudinal time series data. In: Perner P (ed) Case-Based Reasoning in Signals and Images. Springer, pp 247–284

  15. Jaere MD, Aamodt A, Skalle P (2002) Representing temporal knowledge for case-based prediction. In: Craw S, Preece A (eds) Proceedings of the European Conference on Case Based Reasoning (ECCBR) 2002, Lecture Notes in Artificial Intelligence 2416. Springer, Berlin, pp 174–188

    Google Scholar 

  16. Kadar S, Wang J, Showalter K (1998) Noise-supported travelling waves in sub-excitable media. Nature 391:770–772

    Article  Google Scholar 

  17. Keogh E, Chakrabarti K, Pazzani M, Mehrotra S (2000) Dimensionality reduction for fast similarity search in large time series databases. Knowl Inf Syst 3(3):263–286

    Article  Google Scholar 

  18. Lam MS, Aho AV, Sethi R, Ullman JD (2006) Compilers: Principles, Techniques, and Tools. Addison-Wesley, Reading (MA)

    Google Scholar 

  19. Lin J, Keogh E, Lonardi S, Chiu B (2003) A symbolic representation of time series, with implications for streaming algorithms. In: Proceedings of ACM-DMKD, San Diego, pp 2–11

  20. Lin K, Jagadish HV, Faloutsos C (1994) The tv-tree: An index structure for high-dimensional data. VLDB 3(4):517–542

    Article  Google Scholar 

  21. Lin Y, McCool MD (2010) Subseries join: a similarity-based time series match approach. In: Zaki MJ, Yu JX, Ravindran B, Pudi V (eds) Advances in Knowledge Discovery and Data Mining, 14th Pacific-Asia Conference, PAKDD 2010, Hyderabad, India, June 21-24, 2010. Proceedings. Part I, volume 6118 of Lecture Notes in Computer Science. Springer, pp 238–245

  22. Miksch S, Horn W, Popow C, Paky F (1996) Utilizing temporal data abstractions for data validation and therapy planning for artificially ventilated newborn infants. Artif Intell Med 8:543–576

    Article  Google Scholar 

  23. Montani S, Leonardi G, Bottrighi A, Portinale L, Terenziani P (2013) Supporting flexible, efficient and user-interpretable retrieval of similar time series. IEEE Trans Knowl Data Eng 25(3):677–689

    Article  Google Scholar 

  24. Montani S, Portinale L, Leonardi G, Bellazzi R, Bellazzi R (2006) Case-based retrieval to support the treatment of end stage renal failure patients. Artif Intell Med 37:31–42

    Article  Google Scholar 

  25. Nakhaeizadeh G (1994) Learning prediction from time series: a theoretical and empirical comparison of cbr with some other approaches. In: Topics in Case-Based Reasoning, LNAI 837. Springer, Berlin, pp 65–76

  26. Nilsson M, Funk P, Olsson E, vonScheele B, Xiong N (2006) Clinical decision-support for diagnosing stress-related disorders by applying psychophysiological medical knowledge to an instance-based learning system. Artif Intell Med 36:159–176

    Article  Google Scholar 

  27. Nilsson M, Funk P, Xiong N (2005) Clinical decision support by time series classification using wavelets. In: Chen CS, Filipe J, Seruca I, Cordeiro J (eds) Proc. Seventh International Conference on Enterprise Information Systems (ICEIS’05). INSTICC Press, pp 169–175

  28. Oppenheim AV, Shafer RW (1975) Digital signal processing. Prentice Hall, London

    MATH  Google Scholar 

  29. Ram A, Santamaria JC (1993) Continuous case-based reasoning. In: Proc. AAAI Case-Based Reasoning Workshop, pp 86– 93

  30. Sakoe H, Chiba S (1978) Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans Acoust, Speech Signal Process 26(1):43–49

    Article  MATH  Google Scholar 

  31. Serra J, Arcos JL (2012) A competitive measure to assess the similarity between two time series. In: Díaz-Agudo B, Watson I (eds) Case-Based Reasoning Research and Development - 20th International Conference, ICCBR 2012, Lyon, France, September 3-6, 2012. Proceedings, volume 7466 of Lecture Notes in Computer Science. Springer, pp 414–427

  32. Serra J, Serra X, Andrzejak RG (2009) Cross recurrence quantification for cover song identification. New J Phys 11(9):093017

  33. Shahar Y (1997) A framework for knowledge-based temporal abstractions. Artif Intell 90:79–133

    Article  MATH  Google Scholar 

  34. Shahar Y, Musen MA (1996) Knowledge-based temporal abstraction in clinical domains. Artif Intell Med 8:267–298

    Article  Google Scholar 

  35. Stacey M (2005) Knowledge based temporal abstractions within the neonatal intensive care domain. In: Proc. CSTE Innovation Conference, University of Western Sidney

  36. Terenziani P, German E, Shahar Y (2008) The temporal aspects of clinical guidelines. In: Ten Teije A, Miksch S, Lucas P (eds) Computer-based Medical Guidelines and Protocols: A Primer and Current Trends

  37. van Leeuwen J (1994) Handbook of Theoretical Computer Science. Mit Press, Cambridge (CA)

    Google Scholar 

  38. Wang Z, Huang S, Wang L, Li H, Wang Y, Yang H (2013) Accelerating subsequence similarity search based on dynamic time warping distance with fpga. In: Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays, FPGA ’13. ACM, New York, pp 53–62

  39. Witten IH, Frank E, Hall MA (2011) Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Mateo (CA)

    Google Scholar 

  40. Xylogiannopoulosand KF, Karampelas P, Alhajj R (2014) Analyzing very large time series using suffix arrays. Appl Intell:1–15

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stefania Montani.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bottrighi, A., Leonardi, G., Montani, S. et al. A time series retrieval tool for sub-series matching. Appl Intell 43, 132–149 (2015). https://doi.org/10.1007/s10489-014-0628-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-014-0628-8

Keywords

Navigation