Abstract
The problem of retrieving time series similar to a specified query pattern has been recently addressed within the case based reasoning (CBR) literature. Providing a flexible and efficient way of dealing with such an issue is of paramount importance in many domains (e.g., medical), where the evolution of specific parameters is collected in the form of time series. In the past, we have developed a framework for retrieving time series, applying temporal abstractions. With respect to more classical (mathematical) approaches, our framework provides significant advantages. In particular, multi-level abstraction mechanisms and proper indexing techniques allow for flexible query issuing, and for efficient and interactive query answering. In this paper, we present an extension to such a framework, which aims to support sub-series matching as well. Indeed, sub-series retrieval may be crucial when the whole time series evolution is not of interest, while critical patterns to be searched for are only “local”. Moreover, sometimes the relative order of patterns, but not their precise location in time, may be known. Finally, an interactive search, at different abstraction levels, may be required by the decision maker. Our extended framework (which is currently being applied in haemodialysis, but is domain independent) deals with all these issues.
Similar content being viewed by others
Notes
The result of abstracting (one or more) dummy symbol(s) is obviously domain dependent, and has to be defined by experts while setting up the system.
Our indexing approach may resemble association rule mining with frequent pattern (FP) trees. Association rule mining is a well-known method for discovering interesting relations between variables in large databases. The FP-growth algorithm [39] has been proposed in this context to mine frequent itemsets, that will be the input for rule mining. FP-growth is an efficient and scalable method for mining the complete set of frequent patterns, using an extended prefix-tree structure for storing information named FP-tree. Despite the use of the tree structure, it is, however, clear that our approach is only loosely related to FP trees, since our goal (i.e., time series retrieval) is very different.
References
Aamodt A, Plaza E (1994) Case-based reasoning: foundational issues, methodological variations and systems approaches. AI Commun 7:39–59
Agrawal R, Faloutsos C, Swami AN (1993) Efficient similarity search in sequence databases. In: Lomet D (ed) Foundations of Data Organization and Algorithms, 4th International Conference, FODO’93. Springer, Berlin, pp 69–84
Bas E, Uslu VR, Yolcu U, Egrioglu E (2014) A modified genetic algorithm for forecasting fuzzy time series. Appl Intell 41:453–463
Bellazzi R, Larizza C, Magni P, Montani S, Stefanelli M (2000) Intelligent analysis of clinical time series: an application in the diabetes mellitus domain. Artif Intell Med 20:37–57
Bellazzi R, Larizza C, Riva A (1998) Temporal abstractions for interpreting diabetic patients monitoring data. Intell Data Anal 2:97–122
Berry G, Sethi R (1986) From regular expressions to deterministic automata. Theor Comput Sci 48(3):117–126
Branting LK, Hastings JD (1994) An empirical evaluation of model-based case matching and adaptation. In: Proceedings of the Workshop on Case-Based Reasoning, AAAI 1994
Chan KP, Fu AWC (1999) Efficient time series matching by wavelets. In: Proc. ICDE 99. IEEE Computer Society Press, Washington, pp 126–133
Daw CS, Finney CE, Tracy ER (2003) A review of symbolic analysis of experimental data. Rev Sci Instrum 74:915–930
Dechter R, Meiri I, Pearl J (1991) Temporal constraint networks. In: Brachman RJ, Levesque HJ, Reiter R (eds) Knowledge Representation. MIT Press, London, pp 61–95
Feng L, Yin Z, Jin B, Zhao X (2011) Time series subsequence retrieval based on the properties of multi-index. AISS: Adv Inf Sci Serv Sci 3(9):300–308
Fuad MMM, Marteau PF (2010) Fast retrieval of time series using a multi-resolution filter with multiple reduced spaces. In: Cao L, Feng Y, Zhong J (eds) Advanced Data Mining and Applications - 6th International Conference, ADMA 2010, Chongqing, China, November 19–21, 2010, Proceedings, Part I, volume 6440 of Lecture Notes in Computer Science. Springer, pp 137–148
Fuch B, Mille A, Chiron B (1995) Operator decision aiding by adaptation of supervision strategies. In: Case-Based Reasoning Research and Development, LNAI. Springer, Berlin, pp 23–32
Funk P, Xiong N (2008) Extracting knowledge from sensor signals for case-based reasoning with longitudinal time series data. In: Perner P (ed) Case-Based Reasoning in Signals and Images. Springer, pp 247–284
Jaere MD, Aamodt A, Skalle P (2002) Representing temporal knowledge for case-based prediction. In: Craw S, Preece A (eds) Proceedings of the European Conference on Case Based Reasoning (ECCBR) 2002, Lecture Notes in Artificial Intelligence 2416. Springer, Berlin, pp 174–188
Kadar S, Wang J, Showalter K (1998) Noise-supported travelling waves in sub-excitable media. Nature 391:770–772
Keogh E, Chakrabarti K, Pazzani M, Mehrotra S (2000) Dimensionality reduction for fast similarity search in large time series databases. Knowl Inf Syst 3(3):263–286
Lam MS, Aho AV, Sethi R, Ullman JD (2006) Compilers: Principles, Techniques, and Tools. Addison-Wesley, Reading (MA)
Lin J, Keogh E, Lonardi S, Chiu B (2003) A symbolic representation of time series, with implications for streaming algorithms. In: Proceedings of ACM-DMKD, San Diego, pp 2–11
Lin K, Jagadish HV, Faloutsos C (1994) The tv-tree: An index structure for high-dimensional data. VLDB 3(4):517–542
Lin Y, McCool MD (2010) Subseries join: a similarity-based time series match approach. In: Zaki MJ, Yu JX, Ravindran B, Pudi V (eds) Advances in Knowledge Discovery and Data Mining, 14th Pacific-Asia Conference, PAKDD 2010, Hyderabad, India, June 21-24, 2010. Proceedings. Part I, volume 6118 of Lecture Notes in Computer Science. Springer, pp 238–245
Miksch S, Horn W, Popow C, Paky F (1996) Utilizing temporal data abstractions for data validation and therapy planning for artificially ventilated newborn infants. Artif Intell Med 8:543–576
Montani S, Leonardi G, Bottrighi A, Portinale L, Terenziani P (2013) Supporting flexible, efficient and user-interpretable retrieval of similar time series. IEEE Trans Knowl Data Eng 25(3):677–689
Montani S, Portinale L, Leonardi G, Bellazzi R, Bellazzi R (2006) Case-based retrieval to support the treatment of end stage renal failure patients. Artif Intell Med 37:31–42
Nakhaeizadeh G (1994) Learning prediction from time series: a theoretical and empirical comparison of cbr with some other approaches. In: Topics in Case-Based Reasoning, LNAI 837. Springer, Berlin, pp 65–76
Nilsson M, Funk P, Olsson E, vonScheele B, Xiong N (2006) Clinical decision-support for diagnosing stress-related disorders by applying psychophysiological medical knowledge to an instance-based learning system. Artif Intell Med 36:159–176
Nilsson M, Funk P, Xiong N (2005) Clinical decision support by time series classification using wavelets. In: Chen CS, Filipe J, Seruca I, Cordeiro J (eds) Proc. Seventh International Conference on Enterprise Information Systems (ICEIS’05). INSTICC Press, pp 169–175
Oppenheim AV, Shafer RW (1975) Digital signal processing. Prentice Hall, London
Ram A, Santamaria JC (1993) Continuous case-based reasoning. In: Proc. AAAI Case-Based Reasoning Workshop, pp 86– 93
Sakoe H, Chiba S (1978) Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans Acoust, Speech Signal Process 26(1):43–49
Serra J, Arcos JL (2012) A competitive measure to assess the similarity between two time series. In: Díaz-Agudo B, Watson I (eds) Case-Based Reasoning Research and Development - 20th International Conference, ICCBR 2012, Lyon, France, September 3-6, 2012. Proceedings, volume 7466 of Lecture Notes in Computer Science. Springer, pp 414–427
Serra J, Serra X, Andrzejak RG (2009) Cross recurrence quantification for cover song identification. New J Phys 11(9):093017
Shahar Y (1997) A framework for knowledge-based temporal abstractions. Artif Intell 90:79–133
Shahar Y, Musen MA (1996) Knowledge-based temporal abstraction in clinical domains. Artif Intell Med 8:267–298
Stacey M (2005) Knowledge based temporal abstractions within the neonatal intensive care domain. In: Proc. CSTE Innovation Conference, University of Western Sidney
Terenziani P, German E, Shahar Y (2008) The temporal aspects of clinical guidelines. In: Ten Teije A, Miksch S, Lucas P (eds) Computer-based Medical Guidelines and Protocols: A Primer and Current Trends
van Leeuwen J (1994) Handbook of Theoretical Computer Science. Mit Press, Cambridge (CA)
Wang Z, Huang S, Wang L, Li H, Wang Y, Yang H (2013) Accelerating subsequence similarity search based on dynamic time warping distance with fpga. In: Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays, FPGA ’13. ACM, New York, pp 53–62
Witten IH, Frank E, Hall MA (2011) Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, San Mateo (CA)
Xylogiannopoulosand KF, Karampelas P, Alhajj R (2014) Analyzing very large time series using suffix arrays. Appl Intell:1–15
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bottrighi, A., Leonardi, G., Montani, S. et al. A time series retrieval tool for sub-series matching. Appl Intell 43, 132–149 (2015). https://doi.org/10.1007/s10489-014-0628-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-014-0628-8