Abstract
We study the problem of online similarity search in sequences of temporal intervals; given a standing query and a time-evolving sequence of event-intervals, we want to assess the existence of the query in the sequence over time. Since indexing is inapplicable to our problem, the goal is to reduce runtime without sacrificing retrieval accuracy. We present three lower-bounding and two early-abandon methods for speeding up search, while guaranteeing no false dismissals. We present a framework for combining lower bounds with early abandoning, called ABIDE. Empirical evaluation on eight real datasets and two synthetic datasets suggests that ABIDE provides speedups of at least an order of magnitude and up to 6977 times on average, compared to existing approaches and a baseline. We conclude that ABIDE is more powerful than existing methods, while we can attain the same pruning power with less CPU computations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
DR generates synthetic datasets, given the statistical properties of an input dataset, i.e., the e-sequence length, count of intervals per event label, and the total duration of the e-stream. We can also create denser streams by multiplying the number of intervals and their total duration by a given scalar value, the ‘density multiplier’.
References
Source Code and Datasets (2017). http://goo.gl/fFnTEw
Allen, J.F.: Maintaining knowledge about temporal intervals. Commun. ACM 26(11), 832–843 (1983)
Asker, L., Boström, H., Papapetrou, P., Persson, H.: Identifying factors for the effectiveness of treatment of heart failure: a registry study. In: CBMS 2016, pp. 205–206 (2016)
Babu, S., Widom, J.: Continuous queries over data streams. SIGMOD Rec. 30(3), 109–120 (2001)
Hartzman, C., Watters, C.: A relational approach to querying data streams. IEEE TKDE 2(4), 401–409 (1990)
Hwang, Y., Ahn, H.-K.: Convergent bounds on the euclidean distance. In: NIPS, pp. 388–396 (2011)
Keogh, E., Ratanamahatana, C.A.: Exact indexing of dynamic time warping. KAIS 7(3), 358–386 (2005)
Kim, S., Park, S., Chu, W.: An index-based approach for similarity search supporting time warping in large sequence databases. In: Proceedings of ICDE, pp. 607–614 (2001)
Klimov, D., Shknevsky, A., Shahar, Y.: Exploration of patterns predicting renal damage in patients with diabetes type ii using a visual temporal analysis laboratory. J. Am. Med. Inform. Assoc. 22(2), 275–289 (2015)
Kostakis, O., Papapetrou, P.: Finding the longest common sub-pattern in sequences of temporal intervals. Data Min. Knowl. Disc. 29(5), 1178–1210 (2015)
Kostakis, O., Papapetrou, P., Hollmén, J.: Artemis: assessing the similarity of event-interval sequences. In: Proceedings of ECML/PKDD, pp. 229–244 (2011)
Kostakis, O., Papapetrou, P., Hollmén, J.: Distance measure for querying sequences of temporal intervals. In: Proceedings of PETRAE, pp. 40:1–40:8. ACM (2011)
Kostakis, O.K., Gionis, A.G.: Subsequence search in event-interval sequences. In: Proceedings of ACM SIGIR, pp. 851–854. ACM (2015)
Kotsifakos, A., Papapetrou, P., Athitsos, V.: Ibsm: Interval-based sequence matching. In: Proceedings of SDM, pp. 596–604 (2013)
Moerchen, F., Fradkin, D.: Robust mining of time intervals with semi-interval partial order patterns. In: Proceedings of SDM, pp. 315–326 (2010)
Moskovitch, R., Shahar, Y.: Medical temporal-knowledge discovery via temporal abstraction. In: Proceedings of the AMIA Annual Symposium, pp. 452–456 (2009)
Moskovitch, R., Shahar, Y.: Classification-driven temporal discretization of multivariate time series. Data Min. Knowl. Disc. 29(4), 871–913 (2014)
Papapetrou, P., Kollios, G., Sclaroff, S., Gunopulos, D.: Mining frequent arrangements of temporal intervals. KAIS 21, 133–171 (2009)
Patel, D., Hsu, W., Lee, M.: Mining relationships among interval-based events for classification. In: Proceedings of SIGMOD, pp. 393–404. ACM (2008)
Rakthanmanon, T., Campana, B., Mueen, A., Batista, G., Westover, B., Zhu, Q., Zakaria, J., Keogh, E.: Searching and mining trillions of time series subsequences under dynamic time warping. In: Proceedings of KDD, pp. 262–270 (2012)
Sakurai, Y., Faloutsos, C., Yamamuro, M.: Stream monitoring under the time warping distance. In: Proceedings of ICDE, pp. 1046–1055 (2007)
Winarko, E., Roddick, J.F.: Armada - an algorithm for discovering richer relative temporal association rules from interval-based data. DKE 63(1), 76–90 (2007)
Wu, S.-Y., Chen, Y.-L.: Mining nonambiguous temporal patterns for interval-based events. IEEE TKDE 19(6), 742–758 (2007)
Acknowledgments
This work was partly supported by the VR-2016-03372 Swedish Research Council Starting Grant.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Kostakis, O., Papapetrou, P. (2017). ABIDE: Querying Time-Evolving Sequences of Temporal Intervals. In: Adams, N., Tucker, A., Weston, D. (eds) Advances in Intelligent Data Analysis XVI. IDA 2017. Lecture Notes in Computer Science(), vol 10584. Springer, Cham. https://doi.org/10.1007/978-3-319-68765-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-68765-0_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68764-3
Online ISBN: 978-3-319-68765-0
eBook Packages: Computer ScienceComputer Science (R0)