Abstract
Time series motif discovery is an important primitive for the time series data mining. With the explosion of new sensing technology, there is a continuously increasing amount of time series data in every aspect of our lives, from seismology, entomology, human activity monitoring, medicine and so on. Considering the rich information included in time series, motif discovery has become an essential part of many data mining tasks. In recent years, the problem of consensus motif discovery in multiple time series begins to appear in our vision. For this task, the existing approaches can only search the consensus motif of a fixed length. However, variable-length motif mining is more common in real applications. To address this problem, the brute force version of the existing fixed-length approach is prohibitively expensive. In this paper, we propose an efficient, scalable and exact algorithm VACOMI to search the consensus motif of all lengths in a given motif length range. We evaluate the performance of VACOMI on four real datasets. The results show that VACOMI can reduce up to 96% of the running time compared with the state-of-the-art approach.
The work is supported by the Ministry of Science and Technology of China, National Key Research and Development Program (No. 2020YFB1710001).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: Lomet, D.B. (ed.) FODO 1993. LNCS, vol. 730, pp. 69–84. Springer, Heidelberg (1993). https://doi.org/10.1007/3-540-57301-1_5
Chiu, B., Keogh, E., Lonardi, S.: Probabilistic discovery of time series motifs. In: SIGKDD, pp. 493–498 (2003)
Dau, H.A., et al.: The UCR time series archive. IEEE/CAA J. Automatica Sinica 6(6), 1293–1305 (2019)
De Paepe, D., Van Hoecke, S.: Mining recurring patterns in real-valued time series using the radius profile. In: ICDM, pp. 984–989. IEEE (2020)
Gao, Y., Lin, J.: HIME: discovering variable-length motifs in large-scale time series. KAIS 61(1), 513–542 (2018). https://doi.org/10.1007/s10115-018-1279-6
Kamgar, K., Gharghabi, S., Keogh, E.: Matrix profile XV: exploiting time series consensus motifs to find structure in time series sets. In: ICDM, pp. 1156–1161. IEEE (2019)
Lei, W., Li, P., Han, Y., Gong, S., Yang, L., Hou, M.: EPG recordings reveal differential feeding behaviors in Sogatella Furcifera in response to plant virus infection and transmission success. Sci. Rep. 6(1), 1–9 (2016)
Linardi, M., Zhu, Y., Palpanas, T., Keogh, E.: Matrix profile X: VALMOD-scalable discovery of variable-length motifs in data series. In: SIGMOD, pp. 1053–1066 (2018)
Mueen, A., Chavoshi, N.: Enumeration of time series motifs of all lengths. KAIS 45(1), 105–132 (2014). https://doi.org/10.1007/s10115-014-0793-4
Paparrizos, J., Gravano, L.: k-Shape: efficient and accurate clustering of time series. In: SIGMOD, pp. 1855–1870 (2015)
Wang, X., et al.: RPM: representative pattern mining for efficient time series classification. In: EDBT, pp. 185–196 (2016)
Ye, L., Keogh, E.: Time series shapelets: a new primitive for data mining. In: SIGKDD, pp. 947–956 (2009)
Yeh, C.C.M., et al.: Matrix profile I: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets. In: ICDM, pp. 1317–1322. IEEE (2016)
Zhu, Y., Yeh, C.C.M., Zimmerman, Z., Kamgar, K., Keogh, E.: Matrix profile XI: SCRIMP++: time series motif discovery at interactive speeds. In: ICDM, pp. 837–846. IEEE (2018)
Zhu, Y., et al.: Matrix profile II: exploiting a novel algorithm and GPUs to break the one hundred million barrier for time series motifs and joins. In: ICDM, pp. 739–748. IEEE (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, M., Wang, P., Wang, W. (2022). Efficient Consensus Motif Discovery of All Lengths in Multiple Time Series. In: Bhattacharya, A., et al. Database Systems for Advanced Applications. DASFAA 2022. Lecture Notes in Computer Science, vol 13246. Springer, Cham. https://doi.org/10.1007/978-3-031-00126-0_39
Download citation
DOI: https://doi.org/10.1007/978-3-031-00126-0_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-00125-3
Online ISBN: 978-3-031-00126-0
eBook Packages: Computer ScienceComputer Science (R0)