Skip to main content

Efficient Consensus Motif Discovery of All Lengths in Multiple Time Series

  • Conference paper
  • First Online:
Book cover Database Systems for Advanced Applications (DASFAA 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13246))

Included in the following conference series:

Abstract

Time series motif discovery is an important primitive for the time series data mining. With the explosion of new sensing technology, there is a continuously increasing amount of time series data in every aspect of our lives, from seismology, entomology, human activity monitoring, medicine and so on. Considering the rich information included in time series, motif discovery has become an essential part of many data mining tasks. In recent years, the problem of consensus motif discovery in multiple time series begins to appear in our vision. For this task, the existing approaches can only search the consensus motif of a fixed length. However, variable-length motif mining is more common in real applications. To address this problem, the brute force version of the existing fixed-length approach is prohibitively expensive. In this paper, we propose an efficient, scalable and exact algorithm VACOMI to search the consensus motif of all lengths in a given motif length range. We evaluate the performance of VACOMI on four real datasets. The results show that VACOMI can reduce up to 96% of the running time compared with the state-of-the-art approach.

The work is supported by the Ministry of Science and Technology of China, National Key Research and Development Program (No. 2020YFB1710001).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: Lomet, D.B. (ed.) FODO 1993. LNCS, vol. 730, pp. 69–84. Springer, Heidelberg (1993). https://doi.org/10.1007/3-540-57301-1_5

    Chapter  Google Scholar 

  2. Chiu, B., Keogh, E., Lonardi, S.: Probabilistic discovery of time series motifs. In: SIGKDD, pp. 493–498 (2003)

    Google Scholar 

  3. Dau, H.A., et al.: The UCR time series archive. IEEE/CAA J. Automatica Sinica 6(6), 1293–1305 (2019)

    Article  Google Scholar 

  4. De Paepe, D., Van Hoecke, S.: Mining recurring patterns in real-valued time series using the radius profile. In: ICDM, pp. 984–989. IEEE (2020)

    Google Scholar 

  5. Gao, Y., Lin, J.: HIME: discovering variable-length motifs in large-scale time series. KAIS 61(1), 513–542 (2018). https://doi.org/10.1007/s10115-018-1279-6

    Article  MathSciNet  Google Scholar 

  6. Kamgar, K., Gharghabi, S., Keogh, E.: Matrix profile XV: exploiting time series consensus motifs to find structure in time series sets. In: ICDM, pp. 1156–1161. IEEE (2019)

    Google Scholar 

  7. Lei, W., Li, P., Han, Y., Gong, S., Yang, L., Hou, M.: EPG recordings reveal differential feeding behaviors in Sogatella Furcifera in response to plant virus infection and transmission success. Sci. Rep. 6(1), 1–9 (2016)

    Article  Google Scholar 

  8. Linardi, M., Zhu, Y., Palpanas, T., Keogh, E.: Matrix profile X: VALMOD-scalable discovery of variable-length motifs in data series. In: SIGMOD, pp. 1053–1066 (2018)

    Google Scholar 

  9. Mueen, A., Chavoshi, N.: Enumeration of time series motifs of all lengths. KAIS 45(1), 105–132 (2014). https://doi.org/10.1007/s10115-014-0793-4

    Article  Google Scholar 

  10. Paparrizos, J., Gravano, L.: k-Shape: efficient and accurate clustering of time series. In: SIGMOD, pp. 1855–1870 (2015)

    Google Scholar 

  11. Wang, X., et al.: RPM: representative pattern mining for efficient time series classification. In: EDBT, pp. 185–196 (2016)

    Google Scholar 

  12. Ye, L., Keogh, E.: Time series shapelets: a new primitive for data mining. In: SIGKDD, pp. 947–956 (2009)

    Google Scholar 

  13. Yeh, C.C.M., et al.: Matrix profile I: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets. In: ICDM, pp. 1317–1322. IEEE (2016)

    Google Scholar 

  14. Zhu, Y., Yeh, C.C.M., Zimmerman, Z., Kamgar, K., Keogh, E.: Matrix profile XI: SCRIMP++: time series motif discovery at interactive speeds. In: ICDM, pp. 837–846. IEEE (2018)

    Google Scholar 

  15. Zhu, Y., et al.: Matrix profile II: exploiting a novel algorithm and GPUs to break the one hundred million barrier for time series motifs and joins. In: ICDM, pp. 739–748. IEEE (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Peng Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, M., Wang, P., Wang, W. (2022). Efficient Consensus Motif Discovery of All Lengths in Multiple Time Series. In: Bhattacharya, A., et al. Database Systems for Advanced Applications. DASFAA 2022. Lecture Notes in Computer Science, vol 13246. Springer, Cham. https://doi.org/10.1007/978-3-031-00126-0_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-00126-0_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-00125-3

  • Online ISBN: 978-3-031-00126-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics