Skip to main content

Two-Phase Mining for Frequent Closed Episodes

  • Conference paper
  • First Online:
Book cover Web-Age Information Management (WAIM 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9658))

Included in the following conference series:

Abstract

The concept of episodes was introduced for discovering the useful and interesting temporal patterns from the sequential data. Over the years, many episode mining strategies have been suggested, which can be roughly classified into two classes: Apriori-based breadth-first algorithms and projection-based depth-first algorithms. As we know, both kinds of algorithms are level-wise pattern growth methods, so that they have higher computational overhead due to level-wise growth iteration. In addition, their mining time will increase with the increase of sequence length. In the paper, we propose a novel two-phase strategy to discover frequent closed episodes. That is, in phase I, we present a level-wise shrinking mechanism, based on maximal duration episodes, to find the candidate frequent closed episodes from the episodes with the same 2-neighboring episode prefix, and in phase II, we compare the candidates with different prefixes to discover the final frequent closed episodes. The advantage of the suggested mining strategy is it can reduce mining time due to narrowing episode mapping range when doing closure judgment. Experiments on simulated and real datasets demonstrate that the suggested strategy is effective and efficient.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mannila, H., Toivonen, H., Verkamo, A.I.: Discovering frequent episodes in sequences (Extended Abstract). In: Proceedings of KDD 1995, pp. 210–215 (1995)

    Google Scholar 

  2. Mannila, H., Toivonen, H., Verkamo, A.I.: Discovery of frequent episodes in event sequences. Data Min. Knowl. Disc. 1(3), 259–289 (1997)

    Article  Google Scholar 

  3. Laxman, S., Tankasali, V., White, R.: Stream prediction using a generative model based on frequent episodes in event sequences. In: Proceedings of KDD 2008, pp. 453–461 (2008)

    Google Scholar 

  4. Ng, A., Fu, A.W.C.: Mining frequent episodes for relating financial events and stock trends. In: Proceedings of PAKDD 2003, pp. 27–39 (2003)

    Google Scholar 

  5. Wan, L., Chen, L., Zhang, C.: Mining frequent serial episodes over uncertain sequence data. In: Proceedings of EDBT 2013, pp. 215–226 (2013)

    Google Scholar 

  6. Wan, L., Chen, L., Zhang, C.: Mining dependent frequent serial episodes from uncertain sequence data. In: Proceedings of ICDM 2013, pp. 1211–1216 (2013)

    Google Scholar 

  7. Katoh, T., Tago, S., Asai, T., Morikawa, H., Shigezumi, J., Inakoshi, H.: Mining frequent partite episodes with partwise constraints. In: Appice, A., Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z.W. (eds.) NFMCP 2013. LNCS, vol. 8399, pp. 117–131. Springer, Heidelberg (2014)

    Google Scholar 

  8. Ma, X., Pang, H., Tan, L.: Finding constrained frequent episodes using minimal occurrences. In: Proceedings of ICDM 2004, pp. 471–474 (2004)

    Google Scholar 

  9. Laxman, S., Sastry, P.S., Unnikrishnan, K.P.: Discovering frequent episodes and learning hidden markov models: A formal connection. IEEE Trans. Knowl. Data Eng. 17(11), 1505–1517 (2005)

    Article  Google Scholar 

  10. Laxman, S., Sastry, P., Unnikrishnan, K.: A fast algorithm for finding frequent episodes in event streams. In: Proceedings of KDD 2007, pp. 410–419 (2007)

    Google Scholar 

  11. Huang, K., Chang, C.: Efficient mining of frequent episodes from complex sequences. Inf. Syst. 33(1), 96–114 (2008)

    Article  Google Scholar 

  12. Zhou, W., Liu, H., Cheng, H.: Mining closed episodes from event sequences efficiently. In: Proceedings of PAKDD 2010, pp. 310–318 (2010)

    Google Scholar 

  13. Tatti, N., Cule, B.: Mining closed strict episodes. In: Proceedings of ICDM 2010, pp. 501–510 (2010)

    Google Scholar 

  14. Tatti, N., Cule, B.: Mining closed episodes with simultaneous events. In: Proceedings of KDD 2011, pp. 1172–1180 (2011)

    Google Scholar 

  15. Zhu, H., Wang, W., Shi, B.: Frequent closed episode mining based on minimal and non-overlaping occurrence. J. Comput. Res. Dev. 50(4), 852–860 (2013)

    Google Scholar 

  16. Wu, C., Lin, Y., Yu, P.S., Tseng, V.S.: Mining high utility episodes in complex event sequences. In: Proceedings of KDD 2013, pp. 536–544 (2013)

    Google Scholar 

  17. Ao, X., Luo, P., Li, C., Zhuang, F., He, Q.: Online frequent episode mining. In: Proceedings of ICDE 2015, pp. 891–902 (2015)

    Google Scholar 

  18. Tatti, N.: Discovering episodes with compact minimal windows. Data Min. Knowl. Disc. 28(4), 1046–1077 (2014)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guoqiong Liao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Liao, G., Yang, X., Xie, S., Yu, P.S., Wan, C. (2016). Two-Phase Mining for Frequent Closed Episodes. In: Cui, B., Zhang, N., Xu, J., Lian, X., Liu, D. (eds) Web-Age Information Management. WAIM 2016. Lecture Notes in Computer Science(), vol 9658. Springer, Cham. https://doi.org/10.1007/978-3-319-39937-9_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-39937-9_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-39936-2

  • Online ISBN: 978-3-319-39937-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics