Skip to main content

Identifying Evolutionary Topic Temporal Patterns Based on Bursty Phrase Clustering

  • Conference paper
  • First Online:
Book cover Web and Big Data (APWeb-WAIM 2017)

Abstract

We discuss a temporal text mining task on finding evolutionary patterns of topics from a collection of article revisions. To reveal the evolution of topics, we propose a novel method for finding key phrases that are bursty and significant in terms of revision histories. Then we show a time series clustering method to group phrases that have similar burst histories, where additions and deletions are separately considered, and time series is abstracted by burst detection. In clustering, we use dynamic time warping to measure the distance between time sequences of phrase frequencies. Experimental results show that our method clusters phrases into groups that actually share similar bursts which can be explained by real-world events.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aji, A., Wang, Y., Agichtein, E., et al.: Using the past to score the present: extending term weighting models through revision history analysis. In: Proceedings 19th ACM International Conference on Information and Knowledge Management, pp. 629–638 (2010)

    Google Scholar 

  2. Adwan, S., Arof, H.: On improving dynamic time warping for pattern matching. Measurement 45(6), 1609–1620 (2012)

    Article  Google Scholar 

  3. Doan, A., Ramakrishnan, R., Halevy, A.Y.: Crowdsourcing systems on the world-wide web. Commun. ACM 54(4), 86–96 (2011)

    Article  Google Scholar 

  4. http://opennlp.apache.org/

  5. Kalogeratos, A., Zagorisios, P., Likas, A.: Improving text stream clustering using term burstiness and co-burstiness. In: Proceedings of the 9th Hellenic Conference Artificial Intelligence, p. 16. ACM (2016)

    Google Scholar 

  6. Kleinberg, J.: Bursty and hierarchical structure in streams. Data Mining Knowl. Discov. 7(4), 373–397 (2003)

    Article  MathSciNet  Google Scholar 

  7. Liu, Y., Gao, Z., Iwaihara, M.: Identifying evolutionary topic temporal patterns based on bursty phrase clustering. DEIM Forum C5-1, March 2017

    Google Scholar 

  8. A Press: Wikipedia and Artificial Intelligence: An Evolving Synergy

    Google Scholar 

  9. Subašic, I., Berendt, B.: From bursty patterns to bursty facts: the effectiveness of temporal text mining for news. In: Proceedings of the ECAI (2010)

    Google Scholar 

  10. Tran, T., Ceroni, A., Georgescu, M., et al: Wikipevent: leveraging Wikipedia edit history for event detection. In: International Conference on Web Information Systems Engineering. Springer International Publishing, pp. 90–108 (2014)

    Google Scholar 

  11. Wikipedia: http://en.wikipedia.org/wiki/Wikipedia

  12. Yang, J., Leskovec, J.: Patterns of temporal variation in online media. In: Proceedings of the 4th ACM International Conference on Web Search and Data Mining, pp. 177–186 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Yixuan Liu , Zihao Gao or Mizuho Iwaihara .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Liu, Y., Gao, Z., Iwaihara, M. (2017). Identifying Evolutionary Topic Temporal Patterns Based on Bursty Phrase Clustering. In: Chen, L., Jensen, C., Shahabi, C., Yang, X., Lian, X. (eds) Web and Big Data. APWeb-WAIM 2017. Lecture Notes in Computer Science(), vol 10367. Springer, Cham. https://doi.org/10.1007/978-3-319-63564-4_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63564-4_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63563-7

  • Online ISBN: 978-3-319-63564-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics