Skip to main content
Log in

Generating timeline summaries with social media attention

  • Research Article
  • Published:
Frontiers of Computer Science Aims and scope Submit manuscript

Abstract

Timeline generation is an important research task which can help users to have a quick understanding of the overall evolution of one given topic. Previous methods simply split the time span into fixed, equal time intervals without studying the role of the evolutionary patterns of the underlying topic in timeline generation. In addition, few of these methods take users’ collective interests into considerations to generate timelines.

We consider utilizing social media attention to address these two problems due to the facts: 1) social media is an important pool of real users’ collective interests; 2) the information cascades generated in it might be good indicators for boundaries of topic phases. Employing Twitter as a basis, we propose to incorporate topic phases and user’s collective interests which are learnt from social media into a unified timeline generation algorithm.We construct both one informativeness-oriented and three interestingness-oriented evaluation sets over five topics.We demonstrate that it is very effective to generate both informative and interesting timelines. In addition, our idea naturally leads to a novel presentation of timelines, i.e., phase based timelines, which can potentially improve user experience.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Swan R, Allan J. Automatic generation of overview timelines. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2000, 49–56

    Google Scholar 

  2. Chieu H L, Lee Y K. Query based event extraction along a timeline. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2004, 425–432

    Google Scholar 

  3. Yan R, Wan X J, Otterbacher J, Kong L, Li X M, Zhang Y. Evolutionary timeline summarization: a balanced optimization framework via iterative substitution. In: Proceedings of the 34th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval. 2011, 745–754

    Google Scholar 

  4. Yan R, Kong L, Huang C R, Wan X J, Li X M, Zhang Y. Timeline generation through evolutionary trans-temporal summarization. In: Proceedings of the Conference on EmpiricalMethods in Natural Language Processing. 2011, 433–443

    Google Scholar 

  5. Yan R, Nie J Y, Li X M. Summarize what you are interested in: an optimization framework for interactive personalized summarization. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011, 1342–1351

    Google Scholar 

  6. Kleinberg J. Bursty and hierarchical structure in streams. Data Mining Knowledge Discovery, 2003, 7(4): 373–397

    Article  MathSciNet  Google Scholar 

  7. Wu SM, Hofman J M, Mason WA, Watts J D. Who says what to whom on twitter. In: Proceedings of the 20th International World Wide Web Conference. 2011, 705–714

    Google Scholar 

  8. Zhai C X, Lafferty J. Model-based feedback in the language modeling approach to information retrieval. In: Proceedings of the 10th ACM International Conference on Information and Knowledge Management. 2001, 403–410

    Google Scholar 

  9. Erkan G, Radev D R. LexPageRank: prestige in multi-document text summarization. In: Proceedings of the Conference on Empirical Methods on Natural Language Processing. 2004, 365–371

    Google Scholar 

  10. Wan X J, Yang J W, Xiao J G. Manifold-ranking based topic-focused multi-document summarization. In: Proceedings of the International Joint Conference on Artificial Intelligence. 2007, 2903–2908

    Google Scholar 

  11. Mei Q Z, Guo J, Radev, D. DivRank: the interplay of prestige and diversity in information networks. In: Proceedings of the 16th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2010, 1009–1018

    Chapter  Google Scholar 

  12. Yang J, Leskovec J. Patterns of temporal variation in online media. In: Proceedings of the 4th ACM International Conference on Web Search and Data Mining. 2011, 177–186

    Google Scholar 

  13. Leskovec J, Backstrom L, Kleinberg, J. Meme-tracking and the dynamics of the news cycle. In: Proceedings of the 15th ACM SIGKDD Conference on Knowledge Discovery and DataMining. 2009, 497–506

    Chapter  Google Scholar 

  14. Radev D R, Jing H Y, Sty M, Tam D. Centroid-based summarization of multiple documents. Information Processing and Management, 2004, 40(6): 919–938

    Article  MATH  Google Scholar 

  15. Wan X J, Yang J W. Multi-document summarization using clusterbased link analysis. In: Proceedings of the 31st ACM SIGIR Conference on Research and Development in Information Retrieval. 2008, 299–306

    Chapter  Google Scholar 

  16. Zhao X W, Shu B H, Jiang J, Song Y, Yan H F, Li X M. Identifying event-related bursts via social media activities. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 2012, 1466–1477

    Google Scholar 

  17. Lin C Y, Hovy E. From single to multi-document summarization: a prototype system and its evaluation. In: Proceedings of the 40th Annual Conference of the Association for Computational Linguistics. 2002, 457–464

    Google Scholar 

  18. Wan X J, Yang J W, Xiao J G. Single document summarization with document expansion. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2007, 931–936

    Google Scholar 

  19. Li L D, Zhou K, Xue G R, Zha H Y, Yu Y. Enhancing diversity, coverage and balance for summarization through structure learning. In: Proceedings of the 18th International World Wide Web Conference. 2009, 71–80

    Chapter  Google Scholar 

  20. Goldstein J, Kantrowitz M, Mittal V, Carbonell J. Summarizing text documents: sentence selection and evaluation metrics. In: Proceedings of the 22nd ACM SIGIR Conference on Research and Development in Information Retrieval. 1999, 121–128

    Chapter  Google Scholar 

  21. Leuski A, Lin C Y, Hovy E. iNeATS: interactive multi-document summarization. In: Proceedings of the 41st Conference of the Association for Computational Linguistics. 2003, 125–128

    Google Scholar 

  22. Allan J, Gupta R, Khandelwal V. Temporal summaries of new topics. In: Proceedings of the 24th ACM SIGIR Conference on Research and Development in Information Retrieval. 2001, 10–18

    Chapter  Google Scholar 

  23. Yang Z, Cai K K, Tang J, Zhang L, Su Z, Li J Z. Social context summarization. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2011, 255–264

    Google Scholar 

  24. Swan R, Allan J. Automatic generation of overview timelines. In: Proceedings of the 23rd Annual ACM SIGIR Conference on Research and Development in Information Retrieval. 2000, 49–56

    Chapter  Google Scholar 

  25. Fung G P C, Yu J X, Yu P S, Lu H J. Parameter free bursty events detection in text streams. In: Proceedings of the 31st International Conference on Very Large Data Bases. 2005, 181–192

    Google Scholar 

  26. Mathioudakis M, Koudas N. TwitterMonitor: trend detection over the twitter stream. In: Proceedings of the 2010 ACM International Conference on Management of Data. 2010, 1155–1158

    Google Scholar 

  27. Zubiaga A, Spina D, Fresno V, Martínez R. Classifying trending topics: a typology of conversation triggers on Twitter. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management. 2011, 2461–2464

    Google Scholar 

  28. Budak C, Agrawal D, Abbadi A E. Structural trend analysis for online social networks. The Proceedings of the VLDB Endowment, 2011, 4(10): 646–656

    Article  Google Scholar 

  29. Naaman M, Becker H, Gravano L. Hip and trendy: characterizing emerging Trends on twitter. Journal of American Society for Information Science and Techonology, 2011, 62(5): 902–918

    Article  Google Scholar 

  30. Sakaki T, Okazaki M, Matsuo Y. Earthquake shakes Twitter users: realtime event detection by social sensors. In: Proceedings of the 19th International World Wide Web Conference. 2010, 851–860

    Google Scholar 

  31. Aramki E, Maskawa S, Morita M. Twitter catches the flu: Detecting influenza epidemics using Twitter. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2011, 1466–1477

    Google Scholar 

  32. Marcus A, Bernstein M S, Badar O, Karger D R, Madden S, Miller R C. Twitinfo: aggregating and visualizing microblogs for event exploration. In: Proceedings of ACM Conference on Human Factors in Computing Systems. 2011, 227–236

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wayne Xin Zhao.

Additional information

Wayne Xin Zhao is currently an assistant professor at the School of Information, Renmin University of China, China. He received the PhD degree from Peking University, China in 2014. His research interests are web text mining and natural language processing. He has published several referred papers in international conferences and journals such as ACL, EMNLP, COLING, ECIR, CIKM, SIGIR, SIGKDD, ACM TOIS, ACM TIST and IEEE TKDE.

Ji-Rong Wen is a professor at the School of Information, Renmin University of China, China. Before that, he was a senior researcher and group manager of the Web Search and Mining Group at MicroSesearch Asia (MSRA), China since 2008. He has published extensively on prestigious international conferences/journals and served as program committee members or chairs in many international conferences. He was the chair of theWWWin China track of the 17thWorldWideWeb conference. He is currently the associate editor of ACM Transactions on Information Systems (TOIS).

Xiaoming Li is a professor at the School of Electronic Engineering and Computer Science and the director of Institute of Network Computing and Information Systems in Peking University, China. He is a senior member of IEEE and currently served as vice president of China Computer Federation. His research interests include search engine and web mining, and web technology enabled social sciences.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, W.X., Wen, JR. & Li, X. Generating timeline summaries with social media attention. Front. Comput. Sci. 10, 702–716 (2016). https://doi.org/10.1007/s11704-015-5145-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11704-015-5145-3

Keywords

Navigation