Abstract
To identify the evolution relationship between events, this paper presents a new model which utilizes random walk model to weight the cosine similarity of events according to the chronological order. This model is proved to be effective and accurate in identifying relationship of events through comparisons with other models. This paper also puts forward an innovative method to detect hot topics which applies the concept of related events. This paper introduces parallel computing, including Spark and MapReduce, to significantly improve the efficiency of the event evolution calculation and hot topic detection.
This work was supported by National Basic Research (973) Program of China (No. 2014CB329606) and Special Fund for Beijing Common Construction Project.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Shahaf, D., Guestrin, C.: Connecting the dots between news articles. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 25-28 July, Washington, D.C. ACM Press, New York (2010)
Swan, R., Jensen, D.: TimeMines: constructing timelines with statistical models of word usage. In: Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 20–23 August, Boston, MA. ACM Press, New York (2000)
Makkonen, J.: Investigations on event evolution in TDT. In: Proceedings of HLT NAACL 2003 Student Research Workshop, pp, 43–48 (2003)
Shahaf, D., Guestrin, C., Horvitz E.: Trains of thought: generating information maps. In: Proceedings of the 21st International Conference on the World Wide Web, 16–20 April, Lyon, France, pp. 899–908. ACM Press, New York (2012)
Shahaf, D., Guestrin, C., Horvitz, E., Leskovec, J.: Information cartography. Commun. ACM 58(11), 62–73 (2015)
Hui, Z., Guohui, L., Li, J., et al.: A method of constructing news events evolution. J. Natl. Univ. Defense Technol. 2013(04), 166–170 (2013)
Kalyanam, J., Mantrach, A., Saez-Trumper, D., Vahabi, H., Lanckriet, G.: Leveraging social context for modeling topic evolution. In: Knowledge Discovery and Data Mining, pp. 517–526 (2015)
Pei, L., Lakshmanan, L.V.S., et al.: Incremental cluster evolution tracking from highly dynamic network data. In: Proceedings of 2014 IEEE 30th International Conference on Data Engineering, 31 March – 4 April (2014)
Cai, H., Huang, Z., Srivastava, D., et al.: Indexing evolving events from tweet streams. In: Proceedings of 2016 IEEE 32nd International Conference on Data Engineering, 16-20 May 2016, pp. 1538–1539 (2016)
Xingxing, L.: Finding Trend and Automatic Topic Features Extraction. Central China Normal University (2009)
Zhifang, Y., Chao, W., Fan, Z., et al.: Emerging rumor identification for social media with hot topic detection. In: 2015 12th Web Information System and Application Conference (WISA), pp. 53–58 (2015)
Goulong, L., Xiaofei, X., Ying, Z., et al.: An improved latent Dirichlet allocation model for hot topic extraction. In: 2014 IEEE Fourth International Conference on Big Data and Cloud Computing (BdCloud), pp. 470–476 (2014)
Jeffrey, D., Sanjay, G.: MapReduce: simplified data processing on large clusters. Commun. ACM 51, 107–113 (2008)
Matei, Z., Mosharaf, C., et al.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing (HotCloud 2010), p. 10 (2010)
Hui, Z., Guohui, L., Li, J., et al.: On-line news event detection based on TF·IEF model. J. Natl. Univ. Defense Technol. 35(3), 9 (2013)
Brin, S., Page, L.: The anatomy of a large-scale hyper textual Web search engine. Comput. Netw. ISDN Syst. 30, 107–117 (1998)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Wu, C., Wu, B., Wang, B. (2016). Event Evolution Model Based on Random Walk Model with Hot Topic Extraction. In: Li, J., Li, X., Wang, S., Li, J., Sheng, Q. (eds) Advanced Data Mining and Applications. ADMA 2016. Lecture Notes in Computer Science(), vol 10086. Springer, Cham. https://doi.org/10.1007/978-3-319-49586-6_42
Download citation
DOI: https://doi.org/10.1007/978-3-319-49586-6_42
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-49585-9
Online ISBN: 978-3-319-49586-6
eBook Packages: Computer ScienceComputer Science (R0)