Skip to main content

Analysis of Topic Evolution on News Comments Based on Word Vectors

  • Conference paper
  • First Online:
Cloud Computing and Security (ICCCS 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10040))

Included in the following conference series:

Abstract

The analysis of topic evolution mainly refers to the mining of topic content which evolves as the time goes on. With the assumption that topic content may be embodied by key words, this article adopted word2vec for the training of 750,000 pieces of news and micro-blog texts and thus established the model of word vector. Then, the text information flow was applied into the model and all word vectors by time series were acquired. Finally, the word vectors were clustered by K-means before the key words were drawn and the analysis of topic evolution was visualized. By comparing the effect of the model of word vector on drawing topic with those of LDA or PLSA topic models, the results showed that the former is superior to the latter two models. Besides, to collect abundant and varied data will facilitate the training of the model of word vector with better generalization ability and the investigation on real-time analysis of topic evolution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://code.google.com/p/word2vec/.

References

  • Chen, H., Wang, F., Zeng, D.: Intelligence and security informatics for homeland security: information, communication, and transportation. IEEE Trans. Intell. Transp. Syst. 5(4), 329–341 (2004)

    Article  MathSciNet  Google Scholar 

  • Wang, F.: Decision service and academic analytics for development of S&T based on open source intelligence and big data. Bull. Chin. Acad. Sci. 27(5), 527–537 (2012)

    Google Scholar 

  • Allan, J., Papka, R., Lavrenko, V.: On-line new event detection and tracking. In: Proceedings of the 21st Annual International Conference on Research and Development in Information Retrieval (SIGIR), pp. 37–45. ACM (1998)

    Google Scholar 

  • Han, Z., Chen, N., Le, J.J., Duan, D., Sun, J.: An efficient and effective clustering algorithm for time series of hot topics. Chin. J. Comput. 35(11), 2337–2347 (2012)

    Article  Google Scholar 

  • Zhang, S., Liu, Z.: Study on clustering method for internet public opinion hotspot topic. J. Chin. Comput. Syst. 34(3), 471–474 (2013)

    Google Scholar 

  • Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the Nineteenth International World Wide Web Conference Series (WWW), pp. 851–860. ACM (2010)

    Google Scholar 

  • Zhou, H., Liu, J., Wang, X.: Retrospective topic identification model for short text information flow. J. Chin. Inf. Process. 29(1), 111–117 (2015a)

    MathSciNet  Google Scholar 

  • Zhou, Y., Liu, X., Youtian, D., Guan, X., Liu, J.: A method for Identifying the evolutionary focuses of online social topics. Chin. J. Comput. 38(2), 261–271 (2015b)

    MathSciNet  Google Scholar 

  • Xing, E.P., Yan, R., Hauptmann, A.G.: Mining associated text and images with dual-wing harmoniums. In: Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence (UAI), pp. 633–641 (2005)

    Google Scholar 

  • Lin, J., Zhou, Y., Yang, A., Chen, Y., Chen, X.: Analysis of public emotion evolution based on probabilistic latent semantic analysis. J. Comput. Appl. 35(10), 2747–2751 (2015). 2756

    Google Scholar 

  • He, J., Chen, X., Min, D., Jiang, H.: Topic evolution analysis based on improved online LDA model. J. Cent. S. Univ. (Sci. Technol.) 2, 547–553 (2015)

    Google Scholar 

  • Lin, P., Huang, W.: Topic evolution analysis of internet emergency based on LDA model. Inf. Sci. 32(10), 20–23 (2014)

    Google Scholar 

  • Cao, J., Wang, H., Xia, Y., Qiao, F., Zhang, X.: Bi-path evolution model for online topic model based on LDA. Acta Automatica Sin. 40(12), 2877–2886 (2014)

    Google Scholar 

  • Alsumait, L., Barbará, D., Domeniconi, C.: On-line LDA: adaptive topic models for mining text streams with applications to topic detection and tracking. In: Proceedings of the 8th IEEE International Conference on Data Mining (ICDM), pp. 3–12. IEEE (2008)

    Google Scholar 

  • Keane, N., Yee, C., Zhou, L.: Using topic modeling and similarity thresholds to detect events. In: Proceedings of the 3rd Workshop on EVENTS at the NAACL-HLT, pp. 34–42 (2015)

    Google Scholar 

  • Chu, K., Li, F.: LDA model-based news topic evolution. Comput. Appl. Softw. 28(4), 4–7 (2011)

    Google Scholar 

  • Zhangjie, F., Xingming, S., Qi, L., Lu, Z., Jiangang, S.: Achieving efficient cloud search services: multi-keyword ranked search over encrypted cloud data supporting parallel computing. IEICE Trans. Commun. E98-B(1), 190–200 (2015)

    Article  Google Scholar 

  • Bin, G., Sheng, V.S., Tay, K.Y., Romano, W., Li, S.: Incremental support vector learning for ordinal regression. IEEE Trans. Neural Netw. Learn. Syst. 26(7), 1403–1416 (2015)

    Article  MathSciNet  Google Scholar 

  • Xia, Z., Wang, X., Sun, X., Wang, Q.: A secure and dynamic multi-keyword ranked search scheme over encrypted cloud data. IEEE Trans. Parallel Distrib. Syst. 27(2), 340–352 (2015)

    Article  MathSciNet  Google Scholar 

  • Zhang, D., Xua, H., Sua, Z., Xua, Y.: Chinese comments sentiment classification based on word2vec and SVMperf. Expert Syst. Appl. 42(4), 1857–1863 (2015)

    Article  Google Scholar 

Download references

Acknowledgment

Supported by The National Social Science Funding Project of China (12BYY045); The Twelfth Five-Year Plan for Philosophy and Social Sciences Project of GuangZhou (15Q16); The Philosophy and Social Sciences Project of GuangDong (GD15YTS01).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhou Yongmei .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Jianghao, L., Yongmei, Z., Aimin, Y., Jin, C. (2016). Analysis of Topic Evolution on News Comments Based on Word Vectors. In: Sun, X., Liu, A., Chao, HC., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2016. Lecture Notes in Computer Science(), vol 10040. Springer, Cham. https://doi.org/10.1007/978-3-319-48674-1_41

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-48674-1_41

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-48673-4

  • Online ISBN: 978-3-319-48674-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics