Skip to main content

Tweet Integration by Finding the Shortest Paths on a Word Graph

  • Chapter
  • First Online:
Modern Approaches for Intelligent Information and Database Systems

Part of the book series: Studies in Computational Intelligence ((SCI,volume 769))

Abstract

Twitter is a well-known social network service. Every second, users post a large number of tweets on different topics, which leads to a significant problem-it is time-consuming for users to get useful information for their individual purposes. It is difficult for a user to receive necessary information from all topics with high accuracy. Thus, integrating the tweets to create summaries is very convenient solution for users. There are some previous works trying to solve the problem of tweet integration. However, they did not consider automatic grouping tweets into small clusters according to topic. Moreover, the tweets have not analyzed for sentiment mining before summarization. In this study, we propose an approach to integrate tweets by taking into account techniques such as topic modeling to automatically determine the number of topics as well as the tweets inside each topic, plus sentiment analysis to classify the attitudes of the users. The experimental results show that the proposed model achieves promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.internetlivestats.com/internet-users/.

  2. 2.

    https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/.

  3. 3.

    http://www.internetlivestats.com/twitter-statistics/.

  4. 4.

    http://linanqiu.github.io/2015/10/07/word2vec-sentiment/.

References

  1. Alvarez-Melis, D., Saveski, M.: Topic modeling in twitter: aggregating tweets by conversations. In: ICWSM, pp. 519–522 (2016)

    Google Scholar 

  2. Sharifi, B., Hutton, M.A., Kalita, J.: Summarizing microblogs automatically. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 685–688. Association for Computational Linguistics (2010)

    Google Scholar 

  3. Yajuan, D., Zhimin, C., Furu, W., Ming, Z., Shum, H.Y.: Twitter topic summarization by ranking tweets using social influence and content quality. In: Proceedings of the 24th International Conference on Computational Linguistics, pp. 763–780 (2012)

    Google Scholar 

  4. Zhang, R., Li, W., Gao, D., Ouyang, Y.: Automatic twitter topic summarization with speech acts. IEEE Trans. Audi Speech Lang. Process. 21(3), 649–658 (2013)

    Article  Google Scholar 

  5. Sharifi, B., Hutton, M.A., Kalita, J.K.: Experiments in microblog summarization. In: 2010 IEEE Second International Conference on Social Computing (SocialCom), pp. 49–56. IEEE (2010)

    Google Scholar 

  6. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)

    Article  Google Scholar 

  7. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)

    Google Scholar 

  8. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv:1301.3781 (2013)

  9. Zhou, S., Li, K., Liu, Y.: Text categorization based on topic model. Int. J. Comput. Intell. Syst. 2(4), 398–409 (2009). https://doi.org/10.1080/18756891.2009.9727671

    Article  Google Scholar 

  10. Heinrich, G.: Infinite LDA implementing the HDP with minimum code complexity. Technical note, Feb 170 (2011)

    Google Scholar 

  11. Hoang, D.T., Tran, V.C., Nguyen, V.D., Nguyen, N.T., Hwang, D.: Improving academic event recommendation using research similarity and interaction strength between authors. Cybern. Syst. 48(3), 210–230 (2017)

    Article  Google Scholar 

  12. Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31st International Conference on Machine Learning (ICM-14), pp. 1188–1196 (2014)

    Google Scholar 

  13. Filippova, K.: Multi-sentence compression: finding shortest paths in word graphs. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 322–330. Association for Computational Linguistics (2010)

    Google Scholar 

  14. Steinberger, J., Ježek, K.: Evaluation measures for text summarization. Comput. Inf. 28(2), 251–275 (2012)

    MATH  Google Scholar 

  15. Vanderwende, L., Suzuki, H., Brockett, C., Nenkova, A.: Beyond sumbasic: task-focused summarization with sentence simplification and lexical expansion. Inf. Process. Manag. 43(6), 1606–1618 (2007)

    Article  Google Scholar 

  16. Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)

    Google Scholar 

Download references

Acknowledgements

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (2017R1A2B4009410).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dosam Hwang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Phan, H.T., Hoang, D.T., Nguyen, N.T., Hwang, D. (2018). Tweet Integration by Finding the Shortest Paths on a Word Graph. In: Sieminski, A., Kozierkiewicz, A., Nunez, M., Ha, Q. (eds) Modern Approaches for Intelligent Information and Database Systems. Studies in Computational Intelligence, vol 769. Springer, Cham. https://doi.org/10.1007/978-3-319-76081-0_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-76081-0_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-76080-3

  • Online ISBN: 978-3-319-76081-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics