Skip to main content

Advertisement

Log in

Topic modeling and sentiment analysis of global climate change tweets

  • Original Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

Social media websites can be used as a data source for mining public opinion on a variety of subjects including climate change. Twitter, in particular, allows for the evaluation of public opinion across both time and space because geotagged tweets include timestamps and geographic coordinates (latitude/longitude). In this study, a large dataset of geotagged tweets containing certain keywords relating to climate change is analyzed using volume analysis and text mining techniques such as topic modeling and sentiment analysis. Latent Dirichlet allocation was applied for topic modeling to infer the different topics of discussion, and Valence Aware Dictionary and sEntiment Reasoner was applied for sentiment analysis to determine the overall feelings and attitudes found in the dataset. These techniques are used to compare and contrast the nature of climate change discussion between different countries and over time. Sentiment analysis shows that the overall discussion is negative, especially when users are reacting to political or extreme weather events. Topic modeling shows that the different topics of discussion on climate change are diverse, but some topics are more prevalent than others. In particular, the discussion of climate change in the USA is less focused on policy-related topics than other countries.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

References

  • Alampalayam SP, Kumar A (2004) Predictive security model using data mining. In: Global telecommunications conference, 2004. GLOBECOM’04, vol 4, pp 2208–2212. IEEE

  • Alampalayam SK, Natsheh EF (2008) Multivariate fuzzy analysis for mobile ad hoc network threat detection. Int J Bus Data Commun Netw (IJBDCN) 4(3):1–30

    Article  Google Scholar 

  • Blei DM (2012) Probabilistic topic models. Commun ACM 55(4):77–84

    Article  Google Scholar 

  • Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022

    MATH  Google Scholar 

  • Cody EM, Reagan AJ, Mitchell L, Dodds PS, Danforth CM (2015) Climate change sentiment on Twitter: an unsolicited public opinion poll. Plos One. https://doi.org/10.1371/journal.pone.0136092

    Article  Google Scholar 

  • Ghosh D, Guha R (2013) What are we “tweeting” about obesity? Mapping tweets with topic modeling and geographic information system. Cartogr Geogr Inf Sci 40(2):90–102. https://doi.org/10.1080/15230406.2013.776210

    Article  Google Scholar 

  • Hutto CJ, Gilbert EE (2014) VADER: a parsimonious rule-based model for sentiment analysis of social media text. In: Eighth international conference on weblogs and social media (ICWSM-14), Ann Arbor, MI

  • IPCC (2014) Contribution of working groups I, II and III to the fifth assessment report of the Intergovernmental Panel on Climate Change. Climate change 2014: synthesis report. IPCC, Geneva, Switzerland

  • Karami A, Dahl A, Turner-McGrievy G, Kharrazi H, Shaw G (2018) Characterizing diabetes, diet, exercise, and obesity comments on Twitter. Int J Inf Manag 38(01):1–6

    Article  Google Scholar 

  • Kirilenko AP, Stepchenkova SO (2014) Public microblogging on climate change: one year of Twitter worldwide. Glob Environ Change 26:171–182. https://doi.org/10.1016/j.gloenvcha.2014.02.008

    Article  Google Scholar 

  • Koto F, Adriani M (2015) A comparative study on Twitter sentiment analysis: which features are good? In: Natural language processing and information systems. NLDB 2015. Lecture notes in computer science, pp 453–457. https://doi.org/10.1007/978-3-319-19581-0_46

  • Lansley G, Longley PA (2016) The geography of Twitter topics in London. Comput Environ Urban Syst 58:85–96. https://doi.org/10.1016/j.compenvurbsys.2016.04.002

    Article  Google Scholar 

  • Leas EC, Althouse BM, Dredze M, Obradovich N, Fowler JH et al (2016) Big data sensors of organic advocacy: the case of Leonardo DiCaprio and climate change. PLoS ONE 11(8):e0159885

    Article  Google Scholar 

  • Lee GM, Qui L, Whinston AB (2016) A friend like me: modeling network formation in a location-based social network. J Manag Inf Syst 33(4):1008–1033

    Article  Google Scholar 

  • Mandel B, Culotta A, Boulahanis J, Stark D, Lewis B, Rodrigue J (2012) A demographic analysis of online sentiment during Hurricane Irene. In: Proceedings of the second workshop on language in social media

  • Manning C, Raghavan R, Schütze H (2008) Introduction to information retrieval. Retrieved from https://nlp.stanford.edu/IR-book/information-retrieval-book.html

  • Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: a survey. Ain Shams Eng J. 5(4):1093–1113

    Article  Google Scholar 

  • Mehrotra R, Sanner S, Buntine W, Xie L (2013) Improving LDA topic models for microblogs via tweet pooling and automatic labeling. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval-SIGIR 13. https://doi.org/10.1145/2484028.2484166

  • Omnicore Agency (2018) Twitter by the numbers (2018): stats, demographics & fun facts. Retrieved from https://www.omnicoreagency.com/twitter-statistics/

  • Perrin A (2015) Social network usage: 2005–2015. Pew Research Center. Retrieved from http://www.pewinternet.org/2015/10/08/2015/Social-Networking-Usage-2005-2015/

  • Raj P, Kumar SA (2017) Big data analytics processes and platforms facilitating smart cities. Smart cities: foundations, principles, and applications. Wiley, pp 23–52

  • Rehurek R, Sojka P (2010) Software framework for topic modeling with large corpora. In: Proceedings of the LREC 2010 workshop on new challenges for NLP frameworks, pp 45–50

  • Röder M, Both A, Hinneburg A (2015) Exploring the space of topic coherence measures. In: Proceedings of ACM international conference on web search and data mining

  • Saad L (2017) Global warming concern at three-decade high in U.S. Gallup. Retrieved from https://news.gallup.com/poll/206030/global-warming-concern-three-decade-high.aspx

  • Shi Z, Lee GM, Whinston AB (2016) Toward a better measure of business proximity: topic modeling for industry intelligence. MIS Q 40(4):1035–1056

    Article  Google Scholar 

  • Steinskog A, Therkelsen J, Gambäck B (2017) Twitter topic modeling by tweet aggregation. In: Proceedings of the 21st nordic conference on computational linguistics, pp 77–86

  • Thampi A (2016) Reverse geocoder [computer software]. Retrieved from https://github.com/thampiman/reverse-geocoder

  • Tumasjan A, Sprenger TO, Sandner PG, Welpe IM (2010) Predicting elections with Twitter: what 140 characters reveal about political sentiment. In: Proceedings of the fourth international AAAI conference on weblogs and social media

  • Van der Linden S (2017) Determinants and measurement of climate change risk perception, worry, and concern. In: Nisbett M (ed) Oxford research encyclopedia of climate science. Oxford University Press, Oxford. https://doi.org/10.1093/acrefore/9780190228620.013.318

    Chapter  Google Scholar 

  • Xu BH, Kumar SA (2014) Big data analytics framework for improved decision making. In: Proceedings on the international conference on internet computing (ICOMP) (p 1). The steering committee of the world congress in computer science, computer engineering and applied computing (WorldComp)

  • Xu B, Kumar S (2015a) A text mining classification framework and its experiments using aviation datasets. In: 25th workshop of information technology and systems (WITS), Dallas, TX, 2015 SSRN

  • Xu B, Kumar SA (2015b) Big data analytics framework for system health monitoring. In: 2015 IEEE international congress on big data (bigdata congress). IEEE, pp 401–408

  • Xu J, Wang S, Su S, Kumar SA, Wu C (2016) Latent interest and topic mining on user-item bipartite networks. In: 2016 IEEE international conference on services computing (SCC). IEEE, pp 778–781

  • Yan X, Guo J, Lan Y, Cheng X (2013) A biterm topic model for short texts. In: Proceedings of the 22nd international conference on world wide web, pp 1145–1456

  • Zubiaga A, Spina D, Martínez R, Fresno V (2014) Real-time classification of Twitter trends. J Assoc Inf Sci Technol 66(3):462–473. https://doi.org/10.1002/asi.23186

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by National Science Foundation (Grant No. AGS 1560210) and South Carolina Research Foundation, University of South Carolina (USA) (Grant No. ASPIRE-I 13540-17-44820).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sathish A. P. Kumar.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dahal, B., Kumar, S.A.P. & Li, Z. Topic modeling and sentiment analysis of global climate change tweets. Soc. Netw. Anal. Min. 9, 24 (2019). https://doi.org/10.1007/s13278-019-0568-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-019-0568-8

Navigation