Abstract
Social media websites can be used as a data source for mining public opinion on a variety of subjects including climate change. Twitter, in particular, allows for the evaluation of public opinion across both time and space because geotagged tweets include timestamps and geographic coordinates (latitude/longitude). In this study, a large dataset of geotagged tweets containing certain keywords relating to climate change is analyzed using volume analysis and text mining techniques such as topic modeling and sentiment analysis. Latent Dirichlet allocation was applied for topic modeling to infer the different topics of discussion, and Valence Aware Dictionary and sEntiment Reasoner was applied for sentiment analysis to determine the overall feelings and attitudes found in the dataset. These techniques are used to compare and contrast the nature of climate change discussion between different countries and over time. Sentiment analysis shows that the overall discussion is negative, especially when users are reacting to political or extreme weather events. Topic modeling shows that the different topics of discussion on climate change are diverse, but some topics are more prevalent than others. In particular, the discussion of climate change in the USA is less focused on policy-related topics than other countries.
Similar content being viewed by others
References
Alampalayam SP, Kumar A (2004) Predictive security model using data mining. In: Global telecommunications conference, 2004. GLOBECOM’04, vol 4, pp 2208–2212. IEEE
Alampalayam SK, Natsheh EF (2008) Multivariate fuzzy analysis for mobile ad hoc network threat detection. Int J Bus Data Commun Netw (IJBDCN) 4(3):1–30
Blei DM (2012) Probabilistic topic models. Commun ACM 55(4):77–84
Blei DM, Ng AY, Jordan MI (2003) Latent Dirichlet allocation. J Mach Learn Res 3:993–1022
Cody EM, Reagan AJ, Mitchell L, Dodds PS, Danforth CM (2015) Climate change sentiment on Twitter: an unsolicited public opinion poll. Plos One. https://doi.org/10.1371/journal.pone.0136092
Ghosh D, Guha R (2013) What are we “tweeting” about obesity? Mapping tweets with topic modeling and geographic information system. Cartogr Geogr Inf Sci 40(2):90–102. https://doi.org/10.1080/15230406.2013.776210
Hutto CJ, Gilbert EE (2014) VADER: a parsimonious rule-based model for sentiment analysis of social media text. In: Eighth international conference on weblogs and social media (ICWSM-14), Ann Arbor, MI
IPCC (2014) Contribution of working groups I, II and III to the fifth assessment report of the Intergovernmental Panel on Climate Change. Climate change 2014: synthesis report. IPCC, Geneva, Switzerland
Karami A, Dahl A, Turner-McGrievy G, Kharrazi H, Shaw G (2018) Characterizing diabetes, diet, exercise, and obesity comments on Twitter. Int J Inf Manag 38(01):1–6
Kirilenko AP, Stepchenkova SO (2014) Public microblogging on climate change: one year of Twitter worldwide. Glob Environ Change 26:171–182. https://doi.org/10.1016/j.gloenvcha.2014.02.008
Koto F, Adriani M (2015) A comparative study on Twitter sentiment analysis: which features are good? In: Natural language processing and information systems. NLDB 2015. Lecture notes in computer science, pp 453–457. https://doi.org/10.1007/978-3-319-19581-0_46
Lansley G, Longley PA (2016) The geography of Twitter topics in London. Comput Environ Urban Syst 58:85–96. https://doi.org/10.1016/j.compenvurbsys.2016.04.002
Leas EC, Althouse BM, Dredze M, Obradovich N, Fowler JH et al (2016) Big data sensors of organic advocacy: the case of Leonardo DiCaprio and climate change. PLoS ONE 11(8):e0159885
Lee GM, Qui L, Whinston AB (2016) A friend like me: modeling network formation in a location-based social network. J Manag Inf Syst 33(4):1008–1033
Mandel B, Culotta A, Boulahanis J, Stark D, Lewis B, Rodrigue J (2012) A demographic analysis of online sentiment during Hurricane Irene. In: Proceedings of the second workshop on language in social media
Manning C, Raghavan R, Schütze H (2008) Introduction to information retrieval. Retrieved from https://nlp.stanford.edu/IR-book/information-retrieval-book.html
Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: a survey. Ain Shams Eng J. 5(4):1093–1113
Mehrotra R, Sanner S, Buntine W, Xie L (2013) Improving LDA topic models for microblogs via tweet pooling and automatic labeling. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval-SIGIR 13. https://doi.org/10.1145/2484028.2484166
Omnicore Agency (2018) Twitter by the numbers (2018): stats, demographics & fun facts. Retrieved from https://www.omnicoreagency.com/twitter-statistics/
Perrin A (2015) Social network usage: 2005–2015. Pew Research Center. Retrieved from http://www.pewinternet.org/2015/10/08/2015/Social-Networking-Usage-2005-2015/
Raj P, Kumar SA (2017) Big data analytics processes and platforms facilitating smart cities. Smart cities: foundations, principles, and applications. Wiley, pp 23–52
Rehurek R, Sojka P (2010) Software framework for topic modeling with large corpora. In: Proceedings of the LREC 2010 workshop on new challenges for NLP frameworks, pp 45–50
Röder M, Both A, Hinneburg A (2015) Exploring the space of topic coherence measures. In: Proceedings of ACM international conference on web search and data mining
Saad L (2017) Global warming concern at three-decade high in U.S. Gallup. Retrieved from https://news.gallup.com/poll/206030/global-warming-concern-three-decade-high.aspx
Shi Z, Lee GM, Whinston AB (2016) Toward a better measure of business proximity: topic modeling for industry intelligence. MIS Q 40(4):1035–1056
Steinskog A, Therkelsen J, Gambäck B (2017) Twitter topic modeling by tweet aggregation. In: Proceedings of the 21st nordic conference on computational linguistics, pp 77–86
Thampi A (2016) Reverse geocoder [computer software]. Retrieved from https://github.com/thampiman/reverse-geocoder
Tumasjan A, Sprenger TO, Sandner PG, Welpe IM (2010) Predicting elections with Twitter: what 140 characters reveal about political sentiment. In: Proceedings of the fourth international AAAI conference on weblogs and social media
Van der Linden S (2017) Determinants and measurement of climate change risk perception, worry, and concern. In: Nisbett M (ed) Oxford research encyclopedia of climate science. Oxford University Press, Oxford. https://doi.org/10.1093/acrefore/9780190228620.013.318
Xu BH, Kumar SA (2014) Big data analytics framework for improved decision making. In: Proceedings on the international conference on internet computing (ICOMP) (p 1). The steering committee of the world congress in computer science, computer engineering and applied computing (WorldComp)
Xu B, Kumar S (2015a) A text mining classification framework and its experiments using aviation datasets. In: 25th workshop of information technology and systems (WITS), Dallas, TX, 2015 SSRN
Xu B, Kumar SA (2015b) Big data analytics framework for system health monitoring. In: 2015 IEEE international congress on big data (bigdata congress). IEEE, pp 401–408
Xu J, Wang S, Su S, Kumar SA, Wu C (2016) Latent interest and topic mining on user-item bipartite networks. In: 2016 IEEE international conference on services computing (SCC). IEEE, pp 778–781
Yan X, Guo J, Lan Y, Cheng X (2013) A biterm topic model for short texts. In: Proceedings of the 22nd international conference on world wide web, pp 1145–1456
Zubiaga A, Spina D, Martínez R, Fresno V (2014) Real-time classification of Twitter trends. J Assoc Inf Sci Technol 66(3):462–473. https://doi.org/10.1002/asi.23186
Acknowledgements
This work was supported by National Science Foundation (Grant No. AGS 1560210) and South Carolina Research Foundation, University of South Carolina (USA) (Grant No. ASPIRE-I 13540-17-44820).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Dahal, B., Kumar, S.A.P. & Li, Z. Topic modeling and sentiment analysis of global climate change tweets. Soc. Netw. Anal. Min. 9, 24 (2019). https://doi.org/10.1007/s13278-019-0568-8
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-019-0568-8