Skip to main content

GreenScreen: A Multimodal Dataset for Detecting Corporate Greenwashing in the Wild

  • Conference paper
  • First Online:
MultiMedia Modeling (MMM 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14565))

Included in the following conference series:

  • 667 Accesses

Abstract

Greenwashing, a form of deceptive marketing where organizations attempt to convince consumers that their offerings and operations are environmentally sound, can cause lasting damage to sustainability efforts by confusing consumers and eroding trust in genuine pro-sustainability actions. Nonetheless, capturing greenwashing “in the wild” remains challenging because greenwashed content frequently employs subliminal messaging through abstract semantic concepts that require subjective interpretation and contextualization within the context of the parent company’s actual environmental performance. Moreover, this task typically presents itself as a weakly-supervised set-relevance problem, where the detection of greenwashing in individual media relies on utilizing supervisory signals available at the company level. To open up the task of detecting greenwashing in the wild to the wider multimedia retrieval community, we present a dataset that combines large-scale text and image collections, obtained from Twitter accounts for Fortune-1000 companies, with authoritative environmental risk scores on fine-grained issue categories like emissions, effluent discharge, resource usage, and greenhouse gas emissions. Furthermore, we offer a simple baseline method that uses state-of-the-art content encoding techniques to represent social media content and to understand the connection between content and its tendency for greenwashing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Twitter is currently rebranding as X. However, for the remainder of this work, we will continue to refer to this platform by its former name – Twitter.

  2. 2.

    https://www.msci.com/our-solutions/esg-investing.

  3. 3.

    https://www.spglobal.com/spdji/en/index-family/esg/.

  4. 4.

    https://www.sustainalytics.com/esg-ratings.

  5. 5.

    https://wrds-www.wharton.upenn.edu/pages/about/data-vendors/sustainalytics/.

References

  1. Bonneuil, C., Choquet, P.L., Franta, B.: Early warnings and emerging accountability: total’s responses to global warming, 1971–2021. Glob. Environ. Chang. 71, 102386 (2021)

    Article  Google Scholar 

  2. Broadstock, D.C., Chan, K., Cheng, L.T., Wang, X.: The role of ESG performance during times of financial crisis: evidence from COVID-19 in china. Financ. Res. Lett. 38, 101716 (2021)

    Article  Google Scholar 

  3. Cai, H., Yang, Y., Li, X., Huang, Z.: What are popular: exploring twitter features for event detection, tracking and visualization. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 89–98. MM 2015, Association for Computing Machinery, New York (2015)

    Google Scholar 

  4. Carbonneau, M.A., Cheplygina, V., Granger, E., Gagnon, G.: Multiple instance learning: a survey of problem characteristics and applications. Pattern Recogn. 77, 329–353 (2018)

    Article  Google Scholar 

  5. Cherti, M., et al.: Reproducible scaling laws for contrastive language-image learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2818–2829 (2023)

    Google Scholar 

  6. Deng, X., Cheng, X.: Can ESG indices improve the enterprises’ stock market performance?-an empirical study from China. Sustainability 11(17), 4765 (2019)

    Article  Google Scholar 

  7. Ding, K., Wang, R., Wang, S.: Social media popularity prediction: a multiple feature fusion approach with deep neural networks. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 2682–2686 (2019)

    Google Scholar 

  8. Dunlap, R.E., McCright, A.M., et al.: Organized climate change denial. Oxford Handb. Clim. Change Soc. 1, 144–160 (2011)

    Google Scholar 

  9. Gillan, S.L., Koch, A., Starks, L.T.: Firms and social responsibility: a review of ESG and CSR research in corporate finance. J. Corp. Finan. 66, 101889 (2021)

    Article  Google Scholar 

  10. Halbritter, G., Dorfleitner, G.: The wages of social responsibility - where are they? A critical review of ESG investing. Rev. Financ. Econ. 26, 25–35 (2015)

    Article  Google Scholar 

  11. Harman, D.: Overview of the first TREC conference. In: Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 36–47 (1993)

    Google Scholar 

  12. Ilharco, G., et al.: Openclip (2021)

    Google Scholar 

  13. Ilse, M., Tomczak, J., Welling, M.: Attention-based deep multiple instance learning. In: International Conference on Machine Learning, pp. 2127–2136. PMLR (2018)

    Google Scholar 

  14. Kumar, A., Garg, G.: Sentiment analysis of multimodal twitter data. Multimedia Tools Appl. 78, 24103–24119 (2019)

    Article  Google Scholar 

  15. Larson, M., et al.: Automatic tagging and geotagging in video collections and communities. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval. ICMR 2011, Association for Computing Machinery, New York (2011)

    Google Scholar 

  16. Ma, Y., Yang, X., Liao, L., Cao, Y., Chua, T.S.: Who, where, and what to wear? Extracting fashion knowledge from social media. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 257–265. MM 2019, Association for Computing Machinery, New York (2019)

    Google Scholar 

  17. Nguyen, D.Q., Vu, T., Nguyen, A.T.: BERTweet: a pre-trained language model for English tweets. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp. 9–14 (2020)

    Google Scholar 

  18. Radford, A., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763. PMLR (2021)

    Google Scholar 

  19. Reimers, N., Gurevych, I.: Sentence-BERT: sentence embeddings using siamese BERT-networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics (2019). https://arxiv.org/abs/1908.10084

  20. Schuhmann, C., et al.: LAION-5b: an open large-scale dataset for training next generation image-text models. In: Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (2022). https://openreview.net/forum?id=M3Y74vmsMcY

  21. Smeaton, A.F., Over, P., Kraaij, W.: Evaluation campaigns and TRECVid. In: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, pp. 321–330 (2006)

    Google Scholar 

  22. Summers, E., et al.: Docnow/twarc: v2.13.0 (2022). https://doi.org/10.5281/zenodo.7484102

  23. Supran, G., Oreskes, N.: Assessing ExxonMobil’s climate change communications (1977–2014). Environ. Res. Lett. 12(8), 084019 (2017)

    Article  Google Scholar 

  24. Tian, L., Zhang, X., Wang, Y., Liu, H.: Early detection of rumours on twitter via stance transfer learning. In: Jose, J.M., et al. (eds.) ECIR 2020. LNCS, vol. 12035, pp. 575–588. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-45439-5_38

    Chapter  Google Scholar 

  25. Wong, J.B., Zhang, Q.: Stock market reactions to adverse ESG disclosure via media channels. Br. Account. Rev. 54(1), 101045 (2022)

    Article  Google Scholar 

  26. Zhai, X., Kolesnikov, A., Houlsby, N., Beyer, L.: Scaling vision transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12104–12113 (2022)

    Google Scholar 

  27. Zhai, X., et al.: Lit: zero-shot transfer with locked-image text tuning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 18123–18133 (2022)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ujjwal Sharma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sharma, U., Rudinac, S., Demmers, J., van Dolen, W., Worring, M. (2024). GreenScreen: A Multimodal Dataset for Detecting Corporate Greenwashing in the Wild. In: Rudinac, S., et al. MultiMedia Modeling. MMM 2024. Lecture Notes in Computer Science, vol 14565. Springer, Cham. https://doi.org/10.1007/978-3-031-56435-2_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-56435-2_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-56434-5

  • Online ISBN: 978-3-031-56435-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics