GreenScreen: A Multimodal Dataset for Detecting Corporate Greenwashing in the Wild

Sharma, Ujjwal; Rudinac, Stevan; Demmers, Joris; van Dolen, Willemijn; Worring, Marcel

doi:10.1007/978-3-031-56435-2_8

Ujjwal Sharma¹⁴,
Stevan Rudinac¹⁴,
Joris Demmers¹⁴,
Willemijn van Dolen¹⁴ &
…
Marcel Worring¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14565))

Included in the following conference series:

International Conference on Multimedia Modeling

691 Accesses

Abstract

Greenwashing, a form of deceptive marketing where organizations attempt to convince consumers that their offerings and operations are environmentally sound, can cause lasting damage to sustainability efforts by confusing consumers and eroding trust in genuine pro-sustainability actions. Nonetheless, capturing greenwashing “in the wild” remains challenging because greenwashed content frequently employs subliminal messaging through abstract semantic concepts that require subjective interpretation and contextualization within the context of the parent company’s actual environmental performance. Moreover, this task typically presents itself as a weakly-supervised set-relevance problem, where the detection of greenwashing in individual media relies on utilizing supervisory signals available at the company level. To open up the task of detecting greenwashing in the wild to the wider multimedia retrieval community, we present a dataset that combines large-scale text and image collections, obtained from Twitter accounts for Fortune-1000 companies, with authoritative environmental risk scores on fine-grained issue categories like emissions, effluent discharge, resource usage, and greenhouse gas emissions. Furthermore, we offer a simple baseline method that uses state-of-the-art content encoding techniques to represent social media content and to understand the connection between content and its tendency for greenwashing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

ImageCLEF 2019: Multimedia Retrieval in Lifelogging, Medical, Nature, and Security Applications

Overview of the ImageCLEF 2021: Multimedia Retrieval in Medical, Nature, Internet and Social Media Applications

The 2021 ImageCLEF Benchmark: Multimedia Retrieval in Medical, Nature, Internet and Social Media Applications

Notes

1.
Twitter is currently rebranding as X. However, for the remainder of this work, we will continue to refer to this platform by its former name – Twitter.
2.
https://www.msci.com/our-solutions/esg-investing.
3.
https://www.spglobal.com/spdji/en/index-family/esg/.
4.
https://www.sustainalytics.com/esg-ratings.
5.
https://wrds-www.wharton.upenn.edu/pages/about/data-vendors/sustainalytics/.

References

Bonneuil, C., Choquet, P.L., Franta, B.: Early warnings and emerging accountability: total’s responses to global warming, 1971–2021. Glob. Environ. Chang. 71, 102386 (2021)
Article Google Scholar
Broadstock, D.C., Chan, K., Cheng, L.T., Wang, X.: The role of ESG performance during times of financial crisis: evidence from COVID-19 in china. Financ. Res. Lett. 38, 101716 (2021)
Article Google Scholar
Cai, H., Yang, Y., Li, X., Huang, Z.: What are popular: exploring twitter features for event detection, tracking and visualization. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 89–98. MM 2015, Association for Computing Machinery, New York (2015)
Google Scholar
Carbonneau, M.A., Cheplygina, V., Granger, E., Gagnon, G.: Multiple instance learning: a survey of problem characteristics and applications. Pattern Recogn. 77, 329–353 (2018)
Article Google Scholar
Cherti, M., et al.: Reproducible scaling laws for contrastive language-image learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2818–2829 (2023)
Google Scholar
Deng, X., Cheng, X.: Can ESG indices improve the enterprises’ stock market performance?-an empirical study from China. Sustainability 11(17), 4765 (2019)
Article Google Scholar
Ding, K., Wang, R., Wang, S.: Social media popularity prediction: a multiple feature fusion approach with deep neural networks. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 2682–2686 (2019)
Google Scholar
Dunlap, R.E., McCright, A.M., et al.: Organized climate change denial. Oxford Handb. Clim. Change Soc. 1, 144–160 (2011)
Google Scholar
Gillan, S.L., Koch, A., Starks, L.T.: Firms and social responsibility: a review of ESG and CSR research in corporate finance. J. Corp. Finan. 66, 101889 (2021)
Article Google Scholar
Halbritter, G., Dorfleitner, G.: The wages of social responsibility - where are they? A critical review of ESG investing. Rev. Financ. Econ. 26, 25–35 (2015)
Article Google Scholar
Harman, D.: Overview of the first TREC conference. In: Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 36–47 (1993)
Google Scholar
Ilharco, G., et al.: Openclip (2021)
Google Scholar
Ilse, M., Tomczak, J., Welling, M.: Attention-based deep multiple instance learning. In: International Conference on Machine Learning, pp. 2127–2136. PMLR (2018)
Google Scholar
Kumar, A., Garg, G.: Sentiment analysis of multimodal twitter data. Multimedia Tools Appl. 78, 24103–24119 (2019)
Article Google Scholar
Larson, M., et al.: Automatic tagging and geotagging in video collections and communities. In: Proceedings of the 1st ACM International Conference on Multimedia Retrieval. ICMR 2011, Association for Computing Machinery, New York (2011)
Google Scholar
Ma, Y., Yang, X., Liao, L., Cao, Y., Chua, T.S.: Who, where, and what to wear? Extracting fashion knowledge from social media. In: Proceedings of the 27th ACM International Conference on Multimedia, pp. 257–265. MM 2019, Association for Computing Machinery, New York (2019)
Google Scholar
Nguyen, D.Q., Vu, T., Nguyen, A.T.: BERTweet: a pre-trained language model for English tweets. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp. 9–14 (2020)
Google Scholar
Radford, A., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763. PMLR (2021)
Google Scholar
Reimers, N., Gurevych, I.: Sentence-BERT: sentence embeddings using siamese BERT-networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics (2019). https://arxiv.org/abs/1908.10084
Schuhmann, C., et al.: LAION-5b: an open large-scale dataset for training next generation image-text models. In: Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (2022). https://openreview.net/forum?id=M3Y74vmsMcY
Smeaton, A.F., Over, P., Kraaij, W.: Evaluation campaigns and TRECVid. In: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, pp. 321–330 (2006)
Google Scholar
Summers, E., et al.: Docnow/twarc: v2.13.0 (2022). https://doi.org/10.5281/zenodo.7484102
Supran, G., Oreskes, N.: Assessing ExxonMobil’s climate change communications (1977–2014). Environ. Res. Lett. 12(8), 084019 (2017)
Article Google Scholar
Tian, L., Zhang, X., Wang, Y., Liu, H.: Early detection of rumours on twitter via stance transfer learning. In: Jose, J.M., et al. (eds.) ECIR 2020. LNCS, vol. 12035, pp. 575–588. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-45439-5_38
Chapter Google Scholar
Wong, J.B., Zhang, Q.: Stock market reactions to adverse ESG disclosure via media channels. Br. Account. Rev. 54(1), 101045 (2022)
Article Google Scholar
Zhai, X., Kolesnikov, A., Houlsby, N., Beyer, L.: Scaling vision transformers. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12104–12113 (2022)
Google Scholar
Zhai, X., et al.: Lit: zero-shot transfer with locked-image text tuning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 18123–18133 (2022)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Amsterdam, Amsterdam, The Netherlands
Ujjwal Sharma, Stevan Rudinac, Joris Demmers, Willemijn van Dolen & Marcel Worring

Authors

Ujjwal Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Stevan Rudinac
View author publications
You can also search for this author in PubMed Google Scholar
Joris Demmers
View author publications
You can also search for this author in PubMed Google Scholar
Willemijn van Dolen
View author publications
You can also search for this author in PubMed Google Scholar
Marcel Worring
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ujjwal Sharma .

Editor information

Editors and Affiliations

University of Amsterdam, Amsterdam, The Netherlands
Stevan Rudinac
Delft University of Technology, Delft, The Netherlands
Alan Hanjalic
Delft University of Technology, Delft, The Netherlands
Cynthia Liem
University of Amsterdam, Amsterdam, The Netherlands
Marcel Worring
Reykjavik University, Reykjavik, Iceland
Björn Þór Jónsson
Microsoft Research Lab – Asia, Beijing, China
Bei Liu
The University of Tokyo, Tokyo, Japan
Yoko Yamakata

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sharma, U., Rudinac, S., Demmers, J., van Dolen, W., Worring, M. (2024). GreenScreen: A Multimodal Dataset for Detecting Corporate Greenwashing in the Wild. In: Rudinac, S., et al. MultiMedia Modeling. MMM 2024. Lecture Notes in Computer Science, vol 14565. Springer, Cham. https://doi.org/10.1007/978-3-031-56435-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-56435-2_8
Published: 20 March 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-56434-5
Online ISBN: 978-3-031-56435-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

GreenScreen: A Multimodal Dataset for Detecting Corporate Greenwashing in the Wild