Skip to main content
Log in

ESG score prediction through random forest algorithm

  • Original Paper
  • Published:
Computational Management Science Aims and scope Submit manuscript

Abstract

Environment-related risks affect assets in various sectors of the global economy, as well as social and governance aspects, giving birth to what is known as ESG investments. Sustainable and responsible finance has become a major aim for asset managers who are regularly dealing with the measurement and management of ESG risks. To this purpose, Financial Institutions and Rating Agencies have created an ESG score aimed to provide disclosure on the environment, social, and governance (corporate social responsibilities) metrics. CSR/ESG ratings are becoming quite popular even if highly questioned in terms of reliability. Asset managers do not always believe that markets consistently and correctly price climate risks into company valuations, in these cases ESG ratings, when available, provide an important tool in the company’s fundraising process or on the shares’ return. Assuming we can choose a reliable set of CSR/ESG ratings, we aim to assess how structural data- balance sheet items- may affect ESG scores assigned to regularly traded stocks. Using a Random Forest algorithm, we investigate how structural data affect the Thomson Reuters Refinitiv ESG scores for the companies which constitute the STOXX 600 Index. We find that balance sheet data provide a crucial element to explain ESG scores.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Notes

  1. In science, computing, and engineering, a black box is a device, system, or object which can be viewed in terms of its inputs and outputs, without any knowledge of its internal workings. Its implementation is opaque or “black”. Source: Investopedia.

  2. For the target variables E.Score, S.Score and G.Score, the algorithm’s parameters are set respectively as follows: mtry = 9, 9, 10 and nodesize = 1, 1, 1.

References

  • Aas K, Jullum M, Løland A (2019) Explaining individual predictions when features are dependent: more accurate approximations to shapley values. arXiv preprint arXiv:1903.10464

  • Adler P, Falk C, Friedler SA, Nix T, Rybeck G, Scheidegger C, Smith B, Venkatasubramanian S (2018) Auditing black-box models for indirect influence. Knowl Inf Syst 54(1):95–122

    Article  Google Scholar 

  • Alberg, J., Lipton, Z. (2017). Improving factor-based quantitative investing by forecasting company fundamentals. Working paper, Cornell University. Available at https://arxiv.org/abs/1711.04837

  • Angel J, Rivoli P (1997) Does ethical investing impose a cost upon the firm? a theoretical perspective. J Invest 6:57–61

    Article  Google Scholar 

  • Anderson TW (1984) An introduction to multivariate statistical analysis. Wiley, New York

    Google Scholar 

  • Arribas I, Espinós-Vañó MD, García F, Oliver J (2019) Defining socially responsible companies according to retail investors preferences. Entrep Sustain Issues 7(2):1641–1653

    Google Scholar 

  • Arribas I, Espinós-Vañó MD, Garcîa F, Morales-Bañuelos PB (2019) The inclusion of socially irresponsible companies in sustainable stock indices. Sustain 11:2047

    Article  Google Scholar 

  • Aydin AD, Caliskan Cavdar S (2015) Two different points of view through artificial intelligence and vector autoregressive models for Ex Post and Ex ante forecasting. Comput Intell (Neuroscience)

  • Batres-Estrada, G. (2015). Deep Learning for Multivariate Financial Time Series. Master’s thesis, KTH Royal Institute of Technology. https://www.math.kth.se/matstat/seminarier/ reports/M-exjobb15/150612a.pdf

  • Bhagat, S. Bolton, B. Corporate governance and firm performance. J Corp Financ. 14: 257–273

  • Bauer R, Koedijk K, Otten R (2005) International evidence on ethical mutual fund performance and investment style. J Bank Financ 29(7):1751–1767

    Article  Google Scholar 

  • Belghitar Y, Clark E, Deshmukh N (2014) Does it pay to be ethical? Evidence from the FTSE4Good. J Bank Financ 47:54–62

    Article  Google Scholar 

  • Bello ZY (2005) Socially responsible investing and portfolio diversification. J Financ Res 28(1):41–57

    Article  Google Scholar 

  • Benabou R, Tirole J (2010) Individual and corporate social responsibility. Economica. https://doi.org/10.1111/j.1468-0335.2009.00843.x

    Article  Google Scholar 

  • Berg F, Koelbel JF, Rigobon R (2019) Aggregate Confusion: The Divergence of ESG Ratings. MIT Sloan Research Paper No. 5822–19

  • Bernard S, Adam S, Heutte L (2012) Dynamic Random Forests. Pattern Recognit Lett 33(12)

  • Biau G, Scornet E (2016) A random forest guided tour. TEST 25:197–227

  • Boiral O, Paillé P (2012) Organizational citizenship behaviour for the environment: measurement and validation. J Bus Ethics 109(4):431–445

    Article  Google Scholar 

  • Bontempi G, Ben Taieb S, Le Borgne YA, (2013). Machine Learning Strategies for Time Series Forecasting. In: Aufaure MA., Zimányi E. (eds) Business Intelligence. eBISS, (2012) Lecture notes in business information processing, vol 138. Springer, Berlin, Heidelberg

  • Bowen HR (1953) Social responsibilities of the businessman. Ethics and economics of society, New York, Harper

    Google Scholar 

  • Breiman L, Friedman J et al (1984) Classification and regression trees

  • Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140

    Google Scholar 

  • Breiman L (1999) Using adaptive bagging to debias regressions. Technical Report 547, Statistics Dept. UCB

  • Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  Google Scholar 

  • Brownlee J (2020) Introduction to Time Series Forecasting With Python, Mach Learn Mastery

  • Carannante M, D’Amato V (2021) Weather Index Insurance by satellite data: the basis risk measurement, presented at Energy Finance 2021

  • Cerin P, Dobers P (2001) What does the performance of the Dow Jones sustainability group index tell us? Eco-management and auditing. J Corp Environ Manag 8(3):123–133

    Google Scholar 

  • Chatterji AK, Durand R, Levine DI, Touboul S (2016) Do ratings of firms converge? Implications for managers, investors and strategy researchers. Strategic Manag J 37(8):1597–1614

    Article  Google Scholar 

  • Chava, S., Environmental Externalities and Cost of Capital (June 15, 2011). Available at SSRN: https://ssrn.com/abstract=1677653 or http://dx.doi.org/10.2139/ssrn.1677653

  • Cohen B, Winn MI (2007) Market imperfections, opportunity and sustainable entrepreneurship. J Bus Ventur 22(1):29–49

    Article  Google Scholar 

  • Cremers M, Nair VN (2005) Governance mechanism and equity prices. J Financ 60(6):2859–2894

    Article  Google Scholar 

  • Cui Y, Geobey S, Weber O, Lin H (2018) The impact of green lending on credit risk in China. Sustain 10(6)

  • Derwall J, Guenster N, Bauer R, Koedijk K (2005) The eco-efficiency premium puzzle. Financ Anal J 61(2):51–63

    Article  Google Scholar 

  • Sustainable Investing: Establishing Long term Value and Performance. 2012

  • Dietterich TG (2002) Machine Learning for Sequential Data: A Review, : Caelli T., Amin A., Duin R.P.W., de Ridder D., Kamel M. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR, Lecture Notes in Computer Science, vol 2396. Springer, Berlin, Heidelberg

  • Drempetic S, Klein C, Zwergel B (2020) The influence of firm size on the ESG Score: corporate sustainability ratings under review. J Bus Ethics 167:333–360

    Article  Google Scholar 

  • EBA (2020). Discussion paper on management and supervision of ESG risks for credit institutions and investment firms. Consultation Paper (EBA/DP/2020/03)

  • EBA (2020). Risk Assessment of the European Banking System. Available at https://www.eba.europa.eu/sites/default/documents/files/document_library/Risk%20Analysis%20and%20Data/Risk%20Assessment %20Reports/2020/December%202020/961060/Risk%20Assessment_Report_December_2020.pdf

  • EBF (2021). EBF Response to the Discussion Paper on management and supervision of ESG risks for credit institutions and investment firms

  • European Investment Bank (2018). Sustainability Reporting Disclosures In accordance with the GRI Standards. https://www.eib.org/attachments/documents/gri_standards_2018_en.pdf. [Online; accessed 30-05-2018]

  • Friede G, Busch T, Bassen A (2015) ESG and financial performance: aggregated evidence from more than 2000 empirical studies. J Sustain Financ Invest 5(4):210–233

    Article  Google Scholar 

  • Garcia F, González-Bueno J, Guijarro F, Oliver J (2020) Forecasting the environmental, social, and governance rating of firms by using corporate financial performance variables: a rough set approach. Sustain 12:3324

    Article  Google Scholar 

  • Gu S, Kelly B, Xiu D (2020) Empirical asset pricing via machine learning. Rev Financ Stud 33:2223–2273

    Article  Google Scholar 

  • Gunning D, Stefik M, Choi J, Miller T, Stumpf S, Yang GZ (2019) XAI-Explainable artificial intelligence, Sci Robot

  • Hachenberg B, Schiereck D (2018) Are green bonds priced differently from conventional bonds? J Asset Manag 19(6):371–383

    Article  Google Scholar 

  • Hacker P, Krestel R, Grundmann S, Naumann F (2020) Explainable AI under contract and tort law: legal incentives and technical challenges. Artif Intell Law 28:415–439

    Article  Google Scholar 

  • Hamilton S, Jo H, Statman M (1993) Doing well while doing good? The investment performance of socially responsible mutual funds. Financ Anal J 49(6):62–66

    Article  Google Scholar 

  • Hartzmark SM, Sussman AB (2019) Do investors value sustainability? A natural experiment examining ranking and fund flows. J Financ 74(6):2789–2837

    Article  Google Scholar 

  • Harvard Law School Forum on Corporate Governance (2017). ESG Reports and Ratings: What They Are, Why They Matter. https://corpgov.law.harvard.edu/2017/07/27/ esg-reports-and-ratings-what-they-are-why-they-matter/. [Online; accessed 27-07-2017]

  • James G, Witten D, Hastie T, Tibshirani R (2017) An Introduction to Statistical Learning: with Applications in R. Springer Texts in Statistics. ISBN 10: 1461471370

  • Jewell J, Livingston M (1998) Split ratings, bond yields, and underwriter spreads. J Financ Res 21(2):185–204

    Article  Google Scholar 

  • Joliet R, Titova Y (2018) Equity SRI funds vacillate between ethics and money: An analysis of the funds stock holding decisions. J Bank Financ 97:70–86

    Article  Google Scholar 

  • Kreander N, Gray R, Power D, Sinclair C (2005) Evaluating the performance of ethical and non-ethical funds: a matched pair analysis. J Bus Financ Account 32(7/8):1465–1493

    Article  Google Scholar 

  • Kumar E, Venkatasubramanian S, Scheidegger C, Friedler S (2020) Problems with Shapley-value-based explanations as feature importance measures. Computer Science-Artif Intell arXiv:2002.11097

  • In SY, Rook D, Monk A (2019) Integrating alternative data (Also Known as ESG Data) in investment decision making. Global Econ Rev 48(3):237–260

    Article  Google Scholar 

  • Lansilahti S (2012) Market reactions to Environmental, Social, and Governance (ESG)-news: Evidence from European Markets, WP AAlto University

  • Liaw A (2018) Package randomforest. Available on line at https://cran.r-project.org/ web/packages/randomForest/randomForest.pdf

  • Limkriangkrai M, Koh S, Durand RB (2017) Environmental, social, and governance (ESG) profiles, stock returns, and financial policy: Australian evidence. Int Rev Financ 17(3):461–471

    Article  Google Scholar 

  • Lin WL, Law SH, Ho JA, Sambasivan M (2019) The causality direction of the corporate social responsibility-Corporate financial performance Nexus: Application of Panel Vector Autoregression approach. North Am J Econ Financ 48:401–418

    Article  Google Scholar 

  • Lins KV, Servaes H, Tamayo A (2017) Social capital, trust, and firm performance: The value of corporate social responsibility during the financial crisis. J Financ 72(4):1785–1824

    Article  Google Scholar 

  • Loh WY (2011) Classification and regression trees. Data Mining and Knowledge Discovery, Wiley Interdisciplinary Reviews

  • Mahjoub LB, Khamoussi H (2012) Environmental and Social Policy and Earning Persistence. Bus Strategy Environ, 22(3). https://doi.org/10.1002/bse.1739

  • Mahler D, Barker J, Belsand L, Schulz O (2009) Green winners: the performance of sustainability focused companies during the financial crisis. Technical report, A.T., Kearney

  • Makni R, Francoeur C, Bellavance F (2008) Causality between corporate social performance and financial performance: Evidence from Canadian firms. J Bus Ethics 89(3):409

    Article  Google Scholar 

  • Melas N, Padmakar K (2018) Factor investing and ESG integration. Expert Syst Appl. https://doi.org/10.1016/B978-1-78548-201-4.50015-5

    Article  Google Scholar 

  • Merrick L, Taly A (2019) The explanation game: Explaining machine learning models with cooperative game theory. arXiv preprint arXiv:1909.08128

  • Monk A, Prins M, Rook D (2019) Rethinking alternative data in institutional investment. J Financ Data Sci 1(1):14–31

    Article  Google Scholar 

  • Moritz B, and Zimmermann T (2016) Tree-based conditional portfolio sorts: The relation between past and future stock returns. Working Paper, Ludwig Maximilian University of Munich

  • Nakao Y, Amano A, Matsumura K, Genba K, Nakano M (2007) Relationship between environmental performance and financial performance: an empirical analysis of Japanese corporations. Bus Strategy Environ 16:106–118

    Article  Google Scholar 

  • Nami S, Shajari M (2018) Cost-sensitive payment card fraud detection based on dynamic random forest and k-nearest neighbors, Expert Syst Appl, 110: 381–392. ISSN 0957–4174

  • Oshiro M., Santoro Perez P., 2012, Baranauskas J. A., How Many Trees in a Random Forest?, Machine Learning and Data Mining in Pattern Recognition, Springer

  • Oztemel E (2003) Yapay Sinir Aglari (Artificial Neural Networks). Papatya Publisher, Istanbul

    Google Scholar 

  • Petitjean M (2019) Eco-friendly policies and financial performance: Was the financial crisis a game changer for large us companies? Energ Econ 80:502–511

    Article  Google Scholar 

  • Principles for Responsible Investment (2016). A practical guide to ESG integration for equity investing. United Nations. Available on line at: https://www.unpri.org/listed-equity/a-practical-guide-to-esg-integration-for-equity-investing/10.article

  • Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106

    Google Scholar 

  • Rai A (2020) Explainable AI: from black box to glass box. J Acad Mark Sci 48:137–141

    Article  Google Scholar 

  • Renneboog L, Ter Horst J, Zhang C (2008) Socially responsible investments: institutional aspects, performance, and investor behavior. J Bank Financ 32(9):1723–1742

    Article  Google Scholar 

  • Searcy C, Elkhawas D (2012) Corporate sustainability ratings: an investigation into how corporations use the Dow Jones Sustainability Index. J Clean Prod 35:79–92

    Article  Google Scholar 

  • Schaltegger S, Figge F (2000) Environmental shareholder value: economic success with corporate environmental management. Eco-Manag Audit: J Corp Environ Manag 7(1):29–42

    Article  Google Scholar 

  • Shapley LS. A value for n-person games. Contrib. Theory Games 2(28), 307-317

  • Simpson WG, Kohers T (2002) The link between corporate social and financial performance: Evidence from the banking industry. J Bus Ethics 35:97–109

    Article  Google Scholar 

  • Statman M (2000) Socially responsible mutual funds. Financ Anal J 56(3):30–39

    Article  Google Scholar 

  • Takeuchi L, Lee YYA (2013) Applying Deep Learning to Enhance Momentum Trading Strategies in Stocks. Working paper, Stanford University. http://cs229.stanford.edu/proj2013/TakeuchiLee-ApplyingDeepLearningToEnhanceMomentumTradingStrategiesInStocks.pdf

  • Thomson Reuters (2020). Environmental, social and governance (ESG) scores. Available from Refinitiv at: Refinitiv. https://www.refinitiv.com/content/dam/marketing/en_us/documents/methodology/refinitiv-esg-scores-methodology.pdf

  • Thomson Reuters (2020).TRBC Sector Classification. Available from Refinitiv at: https://www.refinitiv.com/content/dam/marketing/en_us/documents/quick-reference-guides/trbc-business-classification-quick-guide.pdf

  • Trucost and Mercer (2010). Carbon Counts: Assessing the Carbon Exposure of Canadian Institutional Investment Portfolios. Posted: Sept 1, 2010

  • Tu JV (1996) Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J Clin Epidemiol 49(11):1225–1231

    Article  Google Scholar 

  • Utz S, Wimmer M (2014) Are they any good at all? A financial and ethical analysis of socially responsible mutual funds. J Asset Manag 15(1):72–82

    Article  Google Scholar 

  • Van de Velde E, Vermeir W, Corten F (2005) Corporate social responsibility and financial performance. Corp Gov 5:129–138

    Article  Google Scholar 

  • Wang S, and Luo Y (2012) Signal Processing: The Rise of the Machines. Deutsche Bank Quantitative Strategy (5 June)

  • Wang S, Luo Y (2014) Signal Processing: The Rise of the Machines III. Deutsche Bank Quantitative Strategy

  • WBCSD (2019) ESG Disclosure Handbook. World Business Council for Sustainable Development, Available on line at (https://docs.wbcsd.org/2019/04/ESG_Disclosure_Handbook.pdf)

  • Weber O (2013) Measuring the Impact of Socially Responsible Investing. Working paper. Available at SSRN: https ://ssrn.com/abstract=2217784 or http ://dx.doi.org/10.2139/ssrn.2217784

  • Weber O (2017) Corporate sustainability and financial performance of Chinese banks. Sustain Acc, Manag Policy J, 8(3): 358-385. https://doi.org/10.1108/SAMPJ-09-2016-0066

  • Weber O, Koellner T, Habegger D, Steffensen H, Ohnemus P (2008) The relation between the GRI indicators and the financial performance of firms. Progress Ind Ecol, Int J 5(3):236–254

    Article  Google Scholar 

  • Weber O, Mansfeld M, Schirrmann E (2010) The Financial Performance of SRI Funds Between 2002 and 2009 (June 25, 2010). Available at SSRN: https://ssrn.com/abstract=1630502 or http://dx.doi.org/10.2139/ssrn.1630502

  • Windolph SE (2011) Assessing corporate sustainability through ratings: challenges and their causes. J Environ Sustain 1(1):61–80

    Article  Google Scholar 

  • Winn M, Pinkse J, Illge L (2012) Case studies on trade-offs in corporate sustainability. Corp Soc Responsib Environ MGMT 19:63–68. https://doi.org/10.1002/csr.293

    Article  Google Scholar 

  • Zerbib OD (2019) The effect of pro-environmental preferences on bond prices: Evidence from green bonds. J Bank Financ 98:39–60

    Article  Google Scholar 

  • Xie J, Nozawa W, Yagi M, Fujii H, Managi S (2019) Do environmental, social, and governance activities improve corporate financial performance? Bus Stratergy Environ 28:286–300

    Article  Google Scholar 

  • Xu X, Chen W (2017) Implementation and Performance Optimization of Dynamic Random Forest, International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), Nanjing, China, 2017: 283-289

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Valeria D’Amato.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

D’Amato, V., D’Ecclesia, R. & Levantesi, S. ESG score prediction through random forest algorithm. Comput Manag Sci 19, 347–373 (2022). https://doi.org/10.1007/s10287-021-00419-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10287-021-00419-3

Keywords

JEL Classifications

Navigation