skip to main content
10.1145/3589334.3648147acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Unveiling Climate Drivers via Feature Importance Shift Analysis in New Zealand

Published: 13 May 2024 Publication History

Abstract

In the face of rising surface temperatures from climate change, impacting biodiversity, extreme weather events, and agricultural productivity, understanding the drivers behind temperature changes is imperative. Traditional global climate models (GCMs) are computationally expensive, limiting their applicability, while machine learning approaches, though promising, face interpretability challenges due to their "black box" nature, especially in a dynamic setting where the data is constantly evolving. We propose DUO, a framework to identify shifts in important features and feature combinations as the data distribution changes over time. Our model independently assesses the importance of features and their interactions while also evaluating their relevance when combined with additional features, contributing to the target class. As a case study, we apply DUO to assess the shifts in climate drivers for station-level temperatures in six locations across New Zealand from 1980 to 2020, we identify specific humidity, geopotential height, and air temperature at high atmospheric pressure levels as the most important features for describing temperature variability. By revealing how climate drivers change over time, DUO contributes to a deeper understanding of temperature change patterns, enabling practitioners to develop targeted and adaptive mitigation strategies.

Supplemental Material

MKV File
Supplemental video
MP4 File
Video presentation

References

[1]
Kashif Abbass, Muhammad Zeeshan Qasim, Huaming Song, Muntasir Murshed, Haider Mahmood, and Ijaz Younis. 2022. A review of the global climate change impacts, adaptation, and sustainable mitigation measures. Environmental Science and Pollution Research, Vol. 29, 28 (2022), 42539--42559.
[2]
Rakesh Agrawal, Tomasz Imielinski, and Arun Swami. 1993. Database mining: A performance perspective. IEEE Transactions on Knowledge and Data Engineering, Vol. 5, 6 (1993), 914--925.
[3]
Rakesh Agrawal, Ramakrishnan Srikant, et al. 1994. Fast algorithms for mining association rules. In Proceedings of the 20th International Conference on Very Large Data Bases, Vol. 1215. Santiago, Chile, 487--499.
[4]
Abdallah Alabdallah, Sepideh Pashami, Thorsteinn Rögnvaldsson, and Mattias Ohlsson. 2022. SurvSHAP: A Proxy-Based Algorithm for Explaining Survival Models with SHAP. In 2022 IEEE 9th International Conference on Data Science and Advanced Analytics (DSAA). IEEE, 1--10.
[5]
Albert Bifet and Ricard Gavalda. 2007. Learning from time-changing data with adaptive windowing. In Proceedings of the 2007 SIAM International Conference on Data Mining. SIAM, 443--448.
[6]
Bogdan Bochenek and Zbigniew Ustrnul. 2022. Machine learning in weather prediction and climate analyses-applications and perspectives. Atmosphere, Vol. 13, 2 (2022), 180.
[7]
Leo Breiman. 2001. Random forests. Machine learning, Vol. 45 (2001), 5--32.
[8]
Fausto G da Costa, Felipe SLG Duarte, Rosane MM Vallim, and Rodrigo F de Mello. 2017. Multidimensional surrogate stability to detect data stream concept drift. Expert Systems with Applications, Vol. 87 (2017), 15--29.
[9]
Fengqian Ding and Chao Luo. 2019. The entropy-based time domain feature extraction for online concept drift detection. Entropy, Vol. 21, 12 (2019), 1187.
[10]
Ministry for the Environment. 2018. Climate Change Projections for New Zealand: Atmosphere Projections Based on Simulations from the IPCC Fifth Assessment, 2nd Edition. Ministry for the Environment, Wellington.
[11]
David J Frame, Suzanne M Rosier, Ilan Noy, Luke J Harrington, Trevor Carey-Smith, Sarah N Sparrow, Dáith'i A Stone, and Samuel M Dean. 2020. Climate change attribution and the economic costs of extreme weather events: a study on damages from extreme rainfall and drought. Climatic Change, Vol. 162 (2020), 781--797.
[12]
Jerome H Friedman. 1991. Multivariate adaptive regression splines. The Annals of Statistics, Vol. 19, 1 (1991), 1--67.
[13]
Anuroop Gaddam, Me Al-Hrooby, and WF Esmael. 2020. Designing a wireless sensors network for monitoring and predicting droughts. International Journal on Smart Sensing and Intelligent Systems, Vol. 7, 5 (2020), 1--6.
[14]
Krishna Gade, Sahin Geyik, Krishnaram Kenthapadi, Varun Mithal, and Ankur Taly. 2020. Explainable AI in industry: Practical challenges and lessons learned. In Companion Proceedings of the Web Conference 2020. 303--304.
[15]
Joao Gama, Pedro Medas, Gladys Castillo, and Pedro Rodrigues. 2004. Learning with drift detection. In Advances in Artificial Intelligence - SBIA. Springer, 286--295.
[16]
Peter B Gibson, William E Chapman, Alphan Altinok, Luca Delle Monache, Michael J DeFlorio, and Duane E Waliser. 2021. Training machine learning models on climate model output yields skillful interpretable seasonal precipitation forecasts. Communications Earth & Environment, Vol. 2, 1 (2021), 159.
[17]
Leilani H Gilpin, David Bau, Ben Z Yuan, Ayesha Bajwa, Michael Specter, and Lalana Kagal. 2018. Explaining explanations: An overview of interpretability of machine learning. In 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA). IEEE, 80--89.
[18]
Brandon M Greenwell, Bradley C Boehmke, and Andrew J McCarthy. 2018. A simple and effective model-based variable importance measure. arXiv preprint arXiv:1805.04755 (2018).
[19]
Michael Richard Grose, Sugata Narsey, FP Delage, Andrew J Dowdy, Margot Bador, Ghyslaine Boschat, Christine Chung, JB Kajtar, Surendra Rauniyar, MB Freund, et al. 2020. Insights from CMIP6 for Australia's future climate. Earth's Future, Vol. 8, 5 (2020).
[20]
Joel Guinto. 2023. Cyclone Gabrielle: Rebuilding cost on par with Christchurch quake - NZ.
[21]
Ben Halstead, Yun Sing Koh, Patricia Riddle, Mykola Pechenizkiy, Albert Bifet, and Russel Pears. 2021. Fingerprinting concepts in data streams with supervised and unsupervised meta-information. In 2021 IEEE 37th International Conference on Data Engineering (ICDE). IEEE, 1056--1067.
[22]
Luke J Harrington, Sam M Dean, Shaun Awatere, Suzanne Rosier, Laura Queen, Peter B Gibson, Clair Barnes, Mariam Zachariah, Sjoukje Philip, Sarah Kew, et al. 2023. The role of climate change in extreme rainfall associated with Cyclone Gabrielle over Aotearoa New Zealand's East Coast. (2023).
[23]
Sijie He, Xinyan Li, Vidyashankar Sivakumar, and Arindam Banerjee. 2019. Interpretable predictive modeling for climate variables with weighted lasso. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 1385--1392.
[24]
Lieve Helsen. 2021. Tackling Climate Change with Machine Learning. In Conference on Neural Information Processing System (NeurIPS)-panelist workshop, Location: online.
[25]
Hans Hersbach, Bill Bell, Paul Berrisford, Shoji Hirahara, András Horányi, Joaqu'in Mu noz-Sabater, Julien Nicolas, Carole Peubey, Raluca Radu, Dinand Schepers, et al. 2020. The ERA5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, Vol. 146, 730 (2020), 1999--2049.
[26]
Geoffrey E Hinton and Ruslan R Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. science, Vol. 313, 5786 (2006), 504--507.
[27]
Sanaa Hobeichi, Nidhi Nishant, Yawen Shao, Gab Abramowitz, Andy Pitman, Steve Sherwood, Craig Bishop, and Samuel Green. 2023. Using machine learning to cut the cost of dynamical downscaling. Earth's Future, Vol. 11, 3 (2023).
[28]
Tongxi Hu, Xuesong Zhang, Gil Bohrer, Yanlan Liu, Yuyu Zhou, Jay Martin, Yang Li, and Kaiguang Zhao. 2023. Crop yield prediction via explainable AI and interpretable machine learning: Dangers of black box models for evaluating climate change impacts on crop yield. Agricultural and Forest Meteorology, Vol. 336 (2023), 109458.
[29]
Jakub Jakubowski, Przemysław Stanisz, Szymon Bobek, and Grzegorz J Nalepa. 2021. Explainable anomaly detection for Hot-rolling industrial process. In 2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA). IEEE, 1--10.
[30]
Andrew K Jorgenson, Shirley Fiske, Klaus Hubacek, Jia Li, Tom McGovern, Torben Rick, Juliet B Schor, William Solecki, Richard York, and Ariela Zycherman. 2019. Social science perspectives on drivers of and responses to global climate change. Wiley Interdisciplinary Reviews: Climate Change, Vol. 10, 1 (2019), e554.
[31]
Muhammad Firmansyah Kasim, D Watson-Parris, L Deaconu, S Oliver, P Hatfield, Dustin H Froula, Gianluca Gregori, M Jarvis, S Khatiwala, J Korenaga, et al. 2021. Building high accuracy emulators for scientific simulations with deep neural architecture search. Machine Learning: Science and Technology, Vol. 3, 1 (2021), 015013.
[32]
Arash Khodadadi and Daniel J McDonald. 2019. Algorithms for estimating trends in global temperature volatility. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 614--621.
[33]
Yuxuan Liang, Yutong Xia, Songyu Ke, Yiwei Wang, Qingsong Wen, Junbo Zhang, Yu Zheng, and Roger Zimmermann. 2023. Airformer: Predicting nationwide air quality in china with transformers. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 14329--14337.
[34]
Ning Lu, Guangquan Zhang, and Jie Lu. 2014. Concept drift detection via competence models. Artificial Intelligence, Vol. 209 (2014), 11--28.
[35]
Scott M Lundberg and Su-In Lee. 2017. A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, Vol. 30 (2017).
[36]
Valérie Masson-Delmotte, Panmao Zhai, Hans-Otto Pörtner, Debra Roberts, Jim Skea, Priyadarshi R Shukla, et al. 2022. Global Warming of 1.5 C: IPCC special report on impacts of global warming of 1.5 C above pre-industrial levels in context of strengthening response to climate change, sustainable development, and efforts to eradicate poverty. Cambridge University Press.
[37]
Christoph Molnar, Gunnar König, Julia Herbinger, Timo Freiesleben, Susanne Dandl, Christian A Scholbeck, Giuseppe Casalicchio, Moritz Grosse-Wentrup, and Bernd Bischl. 2022. General pitfalls of model-agnostic interpretation methods for machine learning models. In xxAI-Beyond Explainable AI: International Workshop. Springer, 39--68.
[38]
Maximilian Muschalik, Fabian Fumagalli, Barbara Hammer, and Eyke Hüllermeier. 2022. Agnostic Explanation of Model Change based on Feature Importance. KI-Künstliche Intelligenz (2022), 1--14.
[39]
Nimisha G Nair, Pallavi Satpathy, Jabez Christopher, et al. 2019. Covariate shift: A review and analysis on classifiers. In 2019 Global Conference for Advancement in Technology (GCAT). IEEE, 1--6.
[40]
Rebecca Newman and Ilan Noy. 2023. The global costs of extreme weather that are attributable to climate change. Nature Communications, Vol. 14, 1 (2023), 6103.
[41]
J Jake Nichol, Matthew G Peterson, Kara J Peterson, G Matthew Fricke, and Melanie E Moses. 2021. Machine learning feature analysis illuminates disparity between E3SM climate models and observed climate change. J. Comput. Appl. Math., Vol. 395 (2021), 113451.
[42]
Neelesh Rampal, Peter B Gibson, Abha Sood, Stephen Stuart, Nicolas C Fauchereau, Chris Brandolino, Ben Noll, and Tristan Meyers. 2022. High-resolution downscaling with interpretable deep learning: Rainfall extremes over New Zealand. Weather and Climate Extremes, Vol. 38 (2022), 100525.
[43]
Stephan Rasp, Michael S Pritchard, and Pierre Gentine. 2018. Deep learning to represent subgrid processes in climate models. Proceedings of the National Academy of Sciences, Vol. 115, 39 (2018), 9684--9689.
[44]
Markus Reichstein, Gustau Camps-Valls, Bjorn Stevens, Martin Jung, Joachim Denzler, Nuno Carvalhais, and fnm Prabhat. 2019. Deep learning and process understanding for data-driven Earth system science. Nature, Vol. 566, 7743 (2019), 195--204.
[45]
Ashkan Rezaei, Anqi Liu, Omid Memarrast, and Brian D Ziebart. 2021. Robust fairness under covariate shift. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 9419--9427.
[46]
David Rolnick, Priya L Donti, Lynn H Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, et al. 2022. Tackling climate change with machine learning. ACM Computing Surveys (CSUR), Vol. 55, 2 (2022), 1--96.
[47]
Jeffrey C Schlimmer and Richard H Granger. 1986. Incremental learning from noisy data. Machine learning, Vol. 1 (1986), 317--354.
[48]
Tapio Schneider, Jo ao Teixeira, Christopher S Bretherton, Florent Brient, Kyle G Pressel, Christoph Sch"ar, and A Pier Siebesma. 2017. Climate goals and computing the future of clouds. Nature Climate Change, Vol. 7, 1 (2017), 3--5.
[49]
Patrick Schober, Christa Boer, and Lothar A Schwarte. 2018. Correlation coefficients: Appropriate use and interpretation. Anesthesia & Analgesia, Vol. 126, 5 (2018), 1763--1768.
[50]
Vanessa Jine Schweizer, Jude Herijadi Kurniawan, and Aidan Power. 2022. Semi-automated Literature Review for Scientific Assessment of Socioeconomic Climate Change Scenarios. In Companion Proceedings of the Web Conference 2022. 789--799.
[51]
Qiaohong Sun, Xuebin Zhang, Francis Zwiers, Seth Westra, and Lisa V Alexander. 2021. A global, continental, and regional analysis of changes in extreme precipitation. Journal of Climate, Vol. 34, 1 (2021), 243--258.
[52]
Byomkesh Talukder, Nilanjana Ganguli, Richard Matthew, Keith W Hipel, James Orbinski, et al. 2022. Climate change-accelerated ocean biodiversity loss & associated planetary health impacts. The Journal of Climate Change and Health, Vol. 6 (2022), 100114.
[53]
Apoorva Upadhyaya, Marco Fisichella, and Wolfgang Nejdl. 2023. A Multi-task Model for Emotion and Offensive Aided Stance Detection of Climate Change Tweets. In Proceedings of the ACM Web Conference 2023. 3948--3958.
[54]
Chiem Van Straaten, Kirien Whan, Dim Coumou, Bart Van den Hurk, and Maurice Schmeits. 2022. Using explainable machine learning forecasts to discover subseasonal drivers of high summer temperatures in western and central Europe. Monthly Weather Review, Vol. 150, 5 (2022), 1115--1134.
[55]
Geoffrey I Webb, Roy Hyde, Hong Cao, Hai Long Nguyen, and Francois Petitjean. 2016. Characterizing concept drift. Data Mining and Knowledge Discovery, Vol. 30, 4 (2016), 964--994.
[56]
Di Zhao and Yun Sing Koh. 2020. Feature drift detection in evolving data streams. In Database and Expert Systems Applications: 31st International Conference, DEXA 2020, Bratislava, Slovakia, September 14--17, 2020, Proceedings, Part II 31. Springer, 335--349.
[57]
Alexander Zien, Nicole Kr"amer, Sören Sonnenburg, and Gunnar R"atsch. 2009. The feature importance ranking measure. arXiv preprint arXiv:0906.4258 (2009).
[58]
G Zittis, M Almazroui, P Alpert, P Ciais, W Cramer, Y Dahdal, M Fnais, D Francis, P Hadjinicolaou, F Howari, et al. 2022. Climate change and weather extremes in the Eastern Mediterranean and Middle East. Reviews of Geophysics, Vol. 60, 3 (2022), e2021RG000762. io

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
WWW '24: Proceedings of the ACM Web Conference 2024
May 2024
4826 pages
ISBN:9798400701719
DOI:10.1145/3589334
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. climate modelling
  2. feature importance
  3. interpretability

Qualifiers

  • Research-article

Conference

WWW '24
Sponsor:
WWW '24: The ACM Web Conference 2024
May 13 - 17, 2024
Singapore, Singapore

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 129
    Total Downloads
  • Downloads (Last 12 months)129
  • Downloads (Last 6 weeks)10
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media