Abstract
Owing to the remarkable development of deep learning technology, there have been a series of efforts to build deep learning-based climate models. Whereas most of them utilize recurrent neural networks and/or graph neural networks, we design a novel climate model based on two concepts, the neural ordinary differential equation (NODE) and the advection–diffusion equation. The advection–diffusion equation is widely used for climate modeling because it describes many physical processes involving Brownian and bulk motions in climate systems. On the other hand, NODEs are to learn a latent governing equation of ODE from data. In our presented method, we combine them into a single framework and propose a concept, called neural advection–diffusion equation (NADE). Our NADE, equipped with the advection–diffusion equation and one more additional neural network to model inherent uncertainty, can learn an appropriate latent governing equation that best describes a given climate dataset. In our experiments with three real-world and two synthetic datasets and fourteen baselines, our method consistently outperforms existing baselines by non-trivial margins.
Similar content being viewed by others
Notes
Each research domain has its own preference on graph models. For instance, social networks typically assume scale-free networks.
A well-posed problem means (i) its solution uniquely exists, and (ii) its solution continuously changes as input data changes.
References
Zaytar MA, El Amrani C (2016) Sequence to sequence weather forecasting with long short-term memory recurrent neural networks. Int J Comput Appl 143:7–11
Shi X, Chen Z, Wang H, Yeung D-Y, Wong W-K, Woo W-C (2015) Convolutional lstm network: a machine learning approach for precipitation nowcasting. NeurIPS 28
Shi X, Gao Z, Lausen L, Wang H, Yeung D-Y, Wong W-K, Woo W-C (2017) Deep learning for precipitation nowcasting: a benchmark and a new model. Adv Neural Inf Process Syst 30
Liu Y, Racah E, Correa J, Khosrowshahi A, Lavers D, Kunkel K, Wehner M, Collins W et al (2016) Application of deep convolutional neural networks for detecting extreme weather in climate datasets. arXiv preprint arXiv:1605.01156
Racah E, Beckham C, Maharaj T, Ebrahimi Kahou S, Prabhat M, Pal C (2017)Extremeweather: a large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events. Adv Neural Inf Process Syst 30
Kurth T, Treichler S, Romero J, Mudigonda M, Luehr N, Phillips E, Mahesh A, Matheson M, Deslippe J, Fatica M (2018) Exascale deep learning for climate analytics. In: International conference for high performance computing, networking, storage and analysis. IEEE
Cheng L, Zang H, Ding T, Sun R, Wang M, Wei Z, Sun G (2018) Ensemble recurrent neural network based probabilistic wind speed forecasting approach. Energies 11(8):1958
Cheng W, Shen Y, Zhu Y, Huang L (2018) A neural attention model for urban air quality inference: learning the weights of monitoring stations. In: Proceedings of the AAAI conference on artificial intelligence, vol 32(1)
Hossain M, Rekabdar B, Louis SJ, Dascalu S (2015) Forecasting the weather of nevada: a deep learning approach. In: IJCNN. IEEE
Ren X, Li X, Ren K, Song J, Xu Z, Deng K, Wang X (2021) Deep learning-based weather prediction: a survey. Big Data Res 23:100178
Tekin SF, Karaahmetoglu O, Ilhan F, Balaban I, Kozat SS (2021) Spatio-temporal weather forecasting and attention mechanism on convolutional lstms. arXiv preprint arXiv:2102.00696
Rasp S, Lerch S (2018) Neural networks for postprocessing ensemble weather forecasts. Mon Weather Rev
Scher S (2018) Toward data-driven weather and climate forecasting: approximating a simple general circulation model with deep learning. Geophys Res Lett 45:22
Seo S, Liu Y (2019) Differentiable physics-informed graph networks. arXiv preprint arXiv:1902.02950
Seo S, Meng C, Liu Y (2019) Physics-aware difference graph networks for sparsely-observed dynamics. In: ICLR
Lin Y, Mago N, Gao Y, Li Y, Chiang Y-Y, Shahabi C, Ambite JL (2018) Exploiting spatiotemporal patterns for accurate air quality forecasting using deep learning. In: ACM SIGSPATIAL
Zhang P, Jia Y, Gao J, Song W, Leung H (2018) Short-term rainfall forecasting using multi-layer perceptron. IEEE Trans Big Data 6:93–106
Liu H, Mi X, Li Y (2018) Smart deep learning based wind speed prediction model using wavelet packet decomposition, convolutional neural network and convolutional long short term memory network. Energy Convers Manag 166:120–131
Zhu Q, Chen J, Zhu L, Duan X, Liu Y (2018) Wind speed prediction with spatio-temporal correlation: a deep learning approach. Energies 11:705
Chattopadhyay A, Hassanzadeh P, Pasha S (2020) Predicting clustered weather patterns: a test case for applications of convolutional neural networks to spatio-temporal climate data. Sci Rep 10:1–13
De Bézenac E, Pajot A, Gallinari P (2019) Deep learning for physical processes: incorporating prior scientific knowledge. J Stat Mech Theory Exp 2019:124009
Han J, Liu H, Xiong H, Yang J (2022) Semi-supervised air quality forecasting via self-supervised hierarchical graph neural network. IEEE Trans Knowl Data Eng
Wang Y, Song G, Du L, Lu Z (2019) Real-time estimation of the urban air quality with mobile sensor system. ACM Trans Knowl Discov Data (TKDD) 13(5):11–19
Xu Y, Zhu Y, Shen Y, Yu J (2019) Fine-grained air quality inference with remote sensing data and ubiquitous urban data. ACM Trans Knowl Discov Data (TKDD) 13(5):1–27
Tey FJ, Wu T-Y, Chen J-L (2022) Machine learning-based short-term rainfall prediction from sky data. ACM Trans Knowl Discov Data (TKDD) 16(6):1–18
Qiu M, Zhao P, Zhang K, Huang J, Shi X, Wang X, Chu W (2017) A short-term rainfall prediction model using multi-task convolutional neural networks. In: 2017 IEEE international conference on data mining (ICDM). IEEE, pp 395–404
Liu X, Tan P-N, Abraham Z, Luo L, Hatami P (2018) Distribution preserving multi-task regression for spatio-temporal data. In: 2018 IEEE international conference on data mining (ICDM). IEEE, pp 1134–1139
Wilson T, Tan P-N, Luo L (2018) A low rank weighted graph convolutional approach to weather prediction. In: 2018 IEEE international conference on data mining (ICDM). IEEE, pp 627–636
Zhang W, Han L, Sun J, Guo H, Dai J (2019) Application of multi-channel 3d-cube successive convolution network for convective storm nowcasting. In: 2019 IEEE international conference on big data (big data). IEEE, pp 1705–1710
Rolnick D, Donti PL, Kaack LH, Kochanski K, Lacoste A, Sankaran K, Ross AS, Milojevic-Dupont N, Jaques N, Waldman-Brown A (2022) Tackling climate change with machine learning. ACM Comput Surv (CSUR) 55(2):1–96
Banerjee A (2011) Probabilistic graphical models for climate data analysis. In: Proceedings of the 2011 workshop on climate knowledge discovery, pp 3–3
Geng Y-A, Li Q, Lin T, Zhang J, Xu L, Yao W, Zheng D, Lyu W, Huang H (2020) A heterogeneous spatiotemporal network for lightning prediction. In: 2020 IEEE international conference on data mining (ICDM), pp 1034–1039
Chen RTQ, Rubanova Y, Bettencourt J, Duvenaud DK (2018) Neural ordinary differential equations. In: NeurIPS
Greydanus S, Dzamba M, Yosinski J (2019) Hamiltonian neural networks. Adv Neural Inf Process Syst 32
Finzi M, Wang KA, Wilson AG (2020) Simplifying Hamiltonian and Lagrangian neural networks via explicit constraints. Adv Neural Inf Process Syst 33:13880–13889
Cranmer M, Greydanus S, Hoyer S, Battaglia P, Spergel D, Ho S (2020) Lagrangian neural networks. arXiv preprint arXiv:2003.04630
Lutter M, Ritter C, Peters J (2019) Deep Lagrangian networks: using physics as model prior for deep learning. arXiv preprint arXiv:1907.04490
Stocker T (2011) Introduction to climate modelling. Springer, Berlin
Larwa B (2019) Heat transfer model to predict temperature distribution in the ground. Energies 12(1):25
Dormand JR, Prince PJ (1980) A family of embedded Runge–Kutta formulae. J Comput Appl Math 6(1):19–26
Zhuang J, Dvornek N, Li X, Tatikonda S, Papademetris X, Duncan J (2020) Adaptive checkpoint adjoint method for gradient estimation in neural ode. In: ICML
Eagleson GK (1973) Brownian motion and diffusion. J R Stat Soc Ser A (Gener) 136(1):105–106
Shikano Y, Wada T, Horikawa J (2014) Discrete-time quantum walk with feed-forward quantum coin. Sci Rep 4:1–7
dos SantosMendes R, Lenzi EK, Malacarne LC, Picoli S, Jauregui M (2017) Random walks associated with nonlinear Fokker–Planck equations. Entropy 19(4):155
Plastino A, Curado E, Nobre F, Tsallis C (2018) From the nonlinear Fokker–Planck equation to the Vlasov description and back: confined interacting particles with drag. Phys Rev E 97:022120
Mendes G, Ribeiro M, Mendes R, Lenzi E, Nobre F (2015) Nonlinear Kramers equation associated with nonextensive statistical mechanics. Phys Rev E 91:052106
Wu F, Souza A, Zhang T, Fifty C, Yu T, Weinberger K (2019) Simplifying graph convolutional networks. In: ICML. PMLR
Wang Y, Wang Y, Yang J, Lin Z (2021) Dissecting the diffusion process in linear graph convolutional networks. Adv Neural Inf Process Syst 34:5758–5769
Choi J, Jeon J, Park N (2021) Lt-ocf: learnable-time ode-based collaborative filtering. In: Proceedings of the 30th ACM international conference on information and knowledge management, pp 251–260
Chamberlain BP, Rowbottom J, Goronova M, Webb S, Rossi E, Bronstein MM (2021) Grand: graph neural diffusion. In: ICML
Choi J, Hong S, Park N, Cho S-B (2022) Gread: graph neural reaction–diffusion equations. arXiv preprint arXiv:2211.14208
DeWan A, Dubois N, Theoharides K, Boshoven J (2010) Understanding the impacts of climate change on fish and wildlife in North Carolina. Defenders of Wildlife, Washington
Hanna SR, Briggs GA, Hosker RP Jr (1982) Handbook on atmospheric diffusion. Technical report, National Oceanic and Atmospheric Administration
Wu Z, Pan S, Long G, Jiang J, Chang X, Zhang C (2020) Connecting the dots: multivariate time series forecasting with graph neural networks. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery and data mining, pp 753–763
Brouwer ED, Simm J, Arany A, Moreau Y (2019) Gru-ode-bayes: continuous modeling of sporadically-observed time series. In: NeurIPS
Kidger P, Morrill J, Foster J, Lyons T (2020) Neural controlled differential equations for irregular time series. Adv Neural Inf Process Syst 33:6696–6707
Lyons T, Caruana M, Lévy T (2004) Differential equations driven by rough paths (2004) École D’Eté de Probabilités de Saint-Flour XXXIV
Gozolchiani A, Havlin S, Yamasaki K (2011) Emergence of El Niño as an autonomous component in the climate network. Phys Rev Lett 107(14):148501
Yang X-S (2001) Small-world networks in geophysics. Geophys Res Lett 28(13):2549–2552
Humphries MD, Gurney K, Prescott TJ (2006) The brainstem reticular formation is a small-world, not scale-free, network. Proc R Soc B Biol Sci 273:503–511
Humphries MD, Gurney K (2008) Network ‘small-world-ness’: a quantitative method for determining canonical network equivalence. PLoS ONE 3:e0002051
Watts DJ, Strogatz SH (1998) Collective dynamics of ‘small-world’networks. Nature 393(6684):440–442
Dormand JR (1996) Numerical methods for differential equations: a computational approach. CRC Press, Cambridge
Zang C, Wang F (2020) Neural dynamics on complex networks. In: KDD
Raissi M, Perdikaris P, Karniadakis GE (2019) Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Comput Phys 378:686–707
Sanchez-Gonzalez A, Heess N, Springenberg JT, Merel J, Riedmiller M, Hadsell R, Battaglia P (2018) Graph networks as learnable physics engines for inference and control. In: ICML. PMLR
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: CVPR
Hwang J, Choi J, Choi H, Lee K, Lee D, Park N (2021) Climate modeling with neural diffusion equations. In: 2021 IEEE international conference on data mining (ICDM). IEEE, pp 230–239. https://doi.org/10.1109/ICDM51629.2021.00033
Bai L, Yao L, Li C, Wang X, Wang C (2020) Adaptive graph convolutional recurrent network for traffic forecasting. Adv Neural Inf Process Syst 33:17804–17815
Choi J, Choi H, Hwang J, Park N (2022) Graph neural controlled differential equations for traffic forecasting. Proc AAAI Conf Artif Intell 36(6):6367–6374
Chen RTQ, Duvenaud DK (2019) Neural networks with cheap differential operators. In: NeurIPS
Kong L, Sun J, Zhang C (2020) SDE-net: equipping deep neural networks with uncertainty estimates. In: ICML
Acknowledgements
Noseong Park is the corresponding author. This work was supported by the Institute of Information & Communications Technology Planning & Evaluation (IITP) grant funded by the Korean government (MSIT) (No. 2020-0-01361, Artificial Intelligence Graduate School Program at Yonsei University, 10%), and (No. 2022-0-00857, Development of AI/data-based financial/economic digital twin platform, 45%) and (No. 2022-0-00113, Developing a Sustainable Collaborative Multi-modal Lifelong Learning Framework, 45%).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Choi, H., Choi, J., Hwang, J. et al. Climate modeling with neural advection–diffusion equation. Knowl Inf Syst 65, 2403–2427 (2023). https://doi.org/10.1007/s10115-023-01829-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-023-01829-2