Accuracy and Architecture Studies of Residual Neural Network Method for Ordinary Differential Equations

Qiu, Changxin; Bendickson, Aaron; Kalyanapu, Joshua; Yan, Jue

doi:10.1007/s10915-023-02173-x

Accuracy and Architecture Studies of Residual Neural Network Method for Ordinary Differential Equations

Published: 28 March 2023

Volume 95, article number 50, (2023)
Cite this article

Journal of Scientific Computing Aims and scope Submit manuscript

Changxin Qiu¹,
Aaron Bendickson²,
Joshua Kalyanapu³ &
…
Jue Yan ORCID: orcid.org/0000-0002-4821-9197²

470 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

In this paper, we investigate residual neural network (ResNet) method to solve ordinary differential equations. We verify the accuracy order of ResNet ODE solver matches the accuracy order of the data. Forward Euler, Runge–Kutta2 and Runge–Kutta4 finite difference schemes are adapted generating three learning data sets, which are applied to train three ResNet ODE solvers independently. The well trained ResNet solvers obtain 2nd, 3rd and 5th orders of one step errors and behave just as its counterpart finite difference method for linear and nonlinear ODEs with regular solutions. In particular, we carry out (1) architecture study in terms of number of hidden layers and neurons per layer to obtain optimal network structure; (2) target study to verify the ResNet solver is as accurate as its finite difference method counterpart; (3) solution trajectory simulations. A sequence of numerical examples are presented to demonstrate the accuracy and capability of ResNet solver.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Numerical solution for high-dimensional partial differential equations based on deep learning with residual learning and data-driven learning

Article 30 January 2021

Neural network approach to intricate problems solving for ordinary differential equations

Article 01 April 2017

Error estimation using neural network technique for solving ordinary differential equations

Article Open access 15 June 2022

Data Availibility

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

References

LeCun, Y., Bengio, Y.: Convolutional networks for images, speech, and time-series, The handbook of brain theory and neural networks (1995)
Bengio, Y.: Learning deep architectures for AI. Found. Trends Mach. Learn. 2(1), 1–127 (2009)
Article MathSciNet MATH Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 1097–1105 (2012)
Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436–444 (2015)
Article Google Scholar
Wang, B., Yuan, B., Shi, Z., Osher, S.J.: EnResNet: ResNets ensemble via the Feynman-Kac formalism for adversarial defense and beyond. SIAM J. Math. Data Sci. 2(3), 559–582 (2020)
Article MathSciNet MATH Google Scholar
Weinan, E.: A proposal on machine learning via dynamical systems. Commun. Math. Stat. 5(1), 1–11 (2017)
Article MathSciNet MATH Google Scholar
Chaudhari, P., Oberman, A., Osher, S., Soatto, S., Carlier, G.: Deep relaxation: partial differential equations for optimizing deep neural networks (2017). arXiv:1704.04932
Haber, E., Ruthotto, L.: Stable architectures for deep neural networks. Inverse Probl. 34(1), 014004 (2018)
Article MathSciNet MATH Google Scholar
Chang, B., Meng, L., Haber, E., Ruthotto, L., Begert, D., Holtham, E.: Reversible architectures for arbitrarily deep residual neural networks, in: Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), 2018, AAAI Press, 2018, pp. 2811–2818
Ruthotto, L., Haber, E.: Deep neural networks motivated by partial differential equations. J. Math. Imaging Vis. 62(3), 352–364 (2020)
Article MathSciNet MATH Google Scholar
Lu, Y., Zhong, A., Li, Q., Dong, B.: Beyond finite layer neural networks: bridging deep architectures and numerical differential equations, arXiv:1710.10121 (2017)
He, J., Xu, J.: MgNet: a unified framework of multigrid and convolutional neural network. Sci. China Math. 62(7), 1331–1354 (2019)
Article MathSciNet MATH Google Scholar
Cybenko, G.: Approximation by superpositions of a sigmoidal function. Math. Control Signals Syst. 2, 303–314 (1989)
Article MathSciNet MATH Google Scholar
Hornik, K., Stinchcombe, M., White, H.: Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural Netw. 3(5), 551–560 (1990)
Article Google Scholar
Barron, A.R.: Universal approximation bounds for superpositions of a sigmoidal function. IEEE Trans. Inf. Theory 39(3), 930–945 (1993)
Article MathSciNet MATH Google Scholar
Pinkus, A.: Approximation theory of the mlp model in neural networks. Acta Numer. 8, 143–195 (1999)
Article MathSciNet MATH Google Scholar
Lagaris, I., Likas, A., Fotiadis, D.: Artificial neural networks for solving ordinary and partial differential equations. IEEE Trans. Neural Netw. 95, 987–1000 (1998)
Article Google Scholar
Rudd, K., Ferrari, S.: A constrained integration (cint) approach to solving partial differential equations using artificial neural networks. Neurocomputing 155, 277–285 (2015)
Article Google Scholar
Raissi, M., Perdikaris, P., Karniadakis, G.E.: Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 378, 686–707 (2019)
Article MathSciNet MATH Google Scholar
Sirignano, J., Spiliopoulos, K.: DGM: a deep learning algorithm for solving partial differential equations. J. Comput. Phys. 375, 1339–1364 (2018)
Article MathSciNet MATH Google Scholar
Long, Z., Lu, Y., Dong, B.: PDE-Net 2.0: learning PDEs from data with a numeric-symbolic hybrid deep network. J. Comput. Phys. 399, 108925 (2019)
Article MathSciNet MATH Google Scholar
Winovich, N., Ramani, K., Lin, G.: ConvPDE-UQ: convolutional neural networks with quantified uncertainty for heterogeneous elliptic partial differential equations on varied domains. J. Comput. Phys. 394, 263–279 (2019)
Article MathSciNet MATH Google Scholar
Beck, C.E.W., Jentzen, A.: Machine learning approximation algorithms for high-dimensional fully nonlinear partial differential equations and second-order backward stochastic differential equations. J. Nonlinear Sci. 29(4), 1563–1619 (2019)
Article MathSciNet MATH Google Scholar
Fan, Y., Lin, L., Ying, L., Zepeda-Núñez, L.: A multiscale neural network based on hierarchical matrices. Multiscale Model. Simul. 17(4), 1189–1213 (2019)
Article MathSciNet MATH Google Scholar
Khoo, Y., Lu, J., Ying, L.: Solving parametric pde problems with artificial neural networks, Eur. J. Appl. Math. (2020) 1–15
Li, Y., Lu, J., Mao, A.: Variational training of neural network approximations of solution maps for physical models. J. Comput. Phys. 409, 109338 (2020)
Article MathSciNet MATH Google Scholar
Qiu, C., Yan, J.: Cell-average based neural network method for hyperbolic and parabolic partial differential equations, J. Comput. Phys. Under review
Qin, T., Wu, K., Xiu, D.: Data driven governing equations approximation using deep neural networks. J. Comput. Phys. 395, 620–635 (2019)
Article MathSciNet MATH Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016) 770–778
Chen, S., Billings, S.A., Grant, P.M.: Non-linear system identification using neural networks. Int. J. Control 51(6), 1191–1214 (1990)
Article MATH Google Scholar
González-García, R., Rico-Martínez, R., Kevrekidis, I.: Identification of distributed parameter systems: A neural net based approach, Computers & Chemical Engineering 22 (1998) S965–S968, european Symposium on Computer Aided Process Engineering-8
Milano, M., Koumoutsakos, P.: Neural network modeling for near wall turbulent flow. J. Comput. Phys. 182(1), 1–26 (2002)
Article MATH Google Scholar
Pathak, J., Lu, Z., Hunt, B.R., Girvan, M., Ott, E.: Using machine learning to replicate chaotic attractors and calculate lyapunov exponents from data. Chaos Interdiscip. J. Nonlinear Sci. 27(12), 121102 (2017)
Article MathSciNet MATH Google Scholar
Vlachas, P. R., Byeon, W., Wan, Z. Y., Sapsis, T. P., Koumoutsakos, P.: Data-driven forecasting of high-dimensional chaotic systems with long short-term memory networks, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 474 (2213) (2018) 20170844
Mardt, A., Pasquali, L., Wu, H., Noé, F.: Vampnets: deep learning of molecular kinetics, Nat. Commun. 9 (5) (2018)
Yeung, E., Kundu, S., Hodas, N.: Learning deep neural network representations for koopman operators of nonlinear dynamical systems. Am. Control Conf. (ACC) 2019, 4832–4839 (2019)
Google Scholar
Raissi, M., Perdikaris, P., Karniadakis, G. E.: Multistep neural networks for data-driven discovery of nonlinear dynamical systems (2018). arXiv:1801.01236
Chen, R.T.Q., Rubanova, Y., Bettencourt, J., Duvenaud, D.: Neural ordinary differential equations 12, 6572–6583 (2018)
Rudy, S.H., Kutz, J.N., Brunton, S.L.: Deep learning of dynamics and signal-noise decomposition with time-stepping constraints. J. Comput. Phys. 396, 483–506 (2019)
Article MathSciNet MATH Google Scholar
Sun, Y., Zhang, L., Schaeffer, H.: NeuPDE: neural network based ordinary and partial differential equations for modeling time-dependent data, in: Lu, J., Ward, R. (Eds.), Proceedings of The First Mathematical and Scientific Machine Learning Conference, Vol. 107 of Proceedings of Machine Learning Research, PMLR, Princeton University, Princeton, NJ, USA, 2020, pp. 352–372
Reshniak, V., Webster, C. G.: Robust learning with implicit residual networks (2019). arXiv:1905.10479
Xie, X., Zhang, G., Webster, C.G.: Non-intrusive inference reduced order model for fluids using deep multistep neural network. Mathematics 7(8), 757 (2019)
Article Google Scholar
Keller, R., Du, Q.: Discovery of dynamics using linear multistep methods (2020). arXiv:1912.12728
Zagoruyko, S., Komodakis, N.: Wide residual networks, Proceedings of the British Machine Vision Conference (BMVC) (87) (2016) 1–12
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K. Q.: Densely connected convolutional networks, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017) 2261–2269
Xie, S., Girshick, R., Dollár, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR) 2017, 5987–5995 (2017)
Google Scholar
Haber, E., Ruthotto, L, Holtham, E.: Learning across scales—A multiscale method for convolution neural networks, arXiv arXiv:1703.02009 (2017)
Hornik, K.: Approximation capabilities of multilayer feedforward networks. Neural Netw. 4(2), 251–257 (1991)
Article MathSciNet Google Scholar
Leshno, M., Lin, V.Y., Pinkus, A., Schocken, S.: Multilayer feedforward networks with a nonpolynomial activation function can approximate any function. Neural Netw. 6(6), 861–867 (1993)
Article Google Scholar
Venturi, L., Jelassi, S., Ozuch, T., Bruna, J.: Depth separation beyond radial functions. J. Mach. Learn. Res. 23, 1–56 (2022)
Google Scholar
Kaplan, J., McCandlish, S., Henighan, T., Brown, T. B., Chess, B., Child, R., Gray, S., Radford, A., Wu, J., Amodei, D.: Scaling laws for neural language models (2020). arXiv:2001.08361
Wu, K., Xiu, D.: Numerical aspects for approximating governing equations using data. J. Comput. Phys. 384, 200–221 (2019)
Article MathSciNet MATH Google Scholar
Boyce, W. E., DiPrima, R. C.: Elementary differential equations and boundary value problems, John Wiley & Sons, Inc., New York-London-Sydney, 10th Edition
Chartrand, R.: Numerical differentiation of noisy, nonsmooth data. ISRN Appl, Math (2011)
Pulch, R.: Polynomial chaos for semiexplicit differential algebraic equations of index 1. Int. J. Uncertain. Quantif. 3(1), 1–23 (2013)
Article MathSciNet MATH Google Scholar

Download references

Funding

ChangxinQiu: Research work of this author is supported by National Natural Science Foundation of China under Grant (Nos. 12201327) and Ningbo Natural Science Foundation (Nos. 2022J087). Bendickson Bendickson and JoshuaKalyanapu: Researchwork of the authors are partially supported by National Science Foundation grant DMS-1457443. Jue Yan: Researchwork of theauthor is supported by National Science Foundation grant DMS-1620335 and Simons Foundation grant 637716.

Author information

Authors and Affiliations

School of Mathematics and Statistics, Ningbo University, Ningbo, 315211, People’s Republic of China
Changxin Qiu
Department of Mathematics, Iowa State University, Ames, 50011, USA
Aaron Bendickson & Jue Yan
Department of Electrical and Computer Engineering, Iowa State University, Ames, 50011, USA
Joshua Kalyanapu

Authors

Changxin Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Aaron Bendickson
View author publications
You can also search for this author in PubMed Google Scholar
Joshua Kalyanapu
View author publications
You can also search for this author in PubMed Google Scholar
Jue Yan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jue Yan.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix A

In the appendix, we revisit the one-step error between target and the exact solution through interpolation polynomial approximation. For one step error, orders of $O(\Delta ^2)$, $O(\Delta ^3)$ and $O(\Delta ^5)$ are obtained for the first order forward Euler method (3.2), second order Runge–Kutta2 method (3.3) and fourth order Runge–Kutta4 method (3.4) with $\Delta $ as the step size.

Case I: ${\textbf {y}}_j^2$ obtained from Forward Euler method (3.2)

Given ${\textbf {y}}_j^1={\textbf {x}}_j(t_0)$, subtract the exact solution ${\textbf {x}}_j(t_0+\Delta )$ of (3.1) from ${\textbf {y}}_j^2$ of the forward Euler method (3.2), we have

$$\begin{aligned} \Vert {\textbf {y}}_j^2-{\textbf {x}}_j(t_0+\Delta )\Vert _2&=\left\| \int _{t_0}^{t_0+\Delta } \left( {\textbf {F}}({\textbf {x}}_j(t_0), t_0)- {\textbf {F}}({\textbf {x}}_j(t),t)\right) ~dt\right\| _2 \nonumber \\&= \left\| \int _{t_0}^{t_0+\Delta } \frac{d}{dt}{\textbf {F}}({\textbf {x}}_j(\xi (t)),\xi (t)) (t-t_0) ~dt\right\| _2 \nonumber \\&=\frac{\Delta ^2}{2}\left\| \frac{d}{dt}{\textbf {F}}({\textbf {x}}_j(\eta ),\eta ) \right\| _2\le C\Delta ^2. \end{aligned}$$

(A.1)

Here $\frac{d}{dt}{\textbf {F}}({\textbf {x}}(t),t)=\frac{\partial {\textbf {F}}}{\partial {\textbf {x}}}{\textbf {F}}+ \frac{\partial {\textbf {F}}}{\partial t}$ refers to the complete derivative to the t variable, with $\frac{\partial {\textbf {F}}}{\partial {\textbf {x}}}$ denoting the Jacobian matrix of the vector function ${\textbf {F}}$ on variable ${\textbf {x}}(t)$ and $\frac{d {\textbf {x}}}{dt}={\textbf {F}}$. Forward Euler method can be considered as a constant quadrature rule approximation to the integral of the ODE system (3.1). Weighted mean value theorem is applied to estimate the error term.

Case II: ${\textbf {y}}_j^2$ obtained from 2nd order Runge–Kutta method (3.3)

Again we have ${\textbf {y}}_j^1={\textbf {x}}_j(t_0)$. Subtract ${\textbf {x}}_j(t_0+\Delta )$ of (3.1) from ${\textbf {y}}_j^2$ of the second order Runge–Kutta method (3.3), we have

$$\begin{aligned} \Vert {\textbf {y}}_j^2-{\textbf {x}}_j(t_0+\Delta )\Vert _2&=\left\| \Delta \times \left( \frac{k_1+k_2}{2}\right) -\int _{t_0}^{t_0+\Delta } {\textbf {F}}({\textbf {x}}_j(t),t) ~dt\right\| _2 \\&\le \left\| \Delta \times \left( \frac{k_1+\widetilde{k_2}}{2}\right) -\int _{t_0}^{t_0+\Delta } {\textbf {F}}({\textbf {x}}_j(t),t) ~dt\right\| _2 +\frac{\Delta }{2}\Vert k_2-\widetilde{k_2}\Vert _2, \end{aligned}$$

where $k_2={\textbf {F}}({\textbf {y}}_j^1+\Delta k_1, t_0+\Delta )$, $k_1={\textbf {F}}({\textbf {y}}_j^1,t_0)$ and $\widetilde{k_2}={\textbf {F}}({\textbf {x}}_j(t_0+\Delta ), t_0+\Delta )$. With the $O(\Delta )$ local truncation error of the forward Euler method approximating ${\textbf {x}}_j(t_0+\Delta )$ and applying the Lipschitz continuity of ${\textbf {F}}$ of the dynamic system, we have

$$\begin{aligned} \Vert k_2-\widetilde{k_2}\Vert _2&= \left\| {\textbf {F}}({\textbf {y}}_j^1+\Delta k_1, t_0+\Delta )-{\textbf {F}}({\textbf {x}}_j(t_0+\Delta ), t_0+\Delta )\right\| _2 \\&\le L\Vert {\textbf {y}}_j^1+\Delta k_1-{\textbf {x}}_j(t_0+\Delta )\Vert _2 \le C\Delta ^2. \end{aligned}$$

Here C represents a generic constant. The error from the two-points quadrature rule can be estimated as

$$\begin{aligned}&\left\| \Delta \times \left( \frac{k_1+\widetilde{k_2}}{2}\right) -\int _{t_0}^{t_0+\Delta } {\textbf {F}}({\textbf {x}}_j(t),t) ~dt\right\| _2 =\left\| \int _{t_0}^{t_0+\Delta } ({\textbf {G}}_1(t)- {\textbf {F}}({\textbf {x}}_j(t),t)) ~dt\right\| _2 \\&\quad =\frac{1}{2}\left\| \frac{d^2}{dt^2}{\textbf {F}}({\textbf {x}}_j(\eta ),\eta )\right\| _2\left| \int _{t_0}^{t_0+\Delta } (t-t_0)\left( t-(t_0+\Delta )\right) ~dt\right| \\&\quad =\frac{\Delta ^3}{12}\left\| \frac{d^2}{dt^2}{\textbf {F}}({\textbf {x}}_j(\eta ),\eta )\right\| _2 \le C\Delta ^3. \end{aligned}$$

Combine the above arguments, we have

$$\begin{aligned} \Vert {\textbf {y}}_j^2-{\textbf {x}}_j(t_0+\Delta )\Vert _2\le C\Delta ^3. \end{aligned}$$

(A.2)

Here ${\textbf {G}}_1(t)$ denotes the linear interpolation polynomial that interpolates ${\textbf {F}}({\textbf {x}}(t),t)$ at $t_0$ and $t_0+\Delta $. And $\frac{d^2}{dt^2}{\textbf {F}}({\textbf {x}}(\cdot ),\cdot )$ denotes the complete second derivative of ${\textbf {F}}({\textbf {x}}(t),t)$ to t. This 2-stage Runge–Kutta method can be considered as a trapezoidal quadrature rule approximating the integration.

Case III: ${\textbf {y}}_j^2$ obtained from 4th order Runge–Kutta method (3.4)

With ${\textbf {y}}_j^1={\textbf {x}}_j(t_0)$ and subtract ${\textbf {x}}_j(t_0+\Delta )$ of (3.1) from ${\textbf {y}}_j^2$ of the fourth order Runge-Kutta method (3.4), we have

$$\begin{aligned} \Vert {\textbf {y}}_j^2-{\textbf {x}}_j(t_0+\Delta )\Vert _2&=\left\| \frac{\Delta (k_1+3k_2+3k_3+k_4)}{8}-\int _{t_0}^{t_0+\Delta } {\textbf {F}}({\textbf {x}}_j(t),t) ~dt\right\| _2 \\&\le \left\| \frac{\Delta (k_1+3\widetilde{k_2}+3\widetilde{k_3}+\widetilde{k_4)}}{8}-\int _{t_0}^{t_0+\Delta } {\textbf {F}}({\textbf {x}}_j(t),t) ~dt\right\| _2\\&\quad +\left\| \frac{\Delta (k_1+3k_2+3k_3+k_4)}{8}-\frac{\Delta (k_1+3\widetilde{k_2}+3\widetilde{k_3}+\widetilde{k_4)}}{8}\right\| _2. \end{aligned}$$

Terms of $k_2, k_3$ and $k_4$ are from the Runge–Kutta4 method (3.4), with $k_1={\textbf {F}}({\textbf {y}}_j^1,t_0)={\textbf {F}}({\textbf {x}}_j(t_0),t_0)$. We have $\widetilde{k_2}={\textbf {F}}({\textbf {x}}_j(t_0+\frac{\Delta }{3}),t_0+\frac{\Delta }{3})$, $\widetilde{k_3}={\textbf {F}}({\textbf {x}}_j(t_0+\frac{2\Delta }{3}),t_0+\frac{2\Delta }{3})$ and $\widetilde{k_4}={\textbf {F}}({\textbf {x}}_j(t_0+\Delta ),t_0+\Delta )$ introduced that $k_2, k_3$ and $k_4$ approximate. Rewrite the Runge–Kutta4 method of (3.4) as a one-step method, ${\textbf {y}}_j^2={\textbf {y}}_j^1+\Delta \Phi \left( t_0,{\textbf {y}}_j^1,{\textbf {F}}({\textbf {y}}_j^1),\Delta \right) $, we have

$$\begin{aligned}&\left\| \frac{\Delta (k_1+3k_2+3k_3+k_4)}{8}-\frac{\Delta (k_1+3\widetilde{k_2}+3\widetilde{k_3}+\widetilde{k_4)}}{8}\right\| _2 \\&\quad = \Delta \left\| \Phi \left( t_0,{\textbf {y}}_j^1,{\textbf {F}}({\textbf {y}}_j^1),\Delta \right) -\Phi \left( t_0,{\textbf {x}}_j(t_0),{\textbf {F}}({\textbf {x}}_j(t_0)),\Delta \right) \right\| _2 \le C\Delta ^5. \end{aligned}$$

Here C represents a generic constant. The error from the four-points quadrature rule can be estimated as

$$\begin{aligned}&\left\| \Delta \times \left( \frac{k_1+3\widetilde{k_2}+3\widetilde{k_3}+\widetilde{k_4}}{8}\right) -\int _{t_0}^{t_0+\Delta } {\textbf {F}}({\textbf {x}}_j(t),t) ~dt\right\| _2 \\&\quad = \left\| \int _{t_0}^{t_0+\Delta } ({\textbf {G}}_3(t)- {\textbf {F}}({\textbf {x}}_j(t),t)) ~dt\right\| _2 \\&\quad \le C\left\| \frac{d^4 {\textbf {F}}}{dt^4}\right\| _2 \left| \int _{t_0}^{t_0+\Delta } (t-t_0)(t-(t_0+\frac{\Delta }{3}))(t-(t_0+\frac{2\Delta }{3}))\left( t-(t_0+\Delta )\right) ~dt\right| \le C \Delta ^5. \end{aligned}$$

Again C represents a generic constant. Summarize the above arguments, we have

$$\begin{aligned} \Vert {\textbf {y}}_j^2-{\textbf {x}}_j(t_0+\Delta )\Vert _2\le C\Delta ^5. \end{aligned}$$

(A.3)

Here ${\textbf {G}}_3(t)$ denotes the cubic interpolation polynomial that interpolates ${\textbf {F}}({\textbf {x}}(t),t)$ at $t_0$, $t_0+\Delta /3$, $t_0+2\Delta /3$ and $t_0+\Delta $. And $\frac{d^4{\textbf {F}}}{dt^4}$ denotes the complete fourth derivative of ${\textbf {F}}({\textbf {x}}(t),t)$ to t variable at somewhere. This version of 4-stage Runge–Kutta method can be considered as the three-eighth Simpson quadrature rule approximating the integration.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Qiu, C., Bendickson, A., Kalyanapu, J. et al. Accuracy and Architecture Studies of Residual Neural Network Method for Ordinary Differential Equations. J Sci Comput 95, 50 (2023). https://doi.org/10.1007/s10915-023-02173-x

Download citation

Received: 09 September 2021
Revised: 25 May 2022
Accepted: 06 March 2023
Published: 28 March 2023
DOI: https://doi.org/10.1007/s10915-023-02173-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Accuracy and Architecture Studies of Residual Neural Network Method for Ordinary Differential Equations

Abstract

Access this article

Similar content being viewed by others

Numerical solution for high-dimensional partial differential equations based on deep learning with residual learning and data-driven learning

Neural network approach to intricate problems solving for ordinary differential equations

Error estimation using neural network technique for solving ordinary differential equations

Data Availibility

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix A

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Accuracy and Architecture Studies of Residual Neural Network Method for Ordinary Differential Equations

Abstract

Access this article

Similar content being viewed by others

Numerical solution for high-dimensional partial differential equations based on deep learning with residual learning and data-driven learning

Neural network approach to intricate problems solving for ordinary differential equations

Error estimation using neural network technique for solving ordinary differential equations

Data Availibility

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix A

Appendix A

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation