A deep neural network for operator learning enhanced by attention and gating mechanisms for long-time forecasting of tumor growth

Chen, Qijing; Li, He; Zheng, Xiaoning

doi:10.1007/s00366-024-02003-0

A deep neural network for operator learning enhanced by attention and gating mechanisms for long-time forecasting of tumor growth

Original Article
Published: 10 July 2024

Volume 41, pages 423–533, (2025)
Cite this article

Engineering with Computers Aims and scope Submit manuscript

Qijing Chen¹,
He Li² &
Xiaoning Zheng¹

696 Accesses
1 Citation
Explore all metrics

Abstract

Forecasting tumor progression and assessing the uncertainty of predictions play a crucial role in clinical settings, especially for determining disease outlook and making informed decisions about treatment approaches. In this work, we propose TGM-ONets, a deep neural operator learning (PI-DeepONet) based computational framework, which combines bioimaging and tumor growth modeling (TGM) for enhanced prediction of tumor growth. Deep neural operators have recently emerged as a powerful tool for learning the solution maps between the function spaces, and they have demonstrated their generalization capability in making predictions based on unseen input instances once trained. Incorporating the physics laws into the loss function of the deep neural operator can significantly reduce the amount of the training data. The novelties of the design of TGM-ONets include the employment of a convolutional block attention module (CBAM) and a gating mechanism (i.e., mixture of experts (MoE)) to extract the features of the input images. Our results show that the TGM-ONets not only can capture the detailed morphological characteristics of the mild and aggressive tumors within and outside the training domain but also can be used to predict the long-term dynamics of both mild and aggressive tumor growth for up to 6 months with a maximum error of less than 6.7 $\times 10^{-2}$ for unseen input instances with two or three snapshots added. We also systematically study the effects of the number of training snapshots and noisy data on the performance of TGM-ONets as well as quantify the uncertainty of the model predictions. We demonstrate the efficiency and accuracy by comparing the performance of TGM-ONets with three state-of-the-art (SOTA) baseline models. In summary, we propose a new deep learning model capable of integrating the TGM and sequential observations of tumor morphology to improve the current approaches for predicting tumor growth and thus provide an advanced computational tool for patient-specific tumor prognosis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Continuous-Time Deep Glioma Growth Models

Molecular imaging analysis in cancer using deep learning: a review

Article 24 August 2023

Tumor Growth Prediction Using Convolutional Networks

Data availibility statement

The data supporting this studys findings are available from the corresponding author upon reasonable request.

References

Lorenzo G, Heiselman J S, Liss M A, Miga M I, Gomez H, Yankeelov T E, Reali A, Hughes T J. Patient-specific computational forecasting of prostate cancer growth during active surveillance using an imaging-informed biomechanistic model, arXiv preprint arXiv:2310.00060
Xu J, Wang Y, Gomez H, Feng X-Q. Biomechanical modelling of tumor growth with chemotherapeutic treatment: A review, Smart Materials and Structures https://doi.org/10.1088/1361-665X/acf79a
Lorenzo G, Ahmed S R, Hormuth II D A, Vaughn B, Kalpathy-Cramer J, Solorio L, Yankeelov T E, Gomez H. Patient-specific, mechanistic models of tumor growth incorporating artificial intelligence and big data, arXiv preprint arXiv:2308.14925
Yankeelov TE, Atuegwu N, Hormuth D, Weis JA, Barnes SL, Miga MI, Rericha EC, Quaranta V (2013) Clinically relevant modeling of tumor growth and treatment response. Science Translational Medicine 5(187):187ps9-187ps9. https://doi.org/10.1126/scitranslmed.3005686
Article MATH Google Scholar
Lorenzo G, Scott MA, Tew K, Hughes TJ, Zhang YJ, Liu L, Vilanova G, Gomez H (2016) Tissue-scale, personalized modeling and simulation of prostate cancer growth. Proc Natl Acad Sci 113(48):E7663–E7671. https://doi.org/10.1073/pnas.1615791113
Article MATH Google Scholar
Lorenzo G, Scott M, Tew K, Hughes T, Gomez H (2017) Hierarchically refined and coarsened splines for moving interface problems, with particular application to phase-field models of prostate tumor growth. Comput Methods Appl Mech Eng 319:515–548. https://doi.org/10.1016/j.cma.2017.03.009
Article MathSciNet MATH Google Scholar
Lorenzo G, Hughes TJ, Dominguez-Frojan P, Reali A, Gomez H (2019) Computer simulations suggest that prostate enlargement due to benign prostatic hyperplasia mechanically impedes prostate cancer growth. Proc Natl Acad Sci 116(4):1152–1161. https://doi.org/10.1073/pnas.1815735116
Article MATH Google Scholar
Colli P, Gomez H, Lorenzo G, Marinoschi G, Reali A, Rocca E (2020) Mathematical analysis and simulation study of a phase-field model of prostate cancer growth with chemotherapy and antiangiogenic therapy effects. Math Models Methods Appl Sci 30(07):1253–1295. https://doi.org/10.1142/S0218202520500220
Article MathSciNet MATH Google Scholar
Benítez JM, García-Mozos L, Santos A, Montáns FJ, Saucedo-Mora L (2022) A simple agent-based model to simulate 3D tumor-induced angiogenesis considering the evolution of the hypoxic conditions of the cells. Engineering with Computers 38(5):4115–4133. https://doi.org/10.1007/s00366-022-01625-6
Article Google Scholar
Feng Y, Fuentes D, Hawkins A, Bass J, Rylander MN, Elliott A, Shetty A, Stafford RJ, Oden JT (2009) Nanoshell-mediated laser surgery simulation for prostate cancer treatment. Engineering with Computers 25:3–13. https://doi.org/10.1007/s00366-008-0109-y
Article MATH Google Scholar
Srinivasan A, Moure A, Gomez H (2023) Computational modeling of flow-mediated angiogenesis: Stokes–Darcy flow on a growing vessel network, Engineering with Computers 1–19 https://doi.org/10.1007/s00366-023-01889-6
Lagergren JH, Nardini JT, Baker RE, Simpson MJ, Flores KB (2020) Biologically-informed neural networks guide mechanistic modeling from sparse experimental data. PLoS Comput Biol 16(12):e1008462. https://doi.org/10.1371/journal.pcbi.1008462
Article MATH Google Scholar
Oden JT, Lima EA, Almeida RC, Feng Y, Rylander MN, Fuentes D, Faghihi D, Rahman MM, DeWitt M, Gadde M et al (2016) Toward predictive multiscale modeling of vascular tumor growth. Archives of Computational Methods in Engineering 23(4):735–779. https://doi.org/10.1007/s11831-015-9156-x
Article MathSciNet MATH Google Scholar
Fritz M, Jha PK, Köppl T, Oden JT, Wagner A, Wohlmuth B (2021) Modeling and simulation of vascular tumors embedded in evolving capillary networks. Comput Methods Appl Mech Eng 384:113975. https://doi.org/10.1016/j.cma.2021.113975
Article MathSciNet MATH Google Scholar
Wise SM, Lowengrub JS, Frieboes HB, Cristini V (2008) Three-dimensional multispecies nonlinear tumor growth-I: model and numerical method. J Theor Biol 253(3):524–543. https://doi.org/10.1016/j.jtbi.2008.03.027
Article MathSciNet MATH Google Scholar
Frieboes HB, Jin F, Chuang Y-L, Wise SM, Lowengrub JS, Cristini V (2010) Three-dimensional multispecies nonlinear tumor growth-II: tumor invasion and angiogenesis. J Theor Biol 264(4):1254–1278. https://doi.org/10.1016/j.jtbi.2010.02.036
Article MathSciNet MATH Google Scholar
Macklin P, McDougall S, Anderson AR, Chaplain MA, Cristini V, Lowengrub J (2009) Multiscale modelling and nonlinear simulation of vascular tumour growth. J Math Biol 58(4):765–798. https://doi.org/10.1007/s00285-008-0216-9
Article MathSciNet MATH Google Scholar
Anderson AR, Quaranta V (2008) Integrative mathematical oncology. Nat Rev Cancer 8(3):227–234. https://doi.org/10.1038/nrc2329
Article MATH Google Scholar
Cristini V, Lowengrub J (2010) Multiscale modeling of cancer: An integrated experimental and mathematical modeling approach. Cambridge University Press, Cambridge
Book MATH Google Scholar
Oden JT (2018) Adaptive multiscale predictive modelling. Acta Numer 27:353–450. https://doi.org/10.1017/S096249291800003X
Article MathSciNet MATH Google Scholar
Rahman MM, Feng Y, Yankeelov TE, Oden JT (2017) A fully coupled space-time multiscale modeling framework for predicting tumor growth. Comput Methods Appl Mech Eng 320:261–286. https://doi.org/10.1016/j.cma.2017.03.021
Article MathSciNet MATH Google Scholar
Rocha H, Almeida R, Lima E, Resende A, Oden J, Yankeelov T (2018) A hybrid three-scale model of tumor growth. Math Models Methods Appl Sci 28(01):61–93. https://doi.org/10.1142/S0218202518500021
Article MathSciNet MATH Google Scholar
Lima E, Oden J, Almeida R (2014) A hybrid ten-species phase-field model of tumor growth. Math Models Methods Appl Sci 24(13):2569–2599. https://doi.org/10.1142/S0218202514500304
Article MathSciNet MATH Google Scholar
Shen D, Wu G, Suk H-I (2017) Deep learning in medical image analysis. Annu Rev Biomed Eng 19:221–248. https://doi.org/10.1146/annurev-bioeng-071516-044442
Article MATH Google Scholar
Haque IRI, Neubert J (2020) Deep learning approaches to biomedical image segmentation. Informatics in Medicine Unlocked 18:100297. https://doi.org/10.1016/j.imu.2020.100297
Article Google Scholar
Zhang Q, Sampani K, Xu M, Cai S, Deng Y, Li H, Sun JK, Karniadakis GE (2022) AOSLO-net: a deep learning-based method for automatic segmentation of retinal microaneurysms from adaptive optics scanning laser ophthalmoscopy images. Translational Vision Science & Technology 11(8):7–7. https://doi.org/10.1167/tvst.11.8.7
Article Google Scholar
Pereira SP, Oldfield L, Ney A, Hart PA, Keane MG, Pandol SJ, Li D, Greenhalf W, Jeon CY, Koay EJ et al (2020) Early detection of pancreatic cancer. The Lancet Gastroenterology & Hepatology 5(7):698–710. https://doi.org/10.1016/S2468-1253(19)30416-9
Article Google Scholar
Giampaolo F, De Rosa M, Qi P, Izzo S, Cuomo S (2022) Physics-informed neural networks approach for 1D and 2D Gray-Scott systems. Advanced Modeling and Simulation in Engineering Sciences 9(1):1–17. https://doi.org/10.1186/s40323-022-00219-7
Article MATH Google Scholar
Weng Y, Zhou D (2022) Multiscale physics-informed neural networks for stiff chemical kinetics. J Phys Chem A 126(45):8534–8543. https://doi.org/10.1021/acs.jpca.2c06513
Article MATH Google Scholar
Colin T, Iollo A, Lagaert J-B, Saut O (2014) An inverse problem for the recovery of the vascularization of a tumor. Journal of Inverse and Ill-posed Problems 22(6):759–786. https://doi.org/10.1515/jip-2013-0009
Article MathSciNet MATH Google Scholar
Feng X, Hormuth DA, Yankeelov TE (2019) An adjoint-based method for a linear mechanically-coupled tumor model: Application to estimate the spatial variation of murine glioma growth based on diffusion weighted magnetic resonance imaging. Comput Mech 63:159–180. https://doi.org/10.1007/s00466-018-1589-2
Article MathSciNet MATH Google Scholar
Gholami A, Mang A, Biros G (2016) An inverse problem formulation for parameter estimation of a reaction-diffusion model of low grade gliomas. J Math Biol 72(1):409–433. https://doi.org/10.1007/s00285-015-0888-x
Article MathSciNet MATH Google Scholar
Hogea C, Davatzikos C, Biros G (2008) An image-driven parameter estimation problem for a reaction-diffusion glioma growth model with mass effects. J Math Biol 56(6):793–825. https://doi.org/10.1007/s00285-007-0139-x
Article MathSciNet MATH Google Scholar
Knopoff DA, Fernández DR, Torres GA, Turner CV (2013) Adjoint method for a tumor growth pde-constrained optimization problem. Computers & Mathematics with Applications 66(6):1104–1119. https://doi.org/10.1016/j.camwa.2013.05.028
Article MathSciNet MATH Google Scholar
Subramanian S, Scheufele K, Mehl M, Biros G (2020) Where did the tumor start? An inverse solver with sparse localization for tumor growth models. Inverse Prob 36(4):045006. https://doi.org/10.1088/1361-6420/ab649c
Article MathSciNet MATH Google Scholar
Chen X, Summers RM, Yao J (2012) Kidney tumor growth prediction by coupling reaction-diffusion and biomechanical model. IEEE Trans Biomed Eng 60(1):169–173
Article MATH Google Scholar
Konukoglu E, Clatz O, Menze BH, Stieltjes B, Weber M-A, Mandonnet E, Delingette H, Ayache N (2009) Image guided personalization of reaction-diffusion type tumor growth models using modified anisotropic eikonal equations. IEEE Trans Med Imaging 29(1):77–95
Article MATH Google Scholar
Mi H, Petitjean C, Dubray B, Vera P, Ruan S (2014) Prediction of lung tumor evolution during radiotherapy in individual patients with PET. IEEE Trans Med Imaging 33(4):995–1003
Article MATH Google Scholar
Wong KC, Summers RM, Kebebew E, Yao J (2016) Pancreatic tumor growth prediction with elastic-growth decomposition, image-derived motion, and FDM-FEM coupling. IEEE Trans Med Imaging 36(1):111–123
Article Google Scholar
Hormuth DA II, Weis JA, Barnes SL, Miga MI, Rericha EC, Quaranta V, Yankeelov TE (2015) Predicting in vivo glioma growth with the reaction diffusion equation constrained by quantitative magnetic resonance imaging data. Phys Biol 12(4):046006. https://doi.org/10.1088/1478-3975/12/4/046006
Article MATH Google Scholar
Scheufele K, Mang A, Gholami A, Davatzikos C, Biros G, Mehl M (2019) Coupling brain-tumor biophysical models and diffeomorphic image registration. Comput Methods Appl Mech Eng 347:533–567. https://doi.org/10.1016/j.cma.2018.12.008
Article MathSciNet MATH Google Scholar
Raissi M (2018) Deep hidden physics models: Deep learning of nonlinear partial differential equations. The Journal of Machine Learning Research 19(1):932–955
MathSciNet MATH Google Scholar
Raissi M, Perdikaris P, Karniadakis GE (2019) Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Comput Phys 378:686–707. https://doi.org/10.1016/j.jcp.2018.10.045
Article MathSciNet MATH Google Scholar
Li S, Wang G, Di Y, Wang L, Wang H, Zhou Q (2023) A physics-informed neural network framework to predict 3D temperature field without labeled data in process of laser metal deposition. Eng Appl Artif Intell 120:105908. https://doi.org/10.1016/j.engappai.2023.105908
Article MATH Google Scholar
Cai S, Li H, Zheng F, Kong F, Dao M, Karniadakis GE, Suresh S (2021) Artificial intelligence velocimetry and microaneurysm-on-a-chip for three-dimensional analysis of blood flow in physiology and disease. Proc Natl Acad Sci 118(13):e2100697118. https://doi.org/10.1073/pnas.2100697118
Article Google Scholar
Kissas G, Yang Y, Hwuang E, Witschey WR, Detre JA, Perdikaris P (2020) Machine learning in cardiovascular flows modeling: Predicting arterial blood pressure from non-invasive 4D flow MRI data using physics-informed neural networks. Comput Methods Appl Mech Eng 358:112623. https://doi.org/10.1016/j.cma.2019.112623
Article MathSciNet MATH Google Scholar
Sahli Costabal F, Yang Y, Perdikaris P, Hurtado DE, Kuhl E (2020) Physics-informed neural networks for cardiac activation mapping. Frontiers in Physics 8:42. https://doi.org/10.3389/fphy.2020.00042
Article MATH Google Scholar
Lei J, Liu Q, Wang X (2022) Physics-informed multi-fidelity learning-driven imaging method for electrical capacitance tomography. Eng Appl Artif Intell 116:105467. https://doi.org/10.1016/j.engappai.2022.105467
Article MATH Google Scholar
Ouyang H, Zhu Z, Chen K, Tian B, Huang B, Hao J (2023) Reconstruction of hydrofoil cavitation flow based on the chain-style physics-informed neural network. Eng Appl Artif Intell 119:105724. https://doi.org/10.1016/j.engappai.2022.105724
Article MATH Google Scholar
Nguyen TNK, Dairay T, Meunier R, Mougeot M (2022) Physics-informed neural networks for non-Newtonian fluid thermo-mechanical problems: An application to rubber calendering process. Eng Appl Artif Intell 114:105176. https://doi.org/10.1016/j.engappai.2022.105176
Article Google Scholar
Ren P, Rao C, Sun H, Liu Y. SeismicNet: Physics-informed neural networks for seismic wave modeling in semi-infinite domain, arXiv preprint arXiv:2210.14044
Lorenzo G, Hormuth DA II, Jarrett AM, Lima EA, Subramanian S, Biros G, Oden JT, Hughes TJ, Yankeelov TE (2022) Quantitative in vivo imaging to enable tumour forecasting and treatment optimization. In: Cancer Complexity (ed) Computation. New York, Springer, pp 55–97
Zhang E, Dao M, Karniadakis GE, Suresh S (2022) Analyses of internal structures and defects in materials using physics-informed neural networks. Sci Adv 8(7):eabk0644. https://doi.org/10.1126/sciadv.abk0644
Article MATH Google Scholar
Karniadakis GE, Kevrekidis IG, Lu L, Perdikaris P, Wang S, Yang L (2021) Physics-informed machine learning. Nature Reviews Physics 3(6):422–440. https://doi.org/10.1038/s42254-021-00314-5
Article MATH Google Scholar
Cai S, Mao Z, Wang Z, Yin M, Karniadakis G E (2022) Physics-informed neural networks (PINNs) for fluid mechanics: A review, Acta Mechanica Sinica 1–12 https://doi.org/10.1007/s10409-021-01148-1
Jagtap AD, Kharazmi E, Karniadakis GE (2020) Conservative physics-informed neural networks on discrete domains for conservation laws: Applications to forward and inverse problems. Comput Methods Appl Mech Eng 365:113028. https://doi.org/10.1016/j.cma.2020.113028
Article MathSciNet MATH Google Scholar
Yang L, Meng X, Karniadakis GE (2021) B-PINNs: Bayesian physics-informed neural networks for forward and inverse PDE problems with noisy data. J Comput Phys 425:109913. https://doi.org/10.1016/j.jcp.2020.109913
Article MathSciNet MATH Google Scholar
Du P, Zhu X, Wang J-X (2022) Deep learning-based surrogate model for three-dimensional patient-specific computational fluid dynamics. Phys Fluids 34(8):081906. https://doi.org/10.1063/5.0101128
Article MATH Google Scholar
Chen Q, Ye Q, Zhang W, Li H, Zheng X (2023) TGM-Nets: A deep learning framework for enhanced forecasting of tumor growth by integrating imaging and modeling. Eng Appl Artif Intell 126:106867. https://doi.org/10.1016/j.engappai.2023.106867
Article MATH Google Scholar
Ruiz Herrera C, Grandits T, Plank G, Perdikaris P, Sahli Costabal F, Pezzuto S (2022) Physics-informed neural networks to learn cardiac fiber orientation from multiple electroanatomical maps, Engineering with Computers 38(5), 3957–3973. https://doi.org/10.1007/s00366-022-01709-3
Tajdari M, Tajdari F, Shirzadian P, Pawar A, Wardak M, Saha S, Park C, Huysmans T, Song Y, Zhang YJ et al (2022) Next-generation prognosis framework for pediatric spinal deformities using bio-informed deep learning networks. Engineering with Computers 38(5):4061–4084. https://doi.org/10.1007/s00366-022-01742-2
Article MATH Google Scholar
Lee SY, Park C-S, Park K, Lee HJ, Lee S (2023) A physics-informed and data-driven deep learning approach for wave propagation and its scattering characteristics. Engineering with Computers 39(4):2609–2625. https://doi.org/10.1007/s00366-022-01640-7
Article MATH Google Scholar
Fallah A, Aghdam M M (2023) Physics-informed neural network for bending and free vibration analysis of three-dimensional functionally graded porous beam resting on elastic foundation, Engineering with Computers 1–18 https://doi.org/10.1007/s00366-023-01799-7
Mai H T, Mai D D, Kang J, Lee J, Lee J (2023) Physics-informed neural energy-force network: a unified solver-free numerical simulation for structural optimization, Engineering with Computers 1–24 https://doi.org/10.1007/s00366-022-01760-0
Wang S, Wang H, Perdikaris P (2021) Learning the solution operator of parametric partial differential equations with physics-informed DeepONets. Sci Adv 7(40):eabi8605. https://doi.org/10.1126/sciadv.abi8605
Article MATH Google Scholar
Koric S, Viswantah A, Abueidda D W, Sobh N A, Khan K (2023) Deep learning operator network for plastic deformation with variable loads and material properties, Engineering with Computers 1–13 https://doi.org/10.1007/s00366-023-01822-x
Linka K, Schäfer A, Meng X, Zou Z, Karniadakis GE, Kuhl E (2022) Bayesian physics informed neural networks for real-world nonlinear dynamical systems. Comput Methods Appl Mech Eng 402:115346. https://doi.org/10.1016/j.cma.2022.115346
Article MathSciNet MATH Google Scholar
Zakir Ullah M, Zheng Y, Song J, Aslam S, Xu C, Kiazolu GD, Wang L (2021) An attention-based convolutional neural network for acute lymphoblastic leukemia classification. Appl Sci 11(22):10662. https://doi.org/10.3390/app112210662
Article Google Scholar
Yin W, Schütze H, Xiang B, Zhou B (2016) Abcnn: Attention-based convolutional neural network for modeling sentence pairs. Transactions of the Association for computational linguistics 4:259–272. https://doi.org/10.1162/tacl_a_00097
Article Google Scholar
Ling H, Wu J, Huang J, Chen J, Li P (2020) Attention-based convolutional neural network for deep face recognition. Multimedia Tools and Applications 79:5595–5616. https://doi.org/10.1007/s11042-019-08422-2
Article MATH Google Scholar
Shen Y, Huang X-J (2016) Attention-based convolutional neural network for semantic relation extraction, in: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, 2526–2536
Jacobs RA, Jordan MI, Nowlan SJ, Hinton GE (1991) Adaptive mixtures of local experts. Neural Comput 3(1):79–87
Article MATH Google Scholar
Wang S, Perdikaris P (2023) Long-time integration of parametric evolution equations with physics-informed deeponets. J Comput Phys 475:111855. https://doi.org/10.1016/j.jcp.2022.111855
Article MathSciNet MATH Google Scholar
Michałowska K, Goswami S, Karniadakis G E, Riemer-Sørensen S. Neural operator learning for long-time integration in dynamical systems with recurrent neural networks, arXiv preprint arXiv:2303.02243
Zhu M, Zhang H, Jiao A, Karniadakis GE, Lu L (2023) Reliable extrapolation of deep neural operators informed by physics or sparse observations. Comput Methods Appl Mech Eng 412:116064. https://doi.org/10.1016/j.cma.2023.116064
Article MathSciNet MATH Google Scholar
Osband I, Aslanides J, Cassirer A. Randomized prior functions for deep reinforcement learning, Advances in Neural Information Processing Systems 31
Xu J, Vilanova G, Gomez H (2016) A mathematical model coupling tumor growth and angiogenesis. PLoS ONE 11(2):e0149422. https://doi.org/10.1371/journal.pone.0149422
Article MATH Google Scholar
Xu S, Xu Z, Kim OV, Litvinov RI, Weisel JW, Alber M (2017) Model predictions of deformation, embolization and permeability of partially obstructive blood clots under variable shear flow. J R Soc Interface 14(136):20170441. https://doi.org/10.1098/rsif.2017.0441
Article MATH Google Scholar
Xu J, Vilanova G, Gomez H (2020) Phase-field model of vascular tumor growth: Three-dimensional geometry of the vascular network and integration with imaging data. Comput Methods Appl Mech Eng 359:112648. https://doi.org/10.1016/j.cma.2019.112648
Article MathSciNet MATH Google Scholar
Kobayashi R (2010) A brief introduction to phase field method, in: AIP Conference Proceedings, Vol. 1270, American Institute of Physics, 282–291. https://doi.org/10.1063/1.3476232
Lu L, Jin P, Pang G, Zhang Z, Karniadakis GE (2021) Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nature Machine Intelligence 3(3):218–229
Article MATH Google Scholar
Chen T, Chen H (1995) Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems. IEEE Trans Neural Networks 6(4):911–917
Article MATH Google Scholar
Deng B, Shin Y, Lu L, Zhang Z, Karniadakis GE (2022) Approximation rates of DeepONets for learning operators arising from advection-diffusion equations. Neural Netw 153:411–426. https://doi.org/10.1016/j.neunet.2022.06.019
Article MATH Google Scholar
Lu L, Jin P, Karniadakis G E. Deeponet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators, arXiv preprint arXiv:1910.03193
Lu L, Meng X, Cai S, Mao Z, Goswami S, Zhang Z, Karniadakis GE (2022) A comprehensive and fair comparison of two neural operators (with practical extensions) based on fair data. Comput Methods Appl Mech Eng 393:114778. https://doi.org/10.1016/j.cma.2022.114778
Article MathSciNet MATH Google Scholar
He J, Kushwaha S, Park J, Koric S, Abueidda D, Jasiuk I (2024) Sequential Deep Operator networks (S-DeepONet) for predicting full-field solutions under time-dependent loads. Eng Appl Artif Intell 127:107258. https://doi.org/10.1016/j.engappai.2023.107258
Article MATH Google Scholar
Sun Y, Moya C, Lin G, Yue M, Deepgraphonet: A deep graph operator network to learn and zero-shot transfer the dynamic response of networked systems, IEEE Systems Journal
Goswami S, Yin M, Yu Y, Karniadakis GE (2022) A physics-informed variational deeponet for predicting crack path in quasi-brittle materials. Comput Methods Appl Mech Eng 391:114587. https://doi.org/10.1016/j.cma.2022.114587
Article MathSciNet MATH Google Scholar
Goswami S, Bora A, Yu Y, E G (2023) Karniadakis, Physics-informed deep neural operator networks, in: Machine Learning in Modeling and Simulation: Methods and Applications, Springer, New York, pp. 219–254
Koric S, Abueidda DW (2023) Data-driven and physics-informed deep learning operators for solution of heat conduction equation with parametric heat source. Int J Heat Mass Transf 203:123809. https://doi.org/10.1016/j.ijheatmasstransfer.2022.123809
Article MATH Google Scholar
Hao Y, Di Leoni PC, Marxen O, Meneveau C, Karniadakis GE, Zaki TA (2023) Instability-wave prediction in hypersonic boundary layers with physics-informed neural operators. Journal of Computational Science 73:102120. https://doi.org/10.1016/j.jocs.2023.102120
Article MATH Google Scholar
Iqbal S, Ghani MU, Saba T, Rehman A (2018) Brain tumor segmentation in multi-spectral MRI using convolutional neural networks (CNN). Microsc Res Tech 81(4):419–427. https://doi.org/10.1002/jemt.22994
Article Google Scholar
Chen L, Wu Y, DSouza A M, Abidin A Z, Wismüller A, Xu C (2018) MRI tumor segmentation with densely connected 3D CNN, in: Medical Imaging 2018: Image Processing, Vol. 10574, SPIE, pp. 357–364. https://doi.org/10.1117/12.2293394
Pereira S, Pinto A, Alves V, Silva CA (2016) Brain tumor segmentation using convolutional neural networks in MRI images. IEEE Trans Med Imaging 35(5):1240–1251
Article MATH Google Scholar
Havaei M, Davy A, Warde-Farley D, Biard A, Courville A, Bengio Y, Pal C, Jodoin P-M, Larochelle H (2017) Brain tumor segmentation with deep neural networks. Med Image Anal 35:18–31. https://doi.org/10.1016/j.media.2016.05.004
Article Google Scholar
Havaei M, Dutil F, Pal C, Larochelle H, Jodoin P-M (2016) A convolutional neural network approach to brain tumor segmentation, in: Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries: First International Workshop, Brainles 2015, Held in Conjunction with MICCAI 2015, Munich, Germany, October 5, 2015, Revised Selected Papers 1, Springer, pp. 195–208. https://doi.org/10.1007/978-3-319-30858-6_17
Woo S, Park J, Lee J-Y, Kweon I S (2018) CBAM: Convolutional block attention module, in: Proceedings of the European Conference on Computer Vision (ECCV), pp. 3–19
Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift, in: International Conference on Machine Learning, PMLR, pp. 448–456
Zhou Y, Li D, Huo S, Kung S-Y (2021) Shape autotuning activation function. Expert Syst Appl 171:114534. https://doi.org/10.1016/j.eswa.2020.114534
Article Google Scholar
Wang S, Wang H, Perdikaris P (2022) Improved architectures and training algorithms for deep operator networks. J Sci Comput 92(2):35. https://doi.org/10.1007/s10915-022-01881-0
Article MathSciNet MATH Google Scholar
Waterhouse S, Cook G, Ensemble methods for phoneme classification, Advances in Neural Information Processing Systems 9
Nguyen MH, Abbass HA, Mckay RI (2006) A novel mixture of experts model based on cooperative coevolution. Neurocomputing 70(1–3):155–163. https://doi.org/10.1016/j.neucom.2006.04.009
Article MATH Google Scholar
Ebrahimpour R, Kabir E, Yousefi MR (2007) Face detection using mixture of MLP experts. Neural Process Lett 26:69–82. https://doi.org/10.1007/s11063-007-9043-z
Article MATH Google Scholar
Übeyli ED, Ilbay K, Ilbay G, Sahin D, Akansel G (2010) Differentiation of two subtypes of adult hydrocephalus by mixture of experts. J Med Syst 34:281–290. https://doi.org/10.1007/s10916-008-9239-4
Article Google Scholar
Ebrahimpour R, Nikoo H, Masoudnia S, Yousefi MR, Ghaemi MS (2011) Mixture of MLP-experts for trend forecasting of time series: A case study of the tehran stock exchange. Int J Forecast 27(3):804–816. https://doi.org/10.1016/j.ijforecast.2010.02.015
Article MATH Google Scholar
Kingma D P, Ba J, Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980
Raissi M, Yazdani A, Karniadakis GE (2020) Hidden fluid mechanics: Learning velocity and pressure fields from flow visualizations. Science 367(6481):1026–1030. https://doi.org/10.1126/science.aaw4741
Article MathSciNet MATH Google Scholar
Yin M, Zheng X, Humphrey JD, Karniadakis GE (2021) Non-invasive inference of thrombus material properties with physics-informed neural networks. Comput Methods Appl Mech Eng 375:113603. https://doi.org/10.1016/j.cma.2020.113603
Article MathSciNet MATH Google Scholar
Shi X, Chen Z, Wang H, Yeung D-Y, Wong W-K, Woo W-c, Convolutional LSTM network: A machine learning approach for precipitation nowcasting, Advances in Neural Information Processing Systems 28
Kirby R M, Karniadakis G E, Spectral element and hp methods, Encyclopedia of Computational Mechanics
Hanahan D, Weinberg RA (2011) Hallmarks of cancer: the next generation. Cell 144(5):646–674. https://doi.org/10.1016/j.cell.2011.02.013
Article MATH Google Scholar
Lu L, Dao M, Kumar P, Ramamurty U, Karniadakis GE, Suresh S (2020) Extraction of mechanical properties of materials through deep learning from instrumented indentation. Proc Natl Acad Sci 117(13):7052–7062. https://doi.org/10.1073/pnas.1922210117
Article Google Scholar
Sanga S, Sinek JP, Frieboes HB, Ferrari M, Fruehauf JP, Cristini V (2006) Mathematical modeling of cancer progression and response to chemotherapy. Expert Rev Anticancer Ther 6(10):1361–1376. https://doi.org/10.1586/14737140.6.10.1361
Article MATH Google Scholar
Ayensa-Jiménez J, Doweidar MH, Sanz-Herrera JA, Doblare M (2022) Understanding glioblastoma invasion using physically-guided neural networks with internal variables. PLoS Comput Biol 18(4):e1010019. https://doi.org/10.1371/journal.pcbi.1010019
Article MATH Google Scholar
Gao Q, Lin H, Qian J, Liu X, Cai S, Li H, Fan H, Zheng Z (2023) A deep learning model for efficient end-to-end stratification of thrombotic risk in left atrial appendage. Eng Appl Artif Intell 126:107187. https://doi.org/10.1016/j.engappai.2023.107187
Article Google Scholar
Qi C R, Su H, Mo K, Guibas L J (2017) Pointnet: Deep learning on point sets for 3D classification and segmentation, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 652–660
Garcia-Garcia A, Gomez-Donoso F, Garcia-Rodriguez J, Orts-Escolano S, Cazorla M, Azorin-Lopez J, Pointnet: A 3D convolutional neural network for real-time object class recognition, in, (2016) International joint conference on neural networks (IJCNN). IEEE 2016:1578–1584
Aoki Y, Goforth H, Srivatsan R A, Lucey S (2019) Pointnetlk: Robust & efficient point cloud registration using pointnet, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 7163–7172

Download references

Acknowledgements

Q.C and X.Z gratefully acknowledge the support from the starting fund of Jinan University, Guangzhou, Guangdong Province, China.

Author information

Authors and Affiliations

Department of Mathematics, College of Information Science and Technology, Jinan University, Guangzhou, 510632, Guangdong, China
Qijing Chen & Xiaoning Zheng
School of Chemical, Materials and Biomedical Engineering, University of Georgia, Athens, GA, 30602, USA
He Li

Authors

Qijing Chen
View author publications
You can also search for this author inPubMed Google Scholar
He Li
View author publications
You can also search for this author inPubMed Google Scholar
Xiaoning Zheng
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

Qijing Chen: Conceptualization (equal); Formal analysis (equal); Investigation (equal); Methodology (equal); Software (equal); Validation (equal); Writing-original draft (equal), Writing-review & editing (equal). He Li: Conceptualization (equal); Writing-original draft (equal); Writing-review & editing (equal). Xiaoning Zheng: Conceptualization (equal); Funding acquisition (equal); Formal analysis (equal); Investigation (equal); Methodology (equal); Software (equal); Supervision (equal); Writing-original draft (equal); Writing-review & editing (equal).

Corresponding author

Correspondence to Xiaoning Zheng.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

A Convergence of spectral/hp element (Nektar) results for TGMs

We give a brief introduction to the spectral/hp element method which we use to solve the PDEs for tumor growth and generate the synthetic data. More details about the spectral/hp element method can be found in [110]. We first define the weak form of the PDE and impose the boundary conditions. Then we discretize the computational domain into subdomains. Below we use the one-dimensional Poisson equation in the interval $0< x \le 1$ for illustration. $\bigtriangleup u + f$ = 0, where $u(x = 0)$ = $g_D$ = 1 and $\frac{\partial u(x = 1)}{\partial x}$ = $g_N$ = 1.

1.
We get the weak form by multiplying the problem by a discrete test space and integrating the second-order derivative by parts: $\int _{0}^{1}\frac{\partial v^{\delta }}{\partial x}\frac{\partial u^{\delta }}{\partial x} = \int v^{\delta }f dx + v^{\delta }(1)g_{N}.$
2.
We lift a known solution from the problem by decomposing into a known solution satisfying the Dirichlet boundary conditions and a homogeneous solution such that $u^{\delta } = u^{D}+u^{H}$, and the weak solution becomes $\int _{0}^{1}\frac{\partial v^{\delta }}{\partial x}\frac{\partial u^{H}}{\partial x} = \int v^{\delta }f dx + v^{\delta }(1)g_{N}-\int _{0}^{1}\int _{0}^{1}\frac{\partial v^{\delta }}{\partial x}\frac{\partial u^{D}}{\partial x}.$

We use piecewise linear functions as basis functions and decompose the domain into two subdomains. We can use finer mesh which can give h-convergence and higher-order polynomials as basis functions which can give p-type convergence. For the linear two-subdomain case the approximate expansion has the form $u^{\delta } = \sum _{i = 0}^{2} \hat{u_{i}}\Phi _i(x)$, where $\Phi _i(x)$ are the piecewise linear functions. Then we represent f in terms of basis functions $f(x) = \sum _{i = 0}^2\hat{f_i}\Phi _i(x)$. Finally, we can solve the linear system of equations to get the numerical solution for u, which is also a finite element approximation for this example.

We solve the Eqs. 1–4 in Sect. 2.1 for tumor growth using a spectral/hp element Nektar solver with $\Delta t$ = $1.0\,\times \,10^{-2}$, $1.0\,\times \,10^{-3}$, and $1.0\,\times \,10^{-4}$. The characteristic length and time are 1 mm and 1 day. We found that at $\Delta t \le 1.0\,\times \,10^{-3}$, the differences in the results between $\Delta t$s are marginal. For aggressive tumors, the maximum difference (i.e., the maximum pointwise absolute difference between $\phi$ from two simulation runs) between $\Delta t = 1.0\,\times \,10^{-2}$ and $\Delta t = 1.0\,\times \,10^{-5}$ is $2.08\,\times \,10^{-2}$, $\Delta t = 1.0\,\times \,10^{-3}$ and $\Delta t = 1.0\,\times \,10^{-5}$ is $2.99\,\times \,10^{-3}$, and $\Delta t = 1.0\,\times \,10^{-4}$ and $\Delta t = 1.0\,\times \,10^{-5}$ is $1.32\,\times \,10^{-3}$. For all the numerical simulations conducted by Nektar, we use polynomial order = 3, time step size $\Delta t$ = $1.0\,\times \,10^{-3}$, mesh size $6.67\,\times \,10^{-3}$ in both x- and y- directions which resulted in 22,500 quadrilateral elements, and run the solver with 256 CPU nodes in parallel. Table 43 shows the parameters used in the simulations. It takes about 1.1 h to run one mild tumor case up to 80 days and 3.9 h to run one aggressive tumor case up to 200 days.

Table 43 Parameters used for mechanistic simulations using phase-field model

A deep neural network for operator learning enhanced by attention and gating mechanisms for long-time forecasting of tumor growth

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Continuous-Time Deep Glioma Growth Models

Molecular imaging analysis in cancer using deep learning: a review

Tumor Growth Prediction Using Convolutional Networks

Data availibility statement

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendices

A Convergence of spectral/hp element (Nektar) results for TGMs

B Forecast the tumor growth using the initial density of nutrients as the input for the branch net

C Forecast the tumor growth using the initial density of tumor cells with varying shapes as the input for the branch net

1.1 C.1 Forecast the mild tumor growth using the initial density of tumor cells with varying ratio (\(\delta\)) of y-semiaxis to the x-semiaxis

1.2 C.2 Forecast the mild tumor growth using the initial density of tumor cells with varying positions within the domain

1.3 C.3 Forecast the mild tumor growth using the initial density of tumor cells with varying length of the minor axis of the initial ellipsoidal tumor centered at (0.5,0.5)

1.4 C.4 Forecast the mild tumor growth using the initial density of tumor cells with varying length of the minor axis of the initial ellipsoidal tumor centered not at (0.5,0.5)

1.5 C.5 Forecast the mild tumor growth using the initial density of tumor cells with varying radii of the initial circular tumor

1.6 C.6 Forecast the mild tumor growth using the initial density of tumor cells with varying oblique (\(\theta\)) and the ratios of the y-semiaxes to the x-semiaxes (\(\delta\) = 2 or 4)

1.7 C.7 Forecast the aggressive tumor growth using the initial density of tumor cells with varying the ratio of the x-semiaxis to y-semiaxis (\(\delta\))

1.8 C.8 Forecast the aggressive tumor growth using the initial density of tumor cells with varying positions within the domain

D Significance tests for the difference of prediction errors obtained by TGM-ONets using different input functions for the branch net

E Long-time predictions using TGM-ONets

F Ranges of prediction errors in training datasets for examining the robustness of TGM-ONets

G Ablation & grid studies of TGM-ONets

H. Comparison with three state-of-the-art (SOTA) models

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now