Abstract
Sediment discharge in rivers is among the most important water and environmental engineering issues. The present study employed the published reliable field data set and three white-box data-driven methods, including the group method of data handling (GMDH), gene expression programming (GEP), and the multivariate adaptive regression splines (MARS) approach for modeling sediment discharge in rivers (Qs). In addition, the artificial neural network (ANN) model, known as the popular and widely used black-box data-driven model, was employed for modeling sediment discharge. The performance of the proposed methods was explored by statistical measures, scatter plots, and Taylor and Violin plots. The main feature of white-box data-driven models is that they provide explicit mathematical expressions for the prediction of Qs. The outcomes of the proposed methods provide better and more competitive results than the earlier study conducted using the model tree (MT) approach. Statical measurements and graphical plots indicated that all proposed methods have similar results for the prediction of sediment discharge. However, GMDH and GEP were more accurate than the MARS and ANN models. For the overall evaluation of the proposed models, the ranking mean (RM) method was used. This method showed that the GMDH model with RM = 1.86 had better performance in estimating sediment discharge, followed by the GEP with RM = 2.29, MARS with RM = 2.86, and ANN with RM = 3. It is worth mentioning that simple, explicit mathematical expressions generated by GMDH and MARS were more straightforward calculations for estimating sediment discharge compared to GEP and ANN.
Similar content being viewed by others
Data availability
The data references are described in the text of the article.
References
Adnan RM, Khosravinia P, Karimi B, Kisi O (2021) Prediction of hydraulics performance in drain envelopes using Kmeans based multivariate adaptive regression spline. Appl Soft Comput 100:107008. https://doi.org/10.1016/j.asoc.2020.107008
Ahmed AN, Van Lam T, Hung ND, Van Thieu N, Kisi O, El-Shafie A (2021) A comprehensive comparison of recent developed meta-heuristic algorithms for streamflow time series forecasting problem. Appl Soft Comput 105:107282. https://doi.org/10.1016/j.asoc.2021.107282
Allahbakhshian-Farsani P, Vafakhah M, Khosravi-Farsani H, Hertig E (2020) Regional flood frequency analysis through some machine learning models in semi-arid regions. Water Resour Manage 34:2887–2909. https://doi.org/10.1007/s11269-020-02589-2
Behnia N, Zare M, Moosavi V, Khajeddin SJ (2022) An inter-comparison of different PSO-optimized artificial intelligence algorithms for thermal-based soil moisture retrieval. Earth Sci Inf 15(1):473–484. https://doi.org/10.1007/s12145-021-00747-7
Benbouhenni H (2021) Two-level DPC strategy based on FNN algorithm of DFIG-DRWT systems using two-level hysteresis controllers for reactive and active powers. Renew Energy Res Appl 2(1):137–146
Essam Y, Huang YF, Birima AH, Ahmed AN, El-Shafie A (2022) Predicting suspended sediment load in Peninsular Malaysia using support vector machine and deep learning algorithms. Sci Rep 12(1):302. https://doi.org/10.1038/s41598-021-04419-w
Fayaz SA, Zaman M, Butt MA (2022) Numerical and Experimental Investigation of Meteorological Data Using Adaptive Linear M5 Model Tree for the Prediction of Rainfall. Rev Comput Eng Res 9(1):1–12. https://doi.org/10.18488/76.v9i1.2961
Ferreira C (2002) Gene expression programming in problem solving. In Soft computing and industry: recent applications (pp. 635–653). London: Springer London. https://doi.org/10.1007/978-1-4471-0123-9_54
Firouz MH, Ghadimi N (2015) Wind energy uncertainties in multi-objective environmental/economic dispatch based on multi-objective evolutionary algorithm. UCT J Res Sci Eng Technol 3(3):8–15. https://doi.org/10.24200/jrset.vol3iss03pp8-15
Friedman JH (1991) Multivariate adaptive regression splines. Ann Stat 19(1):1–67. https://doi.org/10.1214/aos/1176347963
Fung PL, Zaidan MA, Timonen H, Niemi JV, Kousa A, Kuula J, Luoma K, Tarkoma S, Petäjä T, Kulmala M, Hussein T (2021) Evaluation of white-box versus black-box machine learning models in estimating ambient black carbon concentration. J Aerosol Sci 152:105694. https://doi.org/10.1016/j.jaerosci.2020.105694
Ghani Ab, Azamathulla Md (2011) Gene-expression programming for sediment transport in sewer pipe systems. J Pipeline Syst Eng Pract 2(3):102–106. https://doi.org/10.1061/(ASCE)PS.1949-1204.0000076
Ghasemi M, Samadi M, Soleimanian E, Chau KW (2023) A comparative study of black-box and white-box data-driven methods to predict landfill leachate permeability. Environ Monit Assess 195(7):862. https://doi.org/10.1007/s10661-023-11462-9
Gong S, Bai X, Luo G, Li C, Wu L, Chen F, Ran C, Xi H, Zhang S (2023) Climate change has enhanced the positive contribution of rock weathering to the major ions in riverine transport. Global Planet Change 228:104203. https://doi.org/10.1016/j.gloplacha.2023.104203
Gupta D, Hazarika BB, Berlin M, Sharma UM, Mishra K (2021) Artificial intelligence for suspended sediment load prediction: a review. Environ Earth Sci 80(9):346. https://doi.org/10.1007/s12665-021-09625-3
Guven A, Pala A (2022) Comparison of different statistical downscaling models and future projection of areal mean precipitation of a river basin under climate change effect. Water Supply 22(3):2424–2439. https://doi.org/10.2166/ws.2021.372
Hanoon MS, Abdullatif BAA, Ahmed AN, Razzaq A, Birima AH, El-Shafie A (2022) A comparison of various machine learning approaches performance for prediction suspended sediment load of river systems: a case study in Malaysia. Earth Sci Inform, pp.1–14. https://doi.org/10.1007/s12145-021-00689-0
Hayder G, Solihin MI, Kushiar KFB (2021) A performance comparison of various artificial intelligence approaches for estimation of sediment of river systems. J Ecol Eng 22(7):20–27. https://doi.org/10.12911/22998993/137847
He MY, Dong JB, Jin Z, Liu CY, Xiao J, Zhang F, Sun H, Zhao ZQ, Gou LF, Liu WG, Luo CG (2021) Pedogenic processes in loess-paleosol sediments: Clues from Li isotopes of leachate in Luochuan loess. Geochim Cosmochim Acta 299:151–162. https://doi.org/10.1016/j.gca.2021.02.021
Jamei M, Ali M, Malik A, Rai P, Karbasi M, Farooque AA, Yaseen ZM (2023) Designing a decomposition-based multi-phase pre-processing strategy coupled with EDBi-LSTM deep learning approach for sediment load forecasting. Ecol Ind 153:110478. https://doi.org/10.1016/j.ecolind.2023.110478
Jang EK (2017) Sediment Discharge Assessment for Rivers using Model Tree in Data Mining. Thesis (PhD). Yongin, Korea: Myongji University
Jang EK, Ji U, Yeo W (2023) Estimation of sediment discharge using a tree-based model. Hydrol Sci J, (just-accepted). https://doi.org/10.1080/02626667.2023.2221790
Jin W (2022) Cognitive radio spectrum allocation based on IOT and genetic algorithm. J Commer Biotechnol, 27(1). https://doi.org/10.5912/jcb1071
Julien PY (2010) Erosion and sedimentation. Cambridge University Press, Cambridge, UK. https://doi.org/10.1017/CBO9780511806049
Kargar K, Safari MJS, Mohammadi M, Samadianfard S (2019) Sediment transport modeling in open channels using neuro-fuzzy and gene expression programming techniques. Water Sci Technol 79(12):2318–2327. https://doi.org/10.2166/wst.2019.229
Kazemi MS, Banihabib ME, Soltani J (2021) A hybrid SVR-PSO model to predict concentration of sediment in typical and debris floods. Earth Sci Inf 14:365–376. https://doi.org/10.1007/s12145-021-00570-0
Keshtegar B, Heddam S, Kisi O, Zhu SP (2019) Modeling total dissolved gas (TDG) concentration at Columbia river basin dams: high-order response surface method (H-RSM) vs. M5Tree, LSSVM, and MARS. Arab J Geosci 12:1–15. https://doi.org/10.1007/s12517-019-4687-3
Kisi O, Dailr AH, Cimen M, Shiri J (2012) Suspended sediment modeling using genetic programming and soft computing techniques. J Hydrol 450:48–58. https://doi.org/10.1016/j.jhydrol.2012.05.031
Kumar M, Kumar P, Kumar A, Elbeltagi A, Kuriqi A (2022) Modeling stage-discharge-sediment using support vector machine and artificial neural network coupled with wavelet transform. Appl Water Sci 12(5):87. https://doi.org/10.1007/s13201-022-01621-7
Mousavi-Mirkalaei P, Roozbahani A, Banihabib ME, Randhir TO (2022) Forecasting urban water consumption using bayesian networks and gene expression programming. Earth Sci Inf 15(1):623–633. https://doi.org/10.1007/s12145-021-00733-z
Najafzadeh M (2015) Neuro-fuzzy GMDH systems based evolutionary algorithms to predict scour pile groups in clear water conditions. Ocean Eng 99:85–94. https://doi.org/10.1016/j.oceaneng.2015.01.014
Najafzadeh M, Lim SY (2015) Application of improved neuro-fuzzy GMDH to predict scour depth at sluice gates. Earth Sci Inf 8:187–196. https://doi.org/10.1007/s12145-014-0144-8
Najafzadeh M, Saberi-Movahed F (2019) GMDH-GEP to predict free span expansion rates below pipelines under waves. Mar Georesour Geotechnol 37(3):375–392. https://doi.org/10.1080/1064119X.2018.1443355
Najafzadeh M, Tafarojnoruz A (2016) Evaluation of neuro-fuzzy GMDH-based particle swarm optimization to predict longitudinal dispersion coefficient in rivers. Environ Earth Sci 75:1–12. https://doi.org/10.1007/s12665-015-4877-6
Najafzadeh M, Barani GA, Azamathulla HM (2013a) GMDH to predict scour depth around a pier in cohesive soils. Appl Ocean Res 40:35–41. https://doi.org/10.1016/j.apor.2012.12.004
Najafzadeh M, Barani GA, Kermani MRH (2013b) GMDH based back propagation algorithm to predict abutment scour in cohesive soils. Ocean Eng 59:100–106. https://doi.org/10.1016/j.oceaneng.2012.12.006
Najafzadeh M, Barani GA, Kermani MRH (2013c) Abutment scour in clear-water and live-bed conditions by GMDH network. Water Sci Technol 67(5):1121–1128. https://doi.org/10.2166/wst.2013.670
Najafzadeh M, Barani GA, Hessami Kermani MR (2014a) Estimation of pipeline scour due to waves by GMDH. J Pipeline Syst Eng Pract 5(3):06014002. https://doi.org/10.1061/(ASCE)PS.1949-1204.0000171
Najafzadeh M, Barani GA, Hessami-Kermani MR (2014b) Group method of data handling to predict scour at downstream of a ski-jump bucket spillway. Earth Sci Inf 7:231–248. https://doi.org/10.1007/s12145-013-0140-4
Najafzadeh M, Etemad-Shahidi A, Lim SY (2016) Scour prediction in long contractions using ANFIS and SVM. Ocean Eng 111:128–135. https://doi.org/10.1016/j.oceaneng.2015.10.053
Niazkar M, Zakwan M (2021) Application of MGGP, ANN, MHBMO, GRG, and linear regression for developing daily sediment rating curves. Math Probl Eng 2021:1–13. https://doi.org/10.1155/2021/8574063
Piraei R, Niazkar M, Afzali SH (2023) Assessment of data-driven models for estimating total sediment discharge. Earth Sci Inform, pp.1–18. https://doi.org/10.1007/s12145-023-01069-6
Pourderogar H, Harasii H, Alayi R, Delbari SH, Sadeghzadeh M, Javaherbakhsh AR (2020) Modeling and technical analysis of solar tracking system to find optimal angle for maximum power generation using MOPSO algorithm. Renew Energy Res Appl 1(2):211–222
Riasetiawan M, Anggara F, Syahra V, Ashari A, Prastowo BN, Kusumawardani IC, Wahyu P (2023) Coal rank data analytic for ASTM and PSDBMP classification. Int J Innov Res Sci Stud 6(2):374–380. https://doi.org/10.53894/ijirss.v6i2.1469
Saberi-Movahed F, Najafzadeh M, Mehrpooya A (2020) Receiving more accurate predictions for longitudinal dispersion coefficients in water pipelines: training group method of data handling using extreme learning machine conceptions. Water Resour Manage 34(2):529–561. https://doi.org/10.1007/s11269-019-02463-w
Safari MJS (2019) Decision tree (DT), generalized regression neural network (GR) and multivariate adaptive regression splines (MARS) models for sediment transport in sewer pipes. Water Sci Technol 79(6):1113–1122. https://doi.org/10.2166/wst.2019.106
Safari MJS, Ebtehaj I, Bonakdari H, Es-haghi MS (2019) Sediment transport modeling in rigid boundary open channels using generalize structure of group method of data handling. J Hydrol 577:123951. https://doi.org/10.1016/j.jhydrol.2019.123951
Salih SQ, Sharafati A, Khosravi K, Faris H, Kisi O, Tao H, Ali M, Yaseen ZM (2020) River suspended sediment load prediction based on river discharge information: application of newly developed data mining models. Hydrol Sci J 65(4):624–637. https://doi.org/10.1080/02626667.2019.1703186
Samadi M, Jabbar E (2012) Assessment of regression trees and multivariate adaptive regression splines for prediction of scour depth below the ski-jump bucket spillway. J Hydraul 7(3):73–79
Samadi M, Jabbari E, Azamathulla HM (2014) Assessment of M5′ model tree and classification and regression trees for prediction of scour depth below free overfall spillways. Neural Comput Appl 24:357–366. https://doi.org/10.1007/s00521-012-1230-9
Samadi M, Jabbari E, Azamathulla HM, Mojallal M (2015) Estimation of scour depth below free overfall spillways using multivariate adaptive regression splines and artificial neural networks. Eng Appl Comput Fluid Mech 9(1):291–300. https://doi.org/10.1080/19942060.2015.1011826
Samadi M, Afshar MH, Jabbari E, Sarkardeh H (2020a) Application of multivariate adaptive regression splines and classification and regression trees to estimate wave-induced scour depth around pile groups. Iran J Sci Technol Trans Civil Eng 44:447–459. https://doi.org/10.1007/s40996-020-00364-2
Samadi M, Sarkardeh H, Jabbari E (2020b) Explicit data-driven models for prediction of pressure fluctuations occur during turbulent flows on sloping channels. Stoch Env Res Risk Assess 34:691–707. https://doi.org/10.1007/s00477-020-01794-0
Samadi M, Afshar MH, Jabbari E, Sarkardeh H (2021) Prediction of current-induced scour depth around pile groups using MARS, CART, and ANN approaches. Mar Georesour Geotechnol 39(5):577–588. https://doi.org/10.1080/1064119X.2020.1731025
Samui P, Yesilyurt SN, Dalkilic HY, Yaseen ZM, Roy SS, Kumar S (2023) Comparison of different optimized machine learning algorithms for daily river flow forecasting. Earth Sci Inf 16(1):533–548. https://doi.org/10.1007/s12145-022-00896-3
Sayed BT, Al-Mohair HK, Alkhayyat A, Ramírez-Coronel AA, Elsahabi M (2023) Comparing machine-learning-based black box techniques and white box models to predict rainfall-runoff in a northern area of Iraq, the Little Khabur River. Water Sci Technol 87(3):812–822. https://doi.org/10.2166/wst.2023.014
Sepahvand A, Golkarian A, Billa L, Wang K, Rezaie F, Panahi S, Samadianfard S, Khosravi K (2022) Evaluation of deep machine learning-based models of soil cumulative infiltration. Earth Sci Inf 15(3):1861–1877. https://doi.org/10.1007/s12145-022-00830-7
Shafagh Loron R, Samadi M, Shamsai A (2023) Predictive explicit expressions from data-driven models for estimation of scour depth below ski-jump bucket spillways. Water Supply 23(1):304–316. https://doi.org/10.2166/ws.2022.421
Sharghi E, Paknezhad NJ, Najafi H (2021) Assessing the effect of emotional unit of emotional ANN (EANN) in estimation of the prediction intervals of suspended sediment load modeling. Earth Sci Inf 14(1):201–213. https://doi.org/10.1007/s12145-020-00567-1
Torabi M, Sarkardeh H, Mirhosseini SM (2022) Estimating the permeability coefficient of soil using CART and GMDH approaches. Water Supply 22(8):6756–6764. https://doi.org/10.2166/ws.2022.248
Wang W, Wang Y (2022) Research on image capture technology of intelligent terminal and multi exposure fusion to improve the resilience of agriculture production systems. J Commer Biotechnol 27(2):46–56. https://doi.org/10.5912/jcb1045
Yin L, Wang L, Li T, Lu S, Yin Z, Liu X, Li X, Zheng W (2023a) U-Net-STN: A novel end-to-end lake boundary prediction model. Land 12(8):1602. https://doi.org/10.3390/land12081602
Yin L, Wang L, Li T, Lu S, Tian J, Yin Z, Li X, Zheng W (2023b) U-Net-LSTM: Time series-enhanced lake boundary prediction model. Land 12(10):1859. https://doi.org/10.3390/land12101859
Zaidan MA, Wraith D, Boor BE, Hussein T (2019) Bayesian proxy modelling for estimating black carbon concentrations using white-box and black-box models. Appl Sci 9(22):4976. https://doi.org/10.3390/app9224976
Zeyneb T, Nadir M, Boualem R (2022) Modeling of suspended sediment concentrations by artificial neural network and adaptive neuro fuzzy interference system method-study of five largest basins in Eastern Algeria. Water Pract Technol 17(5):1058–1081. https://doi.org/10.2166/wpt.2022.050
Acknowledgements
The authors gratefully acknowledge technical and financial support provided by the Ministry of Education and King Abdulaziz University, Deanship of Scientific Research (DSR), Jeddah, Saudi Arabia.
Funding
This research work was funded by Institutional Fund Projects under grant no. (IFPIP:746-829-1443).
Author information
Authors and Affiliations
Contributions
Marwan Kheimi is the sole author of this work.
Corresponding author
Ethics declarations
Competing interests
The author declares no competing interests.
Additional information
Communicated by: H. Babaie
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kheimi, M. Data-driven approaches for estimation of sediment discharge in rivers. Earth Sci Inform 17, 761–781 (2024). https://doi.org/10.1007/s12145-023-01191-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-023-01191-5