Abstract
This study explores the application of machine learning algorithms for the prediction of pan evaporation (Ep), which is a critical factor in water resource management for the assessment of water demand and usage. Specifically, this research evaluates the effectiveness of two base models: Random Forest (RF) and Multi-Layer Perceptron (MLP) and their optimized counterparts using a Genetic Algorithm (GA), designated as GA-RF and GA-MLP, for modeling Ep at a target station using data from adjacent stations. The datasets were split into a training set (70%) and a testing set (30%). The models’ performances were judged using three statistical measures: Correlation Coefficient (CC), Scattered Index (SI), and Willmott’s Index of agreement (WI). The enhanced models, particularly GA-MLP-5, showed superior performance with a CC of 0.8704, SI of 0.2539, and WI of 0.9212, indicating the potent ability of GA to refine RF and MLP models for predictive accuracy. Additionally, sensitivity analysis via the GA-RF indicates the varying influence of Ep from neighboring stations on the target station, shedding light on key predictors for effective water management. Conclusively, this study demonstrates that the hybrid models have significant potential in accurate Ep estimation and can be expanded to predict other meteorological variables, offering valuable tools for water resource management strategies.
Similar content being viewed by others
Data availability
The authors confirm that the data supporting the findings of this study are available within the article and its Supplementary material. Raw data that support the findings of this study are available from the corresponding author, upon reasonable request.
Reference
Abghari H, Ahmadi H, Besharat S, Rezaverdinejad V (2012) Prediction of daily pan evaporation using wavelet neural networks. Water Resour Manage 26(12):3639–3652
Adnan MN, Islam MZ (2016) Optimizing the number of trees in a decision forest to discover a subforest with high ensemble accuracy using a genetic algorithm. Knowl - Based Syst 110:86–97
Adnan RM, Malik A, Kumar A, Parmar KS, Kisi O (2019) Pan evaporation modeling by three different neuro-fuzzy intelligent systems using climatic inputs. Arab J Geosci 12:606
Alipour A, Yarahmadi J, Mahdavi M (2014) Comparative study of M5 model tree and artificial neural network in estimating reference evapotranspiration using MODIS products. J Climatol 2014:1-11
Allawi MF, El-Shafie A (2016) Utilizing RBF-NN and ANFIS methods for multi-lead ahead prediction model of evaporation from reservoir. Water Resour Manag 30(13):4773–4788
Behrooz K, Salim H, Abderrazek S, Shun-Peng Z, Nguyen-Thoi T (2019) SVR-RSM: a hybrid heuristic method for modeling monthly pan evaporation. Environ Sci Pollut Res 26:35807–35826
Birbal P, Azamathulla H, Leon L, Kumar V, Hosein J (2021) Predictive modelling of the stage–discharge relationship using gene-expression programming. Water Supply 21(7):3503–3514
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Chaplot B (2021) Prediction of rainfall time series using soft computing techniques. Environ Monit Assess 193(11):721
Chaudhary A, Kolhe S, Kamal R (2016a) An improved random forest classifier for multi-class classification. Inf Process Agric 3(4):215–222
Chaudhary A, Kolhe S, Kamal R (2016b) A hybrid ensemble for classification in multiclass datasets: an application to oilseed disease dataset. Comput Electron Agric 124:65–72
Chaudhary A, Kolhe S, Kamal R (2020) A particle swarm optimization-based ensemble for vegetable crop disease recognition. Comput Electron Agric 178:1–7
Chelani A, Chalapati Rao CV, Phadke KM, Hasan MZ (2002) Prediction of sulphur dioxide concentration using artificial neural networks. Environ Model Soft 17:161–168
Chen YY, Cheng Y, Cheng Q, Yu H, Li D (2017) Short-term prediction model for ammonia nitrogen in aquaculture pond water based on optimized LSSVM. Int Agric Eng J 26(3):416–427
Deo RC, Ghorbani MA, Samadianfard S, Maraseni T, Bilgili M, Biazar M (2018) Multi-layer perceptron hybrid model integrated with the firefly optimizer algorithm for windspeed prediction of target site using a limited set of neighboring reference station data. Renew Energy 116:309–323
Du KL, Swamy MN (2006) Neural networks in a soft computing framework. Springer Science and Business Media L, London, pp 566
Fan J, Wu L, Zhang F, Xiang Y, Zheng J (2016) Climate change effects on reference crop evapotranspiration across different climatic zones of China during 1956–2015. J Hydrol 542:923–937
Feng Y, Jia Y, Zhang Q, Gong D, Cui N (2018) National-scale assessment of pan evaporation models across different climatic zones of China. J Hydrol 564:314–328
Ghaemi A, Rezaie-Balf M, Adamowski J, Kisi O, Quilty J (2019) On the applicability of maximum overlap discrete wavelet transform integrated with MARS and M5 model tree for monthly pan evaporation prediction. Agric For Meteorol 278:107647
Goldberg DE, Holland JH (1989) Genetic algorithms and machine learning. Mach Lear 3:95–99
Gundalia MJ, Dholakia MB (2013) Estimation of pan evaporation using mean air temperature and radiation for monsoon season in Junagadh region. Int J Eng Res Appl 3(6):64–70
Haddadi F, Moazenzadeh R, Mohammadi B (2022) Estimation of actual evapotranspiration: A novel hybrid method based on remote sensing and artificial intelligence. J Hydro 609:127774
Hassan MA, Khalil A, Kaseb S, Kassem MA (2017) Exploring the potential of tree-based ensemble methods in solar radiation modeling. Appl Energ 203:897–916
Holland JH (1992) Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence. Ann Arbor 6:126–137
Jayathilake T, Gunathilake MB, Wimalasiri EM, Rathnayake U (2023) Wetland water level prediction in the context of machine learning techniques: Where do we stand? Environments 10(5–75):1–17
Keshtegar B, Piri J, Kisi O (2016) A nonlinear mathematical modeling of daily pan evaporation based on conjugate gradient method. Comput Electron Agric 127:120–130
Kim S, Singh VP, Seo Y (2014) Evaluation of pan evaporation modeling with two different neural networks and weather station data. Theor Appl Climatol 117(1–2):1–13
Kisi O (2009) Daily pan evaporation modelling using multi-layer perceptrons and radial basis neural networks. Hydrol Process 23(2):213–223
Kisi O (2015) Pan evaporation modeling using least square support vector machine, multivariate adaptive regression splines and M5 model tree. J Hydrol 528:312–320
Kisi O, Heddam S (2019) Evaporation modelling by heuristic regression approaches using only temperature data. Hydrol Sci J 64(6):653–672
Kisi O, Genc O, Dinc S, Zounemat-Kermani M (2016) Daily pan evaporation modeling using chi-squared automatic interaction detector, neural networks, classification and regression tree. Comput Electron Agric 122:112–117
Ließ M, Glaser B, Huwe B (2012) Uncertainty in the spatial prediction of soil texture: comparison of regression tree and Random Forest models. Geoder 170:70–79
Lin GF, Lin HY, Wu MC (2013) Development of a support-vector‐machine‐based model for daily pan evaporation estimation. Hydrol Process 27(22):3115–3127
Lu X, Ju Y, Wu L, Fan J, Zhang F, Li Z (2018) Daily pan evaporation modeling from local and cross-station data using three tree-based machine learning models. J Hydrol 566:668–684
Majhi B, Naidu D, Mishra AP, Satapathy SC (2019) Improved prediction of daily pan evaporation using Deep-LSTM model. Neural Comput Appl 32:7823–7838
Malik A, Kumar A, Kisi O (2017) Monthly pan-evaporation estimation in Indian central Himalayas using different heuristic approaches and climate-based models. Comput Electron Agric 143:302–313
Mohammadi B (2023) Modeling various drought time scales via a merged artificial neural network with a firefly algorithm. Hydrology 10(3):58
Nawar S, Mouazen AM (2017) Comparison between random forests, artificial neural networks and gradient boosted machines methods of on-line Vis-NIR spectroscopy measurements of soil total nitrogen and total carbon. Sens 17(10):2428
Nourani V, Molajou A, Tajbakhsh AD, Najafi H (2019) A wavelet-based data mining technique for suspended sediment load modeling. Water Resour Manag 33(5):1769–1784
Piri J, Amin S, Moghaddamnia A, Han D, Remesun D (2009) Daily pan evaporation modelling in a hot and dry climate. J Hydrol Eng 14:803–811
Prasad AM, Iverson LR, Liaw A (2006) Newer classification and regression tree techniques: Bagging and Random Forests for ecological prediction. Ecosystems 9(2):181–199
Qasem SN, Samadianfard S, Kheshtgar S, Jarhan S, Kisi O, Shamshirband S, Chau KW (2019) Modeling monthly pan evaporation using wavelet support vector regression and wavelet artificial neural networks in arid and humid climates. Eng Appl Computat Fluid Mech 13(1):177–187
Rahimikhoob A (2009) Estimating daily pan evaporation using artificial neural network in a semi-arid environment. Theor Appl Climatol 98(1):101–105
Ravansalar M, Rajaee T, Kisi O (2017) Wavelet-linear genetic programming: a new approach for modeling monthly streamflow. J Hydrol 549:461–475
Rodriguez-Galiano V, Ghimire B, Rogan J, Chica-Olmo M, Rigol-Sanchez J (2012) An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J Photogramm Remote Sens 67:93–104
Samadianfard S, Jarahan S, Sadri HN (2018) Application of support vector regression integrated with firefly optimization algorithm for predicting global solar radiation. J Energy Syst 2(4):180–189
Samadianfard S, Majnooni-Heris A, Qasem SN, Kisi O, Shamshirband S, Chau KW (2019a) Daily global solar radiation modeling using data-driven techniques and empirical equations in a semi-arid climate. Eng Appl Comput Fluid Mech 13(1):142–157
Samadianfard S, Jarhan S, Salwana E, Mosavi A, Shamshirband S, Akib S (2019b) Support Vector regression integrated with fruit fly optimization algorithm for river flow forecasting in Lake Urmia basin. Water 11:19–34
Samadianfard S, Hashemi S, Kargar K, Izadyar M, Mostafaeipour A, Mosavi A, Shamshirband S (2020) Wind speed prediction using a hybrid model of the multi-layer perceptron and whale optimization algorithm. Ener Rep 6:1147–1159
Samadianfard S, Kargar K, Shadkani S, Abbaspour A, SadeghSafar MJ (2021) Hybrid models for suspended sediment prediction: optimized random forest and multi-layer perceptron through genetic algorithm and stochastic gradient descent methods. Neural Comput Appli 34:3033–3051
Schwefel HP (1993). Evolution and Optimum Seeking: The Sixth Generation. John Wiley & Sons Ltd., Hoboken
Sebbar A, Heddam S, Djemili L (2019) Predicting daily pan evaporation (Epan) from dam reservoirs in the mediterranean regions of Algeria: OPELM vs OSELM. Environ Processes 6(1):309–319
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res Atmos 106(D7):7183–7192
Wang L, Niu Z, Kisi O, Li Ca, Yu D (2017) Pan evaporation modeling using four different heuristic approaches. Comput Electron Agr 140:203–213
Wu L, Huang G, Fan J, Ma X, Zhou H, Zeng W (2020) Hybrid extreme learning machine with meta-heuristic algorithms for monthly pan evaporation prediction. Comput Electron Agr 168:105115
Xu L, Liu S, Li D (2017) Prediction of water temperature in prawn cultures based on a mechanism model optimized by an improved artificial bee colony. Comput Electron Agric 140:397–408
Yang H, Hassan SG, Wang L, Li D (2017) Fault diagnosis method for water quality monitoring and control equipment in aquaculture based on multiple SVM combined with DS evidence theory. Comput Electron Agric 141:96–108
Yaseen ZM, Al-Juboori AM, Beyaztas U, Al-Ansari N, Chau KW, Qi C, Shahid S (2020) Prediction of evaporation in arid and semi-arid regions: a comparative study using different machine learning models. Eng Appl Comput Fluid Mech 14(1):70–89
Yu H, Chen Y, Hassan SG, Li D (2016) Prediction of the temperature in a Chinese solar greenhouse based on LSSVM optimized by improved PSO. Comput Electron Agric 122:94–102
Zhou Z-H, Wu J, Tang W (2002) Ensembling neural networks: many could be better than all. Artif Intell 137:239–263
Zhu S, Bonacci O, Oskoruš D, Hadzima-Nyarko H, Wu S (2009) Long term variations of river temperature and the influence of air temperature and river discharge: case study of Kupa River watershed in Croatia. J Hydrol Hydromech 67(4):305–313
Zhu S, Hadzima-Nyarko M, Gao A, Wang F, Wu J, Wu S (2019a) Two hybrid data-driven models for modeling water-air temperature relationship in rivers. Environ Sci Pollut Res 26:12622–12630
Zhu S, Nyarko EK, Hadzima-Nyarko M, Heddam S, Wu S (2019b) Assessing the performance of a suite of machine learning models for daily river water temperature prediction. PeerJ 7:e7065
Zounemat-Kermani M, Seo Y, Kim S, Ghorbani MA, Samadianfard S, Naghshara S, Kim NW, Singh VP (2019) Can decomposition approaches always enhance soft computing models? Predicting the dissolved oxygen concentration in the St. Johns River, Florida. Appl Sci 9:25–34
Funding
Not Applicable.
Author information
Authors and Affiliations
Contributions
-Conceptualization: Sadra Shadkani, Sajjad Hashemi-Data curation: Sadra Shadkani, Sajjad Hashemi-Formal analysis: Sadra Shadkani, Sajjad Hashemi-Investigation: Sadra Shadkani, Sajjad Hashemi, Amirreza Pak-Methodology: Sadra Shadkani, Sajjad Hashemi-Resources: Sadra Shadkani, Amirreza Pak-Software: Sadra Shadkani-Supervision: Sadra Shadkani-Validation: Sadra Shadkani, Sajjad Hashemi-Visualization: Amirreza Pak, Sajjad Hashemi-Writing - original draft: Sadra Shadkani, Sajjad Hashemi, Amirreza Pak, -Writing - review & editing: Sadra Shadkani, Sajjad Hashemi, Amirreza Pak, Alireza Barzgari Lahijan
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Communicated by: H. Babaie
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shadkani, S., Hashemi, S., Pak, A. et al. Random Forest and Multilayer Perceptron hybrid models integrated with the genetic algorithm for predicting pan evaporation of target site using a limited set of neighboring reference station data. Earth Sci Inform 17, 1261–1280 (2024). https://doi.org/10.1007/s12145-024-01237-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-024-01237-2