Skip to main content

Advertisement

Log in

Random Forest and Multilayer Perceptron hybrid models integrated with the genetic algorithm for predicting pan evaporation of target site using a limited set of neighboring reference station data

  • Research
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

This study explores the application of machine learning algorithms for the prediction of pan evaporation (Ep), which is a critical factor in water resource management for the assessment of water demand and usage. Specifically, this research evaluates the effectiveness of two base models: Random Forest (RF) and Multi-Layer Perceptron (MLP) and their optimized counterparts using a Genetic Algorithm (GA), designated as GA-RF and GA-MLP, for modeling Ep at a target station using data from adjacent stations. The datasets were split into a training set (70%) and a testing set (30%). The models’ performances were judged using three statistical measures: Correlation Coefficient (CC), Scattered Index (SI), and Willmott’s Index of agreement (WI). The enhanced models, particularly GA-MLP-5, showed superior performance with a CC of 0.8704, SI of 0.2539, and WI of 0.9212, indicating the potent ability of GA to refine RF and MLP models for predictive accuracy. Additionally, sensitivity analysis via the GA-RF indicates the varying influence of Ep from neighboring stations on the target station, shedding light on key predictors for effective water management. Conclusively, this study demonstrates that the hybrid models have significant potential in accurate Ep estimation and can be expanded to predict other meteorological variables, offering valuable tools for water resource management strategies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

The authors confirm that the data supporting the findings of this study are available within the article and its Supplementary material. Raw data that support the findings of this study are available from the corresponding author, upon reasonable request.

Reference

  • Abghari H, Ahmadi H, Besharat S, Rezaverdinejad V (2012) Prediction of daily pan evaporation using wavelet neural networks. Water Resour Manage 26(12):3639–3652

    Google Scholar 

  • Adnan MN, Islam MZ (2016) Optimizing the number of trees in a decision forest to discover a subforest with high ensemble accuracy using a genetic algorithm. Knowl - Based Syst 110:86–97

    Google Scholar 

  • Adnan RM, Malik A, Kumar A, Parmar KS, Kisi O (2019) Pan evaporation modeling by three different neuro-fuzzy intelligent systems using climatic inputs. Arab J Geosci 12:606

    Google Scholar 

  • Alipour A, Yarahmadi J, Mahdavi M (2014) Comparative study of M5 model tree and artificial neural network in estimating reference evapotranspiration using MODIS products. J Climatol 2014:1-11

  • Allawi MF, El-Shafie A (2016) Utilizing RBF-NN and ANFIS methods for multi-lead ahead prediction model of evaporation from reservoir. Water Resour Manag 30(13):4773–4788

    Google Scholar 

  • Behrooz K, Salim H, Abderrazek S, Shun-Peng Z, Nguyen-Thoi T (2019) SVR-RSM: a hybrid heuristic method for modeling monthly pan evaporation. Environ Sci Pollut Res 26:35807–35826

    Google Scholar 

  • Birbal P, Azamathulla H, Leon L, Kumar V, Hosein J (2021) Predictive modelling of the stage–discharge relationship using gene-expression programming. Water Supply 21(7):3503–3514

    CAS  Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Google Scholar 

  • Chaplot B (2021) Prediction of rainfall time series using soft computing techniques. Environ Monit Assess 193(11):721

    PubMed  Google Scholar 

  • Chaudhary A, Kolhe S, Kamal R (2016a) An improved random forest classifier for multi-class classification. Inf Process Agric 3(4):215–222

    Google Scholar 

  • Chaudhary A, Kolhe S, Kamal R (2016b) A hybrid ensemble for classification in multiclass datasets: an application to oilseed disease dataset. Comput Electron Agric 124:65–72

    Google Scholar 

  • Chaudhary A, Kolhe S, Kamal R (2020) A particle swarm optimization-based ensemble for vegetable crop disease recognition. Comput Electron Agric 178:1–7

    Google Scholar 

  • Chelani A, Chalapati Rao CV, Phadke KM, Hasan MZ (2002) Prediction of sulphur dioxide concentration using artificial neural networks. Environ Model Soft 17:161–168

    Google Scholar 

  • Chen YY, Cheng Y, Cheng Q, Yu H, Li D (2017) Short-term prediction model for ammonia nitrogen in aquaculture pond water based on optimized LSSVM. Int Agric Eng J 26(3):416–427

    Google Scholar 

  • Deo RC, Ghorbani MA, Samadianfard S, Maraseni T, Bilgili M, Biazar M (2018) Multi-layer perceptron hybrid model integrated with the firefly optimizer algorithm for windspeed prediction of target site using a limited set of neighboring reference station data. Renew Energy 116:309–323

    Google Scholar 

  • Du KL, Swamy MN (2006) Neural networks in a soft computing framework. Springer Science and Business Media L, London, pp 566

  • Fan J, Wu L, Zhang F, Xiang Y, Zheng J (2016) Climate change effects on reference crop evapotranspiration across different climatic zones of China during 1956–2015. J Hydrol 542:923–937

    Google Scholar 

  • Feng Y, Jia Y, Zhang Q, Gong D, Cui N (2018) National-scale assessment of pan evaporation models across different climatic zones of China. J Hydrol 564:314–328

    Google Scholar 

  • Ghaemi A, Rezaie-Balf M, Adamowski J, Kisi O, Quilty J (2019) On the applicability of maximum overlap discrete wavelet transform integrated with MARS and M5 model tree for monthly pan evaporation prediction. Agric For Meteorol 278:107647

    Google Scholar 

  • Goldberg DE, Holland JH (1989) Genetic algorithms and machine learning. Mach Lear 3:95–99

  • Gundalia MJ, Dholakia MB (2013) Estimation of pan evaporation using mean air temperature and radiation for monsoon season in Junagadh region. Int J Eng Res Appl 3(6):64–70

    Google Scholar 

  • Haddadi F, Moazenzadeh R, Mohammadi B (2022) Estimation of actual evapotranspiration: A novel hybrid method based on remote sensing and artificial intelligence. J Hydro 609:127774

    Google Scholar 

  • Hassan MA, Khalil A, Kaseb S, Kassem MA (2017) Exploring the potential of tree-based ensemble methods in solar radiation modeling. Appl Energ 203:897–916

    ADS  Google Scholar 

  • Holland JH (1992) Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence. Ann Arbor 6:126–137

    Google Scholar 

  • Jayathilake T, Gunathilake MB, Wimalasiri EM, Rathnayake U (2023) Wetland water level prediction in the context of machine learning techniques: Where do we stand? Environments 10(5–75):1–17

    Google Scholar 

  • Keshtegar B, Piri J, Kisi O (2016) A nonlinear mathematical modeling of daily pan evaporation based on conjugate gradient method. Comput Electron Agric 127:120–130

    Google Scholar 

  • Kim S, Singh VP, Seo Y (2014) Evaluation of pan evaporation modeling with two different neural networks and weather station data. Theor Appl Climatol 117(1–2):1–13

    ADS  Google Scholar 

  • Kisi O (2009) Daily pan evaporation modelling using multi-layer perceptrons and radial basis neural networks. Hydrol Process 23(2):213–223

    ADS  Google Scholar 

  • Kisi O (2015) Pan evaporation modeling using least square support vector machine, multivariate adaptive regression splines and M5 model tree. J Hydrol 528:312–320

    Google Scholar 

  • Kisi O, Heddam S (2019) Evaporation modelling by heuristic regression approaches using only temperature data. Hydrol Sci J 64(6):653–672

    CAS  Google Scholar 

  • Kisi O, Genc O, Dinc S, Zounemat-Kermani M (2016) Daily pan evaporation modeling using chi-squared automatic interaction detector, neural networks, classification and regression tree. Comput Electron Agric 122:112–117

    Google Scholar 

  • Ließ M, Glaser B, Huwe B (2012) Uncertainty in the spatial prediction of soil texture: comparison of regression tree and Random Forest models. Geoder 170:70–79

    ADS  Google Scholar 

  • Lin GF, Lin HY, Wu MC (2013) Development of a support-vector‐machine‐based model for daily pan evaporation estimation. Hydrol Process 27(22):3115–3127

    Google Scholar 

  • Lu X, Ju Y, Wu L, Fan J, Zhang F, Li Z (2018) Daily pan evaporation modeling from local and cross-station data using three tree-based machine learning models. J Hydrol 566:668–684

    Google Scholar 

  • Majhi B, Naidu D, Mishra AP, Satapathy SC (2019) Improved prediction of daily pan evaporation using Deep-LSTM model. Neural Comput Appl 32:7823–7838

    Google Scholar 

  • Malik A, Kumar A, Kisi O (2017) Monthly pan-evaporation estimation in Indian central Himalayas using different heuristic approaches and climate-based models. Comput Electron Agric 143:302–313

    Google Scholar 

  • Mohammadi B (2023) Modeling various drought time scales via a merged artificial neural network with a firefly algorithm. Hydrology 10(3):58

    Google Scholar 

  • Nawar S, Mouazen AM (2017) Comparison between random forests, artificial neural networks and gradient boosted machines methods of on-line Vis-NIR spectroscopy measurements of soil total nitrogen and total carbon. Sens 17(10):2428

    ADS  Google Scholar 

  • Nourani V, Molajou A, Tajbakhsh AD, Najafi H (2019) A wavelet-based data mining technique for suspended sediment load modeling. Water Resour Manag 33(5):1769–1784

    Google Scholar 

  • Piri J, Amin S, Moghaddamnia A, Han D, Remesun D (2009) Daily pan evaporation modelling in a hot and dry climate. J Hydrol Eng 14:803–811

    Google Scholar 

  • Prasad AM, Iverson LR, Liaw A (2006) Newer classification and regression tree techniques: Bagging and Random Forests for ecological prediction. Ecosystems 9(2):181–199

  • Qasem SN, Samadianfard S, Kheshtgar S, Jarhan S, Kisi O, Shamshirband S, Chau KW (2019) Modeling monthly pan evaporation using wavelet support vector regression and wavelet artificial neural networks in arid and humid climates. Eng Appl Computat Fluid Mech 13(1):177–187

    Google Scholar 

  • Rahimikhoob A (2009) Estimating daily pan evaporation using artificial neural network in a semi-arid environment. Theor Appl Climatol 98(1):101–105

    ADS  Google Scholar 

  • Ravansalar M, Rajaee T, Kisi O (2017) Wavelet-linear genetic programming: a new approach for modeling monthly streamflow. J Hydrol 549:461–475

    Google Scholar 

  • Rodriguez-Galiano V, Ghimire B, Rogan J, Chica-Olmo M, Rigol-Sanchez J (2012) An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J Photogramm Remote Sens 67:93–104

    ADS  Google Scholar 

  • Samadianfard S, Jarahan S, Sadri HN (2018) Application of support vector regression integrated with firefly optimization algorithm for predicting global solar radiation. J Energy Syst 2(4):180–189

  • Samadianfard S, Majnooni-Heris A, Qasem SN, Kisi O, Shamshirband S, Chau KW (2019a) Daily global solar radiation modeling using data-driven techniques and empirical equations in a semi-arid climate. Eng Appl Comput Fluid Mech 13(1):142–157

    Google Scholar 

  • Samadianfard S, Jarhan S, Salwana E, Mosavi A, Shamshirband S, Akib S (2019b) Support Vector regression integrated with fruit fly optimization algorithm for river flow forecasting in Lake Urmia basin. Water 11:19–34

    Google Scholar 

  • Samadianfard S, Hashemi S, Kargar K, Izadyar M, Mostafaeipour A, Mosavi A, Shamshirband S (2020) Wind speed prediction using a hybrid model of the multi-layer perceptron and whale optimization algorithm. Ener Rep 6:1147–1159

    Google Scholar 

  • Samadianfard S, Kargar K, Shadkani S, Abbaspour A, SadeghSafar MJ (2021) Hybrid models for suspended sediment prediction: optimized random forest and multi-layer perceptron through genetic algorithm and stochastic gradient descent methods. Neural Comput Appli 34:3033–3051

    Google Scholar 

  • Schwefel HP (1993). Evolution and Optimum Seeking: The Sixth Generation. John Wiley & Sons Ltd., Hoboken

  • Sebbar A, Heddam S, Djemili L (2019) Predicting daily pan evaporation (Epan) from dam reservoirs in the mediterranean regions of Algeria: OPELM vs OSELM. Environ Processes 6(1):309–319

    Google Scholar 

  • Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res Atmos 106(D7):7183–7192

  • Wang L, Niu Z, Kisi O, Li Ca, Yu D (2017) Pan evaporation modeling using four different heuristic approaches. Comput Electron Agr 140:203–213

    Google Scholar 

  • Wu L, Huang G, Fan J, Ma X, Zhou H, Zeng W (2020) Hybrid extreme learning machine with meta-heuristic algorithms for monthly pan evaporation prediction. Comput Electron Agr 168:105115

    Google Scholar 

  • Xu L, Liu S, Li D (2017) Prediction of water temperature in prawn cultures based on a mechanism model optimized by an improved artificial bee colony. Comput Electron Agric 140:397–408

    Google Scholar 

  • Yang H, Hassan SG, Wang L, Li D (2017) Fault diagnosis method for water quality monitoring and control equipment in aquaculture based on multiple SVM combined with DS evidence theory. Comput Electron Agric 141:96–108

    Google Scholar 

  • Yaseen ZM, Al-Juboori AM, Beyaztas U, Al-Ansari N, Chau KW, Qi C, Shahid S (2020) Prediction of evaporation in arid and semi-arid regions: a comparative study using different machine learning models. Eng Appl Comput Fluid Mech 14(1):70–89

    Google Scholar 

  • Yu H, Chen Y, Hassan SG, Li D (2016) Prediction of the temperature in a Chinese solar greenhouse based on LSSVM optimized by improved PSO. Comput Electron Agric 122:94–102

    Google Scholar 

  • Zhou Z-H, Wu J, Tang W (2002) Ensembling neural networks: many could be better than all. Artif Intell 137:239–263

    MathSciNet  Google Scholar 

  • Zhu S, Bonacci O, Oskoruš D, Hadzima-Nyarko H, Wu S (2009) Long term variations of river temperature and the influence of air temperature and river discharge: case study of Kupa River watershed in Croatia. J Hydrol Hydromech 67(4):305–313

    Google Scholar 

  • Zhu S, Hadzima-Nyarko M, Gao A, Wang F, Wu J, Wu S (2019a) Two hybrid data-driven models for modeling water-air temperature relationship in rivers. Environ Sci Pollut Res 26:12622–12630

    Google Scholar 

  • Zhu S, Nyarko EK, Hadzima-Nyarko M, Heddam S, Wu S (2019b) Assessing the performance of a suite of machine learning models for daily river water temperature prediction. PeerJ 7:e7065

    PubMed  PubMed Central  Google Scholar 

  • Zounemat-Kermani M, Seo Y, Kim S, Ghorbani MA, Samadianfard S, Naghshara S, Kim NW, Singh VP (2019) Can decomposition approaches always enhance soft computing models? Predicting the dissolved oxygen concentration in the St. Johns River, Florida. Appl Sci 9:25–34

    Google Scholar 

Download references

Funding

Not Applicable.

Author information

Authors and Affiliations

Authors

Contributions

-Conceptualization: Sadra Shadkani, Sajjad Hashemi-Data curation: Sadra Shadkani, Sajjad Hashemi-Formal analysis: Sadra Shadkani, Sajjad Hashemi-Investigation: Sadra Shadkani, Sajjad Hashemi, Amirreza Pak-Methodology: Sadra Shadkani, Sajjad Hashemi-Resources: Sadra Shadkani, Amirreza Pak-Software: Sadra Shadkani-Supervision: Sadra Shadkani-Validation: Sadra Shadkani, Sajjad Hashemi-Visualization: Amirreza Pak, Sajjad Hashemi-Writing - original draft: Sadra Shadkani, Sajjad Hashemi, Amirreza Pak, -Writing - review & editing: Sadra Shadkani, Sajjad Hashemi, Amirreza Pak, Alireza Barzgari Lahijan

Corresponding author

Correspondence to Sadra Shadkani.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Communicated by: H. Babaie

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shadkani, S., Hashemi, S., Pak, A. et al. Random Forest and Multilayer Perceptron hybrid models integrated with the genetic algorithm for predicting pan evaporation of target site using a limited set of neighboring reference station data. Earth Sci Inform 17, 1261–1280 (2024). https://doi.org/10.1007/s12145-024-01237-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12145-024-01237-2

Keywords

Navigation