Abstract
Accurate prediction of the water level will help prevent overexploiting groundwater and help control water resources. On the other hand, water level predicting is a highly dynamic and non-linear process dependent on complex factors. Therefore, developing models to predict water levels to optimize water resources management in the reservoir is essential. Thus, this work recommends various supervised machine learning algorithms for predicting water levels with groundwater level correlation. The predicting models have Linear Regression (LR), Support Vector Machines (SVM), Gaussian Processes Regression (GPR), and Neural Network (NN). This study includes four scenarios; The first scenario (SC1) uses lag 1; second scenario (SC2) uses lag 1 and lag 2; third scenario (SC3) uses lag 1, lag 2, and lag 11 and the fourth scenario (SC4) uses lag 1, lag 2, lag 11 and lag 12. These scenarios have been determined using the autocorrelation function (ACF), and these lags represent the month. The results showed that for SC1, SC2, and SC4, all model performance in GPR gave good results where the highest R equal to 0.71 in SC1, 0.78 in SC2, and 0.73 in SC4 using the Matern 5/2 GPR model. For SC3, the Stepwise LR model gave a better result with an R of 0.79. It can be concluded that Matern 5/2 of Gaussian Processes Regression Models is a reliable model to predict water level as the method gave a high performance in each scenario (except SC3) with a relatively fastest training time. The NN model had the worst performance to the other three models since it has the highest MAE values, RMSE, and lowest value of R in almost all four scenarios of input combinations. These results obtained in this study serves as an excellent benchmark for future water level prediction using the GPR and LR with four scenarios created.
Similar content being viewed by others
References
Aakash P, Kinjal Mistree MS (2017) Machine learning techniques for rainfall prediction: a review. Int Conf Innov Inf Embed Commun Syst Vol 3
Abbaspour KC, Yang J, Maximov I, Siber R, Bogner K, Mieleitner J, Zobrist J, Srinivasan R (2007) Modelling hydrology and water quality in the pre-alpine/alpine thur watershed using SWAT. J Hydrol 333(2–4):413–430. https://doi.org/10.1016/j.jhydrol.2006.09.014
Adhikary SK, Muttil N, Yilmaz AG (2018) Improving streamflow forecast using optimal rain gauge network-based input to artificial neural network models. Hydrol Res 49:1559–1577. https://doi.org/10.2166/nh.2017.108
Ahmed AN, Othman FB, Afan HA, Ibrahim RK, Fai CM, Hossain MS, Ehteram M, El-Shafie A (2019) Machine learning methods for better water quality prediction. J Hydrol 578:124084. https://doi.org/10.1016/j.jhydrol.2019.124084
Ali MZ, Shabbir MNSK, Liang X, Zhang Y, Hu T (2019) Machine learning-based fault diagnosis for single- and multi-faults in induction motors using measured stator currents and vibration signals. IEEE Trans Ind Appl 55(3):2378–2391. https://doi.org/10.1109/TIA.2019.2895797
Berg A, Findell K, Lintner B, Giannini A, Seneviratne SI, Van Den Hurk B, Lorenz R, Pitman A, Hagemann S, Meier A, Cheruy F, Ducharne A, Malyshev S, Milly PCD (2016) Land-atmosphere feedbacks amplify aridity increase over land under global warming. Nat Clim Chang 6(9):869–874. https://doi.org/10.1038/nclimate3029
Berne BJ, Boon JP, Rice SA (1966) On the calculation of autocorrelation functions of dynamical variables. J Chem Phys 45(4):1086–1096. https://doi.org/10.1063/1.1727719
Browne MW (1975) Predictive validity of a linear regression equation. Br J Math Stat Psychol 28(1):79–87. https://doi.org/10.1111/j.2044-8317.1975.tb00550.x
Canchala T, Alfonso-Morales W, Carvajal-Escobar Y, Cerón WL, Caicedo-Bravo E (2020) Monthly rainfall anomalies forecasting for southwestern colombia using artificial neural networks approaches. Water (Switzerland) 12(9):2628. https://doi.org/10.3390/w12092628
Caral C, Ece K, Arslan B, Akbulut A (2019) Benchmarking of regression algorithms and time series analysis techniques for sales forecasting. Balk J Electr Comput Eng 7(1):20–26. https://doi.org/10.17694/bajece.494920
ÇELİK R (2018) Impact of dams on groundwater static water level changes: a case study kralkızı and dicle dam watershed. Uluslararası Muhendis. Arastirma ve Gelistirme Derg 10(2):119–126. https://doi.org/10.29137/umagd.442483
Chang FJ, Chang YT (2006) Adaptive neuro-fuzzy inference system for prediction of water level in reservoir. Adv Water Resour 29(1):1–10. https://doi.org/10.1016/j.advwatres.2005.04.015
Chau KW, Wu CL, Li YS (2005) Comparison of several flood forecasting models in Yangtze River. J Hydrol Eng 10(6):485–491. https://doi.org/10.1061/(asce)1084-0699(2005)10:6(485)
Cheng CT, Feng ZK, Niu WJ, Liao SL (2015) Heuristic methods for reservoir monthly inflow forecasting: a case study of xinfengjiang reservoir in Pearl River, China. Water (Switzerland) 7(8):4477–4495. https://doi.org/10.3390/w7084477
Daliakopoulos IN, Coulibaly P, Tsanis IK (2005) Groundwater level forecasting using artificial neural networks. J Hydrol 309(1–4):229–240. https://doi.org/10.1016/j.jhydrol.2004.12.001
Damian DC (2019) A critical review on artificial intelligence models in hydrological forecasting how reliable are artificial intelligence Models. Int J Eng Tech Res 8:365–378.
Famiglietti JS (2014) The global groundwater crisis. Nat Clim Change 4(11):945–948. https://doi.org/10.1038/nclimate2425
Gao W, Karbasi M, Hasanipanah M, Zhang X, Guo J (2018) Developing GPR model for forecasting the rock fragmentation in surface mines. Eng Comput 34(2):339–345. https://doi.org/10.1007/s00366-017-0544-8
Gong Y, Zhang Y, Lan S, Wang H (2016) A comparative study of artificial neural networks, support vector machines and adaptive neuro fuzzy inference system for forecasting groundwater levels near Lake Okeechobee, Florida. Water Resour Manag 30(1):375–391. https://doi.org/10.1007/s11269-015-1167-8
Hampel FR, Ronchetti EM, Rousseeuw PJ, Stahel WA (2011) Robust statistics: the approach based on influence functions. John and Wiley Sons, Vol 196
Hung NQ, Babel MS, Weesakul S, Tripathi NK (2009) An artificial neural network model for rainfall forecasting in Bangkok, Thailand. Hydrol Earth Syst Sci 13(8):1413–1425. https://doi.org/10.5194/hess-13-1413-2009
Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22(4):679–688. https://doi.org/10.1016/j.ijforecast.2006.03.001
Khai WJ, Alraih M, Ahmed AN, Fai CM, El-Shafie A (2019) Daily forecasting of dam water levels using machine learning. Int J Civ Eng Technol 10(6):314–323
Karami H, Mousavi SF, Farzin S, Ehteram M, Singh VP, Kisi O (2018) Improved krill algorithm for reservoir operation. Water Resour Manag 32(10):3353–3372. https://doi.org/10.1007/s11269-018-1995-4
Kaya YZ, Üneş F, Demirci M (2018) Groundwater level prediction using artificial neural network and m5 tree models. Air and Water Components Environ Conf Proc 195–201. https://doi.org/10.24193/awc2018_23
Lee J, Kim CG, Lee JE, Kim NW, Kim H (2018) Application of artificial neural networks to rainfall forecasting in the Geum River Basin, Korea. Water (Switzerland) 10(10):1448. https://doi.org/10.3390/w10101448
Maimunah, Yeni M, Kumala D (2019) The influence of water level fluctuation reservoir stability of the earth dam. IOP Conference Series: Mater Sci Eng 506:012032. https://doi.org/10.1088/1757-899X/506/1/012032
Meyer D, Leisch F, Hornik K (2003) The support vector machine under test. Neurocomputing 55(1–2):169–186. https://doi.org/10.1016/S0925-2312(03)00431-4
Nagelkerke NJD (1991) A Note on a General Definition of the Coefficient of Determination. Biometrika 78(3):691–692
Neal RM (2012) Bayesian learning for neural networks, vol 118. Springer Science and Business Media
Noble WS (2006) What is a support vector machine? Nat Biotechnol 24(12):1565–1567. https://doi.org/10.1038/nbt1206-1565
Noori R, Hoshyaripour G, Ashrafi K, Araabi BN (2010) Uncertainty analysis of developed ANN and ANFIS models in prediction of carbon monoxide daily concentration. Atmos Environ 44(4):476–482. https://doi.org/10.1016/j.atmosenv.2009.11.005
Noori R, Yeh HD, Abbasi M, Kachoosangi FT, Moazami S (2015) Uncertainty analysis of support vector machine for online prediction of five-day biochemical oxygen demand. J Hydrol 527:833–843. https://doi.org/10.1016/j.jhydrol.2015.05.046
Phanindra BR, Pralhad RN, Raj AAB (2020) Machine learning based classification of ducted and non-ducted propeller type quadcopter. Int Conf Adv Comput Commun Syst 1296–1301. https://doi.org/10.1109/ICACCS48705.2020.9074307
Rasmussen CE (1997) Evaluation of gaussian processes and other methods for non-linear regression. Dissertation, University of Toronto, Canada
Reza M, Harun S, Askari M (2018) Streamflow forecasting in Bukit Merah watershed by using ARIMA and ANN. Portal J Tek Sipil 9(1). https://doi.org/10.30811/portal.v9i1.612
Ronchetti E, Field C, Blanchard W (1997) Robust linear model selection by cross-validation. J Am Stat Assoc 92(439):1017–1023. https://doi.org/10.1080/01621459.1997.10474057
Rumelhart DE, Widrow B, Lehr MA (1994) The basic ideas in neural networks. Commun ACM 37(3):87–93
Sapitang M, Ridwan WM, Kushiar KF, Ahmed AN, El-Shafie A (2020) Machine learning application in reservoir water level forecasting for sustainable hydropower generation strategy. Sustain 12 (15):6121. https://doi.org/10.3390/su12156121
Siegelmann HT, Sontag ED (1995) On the computational power of neural nets. J Comput Syst Sci 50(1):132–150. https://doi.org/10.1006/jcss.1995.1013
Tan LK, McLaughlin RA, Lim E, Abdul Aziz YF, Liew YM (2017) Fully automated segmentation of the left ventricle in cine cardiac MRI using neural network regression. J Magn Reson Imaging 48(1):140–152. https://doi.org/10.1002/jmri.25932
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res 106(D7):7183–7192. https://doi.org/10.1029/2000JD900719
Trenberth KE (2011) Changes in precipitation with climate change. Clim Res 47(1–2):123–138. https://doi.org/10.3354/cr00953
Tu JV (1996) Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J Clin Epidemiol 49(11):1225–1231. https://doi.org/10.1016/S0895-4356(96)00002-9
Wang W, Van Gelder PHAJM, Vrijling JK, Ma J (2006) Forecasting daily streamflow using hybrid ANN models. J Hydrol 324(1–4):383–399. https://doi.org/10.1016/j.jhydrol.2005.09.032
Wilson AG, Knowles DA, Ghahramani Z (2011) Gaussian Process Regression Networks. In: Proceedings of the 29th International Conference on Machine Learning, ICML 2012, pp 599–606
Wu WY, Lo MH, Wada Y, Famiglietti JS, Reager JT, Yeh PJF, Ducharne A, Yang ZL (2020) Divergent effects of climate change on future groundwater availability in key mid-latitude aquifers. Nat Commun 11(1):1–9.https://doi.org/10.1038/s41467-020-17581-y
Zhang N, Xiong J, Zhong J, Leatham K (2018) Gaussian process regression method for classification for high-dimensional data with limited samples. Int Conf Inf Sci Technol 358–363
Zieba A, Ramza P (2011) Standard deviation of the mean of autocorrelated observations estimated with the use of the autocorrelation function estimated from the data. Metrol Meas Syst 18(4):529–542. https://doi.org/10.2478/v10178-011-0052-x
Zhou N, Pierre JW, Trudnowski D (2012) A stepwise regression method for estimating dominant electromechanical modes. IEEE Trans Power Syst 27(2):1051–1059. https://doi.org/10.1109/TPWRS.2011.2172004
Acknowledgements
This research was supported by the Ministry of Education (MOE) through Fundamental Research Grant Scheme (FRGS/1/2020/TK0/UNITEN/02/16).
Author information
Authors and Affiliations
Contributions
Data curation, Michelle Sapitang and Wanie M. Ridwan Formal analysis, Michelle Sapitang and Wanie M. Ridwan; Methodology, Ali Najah Ahmed, Chow Ming Fai, and Ahmed El-Shafie; Writing – original draft, Michelle Sapitang; Writing – review & editing, Michelle Sapitang and Ali Najah Ahmed.
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare no conflict of interest.
Additional information
Communicated by: H. Babaie
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sapitang, M., Ridwan, W.M., Ahmed, A.N. et al. Groundwater level as an input to monthly predicting of water level using various machine learning algorithms. Earth Sci Inform 14, 1269–1283 (2021). https://doi.org/10.1007/s12145-021-00654-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-021-00654-x