Skip to main content

Advertisement

Log in

Proposing a Hybrid Genetic Algorithm based Parsimonious Random Forest Regression (H-GAPRFR) technique for solar irradiance forecasting with feature selection and parameter optimization

  • Research Article
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

Machine learning has sparked a wide set of solar prediction experiments due to its recent success, which is one of the most common solutions for solar irradiance forecasting problems specifically. However, while using machine learning regression algorithms, additional attention must be paid to feature selection as well as effective parameter optimization. As a result, this work provides a parsimonious based genetic algorithm that incorporates feature selection integrated with random forest regression. The impact of wind speed on the solar irradiance problem has also been investigated. The performance of the proposed H-GAPRFR algorithm is tested for the location Madurai, India, with eight meteorological data variables over a year and has been validated through statistical metrics such as RMSE, MAE, and coefficient of Determination. In comparison to conventional Support Vector Regression (SVR) and Random Forest (RF) regression techniques, the suggested H-GAPRFR model reduced RMSE by 64.18% and 7.43%, respectively. Second, the suggested H-GAPRFR algorithm was used to investigate the effect of wind speed. From the analysis, it is clear that the proposed model considering wind speed improves the prediction accuracy by further reducing RMSE and MAE by 10.2% and 6.5%, respectively. As a result, it is indicated that in the problem of solar irradiance estimation, the parsimonious model with feature selection can produce improved prediction accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

Abbreviations

ANN:

Artificial Neural Network

ARMA:

Autoregressive Moving Average model

CV:

Cross Validation

GAP-RFR:

GA Parsimonious Random Forest

GAP-SVR:

GA Parsimonious Support Vector Regression

GHI:

Global Horizontal Irradiation

GIS:

Geographic Information Systems

GW:

Gigawatts

MAE:

Mean Absolute Error

MAPE:

Mean Absolute Percentage Error

PV:

Photovoltaic

R2:

Coefficient of Determination

RBF:

Radial Basis Function

RF:

Random Forest

RMSE:

Root Mean Square Error

SVM:

Support Vector Machine

SVR:

Support Vector Regression

TWh:

Terawatt-hours

References

  • Altan A, Karasu S, Zio E (2021) A new hybrid model for wind speed forecasting combining long short-term memory neural network, decomposition methods and grey wolf optimizer. Appl Soft Comput 100:106996. https://doi.org/10.1016/j.asoc.2020.106996

  • Assouline D, Mohajeri N, Scartezzini J-L (2018) Large-scale rooftop solar photovoltaic technical potential estimation using Random Forests. Appl Energy 217:189–211. https://doi.org/10.1016/j.apenergy.2018.02.118

  • Ayodele TR, Ogunjuyigbe ASO, Amedu A, Munda JL (2019) Prediction of global solar irradiation using hybridized k-means and support vector regression algorithms. Renew Energy Focus 29:78–93. https://doi.org/10.1016/j.ref.2019.03.003

  • Babar B, Luppino LT, Boström T, Anfinsen SN (2020) Random forest regression for improved mapping of solar irradiance at high latitudes. Sol Energy 198:81–92. https://doi.org/10.1016/j.solener.2020.01.034

  • Bas E, Egrioglu E, Kolemen E (2022) Training simple recurrent deep artificial neural network for forecasting using particle swarm optimization. Granul Comput 7(2):411–420. https://doi.org/10.1007/s41066-021-00274-2

  • Breiman L (2001) Random Forests. Machine Learning 45:5–32. http://dx.doi.org/10.1023/A:1010933404324

  • Carneiro TC, Rocha PAC, Carvalho PCM, Fernández-Ramírez LM (2022) Ridge regression ensemble of machine learning models applied to solar and wind forecasting in Brazil and Spain. Appl Energy 314:118936. https://doi.org/10.1016/j.apenergy.2022.118936

  • Choubin B, Moradi E, Golshan M, Adamowski J, Sajedi-Hosseini F, Mosavi A (2019) An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. Sci Total Environ 651:2087–2096. https://doi.org/10.1016/j.scitotenv.2018.10.064

  • Fjelkestam Frederiksen CA, Cai Z (2022) Novel machine learning approach for solar photovoltaic energy output forecast using extra-terrestrial solar irradiance. Appl Energy 306:118152. https://doi.org/10.1016/j.apenergy.2021.118152

  • Fouilloy A et al (2018) Solar irradiation prediction with machine learning: Forecasting models selection method depending on weather variability. Energy 165:620–629. https://doi.org/10.1016/j.energy.2018.09.116

  • Ghimire S, Bhandari B, Casillas-Pérez D, Deo RC, Salcedo-Sanz S (2022) Hybrid deep CNN-SVR algorithm for solar radiation prediction problems in Queensland, Australia. Eng Appl Artif Intell 112:104860. https://doi.org/10.1016/j.engappai.2022.104860

  • Ghimire S, Nguyen-Huy T, Deo RC, Casillas-Pérez D, Salcedo-Sanz S (2022) Efficient daily solar radiation prediction with deep learning 4-phase convolutional neural network, dual stage stacked regression and support vector machine CNN-REGST hybrid model. Sustain Mater Technol 32:e00429. https://doi.org/10.1016/j.susmat.2022.e00429

  • Jaihuni M et al (2022) A novel recurrent neural network approach in forecasting short term solar irradiance. ISA Trans 121(63–74). https://doi.org/10.1016/j.isatra.2021.03.043

  • Karasu S, Altan A (2019) 11th International Conference on Electrical and Electronics Engineering (ELECO), pp 8–11. https://doi.org/10.23919/ELECO47770.2019.8990664

  • Karasu S, Altan A, Bekiros S, Ahmad W (2020) A new forecasting model with wrapper-based feature selection approach using multi-objective optimization technique for chaotic crude oil time series. Energy 212:118750. https://doi.org/10.1016/j.energy.2020.118750

  • Karasu S, Altan A (2022) Crude oil time series prediction model based on LSTM network with chaotic Henry gas solubil?ity optimization. Energy 242:122964. https://doi.org/10.1016/j.energy.2021.122964

  • Khan W, Walker S, Zeiler W (2022) Improved solar photovoltaic energy generation forecast using deep learning-based ensemble stacking approach. Energy 240:122812. https://doi.org/10.1016/j.energy.2021.122812

  • Li Z, Xie Y, Li X, Zhao W (2021) Prediction and application of porosity based on support vector regression model optimized by adaptive dragonfly algorithm. Energy Sources Part Recovery Util Environ Eff 43(9):1073–1086. https://doi.org/10.1080/15567036.2019.1634775

  • Li G, Li M, Taylor R, Hao Y, Besagni G, Markides CN (2022) Solar energy utilisation: Current status and roll-out potential. Appl Therm Eng 209:118285. https://doi.org/10.1016/j.applthermaleng.2022.118285

  • Liu L, Zhan M, Bai Y (2019) A recursive ensemble model for forecasting the power output of photovoltaic systems. Sol Energy 189:291–298. https://doi.org/10.1016/j.solener.2019.07.061

  • Mahdi Miraftabzadeh S, Longo M, Foiadelli F (2020) A-day-ahead photovoltaic power prediction based on long short term memory algorithm. In: International Conference on Smart Energy Systems and Technologies (SEST), Sep. 2020, pp 1–6. https://doi.org/10.1109/SEST48500.2020.9203481

  • Markovics D, Mayer MJ (2022) Comparison of machine learning methods for photovoltaic power forecasting based on numerical weather prediction. Renew Sustain Energy Rev 161:112364. https://doi.org/10.1016/j.rser.2022.112364

  • Martinez-de-Pison FJ, Gonzalez-Sendino R, Aldama A, Ferreiro-Cabello J, Fraile-Garcia E (2019) Hybrid methodology based on Bayesian optimization and GA-PARSIMONY to search for par?simony models by combining hyperparameter optimization and feature selection. Neurocomputing 354:20–26. https://doi.org/10.1016/j.neucom.2018.05.136

  • Martinez-de-Pison FJ, Ferreiro J, Fraile E, Pernia-Espinoza A (2021) A comparative study of six model complexity metrics to search for parsimonious models with GAparsimony R Package. Neuro?computing 452:317–332. https://doi.org/10.1016/j.neucom.2020.02.135

  • Mathew A, Sreekumar S, Khandelwal S, Kumar R (2019) Prediction of land surface temperatures for surface urban heat island assessment over Chandigarh city using support vector regres?sion model. Sol Energy 186:404–415. https://doi.org/10.1016/j.solener.2019.04.001

  • Mas’ud AA (2022) Comparison of three machine learning models for the prediction of hourly PV output power in Saudi Arabia. Ain Shams Eng J 13(4):101648. https://doi.org/10.1016/j.asej.2021.11.017

  • Mertens S (2022) Design of wind and solar energy supply, to match energy demand. Clean Eng Technol 6:100402. https://doi.org/10.1016/j.clet.2022.100402

  • Mghouchi YE (2022) On the prediction of daily global solar radiation using temperature as input. An application of hybrid machine learners to the six climatic Moroccan zones. Energy Convers Manag X 13:100157. https://doi.org/10.1016/j.ecmx.2021.100157

  • Nespoli A, Niccolai A, Ogliari E, Perego G, Collino E, Ronzio D (2022) Machine Learning techniques for solar irradiation nowcasting: Cloud type classification forecast through satellite data and imagery. Appl Energy 305:117834. https://doi.org/10.1016/j.apenergy.2021.117834

  • Prasad R, Ali M, Kwan P, Khan H (2019) Designing a multi-stage multivariate empirical mode decomposition coupled with ant colony optimization and random forest model to forecast monthly solar radiation. Appl Energy 236:778–792. https://doi.org/10.1016/j.apenergy.2018.12.034

  • Qu Y, Xu J, Sun Y, Liu D (2021) A temporal distributed hybrid deep learning model for day-ahead distributed PV power forecasting. Appl Energy 304:117704. https://doi.org/10.1016/j.apenergy.2021.117704

  • Ramirez-Vergara J, Bosman LB, Wollega E, Leon-Salas WD (2022) Review of forecasting methods to support photovoltaic predictive maintenance. Clean Eng Technol 8:100460. https://doi.org/10.1016/j.clet.2022.100460

  • Sanz-Garcia A, Fernandez-Ceniceros J, Antonanzas-Torres F, Pernia-Espinoza AV, Martinez-de-Pison FJ (2015) GA-PARSIMONY: A GA-SVR approach with feature selection and parameter optimization to obtain parsimonious solutions for predicting temperature settings in a continuous annealing furnace. Appl Soft Comput 35:13–28. https://doi.org/10.1016/j.asoc.2015.06.012

  • Ser JD et al (2022) Randomization-based machine learning in renewable energy prediction problems: Critical literature review, new results and perspectives. Appl Soft Comput 118:108526. https://doi.org/10.1016/j.asoc.2022.108526

  • Visser L, AlSkaif T, van Sark W (2022) Operational day-ahead solar power forecasting for aggregated PV systems with a varying spatial distribution. Renew Energy 183:267–282. https://doi.org/10.1016/j.renene.2021.10.102

  • Wang F, Xuan Z, Zhen Z, Li K, Wang T, Shi M (2020) A day-ahead PV power forecasting method based on LSTM-RNN model and time correlation modification under partial daily pattern prediction framework. Energy Convers Manag 212:112766. https://doi.org/10.1016/j.enconman.2020.112766

  • Wolff B, Kühnert J, Lorenz E, Kramer O, Heinemann D (2016) Comparing support vector regression for PV power forecasting to a physical modeling approach using measurement, numerical weather prediction, and cloud motion data. Sol Energy 135:197–208. https://doi.org/10.1016/j.solener.2016.05.051

  • Yagli GM, Yang D, Srinivasan D (2019) Automatic hourly solar forecasting using machine learning models. Renew Sustain Energy Rev 105:487–498. https://doi.org/10.1016/j.rser.2019.02.006

  • Yasmeen R, Yao X, Ul Haq Padda I, Shah WUH, Jie W (2022) Exploring the role of solar energy and foreign direct investment for clean environment: Evidence from top 10 solar energy con?suming countries. Renew Energy 185:147–158. https://doi.org/10.1016/j.renene.2021.12.048

  • Zhao W et al (2021) A point prediction method based automatic machine learning for day-ahead power output of multi-region photovoltaic plants. Energy 223:120026. https://doi.org/10.1016/j.energy.2021.120026

  • Zhu T, Li Y, Li Z, Guo Y, Ni C (2022) Inter-hour forecast of solar radiation based on long short-term memory with attention mechanism and genetic algorithm. Energies 15(3):1062. https://doi.org/10.3390/en15031062

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Josalin Jemima J..

Ethics declarations

Competing interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Communicated by H. Babaie.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

J., J., D., N., S., C. et al. Proposing a Hybrid Genetic Algorithm based Parsimonious Random Forest Regression (H-GAPRFR) technique for solar irradiance forecasting with feature selection and parameter optimization. Earth Sci Inform 15, 1925–1942 (2022). https://doi.org/10.1007/s12145-022-00839-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12145-022-00839-y

Keywords

Navigation