Abstract
Effective short-term load forecasting (STLF) is essential for optimizing electricity grid operations. This study focuses on refining STLF for day-ahead predictions using Bayesian multiple linear regression (BMLR). This study’s originality lies in its innovative use of BMLR combined with data clustering techniques to improve prediction accuracy, a method not previously explored in existing literature. We address the critical issue of input data clustering, highlighting its impact on prediction accuracy. Four clustering methods based on temporality were examined, with clustering by weekday and hour proving most effective for BMLR-based STLF. Predictors included historical load, temperature, season, weekday, and hour, selected using the Akaike information criterion (AIC). Linear regression assumptions were verified, and solutions were proposed for deviations, notably addressing heteroscedasticity. Autocorrelation in residuals was addressed to improve forecasting efficiency. Time-cross validation and performance metrics demonstrated model effectiveness. Second-degree polynomial terms are included for better fitting. Clustering by weekday and hour is optimal for BMLR-based STLF, aiding accurate load forecasts. The main objectives of this research are to determine the optimal clustering method for BMLR in STLF and to provide practical insights into the application of Bayesian techniques in load forecasting. This research significantly contributes to the field of STLF by providing practical insights into data clustering and model refinement, offering valuable perspectives for enhanced energy management and grid stability.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
Data will be available upon the request to the corresponding author.
References
Hong T, Fan S (2016) Probabilistic electric load forecasting: A tutorial review. Int J Forecast 32(3). https://doi.org/10.1016/j.ijforecast.2015.11.011
Fallah SN, Ganjkhani M, Shamshirband S, wing Chau K (2019) Computational intelligence on short-term load forecasting: A methodological overview. https://doi.org/10.3390/en12030393
Do LPC, Lin KH, Molnár P (2016) Electricity consumption modelling: A case of Germany. Econ Model 55. https://doi.org/10.1016/j.econmod.2016.02.010
Tziolis G, Lopez-Lorente J, Baka MI, Koumis A, Livera A, Theocharides S, Makrides G, Georghiou GE (2024) Direct short-term net load forecasting in renewable integrated microgrids using machine learning: A comparative assessment. Sust Energ Grids Netw 37(101):256
Ribeiro GT, Mariani VC, dos Santos Coelho L (2019) Enhanced ensemble structures using wavelet neural networks applied to short-term load forecasting. Eng Appl Artif Intell 82:272–281
Wazirali R, Yaghoubi E, Abujazar MSS, Ahmad R, Vakili AH (2023) State-of-the-art review on energy and load forecasting in microgrids using artificial neural networks, machine learning, and deep learning techniques. Electr Power Syst Res 225(109):792
Zhang L, Jánošík D (2024) Enhanced short-term load forecasting with hybrid machine learning models: Catboost and xgboost approaches. Expert Syst Appl 241(122):686
Xiao L, Bai Q, Wang B (2024) A dynamic multi-model transfer based short-term load forecasting. Appl Soft Comput 159:111627 (2024)
Jahani A, Zare K, Khanli LM (2023) Short-term load forecasting for microgrid energy management system using hybrid spm-lstm. Sust Cities Soc 98(104):775
Junior MY, Freire RZ, Seman LO, Stefenon SF, Mariani VC, dos Santos Coelho L (2024) Optimized hybrid ensemble learning approaches applied to very short-term load forecasting. Int J Electr Power Energy Syst 155(109):579
Ribeiro MHDM, da Silva RG, Ribeiro GT, Mariani VC, dos Santos Coelho L (2023) Cooperative ensemble learning model improves electric short-term load forecasting. Chaos, Solitons Fractals 166(112):982
Amral N, Özveren CS, King D (2007) in Proceedings of the Universities Power Engineering Conference. https://doi.org/10.1109/UPEC.2007.4469121
Wang Y, Zhang N, Tan Y, Hong T, Kirschen DS, Kang C (2019) Combining Probabilistic Load Forecasts. IEEE Trans Smart Grid 10(4). https://doi.org/10.1109/TSG.2018.2833869
Baur L, Ditschuneit K, Schambach M, Kaymakci C, Wollmann T, Sauer A (2024) Explainability and interpretability in electric load forecasting using machine learning techniques–a review. Energy and AI p 100358
Acquah MA, Jin Y, Oh BC, Son YG, Kim SY (2023) Spatiotemporal sequence-to-sequence clustering for electric load forecasting. IEEE Access 11:5850–5863. https://doi.org/10.1109/ACCESS.2023.3235724
Wang Q, Chen Z, Wu C (2021) in 2021 IEEE Sustainable Power and Energy Conference (iSPEC) (IEEE, 2021), pp 2417–2424
Nightingale JS, Wang Y, Zobiri F, Mustafa MA (2022) in 2022 IEEE PES Innovative Smart Grid Technologies Conference Europe (ISGT-Europe) (IEEE, 2022), pp 1–5
Zeng W, Li J, Sun C, Cao L, Tang X, Shu S, Zheng J (2023) Ultra short-term power load forecasting based on similar day clustering and ensemble empirical mode decomposition. Energies 16(4):1989
Klimberg RK, Ratick S, Smith H (2017) in Advances in Business and Management Forecasting, vol 12 (Emerald Publishing Limited, 2017), pp 87–101
Wang X, Lee WJ, Huang H, Szabados RL, Wang DY, Van Olinda P (2016) Factors that impact the accuracy of clustering-based load forecasting. IEEE Trans Ind Appl 52(5):3625–3630
Structural ensemble regression for cluster-based aggregate electricity demand forecasting. Electr 3(4):480–504. https://doi.org/10.3390/electricity3040025. https://www.mdpi.com/2673-4826/3/4/25/pdf?version=1666359931
Xiao JW, Fang H, Wang YW (2024) Short-term residential load forecasting via pooling-ensemble model with smoothing clustering. IEEE Trans Artif Intell
Arvanitidis AI, Bargiotas D, Daskalopulu A, Kontogiannis D, Panapakidis IP, Tsoukalas LH (2022) Clustering informed mlp models for fast and accurate short-term load forecasting. Energies 15(4):1295
Shaukat MA, Shaukat HR, Qadir Z, Munawar HS, Kouzani AZ, Mahmud MP (2021) Cluster analysis and model comparison using smart meter data. Sensors 21(9):3157
Ding C, Li Y, Wang Y, Cao S, Gu D (2023) in International Conference on Signal Processing, Computer Networks, and Communications (SPCNC 2022), vol 12626 (SPIE, 2023), pp 293–298
Wang Y, Von Krannichfeldt L, Hug G (2021) in 2021 IEEE Madrid PowerTech (IEEE, 2021), pp. 1–6
Zha W, Ji Y, Liang C (2024) Short-term load forecasting method based on secondary decomposition and improved hierarchical clustering. Results Eng 22:101993
van den Bergh D, Clyde MA, Gupta ARN, de Jong T, Gronau QF, Marsman M, Ly A, Wagenmakers EJ (2021) A tutorial on Bayesian multi-model linear regression with BAS and JASP. Behav Res Methods 53(6). https://doi.org/10.3758/s13428-021-01552-2
Permai SD, Tanty H (2018) Linear regression model using bayesian approach for energy performance of residential building. Procedia Comput Sci 135:671–677
Wang S, Sun X, Lall U (2017) A hierarchical bayesian regression model for predicting summer residential electricity demand across the usa. Energy 140:601–611
Trierweiler Ribeiro G, Guilherme Sauer J, Fraccanabbia N, Cocco Mariani V, dos Santos Coelho L (2020) Bayesian optimized echo state network applied to short-term load forecasting. Energies 13(9):2390
Kruschke JK (2014) Doing Bayesian data analysis: A tutorial with R, JAGS, and Stan, second edition. https://doi.org/10.1016/B978-0-12-405888-0.09999-2
Baldwin SA, Larson MJ (2017) An introduction to using Bayesian linear regression with clinical data. Behav Res Ther 98. https://doi.org/10.1016/j.brat.2016.12.016
McElreath R (2018) Statistical rethinking: A bayesian course with examples in R and stan. https://doi.org/10.1201/9781315372495
Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB (2013) Bayesian data analysis. Chapman and Hall/CRC
Kendall MG (1961) The Advanced Theory of Statistics. Rev Mex Sociol 23(1) https://doi.org/10.2307/3538355
Permai SD, Tanty H (2018) in Procedia Computer Science, vol 135. https://doi.org/10.1016/j.procs.2018.08.219
Hyndman RJ, George A (2014) Forecasting: Principles and Practice. Principles of Optimal Design (September)
Osborne JW, Waters E (2003) Four assumptions of multiple regression that researchers should always test. Pract Assess Res Eval 8(2)
Roziqin MC, Basuki A, Harsono T (2017) in 2016 International Conference on Knowledge Creation and Intelligent Computing, KCIC 2016. https://doi.org/10.1109/KCIC.2016.7883649
Prasad AK, Ahadi M, Thakur BS, Roy S (2016) in Proceedings of 2015 IEEE MTT-S International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization, NEMO 2015. https://doi.org/10.1109/NEMO.2015.7415055
Chen Y, He P, Chen W, Zhao F (2018) in Proceedings of 2018 IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference, IAEAC 2018. https://doi.org/10.1109/IAEAC.2018.8577769
Ostertagová E (2012) in Procedia Engineering, vol 48. https://doi.org/10.1016/j.proeng.2012.09.545
Brownlee J (2020) Probability for Machine Learning : Discover how to harness uncertainty with Python
Trevor Hastie JF, Tibshirani R (2000) The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd edn, vol 20
ENTSOE (2022) Central Collection and Publication of Electricity Generation, Transportation and Consumption Data and Information for the Pan-European Market. https://transparency.entsoe.eu/. Accessed 19 Jan 2022
Visual Crossing Corporation (2022) Visual Crossing Weather. https://www.visualcrossing.com/weather-data. Accessed Feb 16 2022
Urosevic V, Dimitrijevic S, in, (2021) 29th Telecommunications Forum, TELFOR 2021 - Proceedings (2021 29th Telecommunications Forum (TELFOR). Belgrade 2021:4. https://doi.org/10.1109/TELFOR52709.2021.9653206
Zhang N, Li Z, Zou X, Quiring SM (2019) Comparison of three short-term load forecast models in Southern California. Energy 189. https://doi.org/10.1016/j.energy.2019.116358
(2024) U.s. energy information administration - eia - independent statistics and analysis. https://www.eia.gov/todayinenergy/detail.php?id=29012. Accessed 21 Sept 2024
(2024) U.s. energy information administration - eia - independent statistics and analysis. https://www.eia.gov/todayinenergy/detail.php?id=43295. Accessed 21 Sept 2024
Timeanddate (2022) Timeanddate.com. https://www.timeanddate.com/. Accessed 9 Feb 2022
Çevik HH, Çunkaş M (2015) Short-term load forecasting using fuzzy logic and ANFIS. Neural Comput & Applic 26(6). https://doi.org/10.1007/s00521-014-1809-4
Ružić S, Vučković A, Nikolić N (2003) Weather Sensitive Method for Short Term Load Forecasting in Electric Power Utility of Serbia. IEEE Trans Power Syst 18(4). https://doi.org/10.1109/TPWRS.2003.811172
Ranaweera DK, Hubele NF, Karady GG (1996) Fuzzy logic for short term load forecasting. Int J Electr Power Energy Syst 18(4). https://doi.org/10.1016/0142-0615(95)00060-7
Chenthur Pandian S, Duraiswamy K, Rajan CCA, Kanagaraj N (2006) Fuzzy approach for short term load forecasting. Electr Power Syst Res 76(6-7). https://doi.org/10.1016/j.epsr.2005.09.018
Wei J, Chen Z, He F, Qi Q, Jiang J (2023) Short-term Load Forecasting Method Based on Daily Load Curve Clustering and SVM-WD. J Phys Conf Ser 2447(1). https://doi.org/10.1088/1742-6596/2447/1/012007
Betancourt M (2018) A conceptual introduction to Hamiltonian Monte Carlo. Ann Rev Stat Appl 5:65–80
Gelman A, Rubin DB (1992) Inference from iterative simulation using multiple sequences. Stat Sci 7(4). https://doi.org/10.1214/ss/1177011136
Peng FL, Qiao YK, Yang C (2023) Reliability Estimation for the Joint Waterproof Facilities of Utility Tunnels Based on an Improved Bayesian Weibull Model. Appl Sci (Switzerland) 13(1). https://doi.org/10.3390/app13010611
Rosopa PJ, Schaffer MM, Schroeder AN (2013) Managing heteroscedasticity in general linear models. Psychol Methods 18(3). https://doi.org/10.1037/a0032553
Breusch TS, Pagan AR (1979) A Simple Test for Heteroscedasticity and Random Coefficient Variation. Econometrica 47(5). https://doi.org/10.2307/1911963
Durbin J, Watson GS (1950) Testing for Serial Correlation in Least Squares Regression: I. Biometrika 37(3/4). https://doi.org/10.2307/2332391
Dai X, Fu G, Reese R, Zhao S, Shang Z (2022) An approach of bayesian variable selection for ultrahigh-dimensional multivariate regression. Stat 11(1):e476
Celani A, Pagnottoni P, Jones G (2024) Bayesian variable selection for matrix autoregressive models. Stat Comput 34(2):91
PyMC (2022) Introductory Overview of PyMC. https://www.pymc.io/projects/docs/en/stable/learn/core_notebooks/pymc_overview.html. Accessed 16 Jun 2022
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors have no relevant financial or nonfinancial interests to disclose. The authors have no conflicts of interest to declare that are relevant to the content of this article. The authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or nonfinancial interest in the subject matter or materials discussed in this manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Urošević, V., M. Savić, A. Temporal clustering for accurate short-term load forecasting using Bayesian multiple linear regression. Appl Intell 55, 19 (2025). https://doi.org/10.1007/s10489-024-05887-z
Accepted:
Published:
DOI: https://doi.org/10.1007/s10489-024-05887-z