Abstract
Accurately forecasting sales has significant ramifications for producers, distributors, and investors. Sales forecasting accuracy enables businesses to enhance their manufacturing, distribution, and promotional activities. The current research intends to examine the implications of feature selection methods on enhancing the accuracy of seasonal sales forecasts. The author evaluates different feature selection methods in combination with predictive models, aiming to determine their impact on the effectiveness of predictions. Within this research, a diverse set of eight classifiers has been utilized: namely Naïve Bayes, Logistic Regression, Neural Network, Random Forest, J48, IBK, SVM, and K Star. Alongside these classifiers, four distinct feature selection techniques-namely Gainratio, Infogain, Relief, and CFS have also been employed. The effectiveness of these strategies was evaluated individually as well as collaboratively. The outcome of the proposed novel methodology DAAESF engendered a notable advancement in accuracy rates. Combining feature selection techniques with Neural Network led to a 32% accuracy enhancement compared to other classifiers for cement sales prediction, while Naïve Bayes experienced a decline in performance from 19.55% to 32.55% due to its distinct functions. Additionally, feature selection notably improved prediction accuracy across classifiers, with Neural Network achieving up to 22.68% improvement using CFS, and SVM showing gains of 26.3% with Infogain, highlighting the critical role of feature selection in model optimization. Naïve Bayes and J48 exhibited mixed results across datasets and feature selection methods. Additionally, to substantiate the robustness and validity of the observed outcomes, the Friedman test was judiciously applied.




















Similar content being viewed by others
Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.
References
Montoya-Torres JR, Muñoz-Villamizar A, Mejia-Argueta C. Mapping research in logistics and supply chain management during covid-19 pandemic. Int J Log Res Appl. 2023;26(4):421–41.
Kukkar A, Sharma A, Fan J, Zhang M. Data mining applications in university information management system development; 2022.
Singh S, Ramkumar K, Kukkar A. Machine learning techniques and implementation of different ml algorithms. In: 2021 2nd global conference for advancement in technology (GCAT). IEEE; 2021. pp. 1–6.
Kumar A, Kumar Y, Kukkar A. A feature selection model for prediction of software defects. Int J Embedded Syst. 2020;13(1):28–39.
Borucka A. Seasonal methods of demand forecasting in the supply chain as support for the company’s sustainable growth. Sustainability. 2023;15(9):7399.
Feizabadi J. Machine learning demand forecasting and supply chain performance. Int J Log Res Appl. 2022;25(2):119–42.
Kaur G, Goyal S, Kaur H. Brief review of various machine learning algorithms. In: Proceedings of the international conference on innovative computing & communication (ICICC); 2021.
Goswami K, Kandali AB. Machine learning algorithms for predicting electrical load demand: an evaluation and comparison. Sādhanā. 2024;49(1):1–14.
Chatziloizos G-M, Gunopulos D, Konstantinou K. Deep learning for stock market prediction using sentiment and technical analysis. SN Comput Sci. 2024;5(5):446.
Chatziloizos G-M, Gunopulos D, Konstantinou K. Deep learning for stock market prediction using sentiment and technical analysis. SN Comput Sci. 2024;5(5):446.
Zhu X, Ninh A, Zhao H, Liu Z. Demand forecasting with supply-chain information and machine learning: evidence in the pharmaceutical industry. Prod Oper Manage. 2021;30(9):3231–52.
Pereira MM, Frazzon EM. A data-driven approach to adaptive synchronization of demand and supply in omni-channel retail supply chains. Int J Inf Manage. 2021;57: 102165.
Nguyen HD, Tran KP, Thomassey S, Hamad M. Forecasting and anomaly detection approaches using lstm and lstm autoencoder techniques with the applications in supply chain management. Int J Inf Manage. 2021;57: 102282.
Knoll D, Prüglmeier M, Reinhart G. Predicting future inbound logistics processes using machine learning. Proc CIRP. 2016;52:145–50.
Budak A, Ustundag A, Guloglu B. A forecasting approach for truckload spot market pricing. Transport Res A Policy Pract. 2017;97:55–68.
Ji S, Wang X, Zhao W, Guo D. An application of a three-stage xgboost-based model to sales forecasting of a cross-border e-commerce enterprise. Math Probl Eng. 2019. https://doi.org/10.1155/2019/8503252.
Cheriyan S, Ibrahim S, Mohanan S, Treesa S. Intelligent sales prediction using machine learning techniques. In: 2018 international conference on computing, electronics & communications engineering (iCCECE), IEEE; 2018. pp. 53–58.
Mohamed-Iliasse M, Loubna B, Abdelaziz B. Is machine learning revolutionizing supply chain? In: 2020 5th International conference on logistics operations management (GOL). IEEE; 2020. pp. 1–10.
Bousqaoui H, Achchab S, Tikito K. Machine learning applications in supply chains: an emphasis on neural network applications. In: 2017 3rd International conference of cloud computing technologies and applications (CloudTech). IEEE; 2017. pp. 1–7.
Gupta G, Gupta KL, Kansal G. Megamart sales prediction using machine learning techniques. In: Proceedings of third international conference on computing, communications, and cyber-security: IC4S 2021, Springer; 2022. pp. 437–446.
Albadrani A, Zohdy MA, Olawoyin R. An approach to optimize future inbound logistics processes using machine learning algorithms. In: 2020 IEEE international conference on electro information technology (EIT). IEEE; 2020. pp. 402–406.
Htun HH, Biehl M, Petkov N. Survey of feature selection and extraction techniques for stock market prediction. Fin Innov. 2023;9(1):26.
Kaur G, Kaur H, Goyal S. Correlation analysis between different parameters to predict cement logistics. Innov Syst Softw Eng. 2023;19(1):117–27.
Lei Y, Qiaoming H, Tong Z, et al. Research on supply chain financial risk prevention based on machine learning. Comput Intell Neurosci. 2023. https://doi.org/10.1155/2023/6531154.
Cheriyan S, Ibrahim S, Mohanan S, Treesa S. Intelligent sales prediction using machine learning techniques. In: 2018 International conference on computing, electronics & communications engineering (iCCECE). IEEE; 2018. pp. 53–58.
Mohamed-Iliasse M, Loubna B, Abdelaziz B. Is machine learning revolutionizing supply chain? In: 2020 5th International conference on logistics operations management (GOL). IEEE; 2020. pp. 1–10.
Bousqaoui H, Achchab S, Tikito K. Machine learning applications in supply chains: an emphasis on neural network applications. In: 2017 3rd International conference of cloud computing technologies and applications (CloudTech). IEEE; 2017. pp. 1–7.
Pallathadka H, Mustafa M, Sanchez DT, Sajja GS, Gour S, Naved M. Impact of machine learning on management, healthcare and agriculture. Mater Today Proc. 2023;80:2803–6.
Prahara PJ, Hariadi TK. Improved feature selection algorithm of electricity price forecasting using svm. In: 2022 2nd international conference on electronic and electrical engineering and intelligent system (ICE3IS). IEEE; 2022. pp. 34–39.
Kaur G, Kaur H. Prediction of the cause of accident and accident prone location on roads using data mining techniques. In: 2017 8th International conference on computing, communication and networking technologies (ICCCNT). IEEE; 2017. pp. 1–7.
Bindal R, Sarangi P, Kaur G, Dhiman G. An approach for automatic recognition system for Indian vehicles numbers using k-nearest neighbours and decision tree classifier; 2019.
Sharma A, Mishra PK. Performance analysis of machine learning based optimized feature selection approaches for breast cancer diagnosis. Int J Inf Technol. 2022;14(1):1–12.
Reddy EMK, Gurrala A, Hasitha VB, Kumar KVR. Introduction to naive bayes and a review on its subtypes with applications. In: Bayesian Reason. Gaussian Process. Mach. Learn. Appl.; 2022. pp. 1–14.
Karki S, Hadikusumo B. Machine learning for the identification of competent project managers for construction projects in Nepal. Constr Innov. 2023;23(1):1–18.
Wu D, Wang Q, Olson DL. Industry classification based on supply chain network information using graph neural networks. Appl Soft Comput. 2023;132: 109849.
Banik S, Islam MR, Rahman KN, Rahman MA. A comparative analysis of machine learning algorithms to predict backorder in supply chain management. SSRN. 2023. https://doi.org/10.2139/ssrn.4444976.
Luo J. Application of machine learning in supply chain management. In: 2022 3rd international conference on big data economy and information management (BDEIM 2022). Atlantis Press; 2023. pp. 489–498.
Esmaeili M, Olfat L, Amiri M, Raeesi Vanani I. Classification and allocation of suppliers to customers in resilience supply chains using machine learning. J Ind Manage Perspect. 2023;13(3):39–70.
Ghasemkhani B, Aktas O, Birant D. Balanced k-star: an explainable machine learning method for internet-of-things-enabled predictive maintenance in manufacturing. Machines. 2023;11(3):322.
Khosravi K, Golkarian A, Omidvar E, Hatamiafkoueieh J, Shirali M. Snow water equivalent prediction in a mountainous area using hybrid bagging machine learning approaches. Acta Geophys. 2023;71(2):1015–31.
Nguyen HD, Tran KP, Thomassey S, Hamad M. Forecasting and anomaly detection approaches using lstm and lstm autoencoder techniques with the applications in supply chain management. Int J Inf Manage. 2021;57: 102282.
Hasan MR. Addressing seasonality and trend detection in predictive sales forecasting: a machine learning perspective. J Bus Manage Stud. 2024;6(2):100–9.
Soltaninejad M, Aghazadeh R, Shaghaghi S, Zarei M. Using machine learning techniques to forecast Mehram company’s sales: a case study. J Bus Manage Stud. 2024;6(2):42–53.
Kaur G, Kaur H, Goyal S. Strategic feature selection for precision augmentation in cement sales forecasting. In: 2023 Seventh international conference on image information processing (ICIIP). IEEE; 2023. pp. 765–770.
Funding
No funding, grants, or other aid was received during the preparation of this manuscript.
Author information
Authors and Affiliations
Contributions
All the authors have equally contributed to this work.
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kaur, G., Kaur, H. & Goyal, S. Data Analytics Approach for Enhanced Sales Forecasting (DAAESF): Feature Selection and Classifier Integration Analysis. SN COMPUT. SCI. 5, 1158 (2024). https://doi.org/10.1007/s42979-024-03483-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-024-03483-z