Skip to main content

Advertisement

Log in

Data Analytics Approach for Enhanced Sales Forecasting (DAAESF): Feature Selection and Classifier Integration Analysis

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Accurately forecasting sales has significant ramifications for producers, distributors, and investors. Sales forecasting accuracy enables businesses to enhance their manufacturing, distribution, and promotional activities. The current research intends to examine the implications of feature selection methods on enhancing the accuracy of seasonal sales forecasts. The author evaluates different feature selection methods in combination with predictive models, aiming to determine their impact on the effectiveness of predictions. Within this research, a diverse set of eight classifiers has been utilized: namely Naïve Bayes, Logistic Regression, Neural Network, Random Forest, J48, IBK, SVM, and K Star. Alongside these classifiers, four distinct feature selection techniques-namely Gainratio, Infogain, Relief, and CFS have also been employed. The effectiveness of these strategies was evaluated individually as well as collaboratively. The outcome of the proposed novel methodology DAAESF engendered a notable advancement in accuracy rates. Combining feature selection techniques with Neural Network led to a 32% accuracy enhancement compared to other classifiers for cement sales prediction, while Naïve Bayes experienced a decline in performance from 19.55% to 32.55% due to its distinct functions. Additionally, feature selection notably improved prediction accuracy across classifiers, with Neural Network achieving up to 22.68% improvement using CFS, and SVM showing gains of 26.3% with Infogain, highlighting the critical role of feature selection in model optimization. Naïve Bayes and J48 exhibited mixed results across datasets and feature selection methods. Additionally, to substantiate the robustness and validity of the observed outcomes, the Friedman test was judiciously applied.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20

Similar content being viewed by others

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

References

  1. Montoya-Torres JR, Muñoz-Villamizar A, Mejia-Argueta C. Mapping research in logistics and supply chain management during covid-19 pandemic. Int J Log Res Appl. 2023;26(4):421–41.

    Article  Google Scholar 

  2. Kukkar A, Sharma A, Fan J, Zhang M. Data mining applications in university information management system development; 2022.

  3. Singh S, Ramkumar K, Kukkar A. Machine learning techniques and implementation of different ml algorithms. In: 2021 2nd global conference for advancement in technology (GCAT). IEEE; 2021. pp. 1–6.

  4. Kumar A, Kumar Y, Kukkar A. A feature selection model for prediction of software defects. Int J Embedded Syst. 2020;13(1):28–39.

    Article  Google Scholar 

  5. Borucka A. Seasonal methods of demand forecasting in the supply chain as support for the company’s sustainable growth. Sustainability. 2023;15(9):7399.

    Article  Google Scholar 

  6. Feizabadi J. Machine learning demand forecasting and supply chain performance. Int J Log Res Appl. 2022;25(2):119–42.

    Article  Google Scholar 

  7. Kaur G, Goyal S, Kaur H. Brief review of various machine learning algorithms. In: Proceedings of the international conference on innovative computing & communication (ICICC); 2021.

  8. Goswami K, Kandali AB. Machine learning algorithms for predicting electrical load demand: an evaluation and comparison. Sādhanā. 2024;49(1):1–14.

    Article  MathSciNet  Google Scholar 

  9. Chatziloizos G-M, Gunopulos D, Konstantinou K. Deep learning for stock market prediction using sentiment and technical analysis. SN Comput Sci. 2024;5(5):446.

    Article  Google Scholar 

  10. Chatziloizos G-M, Gunopulos D, Konstantinou K. Deep learning for stock market prediction using sentiment and technical analysis. SN Comput Sci. 2024;5(5):446.

    Article  Google Scholar 

  11. Zhu X, Ninh A, Zhao H, Liu Z. Demand forecasting with supply-chain information and machine learning: evidence in the pharmaceutical industry. Prod Oper Manage. 2021;30(9):3231–52.

    Article  Google Scholar 

  12. Pereira MM, Frazzon EM. A data-driven approach to adaptive synchronization of demand and supply in omni-channel retail supply chains. Int J Inf Manage. 2021;57: 102165.

    Article  Google Scholar 

  13. Nguyen HD, Tran KP, Thomassey S, Hamad M. Forecasting and anomaly detection approaches using lstm and lstm autoencoder techniques with the applications in supply chain management. Int J Inf Manage. 2021;57: 102282.

    Article  Google Scholar 

  14. Knoll D, Prüglmeier M, Reinhart G. Predicting future inbound logistics processes using machine learning. Proc CIRP. 2016;52:145–50.

    Article  Google Scholar 

  15. Budak A, Ustundag A, Guloglu B. A forecasting approach for truckload spot market pricing. Transport Res A Policy Pract. 2017;97:55–68.

    Article  Google Scholar 

  16. Ji S, Wang X, Zhao W, Guo D. An application of a three-stage xgboost-based model to sales forecasting of a cross-border e-commerce enterprise. Math Probl Eng. 2019. https://doi.org/10.1155/2019/8503252.

    Article  Google Scholar 

  17. Cheriyan S, Ibrahim S, Mohanan S, Treesa S. Intelligent sales prediction using machine learning techniques. In: 2018 international conference on computing, electronics & communications engineering (iCCECE), IEEE; 2018. pp. 53–58.

  18. Mohamed-Iliasse M, Loubna B, Abdelaziz B. Is machine learning revolutionizing supply chain? In: 2020 5th International conference on logistics operations management (GOL). IEEE; 2020. pp. 1–10.

  19. Bousqaoui H, Achchab S, Tikito K. Machine learning applications in supply chains: an emphasis on neural network applications. In: 2017 3rd International conference of cloud computing technologies and applications (CloudTech). IEEE; 2017. pp. 1–7.

  20. Gupta G, Gupta KL, Kansal G. Megamart sales prediction using machine learning techniques. In: Proceedings of third international conference on computing, communications, and cyber-security: IC4S 2021, Springer; 2022. pp. 437–446.

  21. Albadrani A, Zohdy MA, Olawoyin R. An approach to optimize future inbound logistics processes using machine learning algorithms. In: 2020 IEEE international conference on electro information technology (EIT). IEEE; 2020. pp. 402–406.

  22. Htun HH, Biehl M, Petkov N. Survey of feature selection and extraction techniques for stock market prediction. Fin Innov. 2023;9(1):26.

    Article  Google Scholar 

  23. Kaur G, Kaur H, Goyal S. Correlation analysis between different parameters to predict cement logistics. Innov Syst Softw Eng. 2023;19(1):117–27.

    Article  Google Scholar 

  24. Lei Y, Qiaoming H, Tong Z, et al. Research on supply chain financial risk prevention based on machine learning. Comput Intell Neurosci. 2023. https://doi.org/10.1155/2023/6531154.

    Article  Google Scholar 

  25. Cheriyan S, Ibrahim S, Mohanan S, Treesa S. Intelligent sales prediction using machine learning techniques. In: 2018 International conference on computing, electronics & communications engineering (iCCECE). IEEE; 2018. pp. 53–58.

  26. Mohamed-Iliasse M, Loubna B, Abdelaziz B. Is machine learning revolutionizing supply chain? In: 2020 5th International conference on logistics operations management (GOL). IEEE; 2020. pp. 1–10.

  27. Bousqaoui H, Achchab S, Tikito K. Machine learning applications in supply chains: an emphasis on neural network applications. In: 2017 3rd International conference of cloud computing technologies and applications (CloudTech). IEEE; 2017. pp. 1–7.

  28. Pallathadka H, Mustafa M, Sanchez DT, Sajja GS, Gour S, Naved M. Impact of machine learning on management, healthcare and agriculture. Mater Today Proc. 2023;80:2803–6.

    Article  Google Scholar 

  29. Prahara PJ, Hariadi TK. Improved feature selection algorithm of electricity price forecasting using svm. In: 2022 2nd international conference on electronic and electrical engineering and intelligent system (ICE3IS). IEEE; 2022. pp. 34–39.

  30. Kaur G, Kaur H. Prediction of the cause of accident and accident prone location on roads using data mining techniques. In: 2017 8th International conference on computing, communication and networking technologies (ICCCNT). IEEE; 2017. pp. 1–7.

  31. Bindal R, Sarangi P, Kaur G, Dhiman G. An approach for automatic recognition system for Indian vehicles numbers using k-nearest neighbours and decision tree classifier; 2019.

  32. Sharma A, Mishra PK. Performance analysis of machine learning based optimized feature selection approaches for breast cancer diagnosis. Int J Inf Technol. 2022;14(1):1–12.

    Google Scholar 

  33. Reddy EMK, Gurrala A, Hasitha VB, Kumar KVR. Introduction to naive bayes and a review on its subtypes with applications. In: Bayesian Reason. Gaussian Process. Mach. Learn. Appl.; 2022. pp. 1–14.

  34. Karki S, Hadikusumo B. Machine learning for the identification of competent project managers for construction projects in Nepal. Constr Innov. 2023;23(1):1–18.

    Article  Google Scholar 

  35. Wu D, Wang Q, Olson DL. Industry classification based on supply chain network information using graph neural networks. Appl Soft Comput. 2023;132: 109849.

    Article  Google Scholar 

  36. Banik S, Islam MR, Rahman KN, Rahman MA. A comparative analysis of machine learning algorithms to predict backorder in supply chain management. SSRN. 2023. https://doi.org/10.2139/ssrn.4444976.

    Article  Google Scholar 

  37. Luo J. Application of machine learning in supply chain management. In: 2022 3rd international conference on big data economy and information management (BDEIM 2022). Atlantis Press; 2023. pp. 489–498.

  38. Esmaeili M, Olfat L, Amiri M, Raeesi Vanani I. Classification and allocation of suppliers to customers in resilience supply chains using machine learning. J Ind Manage Perspect. 2023;13(3):39–70.

    Google Scholar 

  39. Ghasemkhani B, Aktas O, Birant D. Balanced k-star: an explainable machine learning method for internet-of-things-enabled predictive maintenance in manufacturing. Machines. 2023;11(3):322.

    Article  Google Scholar 

  40. Khosravi K, Golkarian A, Omidvar E, Hatamiafkoueieh J, Shirali M. Snow water equivalent prediction in a mountainous area using hybrid bagging machine learning approaches. Acta Geophys. 2023;71(2):1015–31.

    Article  Google Scholar 

  41. Nguyen HD, Tran KP, Thomassey S, Hamad M. Forecasting and anomaly detection approaches using lstm and lstm autoencoder techniques with the applications in supply chain management. Int J Inf Manage. 2021;57: 102282.

    Article  Google Scholar 

  42. Hasan MR. Addressing seasonality and trend detection in predictive sales forecasting: a machine learning perspective. J Bus Manage Stud. 2024;6(2):100–9.

    Article  Google Scholar 

  43. Soltaninejad M, Aghazadeh R, Shaghaghi S, Zarei M. Using machine learning techniques to forecast Mehram company’s sales: a case study. J Bus Manage Stud. 2024;6(2):42–53.

    Article  Google Scholar 

  44. Kaur G, Kaur H, Goyal S. Strategic feature selection for precision augmentation in cement sales forecasting. In: 2023 Seventh international conference on image information processing (ICIIP). IEEE; 2023. pp. 765–770.

Download references

Funding

No funding, grants, or other aid was received during the preparation of this manuscript.

Author information

Authors and Affiliations

Authors

Contributions

All the authors have equally contributed to this work.

Corresponding author

Correspondence to Gagandeep Kaur.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kaur, G., Kaur, H. & Goyal, S. Data Analytics Approach for Enhanced Sales Forecasting (DAAESF): Feature Selection and Classifier Integration Analysis. SN COMPUT. SCI. 5, 1158 (2024). https://doi.org/10.1007/s42979-024-03483-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-024-03483-z

Keywords

Navigation