Skip to main content

A Comparative Study of Machine Learning Classification Models on Customer Behavior Data

  • Conference paper
  • First Online:
Soft Computing in Data Science (SCDS 2023)

Abstract

Competent marketers aim to accurately predict consumer desires and needs. With the advancement of machine learning, different machine learning models could be applied to solve various challenges, including precisely determining consumer behavior. Meanwhile, discount coupons are a frequent marketing approach to boost sales and encourage recurring business from existing customers. Accordingly, the current study seeks to analyze customer behavior by assessing an in-vehicle coupon recommendation system dataset as the case study. The dataset, which was obtained from the University of California-Irvine (UCI) machine learning repository, could predict consumer decisions as to whether accept the discount coupon influenced by demographic and environmental factors, such as driving destinations, age, current time, and weather. This study also compared six machine learning classification models, including Bayesian Network (BayesNet), Naïve Bayes, Instance-Bases Learning with Parameter-K (Lazy-IBK), Tree J48, Random Forest, and RandomTree, on the dataset to identify a suitable model for predicting customer behavior through two test modes, namely cross-validation and percentage split. The model performance was evaluated by analyzing factors, such as accuracy, precision, processing time, recall, and F-measure, for the model development. The findings discovered that Naïve Bayes and Lazy-IBK consumed the least amount of prediction time, although with the lowest accuracy. RandomTree was the highest processing time, whereas Random Forest provided the highest accuracy, precision, recall and F-measure values.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wang, M., Yao, J.: A reliable location design of unmanned vending machines based on customer satisfaction. Electron. Comm. Res. 1–35 (2021)

    Google Scholar 

  2. Grigoroudis, E., Siskos, Y.: Preference disaggregation for measuring and analysing customer satisfaction: the MUSA method. Eur. J. Oper. Res. 143(1), 148–170 (2002)

    Article  MATH  Google Scholar 

  3. Hermawan, D.R., Fatihah, M.F.G., Kurniawati, L., Helen, A.: Comparative study of J48 decision tree classification algorithm, random tree, and random forest on in-vehicle coupon recommendation data. In: 2021 International Conference on Artificial Intelligence and Big Data Analytics, pp. 1–6. IEEE (2021)

    Google Scholar 

  4. Raju, S.S., Dhandayudam, P.: Prediction of customer behaviour analysis using classification algorithms. In: AIP Conference proceeding, vol. 1952, no. 1, pp. 020098–1–020098–7. AIP Publishing LLC, New York, USA (2018)

    Google Scholar 

  5. Kanavos, A., Iakovou, S.A., Sioutas, S., Tampakas, V.: Large scale product recommendation of supermarket ware based on customer behaviour analysis. Big Data Cogn. Comput. 2(2), 11–29 (2018)

    Article  Google Scholar 

  6. Khoa, B.T., Oanh, N.T.T., Uyen, V.T.T., Dung, D.C.H.: Customer loyalty in the Covid-19 pandemic: the application of machine learning in survey data. In: Somani, A.K., Mundra, A., Doss, R., Bhattacharya, S. (eds.) Smart Systems: Innovations in Computing. SIST, vol. 235, pp. 419–429. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-2877-1_38

    Chapter  Google Scholar 

  7. Sanjay, M., Shruthi, G.: Customer attrition prediction using machine learning algorithms (2022)

    Google Scholar 

  8. Quynh, T. D., Dung, H. T. T.: Prediction of customer behavior using machine learning: A case study. In: Proceedings of the 2nd International Conference on Human-centered Artificial Intelligence (Computing4Human 2021), pp. 168–175. CEUR Workshop Proceedings, Da Nang, Vietnam (2021)

    Google Scholar 

  9. Wang, T., Rudin, C., Doshi-Velez, F., Liu, Y., Klampfl, E., MacNeille, P.: A Bayesian framework for learning rule sets for interpretable classification. The J. Mach. Learn. Res. 18(1), 2357–2393 (2017)

    MathSciNet  MATH  Google Scholar 

  10. UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/datasets/in-vehicle+coupon+recommendation. Last accessed 22 Aug 2022

  11. Ilyas, I.F., Rekatsinas, T.: Machine learning and data cleaning: which serves the other? J. Data Inform. Qual. (JDIQ) 14(3), 1–11 (2022)

    Article  Google Scholar 

  12. Ranjan, M., Bansiya, A.: Data cleaning rules based on conditional functional dependency. Res. J. Eng. Technol. Med. Sci. 4(2), 6–9 (2021)

    Google Scholar 

  13. Beniwal, S., Arora, J.: Classification and feature selection techniques in data mining. Int. J. Eng. Res. Technol. (IJERT) 1(6), 1–6 (2012)

    Google Scholar 

  14. Erkal, B., Ayyıldız, T.E.: Using machine learning methods in early diagnosis of breast cancer. In: 2021 Medical Technologies Congress (TIPTEKNO), pp 1–3. IEEE (2021)

    Google Scholar 

  15. Choudhury, S., Bhowal, A.: Comparative analysis of machine learning algorithms along with classifiers for network intrusion detection. In: 2015 International Conference on Smart Technologies and Management for Computing, Communication, Controls, Energy and Materials (ICSTM), pp 89–95. IEEE (2015)

    Google Scholar 

  16. Wang, K.: Network data management model based on Naïve Bayes classifier and deep neural networks in heterogeneous wireless networks. Comput. Electr. Eng. 75, 135–145 (2019)

    Article  Google Scholar 

  17. Ye, Z., Song, P., Zheng, D., Zhang, X., Wu, J.: A Naive Bayes model on lung adenocarcinoma projection based on tumor microenvironment and weighted gene co-expression network analysis. Infect. Dis. Model. 7(3), 498–509 (2022)

    Google Scholar 

  18. Chellam, A., Ramanathan, L., Ramani, S.: Intrusion detection in computer networks using lazy learning algorithm. Procedia Comput. Sci. 132, 928–936 (2018)

    Article  Google Scholar 

  19. Panigrahi, R., Borah, S.: Rank allocation to J48 group of decision tree classifiers using binary and multiclass intrusion detection datasets. Procedia Comput. Sci. 132, 323–332 (2018)

    Article  Google Scholar 

  20. Mohan, L., Jain, S., Suyal, P.,Kumar, A.: Data mining classification techniques for intrusion detection system. In: 2020 12th International Conference on Computational Intelligence and Communication Networks (CICN), pp. 351–355. IEEE (2020)

    Google Scholar 

  21. Van Nguyen, T., Zhou, L., Chong, A.Y.L., Li, B., Pu, X.: Predicting customer demand for remanufactured products: a data-mining approach. Eur. J. Oper. Res. 281(3), 543–558 (2020)

    Article  Google Scholar 

Download references

Acknowledgment

The authors would like to thank the Universiti Teknologi MARA for funding the research project under MyRa Grant Scheme File No.: 600-RMC/GPM LPHD 5/3 (180/2021). The authors gratefully acknowledge the College of Computing, Informatics and Media, Universiti Teknologi MARA, Cawangan Negeri Sembilan for supporting the publication of this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nur Ida Aniza Rusli .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rusli, N.I.A., Zulkifle, F.A., Ramli, I.S. (2023). A Comparative Study of Machine Learning Classification Models on Customer Behavior Data. In: Yusoff, M., Hai, T., Kassim, M., Mohamed, A., Kita, E. (eds) Soft Computing in Data Science. SCDS 2023. Communications in Computer and Information Science, vol 1771. Springer, Singapore. https://doi.org/10.1007/978-981-99-0405-1_16

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-0405-1_16

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-0404-4

  • Online ISBN: 978-981-99-0405-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics