skip to main content
10.1145/3695652.3695715acmotherconferencesArticle/Chapter ViewAbstractPublication PagesimmsConference Proceedingsconference-collections
research-article

F-Ensemble: A Full Fusion Ensemble Model for Predicting E-commerce User Purchasing Behavior

Published: 15 October 2024 Publication History

Abstract

Mining historical behavioral data of e-commerce users to help platforms identify users with the highest potential consumption value is a hot topic in the e-commerce field. However, current research has not adequately considered the impact of imbalanced user purchasing behavior data on model prediction. The structure of existing prediction models is relatively simple, the user behavior features involved are not comprehensive enough, and the prediction accuracy needs to be improved. Therefore, this paper proposes an F-Ensemble model based on a full fusion ensemble learning strategy to predict the purchasing behavior of users. This model uses decision tree, multilayer perceptron, random forest, and AdaBoost as base models, integrating both single models and ensemble learning models to predict the purchasing behavior of users. The experimental results show that the Macro-F1 score of the F-Ensemble model based on the full fusion ensemble learning strategy reaches 0.86485, representing a 5.30% improvement over the worst-performing decision tree model. Additionally, the F-Ensemble model significantly improves the F1 score of purchase behavior prediction under imbalanced data conditions, achieving a score of 0.73283, which is 13.39% higher than the performance of the worst-performing decision tree model. This result further validates the effectiveness of the F-Ensemble model. In conclusion, the F-Ensemble model constructed in this paper enhances the prediction accuracy of e-commerce users' purchasing behavior and holds practical value by reducing platform management costs.

References

[1]
Vipin Jain, Bindoo Malviya, and Satyendra Arya. 2021. An overview of electronic commerce (e-Commerce). Journal of Contemporary Issues in Business and Government 27, 3, 665–670.
[2]
Donald G. Morrison. 1979. Purchase intentions and purchase behavior. Journal of Marketing 43, 2, 65–74. https://doi.org/10.1177/002224297904300207
[3]
Zhi-Ping Fan, Yu-Jie Che, and Zhen-Yu Chen. 2017. Product sales forecasting using online reviews and historical sales data: A method combining the Bass model and sentiment analysis. Journal of Business Research 74, 90–100. https://doi.org/10.1016/j.jbusres.2017.01.010
[4]
Yu-Ting Wen, Pei-Wen Yeh, Tzu-Hao Tsai, Wen-Chih Peng, and Hong-Han Shuai. 2018. Customer purchase behavior prediction from payment datasets. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining (WSDM '18). ACM, New York, NY, USA, 628–636. https://doi.org/10.1145/3159652.3159707
[5]
Yongzheng Zhang, and Marco Pennacchiotti. 2013. Predicting purchase behaviors from social media. In Proceedings of the 22nd International Conference on World Wide Web. ACM, New York, NY, USA, 1521–1532. https://doi.org/10.1145/2488388.2488521
[6]
Jing Xu, Jie Wang, Ye Tian, Jiangpeng Yan, Xiu Li, and Xin Gao. 2020. SE-stacking: Improving user purchase behavior prediction by information fusion and ensemble learning. PLoS ONE 15, 11, e0242629. https://doi.org/10.1371/journal.pone.0242629
[7]
Andrés Martínez, Claudia Schmuck, Sergiy Pereverzyev Jr, Clemens Pirker, and Markus Haltmeier. 2020. A machine learning framework for customer purchase prediction in the non-contractual setting. European Journal of Operational Research 281, 3, 588–596. https://doi.org/10.1016/j.ejor.2018.04.034
[8]
Yezheng Liu, Tingting Zhu, and Yuanchun Jiang. 2019. Purchase prediction based on a non-parametric Bayesian method. https://doi.org/10.24251/HICSS.2019.160
[9]
Dennis Koehn, Stefan Lessmann, and Markus Schaal. 2020. Predicting online shopping behaviour from clickstream data using deep learning. Expert Systems with Applications 150, 113342. https://doi.org/10.1016/j.eswa.2020.113342
[10]
Neha Chaudhuri, Gaurav Gupta, Vallurupalli Vamsi, and Indranil Bose. 2021. On the platform but will they buy? Predicting customers' purchase behavior using deep learning. Decision Support Systems 149, 113622. https://doi.org/10.1016/j.dss.2021.113622
[11]
Kiran Maharana, Surajit Mondal, and Bhushankumar Nemade. 2022. A review: Data pre-processing and data augmentation techniques. Global Transitions Proceedings 3, 1, 91–99. https://doi.org/10.1016/j.gltp.2022.04.020
[12]
T. Daniya, M. Geetha, and K. S. Kumar. 2020. Classification and regression trees with Gini index. Advances in Mathematics: Scientific Journal 9, 10, 8237–8247. https://doi.org/10.37418/amsj.9.10.53
[13]
Luis B. Almeida. 2020. Multilayer perceptrons. In Handbook of Neural Computation. CRC Press, C1–2.
[14]
Massimo Aria, Corrado Cuccurullo, and Agostino Gnasso. 2021. A comparison among interpretative proposals for Random Forests. Machine Learning with Applications 6, 100094. https://doi.org/10.1016/j.mlwa.2021.100094
[15]
Wenyang Wang, and Dongchu Sun. 2021. The improved AdaBoost algorithms for imbalanced data classification. Information Sciences 563, 358–374. https://doi.org/10.1016/j.ins.2021.03.042
[16]
Bohdan Pavlyshenko. 2018. Using stacking approaches for machine learning models. In 2018 IEEE Second International Conference on Data Stream Mining & Processing (DSMP), 255–258. IEEE. https://doi.org/10.1109/DSMP.2018.8478522
[17]
Fadi Thabtah, Suhel Hammoud, Firuz Kamalov, and Amanda Gonsalves. 2020. Data imbalance in classification: Experimental evaluation. Information Sciences 513, 429–441. https://doi.org/10.1016/j.ins.2019.11.004
[18]
Xue Ying. 2019. An overview of overfitting and its solutions. In Journal of Physics: Conference Series, Vol. 1168, 022022. IOP Publishing. https://doi.org/10.1088/1742-6596/1168/2/022022
[19]
Utkarsh Mahadeo Khaire, and R. Dhanalakshmi. 2022. Stability of feature selection algorithm: A review. Journal of King Saud University-Computer and Information Sciences 34, 4, 1060–1073. https://doi.org/10.1016/j.jksuci.2019.06.012
[20]
Zuguang Gu. 2022. Complex heatmap visualization. Imeta 1, 3, e43. https://doi.org/10.1002/imt2.43

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
IMMS '24: Proceedings of the 2024 7th International Conference on Information Management and Management Science
August 2024
465 pages
ISBN:9798400716997
DOI:10.1145/3695652
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 October 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Class imbalance
  2. Electronic commerce
  3. Machine learning
  4. User behavior prediction

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • General Subjects of Shanghai Philosophy and Social Science Planning

Conference

IMMS 2024

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 24
    Total Downloads
  • Downloads (Last 12 months)24
  • Downloads (Last 6 weeks)10
Reflects downloads up to 08 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media