Abstract
In recent years, the rapid development of e-commerce has brought great convenience to people. Compared with traditional business environment, e-commerce is more dynamic and complex, which brings many challenges. Data mining technology can help people better deal with these challenges. Traditional data mining technology cannot effectively use the massive data in the electricity supplier, it relies on the time-consuming and labour-consuming characteristic engineering, and the obtained model is not scalable. Convolutional neural network can effectively use a large amount of data, and can automatically extract effective features from the original data, with higher availability. In this paper, convolutional neural network is used to mine e-commerce data to achieve the prediction of commodity sales. First, this article combines the inherent nature of the relevant merchandise information with the original cargo log data that can be converted into a specific “data frame” format. Raw log data includes items sold over a long period of time, price, quantity view, browse, search, search, times collected, number of items added to cart, and many other metrics. Then, convolutional neural network is applied to extract effective features on the data frame. Finally, the final layer of the convolutional neural network uses these features to predict sales of goods. This method can automatically extract effective features from the original structured time series data by convolutional neural network, and further use these features to achieve sales forecast. The validity of the proposed algorithm is verified on the real e-commerce data set.











Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Lin, J., Luo, Z., Cheng, X., et al. (2019). Understanding the interplay of social commerce affordances and swift guanxi: An empirical study. Information & Management,56(2), 213–224.
Ramos, A. L., Mazzinghy, D. B., Barbosa, V. S. B., et al. (2019). Evaluation of an iron ore price forecast using a geometric Brownian motion mode. REM-International Engineering Journal,72(1), 9–15.
Yao, Y., & Wang, H. Y. (2019). Optimal subsampling for softmax regression. Statistical Papers,60(2), 235–249.
Vijaya, J., & Sivasankar, E. (2019). An efficient system for customer churn prediction through particle swarm optimization based feature selection model with simulated annealing. Cluster Computing,22(5), 10757–10768.
Stripling, E., vanden Broucke, S., Antonio, K., et al. (2018). Profit maximizing logistic model for customer churn prediction using genetic algorithms. Swarm and Evolutionary Computation,40, 116–130.
Kannadath, B. S., Cen, P., Rowe, J., et al. (2018). Decision tree analysis of pancreatic cyst fluid data for the detection of mucinous cysts: 73. American Journal of Gastroenterology,113, S41–S42.
Bell, D., & Mgbemena, C. (2018). Data-driven agent-based exploration of customer behavior. Simulation,94(3), 195–212.
Sivasankar, E., & Vijaya, J. (2019). A study of feature selection techniques for predicting customer retention in telecommunication sector. International Journal of Business Information Systems,31(1), 1–26.
Wager, S., & Athey, S. (2018). Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association,113(523), 1228–1242.
Mau, S., Pletikosa, I., & Wagner, J. (2018). Forecasting the next likely purchase events of insurance customers: A case study on the value of data-rich multichannel environment. International Journal of Bank Marketing,36(6), 1125–1144.
Mahdavinejad, M. S., Rezvan, M., Barekatain, M., et al. (2018). Machine learning for internet of things data analysis: A survey. Digital Communications and Networks,4(3), 161–175.
Rao, H., Shi, X., Rodrigue, A. K., et al. (2019). Feature selection based on artificial bee colony and gradient boosting decision tree. Applied Soft Computing,74, 634–642.
Wang, J., Lin, L., Zhang, H., et al. (2017). A novel confidence estimation method for heterogeneous implicit feedback. Frontiers of Information Technology & Electronic Engineering,18(11), 1817–1827.
Hanson, J., Paliwal, K., Litfin, T., et al. (2018). Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks. Bioinformatics,34(23), 4039–4045.
Liberis, E., Veličković, P., Sormanni, P., et al. (2018). Parapred: antibody paratope prediction using convolutional and recurrent neural networks. Bioinformatics,34(17), 2944–2950.
Pham, D. H., & Le, A. C. (2018). Learning multiple layers of knowledge representation for aspect based sentiment analysis. Data & Knowledge Engineering,114, 26–39.
Qiu, X., Suganthan, P. N., & Amaratunga, G. A. J. (2019). Fusion of multiple indicators with ensemble incremental learning techniques for stock price forecasting. Journal of Banking and Financial Technology,3(1), 33–42.
Khaled, A., Ouchani, S., & Chohra, C. (2019). Recommendations-based on semantic analysis of social networks in learning environments. Computers in Human Behavior,101, 435–449.
Kuzovkin, D., Pouli, T., Meur, O. L., et al. (2019). Context in photo albums: Understanding and modeling user behavior in clustering and selection. ACM Transactions on Applied Perception (TAP),16(2), 1–20.
Tong, Wu, Liu, Xinwang, & Qin, Jindong. (2017). A linguistic solution for double large-scale group decision-making in E-commerce. Computers & Industrial Engineering,116, 97–112.
Xu, S. X., & Huang, G. Q. (2017). Efficient multi-attribute multi-unit auctions for B2B E-commerce logistics. Production & Operations Management,26(2), 292–304.
Wang, Dong, Zha, Yong, & Bi, Gongbing. (2018). A meta-analysis of satisfaction-loyalty relationship in e-commerce: sample and measurement characteristics as moderators. Wireless Personal Communications,103(1), 941–962.
Zhu, L., Li, M., Zhang, Z., et al. (2018). Big data mining of users’ energy consumption patterns in the wireless smart grid. IEEE Wireless Communications,25(1), 84–89.
Wu, P. J., & Lin, K. C. (2018). Unstructured big data analytics for retrieving e-commerce logistics knowledge. Telematics and Informatics,35(1), 237–244.
Ortega, J. A., Losada, E., Besteiro, R., et al. (2018). Validation of an autoregressive integrated moving average model for the prediction of animal zone temperature in a weaned piglet building. Biosystems Engineering,174, 231–238.
Guo, Z., Zhao, X., Chen, Y., et al. (2019). Short-term passenger flow forecast of urban rail transit based on GPR and KRR. IET Intelligent Transport Systems,13(9), 1374–1382.
Borkar, T. S., & Karam, L. J. (2019). DeepCorrect: Correcting DNN models against image distortions. IEEE Transactions on Image Processing,28(12), 6022–6034.
Rout, J. K., Choo, K. K. R., Dash, A. K., et al. (2018). A model for sentiment and emotion analysis of unstructured social media text. Electronic Commerce Research,18(1), 181–199.
Gysel, P., Pimentel, J., Motamedi, M., et al. (2018). Ristretto: A framework for empirical study of resource-efficient inference in convolutional neural networks. IEEE Transactions on Neural Networks and Learning Systems,29(11), 5784–5789.
Wan, S., Liang, Y., Zhang, Y., et al. (2018). Deep multi-layer perceptron classifier for behavior analysis to estimate parkinson’s disease severity using smartphones. IEEE Access,6, 36825–36833.
Manogaran, G., & Lopez, D. (2018). Health data analytics using scalable logistic regression with stochastic gradient descent. International Journal of Advanced Intelligence Paradigms,10(1–2), 118–132.
Jiang, X., Pang, Y., Li, X., et al. (2018). Deep neural networks with elastic rectified linear units for object recognition. Neurocomputing,275, 1132–1139.
Yadav, S., & Bist, A. S. (2018). Learning overcomplete representations using leaky linear decoders. International Journal of Digital Information and Wireless Communications,8(3), 174–180.
Seinen, C., & Khouider, B. (2018). Improving the Jacobian free Newton–Krylov method for the viscous–plastic sea ice momentum equation. Physica D: Nonlinear Phenomena,376, 78–93.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Pan, H., Zhou, H. Study on convolutional neural network and its application in data mining and sales forecasting for E-commerce. Electron Commer Res 20, 297–320 (2020). https://doi.org/10.1007/s10660-020-09409-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10660-020-09409-0