Abstract
In the era of big data, huge number of product reviews has been posted to online social media. Accordingly, mining consumers’ sentiments about products can generate valuable business intelligence for enhancing management’s decision-making. The main contribution of our research is the design of a novel methodology that extracts consumers’ sentiments over topics of product reviews (i.e., product aspects) to enhance sales predicting performance. In particular, consumers’ daily sentiments embedded in the online reviews over latent topics are extracted through the joint sentiment topic model. Finally, the sentiment distributions together with other quantitative features are applied to predict sales volume of the following period. Based on a case study conducted in one the largest e-commerce companies in China, our empirical tests show that sentiments over topics together with other quantitative features can more accurately predict sales volume when compared with using quantitative features alone.
Similar content being viewed by others
References
Alfaro, C., Cano-Montero, J., Gómez, J., Moguerza, J. M., & Ortega, F. (2016). A multi-stage method for content classification and opinion mining on weblog comments. Annals of Operations Research, 236(1), 197–213.
Amrouche, N., & Yan, R. (2016). A manufacturer distribution issue: How to manage an online and a traditional retailer. Annals of Operations Research, 244(2), 257–294.
Ansuj, A. P., Camargo, M. E., Radharamanan, R., & Petry, D. G. (1996). Sales forecasting using time series and neural networks. Computers and Industrial Engineering, 31(1), 421–424.
Berger, J., Sorensen, A. T., & Rasmussen, S. J. (2010). Positive effects of negative publicity: When negative reviews increase sales. Marketing Science, 29(5), 815–827.
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.
Boiy, E., & Moens, M. F. (2009). A machine learning approach to sentiment analysis in multilingual Web texts. Information Retrieval, 12(5), 526–558.
Chan, S. W., & Franklin, J. (2011). A text-based decision support system for financial sequence prediction. Decision Support Systems, 52(1), 189–198.
Choi, T. M., Hui, C. L., Liu, N., Ng, S. F., & Yu, Y. (2014). Fast fashion sales forecasting with limited data and time. Decision Support Systems, 59, 84–92.
Chung, C., Niu, S. C., & Sriskandarajah, C. (2012). A sales forecast model for short-life-cycle products: New releases at Blockbuster. Production and Operations Management, 21(5), 851–873.
Dang, Y., Zhang, Y., & Chen, H. (2010). A lexicon-enhanced method for sentiment classification: An experiment on online product reviews. IEEE Intelligent Systems, 25(4), 46–53.
Das, S. R., & Chen, M. Y. (2007). Yahoo! for Amazon: Sentiment extraction from small talk on the web. Management Science, 53(9), 1375–1388.
Daskin, M. S., Coullard, C. R., & Shen, Z. J. M. (2002). An inventory-location model: Formulation, solution algorithm and computational results. Annals of Operations Research, 110(1–4), 83–106.
Dumais, S. T., Furnas, G. W., Landauer, T. K., Deerwester, S., & Harshman, R. (1988). Using latent semantic analysis to improve access to textual information. In Proceedings of the SIGCHI conference on human factors in computing systems, (pp. 281–285). ACM.
Forman, C., Ghose, A., & Wiesenfeld, B. (2008). Examining the relationship between reviews and sales: The role of reviewer identity disclosure in electronic markets. Information Systems Research, 19(3), 291–313.
Gandomi, A., & Haider, M. (2015). Beyond the hype: Big data concepts, methods, and analytics. International Journal of Information Management, 35(2), 137–144.
Grandon, E. E., & Pearson, J. M. (2004). Electronic commerce adoption: An empirical study of small and medium US businesses. Information and Management, 42(1), 197–216.
Ha, S. H., yong Bae, S., & Son, L. K. (2015). Impact of online consumer reviews on product sales: Quantitative analysis of the source effect. Applied Mathematics and Information Sciences, 9(2L), 373–387.
Han, S., Fu, Y., Cao, B., & Luo, Z. (2016). Pricing and bargaining strategy of e-retail under hybrid operational patterns. Annals of Operations Research. doi:10.1007/s10479-016-2214-4.
Häubl, G., & Trifts, V. (2000). Consumer decision making in online shopping environments: The effects of interactive decision aids. Marketing Science, 19(1), 4–21.
Hofmann, T. (1999). Probabilistic latent semantic indexing. In Proceedings of the 22nd annual international ACM SIGIR conference on research and development in information retrieval, (pp. 50–57). ACM.
Hu, N., Liu, L., & Zhang, J. J. (2008). Do online reviews affect product sales? The role of reviewer characteristics and temporal effects. Information Technology and Management, 9(3), 201–214.
Huang, T., & Van Mieghem, J. A. (2014). Clickstream data and inventory management: Model and empirical analysis. Production and Operations Management, 23(3), 333–347.
Johnson, E. J., Moe, W. W., Fader, P. S., Bellman, S., & Lohse, G. L. (2004). On the depth and dynamics of online search behavior. Management Science, 50(3), 299–308.
Klopping, I. M., & McKinney, E. (2004). Extending the technology acceptance model and the task-technology fit model to consumer e-commerce. Information Technology, Learning, and Performance Journal, 22(1), 35.
Kuo, R. J., & Xue, K. C. (1999). Fuzzy neural networks with application to sales forecasting. Fuzzy Sets and Systems, 108(2), 123–143.
Lau, R. Y., Zhang, W., Bruza, P. D., & Wong, K. F. (2011, October). Learning domain-specific sentiment lexicons for predicting product sales. In Proceedings of the 8th international conference on e-business engineering, (pp. 131–138). IEEE.
Lee, P. M. (2002). Behavioral model of online purchasers in e-commerce environment. Electronic Commerce Research, 2(1–2), 75–85.
Liao, Z., & Cheung, M. T. (2001). Internet-based e-shopping and consumer attitudes: An empirical study. Information and Management, 38(5), 299–306.
Limayem, M., Khalifa, M., & Frini, A. (2000). What makes consumers buy from Internet? A longitudinal study of online shopping. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, 30(4), 421–432.
Lin, C., & He, Y. (2009, November). Joint sentiment/topic model for sentiment analysis. In Proceedings of the 18th ACM conference on information and knowledge management, (pp. 375–384). ACM.
Lin, C., He, Y., Everson, R., & Ruger, S. (2012). Weakly supervised joint sentiment-topic detection from text. IEEE Transactions on Knowledge and Data Engineering, 24(6), 1134–1145.
Liu, Y., Li, H., Peng, G., Lv, B., & Zhang, C. (2015). Online purchaser segmentation and promotion strategy selection: Evidence from Chinese e-commerce market. Annals of Operations Research, 233(1), 263–279.
Luxhøj, J. T., Riis, J. O., & Stensballe, B. (1996). A hybrid econometric-neural network modeling approach for sales forecasting. International Journal of Production Economics, 43(2), 175–192.
Mei, Q., Ling, X., Wondra, M., Su, H., & Zhai, C. (2007, May). Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of the 16th international conference on World Wide Web, (pp. 171–180). ACM.
Melville, P., Gryc, W., & Lawrence, R. D. (2009). Sentiment analysis of blogs by combining lexical knowledge with text classification. In Proceedings of the 15th ACM SIGKDD international conference on knowledge discovery and data mining, (pp. 1275–1284). ACM.
Mudambi, S. M., & Schuff, D. (2010). What makes a helpful review? A study of customer reviews on Amazon.com. MIS Quarterly, 34(1), 185–200.
Nasukawa, T., & Yi, J. (2003, October). Sentiment analysis: Capturing favorability using natural language processing. In Proceedings of the 2nd international conference on knowledge capture, (pp. 70–77). ACM.
Pang, B., Lee, L., & Vaithyanathan, S. (2002). Thumbs up?: Sentiment classification using machine learning techniques. In Proceedings of the ACL-02 conference on empirical methods in natural language processing, (Vol. 10, pp. 79–86). Association for Computational Linguistics.
Park, D. H., & Kim, S. (2009). The effects of consumer knowledge on message processing of electronic word-of-mouth via online consumer reviews. Electronic Commerce Research and Applications, 7(4), 399–410.
Prabowo, R., & Thelwall, M. (2009). Sentiment analysis: A combined approach. Journal of Informetrics, 3(2), 143–157.
Raju, C. V. L., Narahari, Y., & Ravikumar, K. (2006). Learning dynamic prices in electronic retail markets with customer segmentation. Annals of Operations Research, 143(1), 59–75.
Schafer, J. B., Konstan, J. A., & Riedl, J. (2001). E-commerce recommendation applications. In Applications of data mining to electronic commerce, (pp. 115–153). Springer US.
Schumaker, R. P., & Chen, H. (2010). A discrete stock price prediction engine based on financial news. Computer, 43(1), 51–56.
Schumaker, R. P., Zhang, Y., Huang, C. N., & Chen, H. (2012). Evaluating sentiment in financial news articles. Decision Support Systems, 53(3), 458–464.
Srinivasan, S. S., Anderson, R., & Ponnavolu, K. (2002). Customer loyalty in e-commerce: An exploration of its antecedents and consequences. Journal of Retailing, 78(1), 41–50.
Sun, Z. L., Choi, T. M., Au, K. F., & Yu, Y. (2008). Sales forecasting using extreme learning machine with applications in fashion retailing. Decision Support Systems, 46(1), 411–419.
Sundaram, D. S., Mitra, K., & Webster, C. (1998). Word-of-mouth communications: A motivational analysis. Advances in Consumer Research, 25(1), 527–531.
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., & Stede, M. (2011). Lexicon-based methods for sentiment analysis. Computational Linguistics, 37(2), 267–307.
Tan, S., & Zhang, J. (2008). An empirical study of sentiment analysis for Chinese documents. Expert Systems with Applications, 34(4), 2622–2629.
Thiesing, F. M., & Vornberger, O. (1997). Sales forecasting using neural networks. In Proceedings of international conference on neural networks, (Vol. 4, pp. 2125–2128). IEEE.
Vanauer, M., Böhle, C., & Hellingrath, B. (2015). Guiding the introduction of big data in organizations: A methodology with business-and data-driven ideation and enterprise architecture management-based implementation. In Proceedings of the 48th Hawaii international conference on system sciences, (pp. 908–917). IEEE.
Wan, X. (2008). Using bilingual knowledge and ensemble techniques for unsupervised Chinese sentiment analysis. In Proceedings of the conference on empirical methods in natural language processing, (pp. 553–561). Association for Computational Linguistics.
Wang, J. C., Chiu, C. C., & Tang, J. J. (2005). The correlation study of eWOM and product sales predictions through SNA perspectives: an exploratory investigation by Taiwan’s cellular phone market. In Proceedings of the 7th international conference on electronic commerce, (pp. 666–673). ACM.
Wilson, T., Wiebe, J., & Hoffmann, P. (2005, October). Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing, (pp. 347–354). Association for Computational Linguistics.
Xiao, B., & Benbasat, I. (2007). E-commerce product recommendation agents: Use, characteristics, and impact. MIS Quarterly, 31(1), 137–209.
Xiao, B., Feng, Y., & Roche, E. (2005). Uncertain switching costs and purchase decisions in electronic markets. Annals of Operations Research, 135(1), 179–196.
Ye, Q., Lin, B., & Li, Y. J. (2005). Sentiment classification for Chinese reviews: A comparison between SVM and semantic approaches. In: Proceedings of international conference on machine learning and cybernetics, (Vol. 4, pp. 2341–2346). IEEE.
Yuan, H., Xu, W., & Wang, M. (2014). Can online user behavior improve the performance of sales prediction in e-commerce?. In Proceedings of IEEE international conference on systems, man, and cybernetics, (pp. 2347–2352). IEEE.
Zhu, F., & Zhang, X. (2010). Impact of online consumer reviews on sales: The moderating role of product and consumer characteristics. Journal of Marketing, 74(2), 133–148.
Acknowledgements
This work was supported in part by National Natural Science Foundation of China (Grant No. 71301163), Humanities and Social Sciences Foundation of the Ministry of Education (No. 14YJA630075, 15YJA630068), Hebei Social Science Fund (HB13GL021), and the Fundamental Research Funds for the Central Universities, and the Research Funds of Renmin University of China (No. 15XNLQ08). Lau’s work was supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project: CityU 11502115), and the Shenzhen Municipal Science and Technology R&D Funding - Basic Research Program (Project No. JCYJ20160229165300897).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yuan, H., Xu, W., Li, Q. et al. Topic sentiment mining for sales performance prediction in e-commerce. Ann Oper Res 270, 553–576 (2018). https://doi.org/10.1007/s10479-017-2421-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-017-2421-7