Abstract
This paper presents a method of analyzing and visualizing e-commerce sites’ product reviews considering customers’ characteristics. Typically, product review data are unstructured and have no fixed format. Such data can be considered as valuable contents or assets generated by customers. Indeed, they can perform significant roles for customers’ benefit. They provide potential consumers with helpful product information (e.g., characteristics) according to which they can decide whether to buy or not. Also, e-commerce companies can understand customers’ experiences or opinions on a product and reflect them when developing marketing strategies or redesigning the product. In order to provide valuable information to customers from massive and unstructured review data, we propose a comprehensive framework that encompasses the processes of collecting, storing, preprocessing and analyzing review data and, finally, deriving implications therefrom. Text mining and personalization techniques can be applied to the process of extracting and visualizing the review data, as appropriate. Thus, the proposed system, as it considers customer profiles when analyzing and visualizing product reviews, enables customers to utilize review data conveniently in accordance with their interests.
Similar content being viewed by others
References
Cho, G.H., Lim, S.Y., Hur, S.: An analysis of the research methodologies and techniques in the industrial engineering using text mining. J. Korean Inst. Ind. Eng. 40(1), 52–59 (2014)
Collobert, R., Weston, J.: A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine learning, Helsinki, Finland, pp. 160–167 (2008)
Decker, R., Trusov, M.: Estimating aggregate consumer preferences from online product reviews. Int. J. Res. Mark. 27(4), 293–307 (2010)
Doan, A., Naughton, J.F., Ramakrishnan, R., Baid, A., Chai, X., et al.: Information extraction challenges in managing unstructured data. ACM SIGMOD Rec. 37(4), 14–20 (2009)
Gulalkari, N.: Implementing Apriori Algorithm in R. https://datascienceplus.com/implementing-apriori-algorithm-in-r/ (2017). Accessed 12 Jan 2018
Holton, C.: Identifying disgruntled employee systems fraud risk through text mining: a simple solution for a multi-billion dollar problem. Decis. Support Syst. 46(4), 853–864 (2009)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 168–177 (2004)
Huang, G., He, J., Zhang, Y., et al.: Mining streams of short text for analysis of world-wide event evolutions. World Wide Web 18(5), 1201–1217 (2015)
Jeon, H.: Package ‘KoNLP’. https://cran.r-project.org/web/packages/KoNLP/KoNLP.pdf (2016). Accessed 12 Jan 2018
Kangale, A., Kumar, S.K., Naeem, M.A., Williams, M., Tiwari, M.K.: Mining consumer reviews to generate ratings of different product attributes while producing feature-based review-summary. Int. J. Syst. Sci. 47(13), 3272–3286 (2016)
Kim, J., Kim, D.: A study on the method for extracting the purpose-specific customized information from online product reviews based on text mining. J. Soc. e-Bus. Stud. 21(2), 151–161 (2016)
Kim, J., Kim, D.: A method for extracting organized information from online product reviews based on text mining. ICIC Express Lett. B 7(10), 2211–2216 (2016)
Kim, J., Lee, W., Song, J.J., et al.: Optimized combinatorial clustering for stochastic processes. Clust. Comput. 20(2), 1135–1148 (2017)
Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. In: Mining Text Data, pp. 415–463. Springer, New York (2012)
Marceli, M.: Introduction to text mining with R for information professionals. Code4Lib J. 33. http://journal.code4lib.org/articles/11626 (2016). Accessed 14 Sep 2017
Mooney, R.J., Bunescu, R.: Mining knowledge from text using information extraction. ACM SIGKDD Explor. Newsl. Nat. Lang. Process. Text Min. 7(1), 3–10 (2005)
Park, S.S.H., Song, J.J., Lee, J.J.H., et al.: How to measure similarity for multiple categorical data sets? Multimed. Tools Appl. 74(10), 3489–3505 (2005)
Rajaraman, K., Tan, A.H.: Topic detection, tracking, and trend analysis using self-organizing neural networks. In: Proceedings of the 5th Pacific–Asia Conference on Knowledge Discovery and Data Mining, pp. 102–107 (2001)
Ren, Z.J., Cohen, M.A., Ho, T.H., Terwiesch, C.: Information sharing in a long-term supply chain relationship: the role of customer review strategy. Oper. Res. 58(1), 81–93 (2010)
Riaz, S., Fatima, M., Kamran, M., Wasif Nisar, M.: Opinion mining on large scale data using sentiment analysis and k-means clustering. Clust. Comput. (2017). https://doi.org/10.1007/s10586-017-1077-z
Silge, J., Robinson, D.: Welcome to text mining with R. http://tidytextmining.com/ (2017). Accessed 14 Sep 2017
Sumathy, K.L., Chidambaram, M.: Text mining: concepts, applications, tools and issues—an overview. Int. J. Comput. Appl. 80(4), 29–32 (2013)
Wang, N., Zeng, J., Ye, M., Chen, M.: Text mining and sustainable clusters from unstructured data in cloud computing. Clust. Comput. (2017). https://doi.org/10.1007/s10586-017-0909-1
Woolley, A.W., Chabris, C.F., Pentland, A., Hashmi, N., Malone, T.W.: Evidence for a collective intelligence factor in the performance of human groups. Science 330(6004), 686–688 (2010)
Yadollahi, A., Shahraki, A.G., Zaiane, O.R.: Current state of text sentiment analysis from opinion to emotion mining. ACM Comput. Surv. 50(2), 25:1–25:33 (2017)
Yune, H.J., Kim, H.J., Chang, J.Y.: An efficient search method of product reviews using opinion mining techniques. J. KIISE Comput. Pract. Lett. 16(2), 222–226 (2010)
Acknowledgements
This work was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2017R1D1A1B05029080).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kim, J., Kim, D. Analyzing and visualizing comprehensive and personalized online product reviews. Cluster Comput 22 (Suppl 1), 2115–2128 (2019). https://doi.org/10.1007/s10586-018-2645-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-018-2645-6