Abstract
With the development of e-commerce, many products are now being sold worldwide, and manufacturers are eager to obtain a better understanding of customer behavior in various regions. To achieve this goal, most previous efforts have focused mainly on questionnaires, which are time-consuming and costly. The tremendous volume of product reviews on e-commerce Web sites has seen a new trend emerge, whereby manufacturers attempt to understand user preferences by analyzing online reviews. Following this trend, this paper addresses the problem of studying customer behavior by exploiting recently developed opinion mining techniques. This work is novel for three reasons. First, questionnaire-based investigation is automatically enabled by employing algorithms for template-based question generation and opinion mining-based answer extraction. Using this system, manufacturers are able to obtain reports of customer behavior featuring a much larger sample size, more direct information, a higher degree of automation, and a lower cost. Second, international customer behavior study is made easier by integrating tools for multilingual opinion mining. Third, this is the first time an automatic questionnaire investigation has been conducted to compare customer behavior in China and America, where product reviews are written and read in Chinese and English, respectively. Our study on digital cameras, smartphones, and tablet computers yields three findings. First, Chinese customers follow the Doctrine of the Mean and often use euphemistic expressions, while American customers express their opinions more directly. Second, Chinese customers care more about general feelings, while American customers pay more attention to product details. Third, Chinese customers focus on external features, while American customers care more about the internal features of products.







Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
We select the brands with the most two quantity of reviews from Amazon as our data set.
Overall sentiment classification here is similar to document-level sentiment classification.
References
Agarwal B, Poria S, Mittal N, Gelbukh A, Hussain A. Concept-level sentiment analysis with dependency-based semantic parsing: a novel approach. Cogn Comput. 2015;7(4):487–99.
Bagwell LS, Bernheim BD. Veblen effects in a theory of conspicuous consumption. Am Econ Rev. 1996;86(3):349–73.
Blair-Goldensohn S, Hannan K, McDonald R, Neylon T, Reis GA, Reynar J. Building a sentiment summarizer for local service reviews. In: WWW workshop on NLP in the information explosion era, 2008;14.
Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res. 2003;3:993–1022.
Boulding W, Lee E, Staelin R. Mastering the mix: Do advertising, promotion, and sales force activities lead to differentiation? J Mark Res. 1994;159–72.
Brown JC, Frishkoff GA, Eskenazi M. Automatic question generation for vocabulary assessment. In: Proceedings of the conference on human language technology and empirical methods in natural language processing. Association for Computational Linguistics; 2005. p. 819–26.
Cambria E, Hussain A, Havasi C, Eckl C. Senticspace: visualizing opinions and sentiments in a multi-dimensional vector space. In: Knowledge-based and intelligent information and engineering systems. Berlin: Springer; 2010. p. 385–93.
Cambria E, Schuller B, Liu B, Wang H, Havasi C. Statistical approaches to concept-level sentiment analysis. IEEE Intell Syst. 2013;28(3):6–9.
Chatterjee P. Online reviews: do consumers use them? Adv Consum Res. 2001;28:129–33.
Chikersal P, Poria S, Cambria E, Gelbukh A, Siong CE. Modelling public sentiment in twitter: Using linguistic patterns to enhance supervised learning. In: Computational linguistics and intelligent text processing. Berlin: Springer; 2015. p. 49–65.
Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–97.
Das D, Bandyopadhyay S. Sentence-level emotion and valence tagging. Cogn Comput. 2012;4(4):420–35.
Davidov D, Rappoport A. Unsupervised discovery of generic relationships using pattern clusters and its evaluation by automatically generated sat analogy questions. In: ACL. Citeseer; 2008. p. 692–700.
Denecke K. Using sentiwordnet for multilingual sentiment analysis. In: Data engineering workshop, 2008. ICDEW 2008. IEEE 24th international conference on. IEEE; 2008. p. 507–12.
Ding X, Liu B, Yu PS. A holistic lexicon-based approach to opinion mining. In: Proceedings of the 2008 international conference on web search and data mining. ACM; 2008. p. 231–40
Gangemi A, Presutti V, Reforgiato Recupero D. Frame-based detection of opinion holders and topics: a model and a tool. Comput Intell Mag IEEE. 2014;9(1):20–30.
Hoffman M, Bach FR, Blei DM. Online learning for latent dirichlet allocation. In: Advances in neural information processing systems. 2010; p. 856–64.
Hwang W, Jung HS, Salvendy G. Internationalisation of e-commerce: a comparison of online shopping preferences among Korean, Turkish and US populations. Behav Inf Technol. 2006;25(1):3–18.
Jo Y, Oh AH. Aspect and sentiment unification model for online review analysis. In: Proceedings of the fourth ACM international conference on Web search and data mining. ACM; 2011. p. 815–24.
Lancaster HO, Seneta E. Chi-square distribution. New York: Wiley Online Library; 2005.
Lee J, Park DH, Han I. The effect of negative online consumer reviews on product attitude: an information processing view. Electron Commer Res Appl. 2008;7(3):341–52.
Legge J, et al. The confucian analects, the great learning & the doctrine of the mean. Cosimo, Inc. 2009.
Li S, Lee SYM, Chen Y, Huang CR, Zhou G. Sentiment classification and polarity shifting. In: Proceedings of the 23rd international conference on computational linguistics. Association for Computational Linguistics 2010. p. 635–43.
Li X, Xie H, Chen L, Wang J, Deng X. News impact on stock price return via sentiment analysis. Knowl Based Syst. 2014;69:14–23.
Lin C, He Y. Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th ACM conference on information and knowledge management. ACM; 2009. p. 375–384.
Lin CY. Automatic question generation from queries. In: Workshop on the question generation shared task, 2008; p. 156–64.
Lin J, Demner-Fushman D. Methods for automatically evaluating answers to complex questions. Inf Retr. 2006;9(5):565–87.
Liu B. Sentiment analysis and opinion mining: synthesis lectures on human language technologies. San Rafael: Morgan & Claypool Publishers; 2012.
Liu M, Calvo RA, Rus V. Automatic question generation for literature review writing support. In: Intelligent tutoring systems. Berlin: Springer; 2010. p. 45–54.
Maas AL, Daly RE, Pham PT, Huang D, Ng AY, Potts C. Learning word vectors for sentiment analysis. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies-volume 1. Association for Computational Linguistics; 2011. p. 142–150.
Mahmood MA, Bagchi K, Ford TC. On-line shopping behavior: cross-country empirical research. Int J Electron Commer. 2004;9(1):9–30.
McWilliam G. Low involvement brands: is the brand manager to blame? Mark Intell Plann. 1997;15(2):60–70.
Moghaddam S, Ester M. On the design of lda models for aspect-based opinion mining. In: Proceedings of the 21st ACM international conference on information and knowledge management. ACM; 2012. p. 803–12.
Mullen T, Collier N. Sentiment analysis using support vector machines with diverse information sources. In: EMNLP, vol 4. 2004; p. 412–8.
MY W. Comparing consumers’ on-line shopping behaviors in taiwan and the united states. In: Proceedings of the annual meeting of the international communication association 2010.
Myller N. Automatic generation of prediction questions during program visualization. Electron Notes Theor Comput Sci. 2007;178:43–9.
Nakagawa T, Inui K, Kurohashi S. Dependency tree-based sentiment classification using crfs with hidden variables. In: Human language technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics 2010. p. 786–94.
Pang B, Lee L, Vaithyanathan S. Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on empirical methods in natural language processing-volume 10. Association for Computational Linguistics 2002. p. 79–86.
Papasalouros A, Kanaris K, Kotis K. Automatic generation of multiple choice questions from domain ontologies. In: e-Learning. Citeseer 2008. p. 427–34.
Park DH, Kim S. The effects of consumer knowledge on message processing of electronic word-of-mouth via online consumer reviews. Electron Commer Res Appl. 2009;7(4):399–410.
Park DH, Lee J, Han I. The effect of on-line consumer reviews on consumer purchasing intention: the moderating role of involvement. Int J Electron Commer. 2007;11(4):125–48.
Popescu AM, Etzioni O. Extracting product features and opinions from reviews. In: Natural language processing and text mining. Berlin: Springer; 2007. p. 9–28.
Rill S, Reinel D, Scheidt J, Zicari RV. Politwi: early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis. Knowl Based Syst. 2014;69:24–33.
Salton G, Yu CT. On the construction of effective vocabularies for information retrieval. In: ACM SIGPLAN notices, vol 10. ACM; 1973. p. 48–60.
Sumita E, Sugaya F, Yamamoto S. Measuring non-native speakers’ proficiency of english by using a test with automatically-generated fill-in-the-blank questions. In: Proceedings of the second workshop on building educational applications using NLP. Association for Computational Linguistics 2005. p. 61–8.
Taboada M, Brooke J, Tofiloski M, Voll K, Stede M. Lexicon-based methods for sentiment analysis. Comput Linguist. 2011;37(2):267–307.
Thet TT, Na JC, Khoo CS. Aspect-based sentiment analysis of movie reviews on discussion boards. J Inf Sci. 2010;0165551510388123.
Turney PD. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics 2002. p. 417–24.
Vermeulen IE, Seegers D. Tried and tested: the impact of online hotel reviews on consumer consideration. Tour Manag. 2009;30(1):123–7.
Warrington P, Shim S. An empirical investigation of the relationship between product involvement and brand commitment. Psychol Mark. 2000;17(9):761–82.
Wei W, Gulla JA. Sentiment learning on product reviews via sentiment ontology tree. In: Proceedings of the 48th annual meeting of the association for computational linguistics. Association for Computational Linguistics 2010. p. 404–13.
Wei W, Gulla JA, Fu Z. Enhancing negation-aware sentiment classification on product reviews via multi-unigram feature generation. In: Advanced intelligent computing theories and applications. Berlin: Springer; 2010. p. 380–91.
Xia R, Wang T, Hu X, Li S, Zong C. Dual training and dual prediction for polarity classification. In: ACL, vol 2. 2013; p. 521–5.
Xia R, Zong C. Exploring the use of word relation features for sentiment classification. In: Proceedings of the 23rd international conference on computational linguistics: posters. Association for Computational Linguistics 2010. p. 1336–44.
Xia R, Zong C, Hu X, Cambria E. Feature ensemble plus sample selection: domain adaptation for sentiment classification. Intell Syst IEEE. 2013;28(3):10–8.
Xia R, Zong C, Li S. Ensemble of feature sets and classification algorithms for sentiment classification. Inf Sci. 2011;181(6):1138–52.
Xia Y, Cambria E, Hussain A, Zhao H. Word polarity disambiguation using bayesian model and opinion-level features. Cogn Comput. 2015;7(3):369–80.
Yau OH. Chinese cultural values: their dimensions and marketing implications. Eur J Mark. 1988;22(5):44–57.
Yessenalina A, Yue Y, Cardie C. Multi-level structured models for document-level sentiment classification. In: Proceedings of the 2010 conference on empirical methods in natural language processing. Association for Computational Linguistics 2010. p. 1046–56.
Yu J, Zha ZJ, Wang M, Chua TS. Aspect ranking: identifying important product aspects from online consumer reviews. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, vol 1, HLT ’11. Association for Computational Linguistics, Stroudsburg, PA, USA 2011. p. 1496–505.
Zhao WX, Jiang J, Yan H, Li X. Jointly modeling aspects and opinions with a maxent-lda hybrid. In: Proceedings of the 2010 conference on empirical methods in natural language processing. Association for Computational Linguistics 2010. p. 56–65.
Zhu F, Zhang X. Impact of online consumer reviews on sales: the moderating role of product and consumer characteristics. J Mark. 2010;74(2):133–48.
Acknowledgments
This work is supported by the Major Projects of National Social Science Fund (No. 13&ZD174), the National Social Science Fund Project (No. 14BTQ033), the Natural Science Foundation of China (Nos. 61305090, 61272233), and the Opening Foundation of Alibaba Research Center for Complex Sciences, Hangzhou Normal University (No. PD12001003002003). We thank the reviewers for the insightful comments.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethical Standard
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.
Conflicts of Interest
Qingqing Zhou, Rui Xia and Chengzhi Zhang declare that they have no conflict of interest.
Informed Consent
All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1975, as revised in 2008 (5). Additional informed consent was obtained from all patients for which identifying information is included in this article.
Human and Animal Rights
This article does not contain any studies with animals performed by any of the authors.
Rights and permissions
About this article
Cite this article
Zhou, Q., Xia, R. & Zhang, C. Online Shopping Behavior Study Based on Multi-granularity Opinion Mining: China Versus America. Cogn Comput 8, 587–602 (2016). https://doi.org/10.1007/s12559-016-9384-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12559-016-9384-x