Skip to main content
Log in

Online Shopping Behavior Study Based on Multi-granularity Opinion Mining: China Versus America

  • Published:
Cognitive Computation Aims and scope Submit manuscript

Abstract

With the development of e-commerce, many products are now being sold worldwide, and manufacturers are eager to obtain a better understanding of customer behavior in various regions. To achieve this goal, most previous efforts have focused mainly on questionnaires, which are time-consuming and costly. The tremendous volume of product reviews on e-commerce Web sites has seen a new trend emerge, whereby manufacturers attempt to understand user preferences by analyzing online reviews. Following this trend, this paper addresses the problem of studying customer behavior by exploiting recently developed opinion mining techniques. This work is novel for three reasons. First, questionnaire-based investigation is automatically enabled by employing algorithms for template-based question generation and opinion mining-based answer extraction. Using this system, manufacturers are able to obtain reports of customer behavior featuring a much larger sample size, more direct information, a higher degree of automation, and a lower cost. Second, international customer behavior study is made easier by integrating tools for multilingual opinion mining. Third, this is the first time an automatic questionnaire investigation has been conducted to compare customer behavior in China and America, where product reviews are written and read in Chinese and English, respectively. Our study on digital cameras, smartphones, and tablet computers yields three findings. First, Chinese customers follow the Doctrine of the Mean and often use euphemistic expressions, while American customers express their opinions more directly. Second, Chinese customers care more about general feelings, while American customers pay more attention to product details. Third, Chinese customers focus on external features, while American customers care more about the internal features of products.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. http://www.camarketing.com.

  2. We select the brands with the most two quantity of reviews from Amazon as our data set.

  3. https://catalog.ldc.upenn.edu/LDC2002L27

  4. Overall sentiment classification here is similar to document-level sentiment classification.

  5. https://github.com/fxsjy/jieba.

  6. https://github.com/fxsjy/jieba.

  7. http://www.nltk.org/.

  8. http://ir.dlut.edu.cn/EmotionOntologyDownload.aspx?utm_source=weibolife.

  9. https://hlt.fbk.eu/technologies/sentiwords.

  10. http://radimrehurek.com/gensim/.

References

  1. Agarwal B, Poria S, Mittal N, Gelbukh A, Hussain A. Concept-level sentiment analysis with dependency-based semantic parsing: a novel approach. Cogn Comput. 2015;7(4):487–99.

    Article  Google Scholar 

  2. Bagwell LS, Bernheim BD. Veblen effects in a theory of conspicuous consumption. Am Econ Rev. 1996;86(3):349–73.

    Google Scholar 

  3. Blair-Goldensohn S, Hannan K, McDonald R, Neylon T, Reis GA, Reynar J. Building a sentiment summarizer for local service reviews. In: WWW workshop on NLP in the information explosion era, 2008;14.

  4. Blei DM, Ng AY, Jordan MI. Latent dirichlet allocation. J Mach Learn Res. 2003;3:993–1022.

    Google Scholar 

  5. Boulding W, Lee E, Staelin R. Mastering the mix: Do advertising, promotion, and sales force activities lead to differentiation? J Mark Res. 1994;159–72.

  6. Brown JC, Frishkoff GA, Eskenazi M. Automatic question generation for vocabulary assessment. In: Proceedings of the conference on human language technology and empirical methods in natural language processing. Association for Computational Linguistics; 2005. p. 819–26.

  7. Cambria E, Hussain A, Havasi C, Eckl C. Senticspace: visualizing opinions and sentiments in a multi-dimensional vector space. In: Knowledge-based and intelligent information and engineering systems. Berlin: Springer; 2010. p. 385–93.

  8. Cambria E, Schuller B, Liu B, Wang H, Havasi C. Statistical approaches to concept-level sentiment analysis. IEEE Intell Syst. 2013;28(3):6–9.

    Article  Google Scholar 

  9. Chatterjee P. Online reviews: do consumers use them? Adv Consum Res. 2001;28:129–33.

    Google Scholar 

  10. Chikersal P, Poria S, Cambria E, Gelbukh A, Siong CE. Modelling public sentiment in twitter: Using linguistic patterns to enhance supervised learning. In: Computational linguistics and intelligent text processing. Berlin: Springer; 2015. p. 49–65.

  11. Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–97.

    Google Scholar 

  12. Das D, Bandyopadhyay S. Sentence-level emotion and valence tagging. Cogn Comput. 2012;4(4):420–35.

    Article  Google Scholar 

  13. Davidov D, Rappoport A. Unsupervised discovery of generic relationships using pattern clusters and its evaluation by automatically generated sat analogy questions. In: ACL. Citeseer; 2008. p. 692–700.

  14. Denecke K. Using sentiwordnet for multilingual sentiment analysis. In: Data engineering workshop, 2008. ICDEW 2008. IEEE 24th international conference on. IEEE; 2008. p. 507–12.

  15. Ding X, Liu B, Yu PS. A holistic lexicon-based approach to opinion mining. In: Proceedings of the 2008 international conference on web search and data mining. ACM; 2008. p. 231–40

  16. Gangemi A, Presutti V, Reforgiato Recupero D. Frame-based detection of opinion holders and topics: a model and a tool. Comput Intell Mag IEEE. 2014;9(1):20–30.

    Article  Google Scholar 

  17. Hoffman M, Bach FR, Blei DM. Online learning for latent dirichlet allocation. In: Advances in neural information processing systems. 2010; p. 856–64.

  18. Hwang W, Jung HS, Salvendy G. Internationalisation of e-commerce: a comparison of online shopping preferences among Korean, Turkish and US populations. Behav Inf Technol. 2006;25(1):3–18.

    Article  Google Scholar 

  19. Jo Y, Oh AH. Aspect and sentiment unification model for online review analysis. In: Proceedings of the fourth ACM international conference on Web search and data mining. ACM; 2011. p. 815–24.

  20. Lancaster HO, Seneta E. Chi-square distribution. New York: Wiley Online Library; 2005.

    Book  Google Scholar 

  21. Lee J, Park DH, Han I. The effect of negative online consumer reviews on product attitude: an information processing view. Electron Commer Res Appl. 2008;7(3):341–52.

    Article  Google Scholar 

  22. Legge J, et al. The confucian analects, the great learning & the doctrine of the mean. Cosimo, Inc. 2009.

  23. Li S, Lee SYM, Chen Y, Huang CR, Zhou G. Sentiment classification and polarity shifting. In: Proceedings of the 23rd international conference on computational linguistics. Association for Computational Linguistics 2010. p. 635–43.

  24. Li X, Xie H, Chen L, Wang J, Deng X. News impact on stock price return via sentiment analysis. Knowl Based Syst. 2014;69:14–23.

    Article  Google Scholar 

  25. Lin C, He Y. Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th ACM conference on information and knowledge management. ACM; 2009. p. 375–384.

  26. Lin CY. Automatic question generation from queries. In: Workshop on the question generation shared task, 2008; p. 156–64.

  27. Lin J, Demner-Fushman D. Methods for automatically evaluating answers to complex questions. Inf Retr. 2006;9(5):565–87.

    Article  Google Scholar 

  28. Liu B. Sentiment analysis and opinion mining: synthesis lectures on human language technologies. San Rafael: Morgan & Claypool Publishers; 2012.

    Google Scholar 

  29. Liu M, Calvo RA, Rus V. Automatic question generation for literature review writing support. In: Intelligent tutoring systems. Berlin: Springer; 2010. p. 45–54.

  30. Maas AL, Daly RE, Pham PT, Huang D, Ng AY, Potts C. Learning word vectors for sentiment analysis. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies-volume 1. Association for Computational Linguistics; 2011. p. 142–150.

  31. Mahmood MA, Bagchi K, Ford TC. On-line shopping behavior: cross-country empirical research. Int J Electron Commer. 2004;9(1):9–30.

    Google Scholar 

  32. McWilliam G. Low involvement brands: is the brand manager to blame? Mark Intell Plann. 1997;15(2):60–70.

    Article  Google Scholar 

  33. Moghaddam S, Ester M. On the design of lda models for aspect-based opinion mining. In: Proceedings of the 21st ACM international conference on information and knowledge management. ACM; 2012. p. 803–12.

  34. Mullen T, Collier N. Sentiment analysis using support vector machines with diverse information sources. In: EMNLP, vol 4. 2004; p. 412–8.

  35. MY W. Comparing consumers’ on-line shopping behaviors in taiwan and the united states. In: Proceedings of the annual meeting of the international communication association 2010.

  36. Myller N. Automatic generation of prediction questions during program visualization. Electron Notes Theor Comput Sci. 2007;178:43–9.

    Article  Google Scholar 

  37. Nakagawa T, Inui K, Kurohashi S. Dependency tree-based sentiment classification using crfs with hidden variables. In: Human language technologies: the 2010 annual conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics 2010. p. 786–94.

  38. Pang B, Lee L, Vaithyanathan S. Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on empirical methods in natural language processing-volume 10. Association for Computational Linguistics 2002. p. 79–86.

  39. Papasalouros A, Kanaris K, Kotis K. Automatic generation of multiple choice questions from domain ontologies. In: e-Learning. Citeseer 2008. p. 427–34.

  40. Park DH, Kim S. The effects of consumer knowledge on message processing of electronic word-of-mouth via online consumer reviews. Electron Commer Res Appl. 2009;7(4):399–410.

    Article  Google Scholar 

  41. Park DH, Lee J, Han I. The effect of on-line consumer reviews on consumer purchasing intention: the moderating role of involvement. Int J Electron Commer. 2007;11(4):125–48.

    Article  Google Scholar 

  42. Popescu AM, Etzioni O. Extracting product features and opinions from reviews. In: Natural language processing and text mining. Berlin: Springer; 2007. p. 9–28.

  43. Rill S, Reinel D, Scheidt J, Zicari RV. Politwi: early detection of emerging political topics on twitter and the impact on concept-level sentiment analysis. Knowl Based Syst. 2014;69:24–33.

    Article  Google Scholar 

  44. Salton G, Yu CT. On the construction of effective vocabularies for information retrieval. In: ACM SIGPLAN notices, vol 10. ACM; 1973. p. 48–60.

  45. Sumita E, Sugaya F, Yamamoto S. Measuring non-native speakers’ proficiency of english by using a test with automatically-generated fill-in-the-blank questions. In: Proceedings of the second workshop on building educational applications using NLP. Association for Computational Linguistics 2005. p. 61–8.

  46. Taboada M, Brooke J, Tofiloski M, Voll K, Stede M. Lexicon-based methods for sentiment analysis. Comput Linguist. 2011;37(2):267–307.

    Article  Google Scholar 

  47. Thet TT, Na JC, Khoo CS. Aspect-based sentiment analysis of movie reviews on discussion boards. J Inf Sci. 2010;0165551510388123.

  48. Turney PD. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics 2002. p. 417–24.

  49. Vermeulen IE, Seegers D. Tried and tested: the impact of online hotel reviews on consumer consideration. Tour Manag. 2009;30(1):123–7.

    Article  Google Scholar 

  50. Warrington P, Shim S. An empirical investigation of the relationship between product involvement and brand commitment. Psychol Mark. 2000;17(9):761–82.

    Article  Google Scholar 

  51. Wei W, Gulla JA. Sentiment learning on product reviews via sentiment ontology tree. In: Proceedings of the 48th annual meeting of the association for computational linguistics. Association for Computational Linguistics 2010. p. 404–13.

  52. Wei W, Gulla JA, Fu Z. Enhancing negation-aware sentiment classification on product reviews via multi-unigram feature generation. In: Advanced intelligent computing theories and applications. Berlin: Springer; 2010. p. 380–91.

  53. Xia R, Wang T, Hu X, Li S, Zong C. Dual training and dual prediction for polarity classification. In: ACL, vol 2. 2013; p. 521–5.

  54. Xia R, Zong C. Exploring the use of word relation features for sentiment classification. In: Proceedings of the 23rd international conference on computational linguistics: posters. Association for Computational Linguistics 2010. p. 1336–44.

  55. Xia R, Zong C, Hu X, Cambria E. Feature ensemble plus sample selection: domain adaptation for sentiment classification. Intell Syst IEEE. 2013;28(3):10–8.

    Article  CAS  Google Scholar 

  56. Xia R, Zong C, Li S. Ensemble of feature sets and classification algorithms for sentiment classification. Inf Sci. 2011;181(6):1138–52.

    Article  Google Scholar 

  57. Xia Y, Cambria E, Hussain A, Zhao H. Word polarity disambiguation using bayesian model and opinion-level features. Cogn Comput. 2015;7(3):369–80.

    Article  Google Scholar 

  58. Yau OH. Chinese cultural values: their dimensions and marketing implications. Eur J Mark. 1988;22(5):44–57.

    Article  Google Scholar 

  59. Yessenalina A, Yue Y, Cardie C. Multi-level structured models for document-level sentiment classification. In: Proceedings of the 2010 conference on empirical methods in natural language processing. Association for Computational Linguistics 2010. p. 1046–56.

  60. Yu J, Zha ZJ, Wang M, Chua TS. Aspect ranking: identifying important product aspects from online consumer reviews. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, vol 1, HLT ’11. Association for Computational Linguistics, Stroudsburg, PA, USA 2011. p. 1496–505.

  61. Zhao WX, Jiang J, Yan H, Li X. Jointly modeling aspects and opinions with a maxent-lda hybrid. In: Proceedings of the 2010 conference on empirical methods in natural language processing. Association for Computational Linguistics 2010. p. 56–65.

  62. Zhu F, Zhang X. Impact of online consumer reviews on sales: the moderating role of product and consumer characteristics. J Mark. 2010;74(2):133–48.

    Article  Google Scholar 

Download references

Acknowledgments

This work is supported by the Major Projects of National Social Science Fund (No. 13&ZD174), the National Social Science Fund Project (No. 14BTQ033), the Natural Science Foundation of China (Nos. 61305090, 61272233), and the Opening Foundation of Alibaba Research Center for Complex Sciences, Hangzhou Normal University (No. PD12001003002003). We thank the reviewers for the insightful comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chengzhi Zhang.

Ethics declarations

Ethical Standard

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.

Conflicts of Interest

Qingqing Zhou, Rui Xia and Chengzhi Zhang declare that they have no conflict of interest.

Informed Consent

All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1975, as revised in 2008 (5). Additional informed consent was obtained from all patients for which identifying information is included in this article.

Human and Animal Rights

This article does not contain any studies with animals performed by any of the authors.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, Q., Xia, R. & Zhang, C. Online Shopping Behavior Study Based on Multi-granularity Opinion Mining: China Versus America. Cogn Comput 8, 587–602 (2016). https://doi.org/10.1007/s12559-016-9384-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12559-016-9384-x

Keywords

Navigation