Abstract
Online product reviews are considered a significant information resource useful for both potential customers and product manufacturers. In order to extract the fundamental product aspects and their associated sentiments from those reviews of plain texts, aspect-based sentiment analysis has emerged and has been regarded as a promising technology. This paper proposes a novel model to realize aspect-based sentiment summarization in an integrative way: composing the system with consistently designed feature extraction and clustering, collocation orientation disambiguation, and sentence sentiment strength calculation. Collocations of product features and opinion words are initially extracted through pattern-based bootstrapping. A novel confidence estimation method considering two measurements, Prevalence and Reliability, is exploited to assess both patterns and features. The obtained features are further clustered into aspects. Each cluster is assigned a weight based on arithmetic means of feature similarities and confidences. The orientations of dynamic sentiment ambiguous adjectives (DSAAs) are then determined within opinion collocations. Finally, sentiment strengths of opinion clauses for each aspect are computed according to a set of fine-grained and stratified scoring formulae. Experimental results on a benchmark data set validates the effectiveness of the proposed model.




Similar content being viewed by others
References
Agichtein E (2000) Confidence estimation methods for partially supervised information extraction. In: Proceedings 6th SIAM international conference on data mining, pp 539–543
Baccianella S, Esuli A, Sebastiani F (2009) Multi-facet rating of product reviews. Adv Inf Retr 5478:461–472
Beineke P, Hastie T, Manning C, Vaithyanathan S (2004) Exploring sentiment summarization. In: AAAI spring symposium on exploring attitude and affect in text: theories and applications
Brin S (1998) Extracting patterns and relations from the world wide web. In: Proceedings WebDB workshop at 6th international conference on extending database technology, pp 172–183
Carenini G, Ng RT, Zwart E (2005) Extracting knowledge from evaluative text. In: Proceedings 3rd international conference on knowledge capture, pp 11–18
Ding X, Liu B, Yu PS (2008) A holistic lexicon-based approach to opinion mining. In: Proceedings 1st international conference on web search and web data mining, pp 231–239
Greenwood MA, Stevenson M (2006) Improving semi-supervised acquisition of relation extraction patterns. In: Proceedings workshop on information extraction beyond the document, pp 29–35
Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings 10th international conference on knowledge discovery and data minning, pp 168–177
Jo Y, Oh A (2010) Aspect and sentiment unification model for online review analysis. In: Proceedings 4th ACM international conference on web search and data mining, pp 815–824
Lakkaraju H, Bhattacharyya C, Bhattacharya I (2011a) Exploiting coherence for the simultaneous discovery of latent facets and associated sentiments. In: Proceedings international conference on data mining, pp 498–509
Lakkaraju H, Bhattacharyya C, Bhattacharya I (2011b) Exploiting coherence for the simultaneous discovery of latent facets and associated sentiments. In: Proceedings 2011 SIAM international conference on data mining, pp 498–509
Lu Y, Zhai C, Sundaresan N (2009) Rated aspect summarization of short comments. In: Proceedings 18th international conference on world wide web, pp 131– 140
Mei Q, Ling X, Wondra M, Su H, Zhai C (2007) Topic sentiment mixture: modeling facets and opinions in weblogs. In: Proceedings 16th international conference on world wide web, pp 171– 180
Moghaddam S, Ester M (2010) Opinion digger: an unsupervised opinion miner from unstructured product reviews. In: Proceedings 19th international conference on information and knowledge management, pp 1825–1828
Pang B, Lee L (2005) Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. In: Proceedings 43rd annual meeting of the association for computational linguistics, pp 115–124
Pang B, Lee L, Vaithyanathan S (2002) Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings ACL-02 conference on empirical methods in natural language processing-volume, vol 10, pp 79–86
Pantel P, Pennacchiotti M (2006) Espresso: Leveraging generic patterns for automatically harvesting semantic relations. In: Proceedings 46th annual meeting of the association for computational linguistics, pp 113–120
Popescu A, Etzioni O (2005) Extracting product features and opinions from reviews. In: Proceedings human language technology conference and conference on empirical methods in natural language processing, pp 339–346
Qu L, Ifrim G, Weikum G (2010) The bag-of-opinions method for review rating prediction from sparse text patterns. In: Proceedings 23rd international conference on computational linguistics, pp 913– 921
Riloff E, Jones R (1999) Learning dictionaries for information extraction by multi-level bootstrapping. In: Proceedings 16th national conference on artificial intelligence, pp 474– 479
Sudo SKS, Grishman R (2003) An improved extraction pattern representation model for automatic ie pattern acquisition. In: Proceedings 43rd annual meeting of the association for computational linguistics, pp 224–231
Thet T T, Na J C, Khoo C S (2010) Aspect-based sentiment analysis of movie reviews on discussion boards. J Inf Sci 36:823–848
Titov I, McDonald R (2008a) A joint model of text and aspect ratings for sentiment summarization. In: Proceedings 46th annual meeting of the association for computational linguistics, pp 308– 316
Titov I, McDonald R (2008b) Modeling online reviews with multi-grain topic models. In: Proceedings 17th international conference on world wide web, pp 111–120
Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings human language technology conference and conference on empirical methods in natural language processing, pp 347–354
Wu Y, Wen M (2010) Disambiguating dynamic sentiment ambiguous adjectives. In: Proceedings 23rd international conference on computational linguistics, pp 1191– 1199
Xu F, Uszkoreit H, Li H (2007) A seed-driven bottom-up machine learning framework for extracting relations of various complexity. In: Proceedings 47th annual meeting of the association for computational linguistics, pp 584–591
Xu F, Uszkoreit H, Krause S, Li H (2010) Boosting relation extraction with limited closed-world knowledge. In: Proceedings 23rd international conference on computational linguistics, pp 1354– 1362
Yangarber R (2001) Scenarion customization for information extraction. PhD thesis, Department of Computer Science, Graduate School of Arts and Science. New York University, New York
Zhuang L, Jing F, Zhu X (2006) Movie review mining and summarization. In: Proceedings 15th international conference on information and knowledge management, pp 43– 50
Acknowledgments
This work was supported by 111 Project of China under Grant No. B08004, key project of ministry of science and technology of China under Grant No. 2011ZX03002-005-01, National Natural Science Foundation of China (61273217) and the Ph.D. Programs Foundation of Ministry of Education of China (20130005110004).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, Y., Qin, Z., Xu, W. et al. A holistic model of mining product aspects and associated sentiments from online reviews. Multimed Tools Appl 74, 10177–10194 (2015). https://doi.org/10.1007/s11042-014-2158-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-014-2158-0