Exploring overall opinions for document level sentiment classification with structural SVM

Pu, Xiaojia; Wu, Gangshan; Yuan, Chunfeng

doi:10.1007/s00530-017-0550-0

Exploring overall opinions for document level sentiment classification with structural SVM

Special Issue Paper
Published: 19 April 2017

Volume 25, pages 21–33, (2019)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

698 Accesses
18 Citations
Explore all metrics

Abstract

As a fundamental task of sentiment analysis, document level sentiment classification aims to predict user’s overall sentiment (e.g., positive or negative) towards the target in a document. The document usually consists of various opinion sentences towards different aspects with different sentiments. Therefore, the overall opinion towards the whole target should play a more important role in document sentiment prediction. However, most existing methods for the task treat all sentences of the document equally. Thus, they are easy to encounter difficulty when the sentiments of most aspect opinion sentences are not coherent with the overall sentiment. To address this, we propose a novel method for document sentiment classification which adequately explores the effect of overall opinion sentences. In our method, firstly, multiple features are exploited to recognize candidate overall opinion sentences, and then a structural SVM is utilized to encode the overall opinion sentences for document sentiment classification. Experiments on several public available datasets including product reviews and movie reviews show the effectiveness of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Cross-Domain Sentiment Classification Method Based on Extraction of Key Sentiment Sentence

Hierarchical Interaction Networks with Rethinking Mechanism for Document-Level Sentiment Analysis

DFDS: A Domain-Independent Framework for Document-Level Sentiment Analysis Based on RST

Notes

This is a review of Canon S100 in the dataset: http://www.cs.uic.edu/~liub/FBS/Reviews-9-products.rar.
https://github.com/oscartackstrom/sentence-sentiment-data.
https://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html.
http://snap.stanford.edu/data/web-Amazon.html.
http://www.cs.cornell.edu/people/pabo/movie-review-data/.
http://ai.stanford.edu/~amaas/data/sentiment.
http://mpqa.cs.pitt.edu/lexicons/subj_lexicon
The improvement of our method is significant, since with the paired t test, p < 0.05.
The hyperparameter tuning has great influence on the performance of LSTM, we follow the work[40, 41], and the result is comparable with [40] and better than [41].
The architecture of GRU differs in many works, e.g., [31, 40, 41], we follow the work [40, 41], and after fine tuning, the result is better than both of them.

References

Sang, J., Xu, C.: Right buddy makes the difference: an early exploration of social relation analysis in multimedia applications. In: ACM International Conference on Multimedia, pp. 19–28. ACM (2012)
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)
Article Google Scholar
McAuley, J., Leskovec, J.: Hidden factors and hidden topics: understanding rating dimensions with review text. In: Proceedings of the 7th ACM conference on Recommender systems, pp. 165–172. ACM (2013)
Sang, J., Xu, C., Liu, J.: User-aware image tag refinement via ternary semantic analysis. IEEE Trans. Multimed. 14(3), 883–895 (2012)
Article Google Scholar
Sang, J.: User-centric cross-osn multimedia computing. In: Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM ’15, Brisbane, Australia, October 26–30, 2015, pp. 1333–1334 (2015). doi:10.1145/2733373.2807423
Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5(1), 1–167 (2012). doi:10.2200/S00416ED1V01Y201204HLT016
Article Google Scholar
Fang, Q., Xu, C., Sang, J., Hossain, M.S., Ghulam, M.: Word-of-mouth understanding: Entity-centric multimodal aspect-opinion mining in social media. IEEE Trans. Multimed. 17(12), 2281–2296 (2015). doi:10.1109/TMM.2015.2491019
Turney, P.: Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 417–424 (2002)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)
Article Google Scholar
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on empirical methods in natural language processing, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)
Wang, S., Manning, C.: Baselines and bigrams: simple, good sentiment and topic classification. In: Proceedings of the 50th annual meeting of the association for computational linguistics (volume 2: Short Papers), pp. 90–94. Association for Computational Linguistics, Jeju Island, Korea (2012)
Yu, C.N.J., Joachims, T.: Learning structural svms with latent variables. In: Proceedings of the 26th annual international conference on machine learning, pp. 1169–1176. ACM (2009)
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’04. ACM, New York, pp. 168–177 (2004). doi:10.1145/1014052.1014073
Ding, X., Liu, B., Yu, P.S.: A holistic lexicon-based approach to opinion mining. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, WSDM ’08, pp. 231–240. ACM, New York (2008). doi:10.1145/1341531.1341561
Kiritchenko, S., Zhu, X., Mohammad, S.M.: Sentiment analysis of short informal texts. J. Artif. Int. Res. 50(1), 723–762 (2014)
Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Lu, Y., Zhai, C.: Opinion integration through semi-supervised topic modeling. In: Proceedings of the 17th International Conference on World Wide Web, WWW ’08, pp. 121–130. ACM, New York (2008). doi:10.1145/1367497.1367514
Sang, J., Xu, C.: Browse by chunks: topic mining and organizing on web-scale social media. ACM Trans. Multimed. Comput. Commun. Appl. 7(1), 30 (2011)
Google Scholar
Lakkaraju, H., Bhattacharyya, C., Bhattacharya, I., Merugu, S.: Exploiting coherence for the simultaneous discovery of latent facets and associated sentiments. In: Proceedings of the 2011 SIAM International Conference on Data Mining (2011)
Lin, C., He, Y.: Joint sentiment/topic model for sentiment analysis. In: Proceedings of the 18th ACM conference on Information and knowledge management, pp. 375–384. ACM (2009)
Pang, B., Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42th Annual Meeting of the Association for Computational Linguistics (ACL) (2004)
McDonald, R., Hannan, K., Neylon, T., Wells, M., Reynar, J.: Structured models for fine-to-coarse sentiment analysis. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 432–439. Association for Computational Linguistics, Prague, Czech Republic (2007)
Yessenalina, A., Yue, Y., Cardie, C.: Multi-level structured models for document-level sentiment classification. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 1046–1056. Association for Computational Linguistics, Cambridge, MA (2010)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Burges, C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K. (eds.) Advances in neural information processing systems, vol. 26, pp. 3111–3119. Curran Associates Inc, New York, USA (2013)
Google Scholar
Baroni, M., Dinu, G., Kruszewski, G.: Don’t count, predict! a systematic comparison of context-counting vs. context-predicting semantic vectors. In: Proceedings of the ACL 2014, pp. 238–247. Association for Computational Linguistics (2014)
Maas, A.L., Daly, R.E., Pham, P.T., Huang, D., Ng, A.Y., Potts, C.: Learning word vectors for sentiment analysis. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, pp. 142–150. Association for Computational Linguistics, Portland, Oregon, USA (2011)
Labutov, I., Lipson, H.: Re-embedding words. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 489–493 (2013)
Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T., Qin, B.: Learning sentiment-specific word embedding for twitter sentiment classification. In: Proceedings of the ACL 2014, pp. 1555–1565 (2014)
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31st international conference on machine learning, pp. 1188–1196 (2014)
Li, J., Luong, M.T., Jurafsky, D.: A hierarchical neural autoencoder for paragraphs and documents. In: Proceedings of the ACL 2015, pp. 1106–1115 (2015)
Tang, D., Qin, B., Liu, T.: Document modeling with gated recurrent neural network for sentiment classification. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1422–1432. Association for Computational Linguistics (2015)
Jin, W., Ho, H.H., Srihari, R.K.: Opinionminer: a novel machine learning system for web opinion mining and extraction. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 1195–1204. ACM (2009)
Jakob, N., Gurevych, I.: Extracting opinion targets in a single-and cross-domain setting with conditional random fields. In: Proceedings of the 2010 conference on empirical methods in natural language processing, pp. 1035–1045. Association for Computational Linguistics (2010)
Moghaddam, S., Ester, M.: On the design of lda models for aspect-based opinion mining. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM ’12, pp. 803–812. ACM, New York (2012). doi:10.1145/2396761.2396863
Becker, I., Aharonson, V.: Last but definitely not least: On the role of the last sentence in automatic polarity-classification. In: Proceedings of the ACL 2010 Conference Short Papers, pp. 331–335 (2010)
Joachims, T., Finley, T., Yu, C.N.J.: Cutting-plane training of structural svms. Mach. Learn. 77(1), 27–59 (2009). doi:10.1007/s10994-009-5108-8
Article MATH Google Scholar
Täckström, O., McDonald, R.: Discovering fine-grained sentiment with latent variable structured prediction models. In: Advances in information retrieval, pp. 368–374. Springer, Heidelberg (2011)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y.: Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014)
Zhang, Y., Er, M.J., Venkatesan, R., Wang, N., Pratama, M.: Sentiment classification using comprehensive attention recurrent models. In: 2016 International joint conference on neural networks (IJCNN), pp. 1562–1569 (2016). doi:10.1109/IJCNN.2016.7727384
Gao, Y., Glowacka, D.: Deep gate recurrent neural network. JMLR Workshop Conf. Proc. 6, 350–365 (2016)

Download references

Acknowledgements

We thank the reviewers for their helpful suggestions. This work is supported by the National Science Foundation of China under Grant No. 61321491 and Collaborative Innovation Center of Novel Software Technology and Industrialization.

Author information

Authors and Affiliations

State Key Laboratory for Novel Software Technology, Department of Computer Science and Technology, Nanjing University, Nanjing, China
Xiaojia Pu, Gangshan Wu & Chunfeng Yuan

Authors

Xiaojia Pu
View author publications
You can also search for this author in PubMed Google Scholar
Gangshan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Chunfeng Yuan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaojia Pu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pu, X., Wu, G. & Yuan, C. Exploring overall opinions for document level sentiment classification with structural SVM. Multimedia Systems 25, 21–33 (2019). https://doi.org/10.1007/s00530-017-0550-0

Download citation

Published: 19 April 2017
Issue Date: 14 February 2019
DOI: https://doi.org/10.1007/s00530-017-0550-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exploring overall opinions for document level sentiment classification with structural SVM

Abstract

Access this article

Similar content being viewed by others

A Cross-Domain Sentiment Classification Method Based on Extraction of Key Sentiment Sentence

Hierarchical Interaction Networks with Rethinking Mechanism for Document-Level Sentiment Analysis

DFDS: A Domain-Independent Framework for Document-Level Sentiment Analysis Based on RST

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Exploring overall opinions for document level sentiment classification with structural SVM

Abstract

Access this article

Similar content being viewed by others

A Cross-Domain Sentiment Classification Method Based on Extraction of Key Sentiment Sentence

Hierarchical Interaction Networks with Rethinking Mechanism for Document-Level Sentiment Analysis

DFDS: A Domain-Independent Framework for Document-Level Sentiment Analysis Based on RST

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation