Application of Semi-supervised Learning to Evaluative Expression Classification

Suzuki, Yasuhiro; Takamura, Hiroya; Okumura, Manabu

doi:10.1007/11671299_52

Application of Semi-supervised Learning to Evaluative Expression Classification

Yasuhiro Suzuki¹⁷,
Hiroya Takamura¹⁸ &
Manabu Okumura¹⁸

Conference paper

1421 Accesses
11 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3878))

Abstract

We propose to use semi-supervised learning methods to classify evaluative expressions, that is, tuples of subjects, their attributes, and evaluative words, that indicate either favorable or unfavorable opinions towards a specific subject. Due to its characteristics, the semi-supervised method that we use can classify evaluative expressions in a corpus by their polarities. This can be accomplished starting from a very small set of seed training examples and using contextual information in the sentences to which the expressions belong. Our experimental results with actual Weblog data show that this bootstrapping approach can improve the accuracy of methods for classifying favorable and unfavorable opinions.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? sentiment classification using machine learning techniques. In: EMNLP 2002, pp. 76–86 (2002)
Google Scholar
Dave, K., Lawrence, S., Pennock, D.M.: Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In: 12th WWW Conference, pp. 519–528 (2003)
Google Scholar
Cristianini, N., Shawe-Taylor, J.: An Introduction to Support Vector Machines (and other kernel-based learning methods). Cambridge University Press, Cambridge (2000)
Google Scholar
Nigam, K., McCallum, A., Thrun, S., Mitchell, T.: Text classification from labeled and unlabeled documents using EM. Machine Learning 39, 103–134 (2000)
Article MATH Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B 39, 1–38 (1977)
MATH MathSciNet Google Scholar
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: 35th ACL, pp. 174–181 (1997)
Google Scholar
Turney, P.D.: Thumbs up? thumbs down? semantic orientation applied to unsupervised classification of reviews. In: 40th ACL, pp. 417–424 (2002)
Google Scholar
Kamps, J., Marx, M., Mokken, R.J., de Rijke, M.: Using wordnet to measure semantic orientations of adjectives. In: 4th LREC, pp. 1115–1118 (2004)
Google Scholar
Kim, S.M., Hovy, E.: Determining the sentiment of opinions. In: 20th COLING, pp. 1367–1373 (2004)
Google Scholar
Kudo, T., Matsumoto, Y.: A boosting algorithm for classification of semi-structured text. In: EMNLP 2004, pp. 301–308 (2004)
Google Scholar
Wilson, T., Wiebe, J., Hwa, R.: Just how mad are you? finding strong and weak opinion clauses. In: 19th AAAI (2004)
Google Scholar
Pang, B., Lee, L.: A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: 42nd ACL, pp. 271–278 (2004)
Google Scholar
Riloff, E., Wiebe, J.: Learning extraction patterns for subjective expressions. In: EMNLP 2003, pp. 105–112 (2003)
Google Scholar
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the Workshop on Computational Learning Theory, pp. 92–100 (1998)
Google Scholar
Hofmann, T., Puzicha, J.: Statistical models for co-occurrence data. Technical Report AIM-1625, Artifical Intelligence Laboratory, Massachusetts Institute of Technology (1998)
Google Scholar
Jaakkola, T., Haussler, D.: Exploiting generative models in discriminative classifiers. NIPS 11, 487–493 (1998)
Google Scholar
Takamura, H., Okumura, M.: A comparative study on the use of labeled and unlabeled data for large margin classifiers. In: 1st IJCNLP 2004, pp. 620–625 (2004)
Google Scholar
Nanno, T., Fujiki, T., Suzuki, Y., Okumura, M.: Automatically collecting, monitoring, and mining japanese weblogs. In: 13th WWW Conference, pp. 320–321 (2004)
Google Scholar
Ikehara, S., Miyazaki, M., Shirai, S., Yokoo, A., Nakaiwa, H., Ogura, K., Ooyama, Y., Hayashi, Y.: Goi-Taikei – A Japanese Lexicon. Iwanami Shoten (1997)
Google Scholar
Wiebe, J.: Instructions for annotating opinions in newspaper articles. Technical report, University of Pittsburgh Technical Report, TR-02-101 (2002)
Google Scholar
Tanaka, Y., Takamura, H., Okumura, M.: Extraction and classification of facemarks with kernel methods. In: IUI 2005, pp. 28–34 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Tokyo Institute of Technology, Interdisciplinary Graduate School of Science and Engineering, 4259 Nagatsuta Midori-ku, Yokohama, 226-8503, Japan
Yasuhiro Suzuki
Precision and Intelligence Laboratory, Tokyo Institute of Technology, 4259 Nagatsuta Midori-ku, Yokohama, 226-8503, Japan
Hiroya Takamura & Manabu Okumura

Authors

Yasuhiro Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Hiroya Takamura
View author publications
You can also search for this author in PubMed Google Scholar
Manabu Okumura
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Suzuki, Y., Takamura, H., Okumura, M. (2006). Application of Semi-supervised Learning to Evaluative Expression Classification. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2006. Lecture Notes in Computer Science, vol 3878. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11671299_52

Download citation

DOI: https://doi.org/10.1007/11671299_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32205-4
Online ISBN: 978-3-540-32206-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics