Learning to Classify Subjective Sentences from Multiple Domains Using Extended Subjectivity Lexicon and Subjective Predicates

Orimaye, Sylvester Olubolu

doi:10.1007/978-3-642-45068-6_17

Sylvester Olubolu Orimaye²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8281))

Included in the following conference series:

Asia Information Retrieval Symposium

1474 Accesses
1 Citations

Abstract

We investigate the performance of subjective predicates and other extended predictive features on subjectivity classification in and across different domains. Our approach constructs a semi-supervised subjective classifier based on an extended subjectivity lexicon that includes subjective annotations resulting from a manually annotated subjectivity corpus, a list of manually constructed subjectivity clues, and a set of subjective predicates learned from a large collection of likely subjective sentences. Using the extended lexicon, we extracted high precision subjective sentences from multiple domains and constructed in-domain and cross-domain subjectivity classifiers. Experimental results on multiple datasets show that the proposed technique performed comparatively better than a high precision subjectivity classification baseline and has improved cross-domain accuracy. We report 97.7% precision, 73.4% recall and 83.8% F-Measure for in-domain subjectivity classification and a accuracy level of 84.6% for cross-domain subjectivity classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Wiebe, J., Wilson, T., Cardie, C.: Annotating expressions of opinions and emotions in language. Language Resources and Evaluation 39(2/3), 165–210 (2005)
Article Google Scholar
Liu, B.: Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies 5(1), 1–167 (2012)
Article Google Scholar
Wiebe, J.M., Bruce, R.F., O’Hara, T.P.: Development and use of a gold-standard data set for subjectivity classifications. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, ACL 1999, pp. 246–253. Association for Computational Linguistics, Stroudsburg (1999)
Chapter Google Scholar
Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona, Spain, p. 271. Association for Computational Linguistics (2004)
Google Scholar
Lambov, D., Dias, G., Noncheva, V.: High-level features for learning subjective language across domains. In: Proceedings of International AAAI Conference on Weblogs and Social Media, ICWSM (2009)
Google Scholar
Liu, B.: Sentiment analysis and subjectivity. In: Handbook of Natural Language Processing, 2nd edn. (2010)
Google Scholar
Wiebe, J.M.: Learning subjective adjectives from corpora (2000)
Google Scholar
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, Madrid, Spain, pp. 174–181. Association for Computational Linguistics (1997)
Google Scholar
Turney, P., Littman, M.L.: Unsupervised learning of semantic orientation from a hundred-billion-word corpus. Technical report (2002)
Google Scholar
Vechtomova, O.: Using subjective adjectives in opinion retrieval from blogs (2008)
Google Scholar
Riloff, E., Wiebe, J.: Learning extraction patterns for subjective expressions. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, EMNLP 2003, pp. 105–112. Association for Computational Linguistics, Stroudsburg (2003)
Google Scholar
Riloff, E.M.: Automatically generating extraction patterns from untagged text (1996)
Google Scholar
Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, Vancouver, British Columbia, Canada, pp. 347–354. Association for Computational Linguistics (2005)
Google Scholar
Wiebe, J.M.: Recognizing subjective sentences: a computational investigation of narrative text. PhD thesis, Buffalo, NY, USA (1990)
Google Scholar
Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. Association of Computational Linguistics (ACL) (2007)
Google Scholar
Domingos, P., Pazzani, M.: On the optimality of the simple bayesian classifier under zero-one loss. Machine Learning 29, 103–130 (1997)
Article MATH Google Scholar
Macdonald, C., Santos, R.L., Ounis, I., Soboroff, I.: Blog track research at trec. SIGIR Forum 44(1), 58–75 (2010)
Article Google Scholar
Torsten, Z., Christof, M., Iryna, G.: Extracting lexical semantic knowledge from wikipedia and wiktionary (2009)
Google Scholar
Peng, F., Schuurmans, D., Wang, S.: Augmenting naive bayes classifiers with statistical language models. Information Retrieval 7(3), 317–345 (2004)
Article Google Scholar
Cui, H., Mittal, V., Datar, M.: Comparative experiments on sentiment classification for online product reviews. American Association for Artificial Intelligence (AAAI) (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Information Technology, MONASH University Sunway Campus, Malaysia
Sylvester Olubolu Orimaye

Authors

Sylvester Olubolu Orimaye
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Infocomm Research, Human Language Technology, 1 Fusionopolis Way #21-01, Connexis South, 138632, Singapore
Rafael E. Banchs , Min Zhang & Sheng Gao , &
Yahoo Labs, Avinguda Diagonal 177, 08018, Barcelona, Spain
Fabrizio Silvestri
Microsoft Research Asia, No. 5, Danling Street, Haidian District, 100080, Beijing, China
Tie-Yan Liu
Institute for Infocomm Research, Human Language Technology, 1 Fusionopolis Way #21-01, Connexis South,, 138632, Singapore
Jun Lang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Orimaye, S.O. (2013). Learning to Classify Subjective Sentences from Multiple Domains Using Extended Subjectivity Lexicon and Subjective Predicates. In: Banchs, R.E., Silvestri, F., Liu, TY., Zhang, M., Gao, S., Lang, J. (eds) Information Retrieval Technology. AIRS 2013. Lecture Notes in Computer Science, vol 8281. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45068-6_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-45068-6_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45067-9
Online ISBN: 978-3-642-45068-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics