Abstract
We investigate the performance of subjective predicates and other extended predictive features on subjectivity classification in and across different domains. Our approach constructs a semi-supervised subjective classifier based on an extended subjectivity lexicon that includes subjective annotations resulting from a manually annotated subjectivity corpus, a list of manually constructed subjectivity clues, and a set of subjective predicates learned from a large collection of likely subjective sentences. Using the extended lexicon, we extracted high precision subjective sentences from multiple domains and constructed in-domain and cross-domain subjectivity classifiers. Experimental results on multiple datasets show that the proposed technique performed comparatively better than a high precision subjectivity classification baseline and has improved cross-domain accuracy. We report 97.7% precision, 73.4% recall and 83.8% F-Measure for in-domain subjectivity classification and a accuracy level of 84.6% for cross-domain subjectivity classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Wiebe, J., Wilson, T., Cardie, C.: Annotating expressions of opinions and emotions in language. Language Resources and Evaluation 39(2/3), 165–210 (2005)
Liu, B.: Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies 5(1), 1–167 (2012)
Wiebe, J.M., Bruce, R.F., O’Hara, T.P.: Development and use of a gold-standard data set for subjectivity classifications. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, ACL 1999, pp. 246–253. Association for Computational Linguistics, Stroudsburg (1999)
Pang, B., Lee, L.: A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, Barcelona, Spain, p. 271. Association for Computational Linguistics (2004)
Lambov, D., Dias, G., Noncheva, V.: High-level features for learning subjective language across domains. In: Proceedings of International AAAI Conference on Weblogs and Social Media, ICWSM (2009)
Liu, B.: Sentiment analysis and subjectivity. In: Handbook of Natural Language Processing, 2nd edn. (2010)
Wiebe, J.M.: Learning subjective adjectives from corpora (2000)
Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Proceedings of the Eighth Conference on European Chapter of the Association for Computational Linguistics, Madrid, Spain, pp. 174–181. Association for Computational Linguistics (1997)
Turney, P., Littman, M.L.: Unsupervised learning of semantic orientation from a hundred-billion-word corpus. Technical report (2002)
Vechtomova, O.: Using subjective adjectives in opinion retrieval from blogs (2008)
Riloff, E., Wiebe, J.: Learning extraction patterns for subjective expressions. In: Proceedings of the 2003 Conference on Empirical Methods in Natural Language Processing, EMNLP 2003, pp. 105–112. Association for Computational Linguistics, Stroudsburg (2003)
Riloff, E.M.: Automatically generating extraction patterns from untagged text (1996)
Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, Vancouver, British Columbia, Canada, pp. 347–354. Association for Computational Linguistics (2005)
Wiebe, J.M.: Recognizing subjective sentences: a computational investigation of narrative text. PhD thesis, Buffalo, NY, USA (1990)
Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification. Association of Computational Linguistics (ACL) (2007)
Domingos, P., Pazzani, M.: On the optimality of the simple bayesian classifier under zero-one loss. Machine Learning 29, 103–130 (1997)
Macdonald, C., Santos, R.L., Ounis, I., Soboroff, I.: Blog track research at trec. SIGIR Forum 44(1), 58–75 (2010)
Torsten, Z., Christof, M., Iryna, G.: Extracting lexical semantic knowledge from wikipedia and wiktionary (2009)
Peng, F., Schuurmans, D., Wang, S.: Augmenting naive bayes classifiers with statistical language models. Information Retrieval 7(3), 317–345 (2004)
Cui, H., Mittal, V., Datar, M.: Comparative experiments on sentiment classification for online product reviews. American Association for Artificial Intelligence (AAAI) (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Orimaye, S.O. (2013). Learning to Classify Subjective Sentences from Multiple Domains Using Extended Subjectivity Lexicon and Subjective Predicates. In: Banchs, R.E., Silvestri, F., Liu, TY., Zhang, M., Gao, S., Lang, J. (eds) Information Retrieval Technology. AIRS 2013. Lecture Notes in Computer Science, vol 8281. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45068-6_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-45068-6_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45067-9
Online ISBN: 978-3-642-45068-6
eBook Packages: Computer ScienceComputer Science (R0)