Abstract
This paper addresses the problem of supervised sentiment detection using classifiers which are derived from word features. We argue that, while the literature has suggested the use of lexical features is inappropriate for sentiment detection, a careful and thorough evaluation reveals a less clear-cut state of affairs. We present results from five classifiers using word-based features on three tasks, and show that the variation between classifiers can often be as great as has been reported between different feature sets with a fixed classifier. We are thus led to conclude that classifier choice plays at least as important a role as feature choice, and that in many cases word-based classifiers perform well on the sentiment detection task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? sentiment classification using machine learning techniques. In: Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2002)
Turney, P., Littman, M.: Measuring praise and criticism: Inference of semantic orientation from association. ACM Transactions on Information Systems 21(4), 315–346 (2003)
Efron, M.: Cultural orientation: Classifying subjective documents by cociation (sic) analysis. In: Proceedings of the AAAI Fall Symposium on Style and Meaning in Language, Art, Music, and Design, pp. 41–48 (2004)
Mullen, T., Malouf, R.: A preliminary investigation into sentiment analysis for informal political discourse. In: Proceedings of the AAAI Workshop on Analysis of Weblogs (2006)
Thomas, M., Pang, B., Lee, L.: Get out the vote: Determining support or opposition from Congressional floor-debate transcripts. In: Proceedings of EMNLP, pp. 327–335 (2006)
Banko, M., Brill, E.: Mitigating the paucity of data problem: Exploring the effect of training corpus size on classifier performance for nlp. In: Proceedings of the Conference on Human Language Technology (2001)
Stone, P.J., Dunphy, D.C., Smith, M.S., Ogilvie, D.M.: Associates: The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge (1966)
Lewis, D.D.: Naïve (Bayes) at forty: The independence assumption in information retrieval. In: Proceedings of ECML 1998, pp. 4–15 (1998)
Robertson, S.E., Jones, K.S.: Relevance weighting of search terms. Document retrieval systems, 143–160 (1988)
McCallum, A., Nigam, K.: A comparison of event models for naïve bayes text classification. In: Proceedings AAAI 1998 Workshop on Learning for Text Categorization (1998)
Guthrie, L., Walker, E., Guthrie, J.: Document classification by machine: theory and practice. In: Proceedings COLING 1994, pp. 1059–1063 (1994)
Jansche, M.: Parametric models of linguistic count data. In: ACL 2003, pp. 288–295 (2003)
Dumais, S., Platt, J., Heckerman, D., Sahami, M.: Inductive learning algorithms and representations for text categorization. In: CIKM 1998, pp. 148–155 (1998)
Yang, Y., Liu, X.: A re-examination of text categorization methods. In: 22nd Annual International SIGIR, Berkley, pp. 42–49 (August 1999)
Joachims, T.: Making large-scale svm learning practical. Advances in Kernel Methods– Support Vector Learning (1999)
Rennie, J.D.M., Rifkin, R.: Improving multiclass text classification with the Support Vector Machine. Technical report, Massachusetts Insititute of Technology, Artificial Intelligence Laboratory (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Allison, B. (2008). Sentiment Detection Using Lexically-Based Classifiers. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2008. Lecture Notes in Computer Science(), vol 5246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87391-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-87391-4_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87390-7
Online ISBN: 978-3-540-87391-4
eBook Packages: Computer ScienceComputer Science (R0)