Abstract
One approach to the assessment of overall opinion polarity (OvOP) of reviews, a concept defined in this paper, is the use of supervised machine learning mechanisms. In this paper, the impact of lexical feature selection and feature generalization, applied to reviews, on the precision of two probabilistic classifiers (Naïve Bayes and Markov Model) with respect to OvOP identification is observed. Feature generalization based on hypernymy as provided by WordNet, and feature selection based on part-ofspeech (POS) tags are evaluated. A ranking criterion is introduced, based on a function of the probability of having positive or negative polarity, which makes it possible to achieve 100% precision with 10% recall. Movie reviews are used for training and testing the probabilistic classifiers, which achieve 80% precision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
12. Bibliography
Brill, E. (1995) Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging. Computational Linguistics, 21(4):543–565.
Duda, R. O. and Hart, P. E. (1973) Pattern Classification and Scene Anaylsis. A Wiley Interscience Publication, New York.
Fellbaum, C. (1998) Wordnet: An Electronic Lexical Database. The MIT Press.
Hatzivassiloglou, V. and McKeown, K. R. (1997) Predicting the Semantic Orientation of Adjectives. In Cohen, P. R. and Wahlster. W. (Ed.) Proceedings of the Thirty-Fifth Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics. 174–181. Association for Computational Linguistics.
Hatzivassiloglou, V., and Wiebe, J. (2000) Effects of Adjective Orientation and Gradability on Sentence Subjectivity. Proceedings of the 18th International Conference in Computational Linguistics.
Jurafsky, D. and Martin, J. H. (2000) Speech and Language Processing. Prenctice Hall.
Leeper, M. R. (1995) Review of Apollo 13, Usenet rec.arts.movies.reviews.
Losee, R. M. (2001) Natural Language Processing in Support of Decision-making: Phrases and Part-of-Speech Tagging. Information Processing and Management, 37(6):769–787.
Marcus, M. P., Santorini, B. and Marcinkiewicz, M. A. (1994) Building a Large Annotated Corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313–330.
Pang, B., Lee, L. and Vaithyanathan, S. (2002) Thumbs up? Sentiment Classification using Machine Learning Techniques. Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP).
Shanahan, J. G., Roma, N. (2003) Improving SVM Text Classification Performance through Threshold Adjustment. European Conference on Machine Learning (ECML) 2003, 361–372.
Turney, P. (2002) Thumbs up or Thumbs down? Semantic Orientation applied to Unsupervised Classification of Reviews. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL02), 417–424.
Turney, P. and Littman, M. (2003) Measuring Praise and Criticism: Inference of Semantic Orientation from Association. ACM Transactions on Information Systems (TOIS), 21(4):315–346.
Wiebe, J., Bruce, R. F. and O’Hara, T. (1999) Development and Use of a Gold-Standard Data Set for Subjectivity Classifications. Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL99), 223–287.
Wiebe, J. (2000) Learning Subjective Adjectives from Corpora. Proceedings of the 17th National Conference on Artificial Intelligence and 12th Conference on Innovative Application of Artificial Intelligence, 735–740. AAAI Press / The MIT Press.
Zhai, C., Jansen, P., Stoica, E., Grot, N., Evans, D.A. (1999) Threshold Calibration in CLARIT Adaptive Filtering. Seventh Text Retrieval Conference (TREC-7), 149–156.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer
About this chapter
Cite this chapter
Salvetti, F., Reichenbach, C., Lewis, S. (2006). Opinion Polarity Identification of Movie Reviews. In: Shanahan, J.G., Qu, Y., Wiebe, J. (eds) Computing Attitude and Affect in Text: Theory and Applications. The Information Retrieval Series, vol 20. Springer, Dordrecht. https://doi.org/10.1007/1-4020-4102-0_23
Download citation
DOI: https://doi.org/10.1007/1-4020-4102-0_23
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-4026-9
Online ISBN: 978-1-4020-4102-0
eBook Packages: Computer ScienceComputer Science (R0)