Skip to main content

Part of the book series: The Information Retrieval Series ((INRE,volume 20))

  • 1435 Accesses

Abstract

One approach to the assessment of overall opinion polarity (OvOP) of reviews, a concept defined in this paper, is the use of supervised machine learning mechanisms. In this paper, the impact of lexical feature selection and feature generalization, applied to reviews, on the precision of two probabilistic classifiers (Naïve Bayes and Markov Model) with respect to OvOP identification is observed. Feature generalization based on hypernymy as provided by WordNet, and feature selection based on part-ofspeech (POS) tags are evaluated. A ranking criterion is introduced, based on a function of the probability of having positive or negative polarity, which makes it possible to achieve 100% precision with 10% recall. Movie reviews are used for training and testing the probabilistic classifiers, which achieve 80% precision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

12. Bibliography

  • Brill, E. (1995) Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging. Computational Linguistics, 21(4):543–565.

    Google Scholar 

  • Duda, R. O. and Hart, P. E. (1973) Pattern Classification and Scene Anaylsis. A Wiley Interscience Publication, New York.

    Google Scholar 

  • Fellbaum, C. (1998) Wordnet: An Electronic Lexical Database. The MIT Press.

    Google Scholar 

  • Hatzivassiloglou, V. and McKeown, K. R. (1997) Predicting the Semantic Orientation of Adjectives. In Cohen, P. R. and Wahlster. W. (Ed.) Proceedings of the Thirty-Fifth Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics. 174–181. Association for Computational Linguistics.

    Google Scholar 

  • Hatzivassiloglou, V., and Wiebe, J. (2000) Effects of Adjective Orientation and Gradability on Sentence Subjectivity. Proceedings of the 18th International Conference in Computational Linguistics.

    Google Scholar 

  • Jurafsky, D. and Martin, J. H. (2000) Speech and Language Processing. Prenctice Hall.

    Google Scholar 

  • Leeper, M. R. (1995) Review of Apollo 13, Usenet rec.arts.movies.reviews.

    Google Scholar 

  • Losee, R. M. (2001) Natural Language Processing in Support of Decision-making: Phrases and Part-of-Speech Tagging. Information Processing and Management, 37(6):769–787.

    Article  MATH  Google Scholar 

  • Marcus, M. P., Santorini, B. and Marcinkiewicz, M. A. (1994) Building a Large Annotated Corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313–330.

    Google Scholar 

  • Pang, B., Lee, L. and Vaithyanathan, S. (2002) Thumbs up? Sentiment Classification using Machine Learning Techniques. Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP).

    Google Scholar 

  • Shanahan, J. G., Roma, N. (2003) Improving SVM Text Classification Performance through Threshold Adjustment. European Conference on Machine Learning (ECML) 2003, 361–372.

    Google Scholar 

  • Turney, P. (2002) Thumbs up or Thumbs down? Semantic Orientation applied to Unsupervised Classification of Reviews. Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL02), 417–424.

    Google Scholar 

  • Turney, P. and Littman, M. (2003) Measuring Praise and Criticism: Inference of Semantic Orientation from Association. ACM Transactions on Information Systems (TOIS), 21(4):315–346.

    Google Scholar 

  • Wiebe, J., Bruce, R. F. and O’Hara, T. (1999) Development and Use of a Gold-Standard Data Set for Subjectivity Classifications. Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL99), 223–287.

    Google Scholar 

  • Wiebe, J. (2000) Learning Subjective Adjectives from Corpora. Proceedings of the 17th National Conference on Artificial Intelligence and 12th Conference on Innovative Application of Artificial Intelligence, 735–740. AAAI Press / The MIT Press.

    Google Scholar 

  • Zhai, C., Jansen, P., Stoica, E., Grot, N., Evans, D.A. (1999) Threshold Calibration in CLARIT Adaptive Filtering. Seventh Text Retrieval Conference (TREC-7), 149–156.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer

About this chapter

Cite this chapter

Salvetti, F., Reichenbach, C., Lewis, S. (2006). Opinion Polarity Identification of Movie Reviews. In: Shanahan, J.G., Qu, Y., Wiebe, J. (eds) Computing Attitude and Affect in Text: Theory and Applications. The Information Retrieval Series, vol 20. Springer, Dordrecht. https://doi.org/10.1007/1-4020-4102-0_23

Download citation

  • DOI: https://doi.org/10.1007/1-4020-4102-0_23

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-4026-9

  • Online ISBN: 978-1-4020-4102-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics