skip to main content
10.1145/1643823.1643921acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmedesConference Proceedingsconference-collections
research-article

Comparing semantic associations in sentences and paragraphs for opinion detection in blogs

Published: 27 October 2009 Publication History

Abstract

Opinion Detection is one of the most interesting and challenging work in the field of Information Retrieval. Lot of research work already exists in this area with some distinctive work. A review of the reveals that researchers have been working on different levels of granularity like documents, passages, sentences and words for the task of opinion detection. In this work we revise our previous approach that combines document level heuristics with a semantic similarity based method. We evaluate this semantic similarity approach on a huge data collection using three different setups involving both sentences and passages and then compare the performance of our approach with these different setups. For evaluation purposes, we are using TREC Blog 2006 collection (148 GB) with 50 topics of TREC Blog 2006 over baseline obtained through Terrier Information System Platform. Results show that our approach improves the baseline opinion MAP by 28.89%, 30.13% and 32.26% using setup one, two and three respectively.

References

[1]
B. Liu, Web Data Mining: Exploring Hyperlinks, contents and Using Data, Chapter 11, Data-Center Systems and Applications, Springer, 2007
[2]
C. J. Gray, "Adolscent Blogging: A Comparison of Developmental Psychology and Self-Depiction in Adolesent Blogs", Master's Thesis, University of North Carolina Nov 2005
[3]
I. Ounis, C. Macdonald, and I. Soboroff, "On the TREC Blog", ICWSM'08: In Proceedings of International Conference on Weblogs and Social Media, AAAI Press, Seattle USA, March 30--April 2, 2008
[4]
M. M. S. Missen, and M. Boughanem, "Sentence-level topic association for opinion detection in blogs", MAW09: In Proceedings of IEEE International Symposium for Web and Data Mining, IEEE Computer Society, Bradford, UK, May 2009
[5]
M. M. S. Missen, and M. Boughanem, "Using WordNet's Semantic relations for Opinion Detection in Blogs", ECIR 2009: European Conference.
[6]
B. Ernsting, W. Weerkamp, and M. de Rijke, "The University of Amsterdam at the TREC 2007 Blog Track", TREC 2007 Blog Track
[7]
K. Yang, N. Yu, A. Valerio, and H. Zhang, "WIDIT in TREC-2006 Blog track", TREC 2006 Blog Track
[8]
G. Zhou, H. Joshi and C. Bayrak, "Topic Categorization for Relevancy and Opinion Detection", TREC 2007 Blog Track
[9]
S. Gerani, M. Carman, and F. Crestani, "Investigating Learning Approaches for Blog Post Opinion Retrieval", ECIR 2009: In Proceedings of European Conference on Information Retrieval, Springer, Toulouse France, 1--3 April 2009
[10]
T. Wilson, J. Wiebe, and Hoffmann, "Recognizing contextual polarity in phrase-level sentiment analysis", HLT/EMNLP 2005: In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Vancouver, British Columbia, Canada, October 06--08, 2005, pp. 347--354
[11]
C. Strapparava, and A. Valitutti, "WordNet-affect: An affective extension of WordNet", LREC 2004: In Proceedings of Language Resources and Evaluation Conference, European Language Resources Association, Lisabon, Portugal, 2004
[12]
A. Esuli, and F. Sebastiani, "SentiWordNet: A publicly available lexical resourcefor opinion mining", LREC-06: in Proceedings of Language Resources and Evaluation Conference, European Language Resources Association, Genova, 2006
[13]
I. Ounis, M. Rijke, C. Macdonald, G. Mishne, and I. Soboroff, "Overview of the TREC-2006 Blog Track", TREC 2006 Blog Track
[14]
C. Macdonald, I. Ounis, and I. Soboroff, "Overview of the TREC-2007 Blog Track"
[15]
I. Ounis, C. Macdonald, and I. Soboroff, "Overview of the TREC-2008 Blog Track"
[16]
I. Ounis, C. Lioma, C. Macdonald and V. Plachouras, "Research Directions in Terrier: a Search Engine for Advanced Retrieval on the Web", In Novatica/UPGRADE Special Issue on Next Generation Web Search, 8(1):49--56, 2007
[17]
H. Yang, J. Callan, and L. Si. Knowledge Transfer and Opinion Detection in the TREC 2006 Blog Track. In Proceedings of TREC 2006
[18]
A. Esuli, and F. Sebastiani, "SentiWordNet: A publicly available lexical resourcefor opinion mining", in Proceedings of LREC-06, the 5th Conference on Language Resources and Evaluation, Genova
[19]
L. Zhou, D. P. Twitchell, T. Qin, J. K. Burgoon, and J. F. Nunamaker, "An exploratory study in deception detection in text-based computer mediated communication", Proceedings of the 36th Hawaii International Conference on System Sciences (HICSS'03)
[20]
C. Fellbaum, WordNet: An Electronic Lexical Database, MIT Press
[21]
T Pedersen, S Patwardhan and J Michelizzi, "WordNet:: Similarity - Measuring the Relatedness of Concepts", In Proceedings of the Nineteenth National Conference on Artificial Intelligence (AAAI-04)
[22]
S. Banerjee and T. Pedersen, "Extended gloss overlaps as a measure of semantic relatedness", In Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, 805--810.
[23]
S. Banerjee and T. Pedersen. An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet. In Proceedings of the Third international Conference on Computational Linguistics and intelligent Text Processing (February 17--23, 2002). A. F. Gelbukh, Ed. Lecture Notes In Computer Science, vol. 2276. Springer-Verlag, London, 136--145.
[24]
M. Hu, B. Liu, "Mining and Summarizing Cutomer Reviews", in Proceedings of the 10th ACM SIGKDD International Conference on Knowledge and Data Mining. August 22--25, 2004, Seattle, WA, USA
[25]
M. Kaszkiel and J. Zobel. Effective ranking with arbitrary passages. Journal of theAmerican Society for Information Science, 52(4):344--364, November 2001
[26]
James P. Callan. Passage-level evidence in document retrieval. In Proceedings of SIGIR, pages 302--310, 1994

Index Terms

  1. Comparing semantic associations in sentences and paragraphs for opinion detection in blogs

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      MEDES '09: Proceedings of the International Conference on Management of Emergent Digital EcoSystems
      October 2009
      525 pages
      ISBN:9781605588292
      DOI:10.1145/1643823
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      • The French Chapter of ACM Special Interest Group on Applied Computing

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 27 October 2009

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. blogs
      2. opinion detection
      3. passages
      4. semantic relatedness
      5. sentences

      Qualifiers

      • Research-article

      Conference

      MEDES '09
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 267 of 682 submissions, 39%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 18
        Total Downloads
      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 08 Mar 2025

      Other Metrics

      Citations

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media