skip to main content
10.1145/2339530.2339592acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Entity-centric topic-oriented opinion summarization in twitter

Published: 12 August 2012 Publication History

Abstract

Microblogging services, such as Twitter, have become popular channels for people to express their opinions towards a broad range of topics. Twitter generates a huge volume of instant messages (i.e. tweets) carrying users' sentiments and attitudes every minute, which both necessitates automatic opinion summarization and poses great challenges to the summarization system. In this paper, we study the problem of opinion summarization for entities, such as celebrities and brands, in Twitter. We propose an entity-centric topic-based opinion summarization framework, which aims to produce opinion summaries in accordance with topics and remarkably emphasizing the insight behind the opinions. To this end, we first mine topics from #hashtags, the human-annotated semantic tags in tweets. We integrate the #hashtags as weakly supervised information into topic modeling algorithms to obtain better interpretation and representation for calculating the similarity among them, and adopt Affinity Propagation algorithm to group #hashtags into coherent topics. Subsequently, we use templates generalized from paraphrasing to identify tweets with deep insights, which reveal reasons, express demands or reflect viewpoints. Afterwards, we develop a target (i.e. entity) dependent sentiment classification approach to identifying the opinion towards a given target (i.e. entity) of tweets. Finally, the opinion summary is generated through integrating information from dimensions of topic, opinion and insight, as well as other factors (e.g. topic relevancy, redundancy and language styles) in an unified optimization framework. We conduct extensive experiments on a real-life data set to evaluate the performance of individual opinion summarization modules as well as the quality of the produced summary. The promising experiment results show the effectiveness of the proposed framework and algorithms.

Supplementary Material

JPG File (310_m_talk_6.jpg)
MP4 File (310_m_talk_6.mp4)

References

[1]
L. Barbosa and J. Feng. Robust sentiment detection on twitter from biased and noisy data. In Proc. COLING, pages 36--44, 2010.
[2]
A. Ben-Hur and J. Weston. A user's guide to support vector machines. Methods in Molecular Biology, 609:223--239, 2010.
[3]
D. M. Blei and J. McAuliffe. Supervised topic models. Advances in Neural Information Processing Systems, 20:121--128, 2008.
[4]
C. Callison-Burch. Syntactic constraints on paraphrases extracted from parallel corpora. In Proc. EMNLP, October 2008.
[5]
C. Chang and C. Lin. Libsvm: a library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST), 2(3):27, 2011.
[6]
J. Cheung, G. Carenini, and R. Ng. Optimization-based content selection for opinion summarization. In Proceedings of the 2009 Workshop on Language Generation and Summarisation, pages 7--14, 2009.
[7]
Y. Choi and C. Cardie. Learning with compositional semantics as structural inference for subsentential sentiment analysis. In Proc. EMNLP, pages 793--801, 2008.
[8]
D. Davidov, O. Tsur, and A. Rappoport. Enhanced sentiment learning using twitter hashtags and smileys. In Proc. COLING, pages 241--249, 2010.
[9]
B. Frey and D. Dueck. Clustering by passing messages between data points. Science, 315(5814):972, 2007.
[10]
K. Gimpel, N. Schneider, B. O'Connor, D. Das, D. Mills, J. Eisenstein, M. Heilman, D. Yogatama, J. Flanigan, and N. A. Smith. Part-of-speech tagging for twitter: annotation, features, and experiments. In Proc. ACL-HLT, pages 42--47, 2011.
[11]
M. Hu and B. Liu. Mining and summarizing customer reviews. In Proc. KDD, pages 168--177, 2004.
[12]
M. Hu and B. Liu. Mining opinion features in customer reviews. In Proc. AAAI, 2004.
[13]
L. Jiang, M. Yu, M. Zhou, X. Liu, and T. Zhao. Target-dependent twitter sentiment classification. In Proc. ACL-HLT, pages 151--160, 2011.
[14]
H. Kim, k. Ganesan, P. Sondhi, and C. Zhai. Comprehensive review of opinion summarization.
[15]
C. Lin. ROUGE: A package for automatic evaluation of summaries. In Proceedings of the workshop on text summarization branches out (WAS 2004), pages 25--26, 2004.
[16]
B. Liu. Opinion mining. In Encyclopedia of Database Systems, pages 1986--1990. 2009.
[17]
C. Manning, P. Raghavan, and H. Schütze. Introduction to information retrieval. 2008.
[18]
R. McDonald, K. Hannan, T. Neylon, M. Wells, and J. Reynar. Structured models for fine-to-coarse sentiment analysis. In Proc. ACL, pages 432--439, 2007.
[19]
Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: modeling facets and opinions in weblogs. In Proc. WWW, pages 171--180, 2007.
[20]
T. Nakagawa, K. Inui, and S. Kurohashi. Dependency tree-based sentiment classification using crfs with hidden variables. In Proc. NAACL-HLT, pages 786--794, 2010.
[21]
B. O'Connor, R. Balasubramanyan, B. Routledge, and N. Smith. From tweets to polls: Linking text sentiment to public opinion time series. In Proc. ICWSM, pages 122--129, 2010.
[22]
B. Pang and L. Lee. Opinion mining and sentiment analysis. Found. Trends Inf. Retr., 2:1--135, January 2008.
[23]
D. Ramage, D. Hall, R. Nallapati, and C. Manning. Labeled lda: A supervised topic model for credit attribution in multi-labeled corpora. In Proc. EMNLP, pages 248--256, 2009.
[24]
M. Resende and R. Werneck. A hybrid heuristic for the p-median problem. Journal of Heuristics, 10(1):59--88, 2004.
[25]
I. Titov and R. McDonald. A joint model of text and aspect ratings for sentiment summarization. In Proc. ACL-HLT, pages 308--316, June 2008.
[26]
I. Titov and R. McDonald. Modeling online reviews with multi-grain topic models. In Proc. WWW, pages 111--120, 2008.
[27]
X. Wang, F. Wei, X. Liu, M. Zhou, and M. Zhang. Topic sentiment analysis in twitter: A graph-based hashtag sentiment classification approach. In Proc. CIKM, 2011.
[28]
L. Zhuang, F. Jing, and X. Zhu. Movie review mining and summarization. In Proc. CIKM, pages 43--50, 2006.

Cited By

View all
  • (2024)Probabilistic Keyphrase Generation From Copy and Generating SpacesIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.329078935:11(15956-15970)Online publication date: Nov-2024
  • (2024)Training with One2MultiSeq: CopyBART for social media keyphrase generationThe Journal of Supercomputing10.1007/s11227-024-06050-880:11(15517-15544)Online publication date: 1-Jul-2024
  • (2024)Trends and challenges in sentiment summarization: a systematic review of aspect extraction techniquesKnowledge and Information Systems10.1007/s10115-024-02075-w66:7(3671-3717)Online publication date: 9-May-2024
  • Show More Cited By

Index Terms

  1. Entity-centric topic-oriented opinion summarization in twitter

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2012
    1616 pages
    ISBN:9781450314626
    DOI:10.1145/2339530
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. #hashtag
    2. Twitter
    3. opinion summarization
    4. sentiment analysis
    5. topic analysis

    Qualifiers

    • Research-article

    Conference

    KDD '12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)16
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 08 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Probabilistic Keyphrase Generation From Copy and Generating SpacesIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.329078935:11(15956-15970)Online publication date: Nov-2024
    • (2024)Training with One2MultiSeq: CopyBART for social media keyphrase generationThe Journal of Supercomputing10.1007/s11227-024-06050-880:11(15517-15544)Online publication date: 1-Jul-2024
    • (2024)Trends and challenges in sentiment summarization: a systematic review of aspect extraction techniquesKnowledge and Information Systems10.1007/s10115-024-02075-w66:7(3671-3717)Online publication date: 9-May-2024
    • (2023)Adlandırılmış Varlık Tanıma Modelleri ile Türkçe Sosyal Medya Metinlerinde Küfürlü Sözlerin SansürlenmesiCensorship of Profanity Words in Turkish Social Media Texts with Named Entity Recognition ModelsAfyon Kocatepe University Journal of Sciences and Engineering10.35414/akufemubid.111578623:1(72-88)Online publication date: 1-Mar-2023
    • (2023)Assessing causality among topics and sentiments: The case of the G20 discussion on TwitterJournal of Information Science10.1177/01655515231160034Online publication date: 30-Mar-2023
    • (2022)GCN-based document representation for keyphrase generation enhanced by maximizing mutual informationKnowledge-Based Systems10.1016/j.knosys.2022.108488243:COnline publication date: 11-May-2022
    • (2022)Mining association rules from COVID-19 related twitter data to discover word patterns, topics and inferencesInformation Systems10.1016/j.is.2022.102054109:COnline publication date: 1-Nov-2022
    • (2022)Topic and Reference Guided Keyphrase Generation from Social MediaKnowledge Science, Engineering and Management10.1007/978-3-031-10986-7_12(140-154)Online publication date: 19-Jul-2022
    • (2021)Preserve Integrity in Realtime Event SummarizationACM Transactions on Knowledge Discovery from Data10.1145/344234415:3(1-29)Online publication date: 3-May-2021
    • (2021)A Survey on Opinion Reason Mining and Interpreting Sentiment VariationsIEEE Access10.1109/ACCESS.2021.30639219(39636-39655)Online publication date: 2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media