skip to main content
10.1145/1242572.1242596acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Topic sentiment mixture: modeling facets and opinions in weblogs

Published: 08 May 2007 Publication History

Abstract

In this paper, we define the problem of topic-sentiment analysis on Weblogs and propose a novel probabilistic model to capture the mixture of topics and sentiments simultaneously. The proposed Topic-Sentiment Mixture (TSM) model can reveal the latent topical facets in a Weblog collection, the subtopics in the results of an ad hoc query, and their associated sentiments. It could also provide general sentiment models that are applicable to any ad hoc topics. With a specifically designed HMM structure, the sentiment models and topic models estimated with TSM can be utilized to extract topic life cycles and sentiment dynamics. Empirical experiments on different Weblog datasets show that this approach is effective for modeling the topic facets and sentiments and extracting their dynamics from Weblog collections. The TSM model is quite general; it can be applied to any text collections with a mixture of topics and sentiments, thus has many potential applications, such as search result summarization, opinion tracking, and user behavior prediction.

References

[1]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, 2003.
[2]
Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan. Identifying sources of opinions with conditional random fields and extraction patterns. In Proceedings of HLT-EMNLP 2005, 2005.
[3]
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of Royal Statist. Soc. B, 39:1--38, 1977.
[4]
K. Eguchi and V. Lavrenko. Sentiment retrieval using generative models. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pages 345--354, July 2006.
[5]
C. Engstrom. Topic dependence in sentiment classification. masters thesis. university of cambridge. 2004.
[6]
D. Gruhl, R. Guha, R. Kumar, J. Novak, and A. Tomkins. The predictive power of online chatter. In Proceedings of KDD '05, pages 78--87, 2005.
[7]
D. Gruhl, R. Guha, D. Liben-Nowell, and A. Tomkins. Information diffusion through blogspace. In Proceedings of the 13th International Conference on World Wide Web, pages 491--501, 2004.
[8]
M. A. Hearst. Clustering versus faceted categories for information exploration. Commun. ACM, 49(4):59--61, 2006.
[9]
T. Hofmann. Probabilistic latent semantic indexing. In Proceedings of SIGIR '99, pages 50--57, 1999.
[10]
R. Krovetz. Viewing morphology as an inference process. In Proceedings of SIGIR '93, pages 191--202, 1993.
[11]
R. Kumar, J. Novak, P. Raghavan, and A. Tomkins. On the bursty evolution of blogspace. In Proceedings of the 12th International Conference on World Wide Web, pages 568--576, 2003.
[12]
W. Li and A. McCallum. Pachinko allocation: Dag-structured mixture models of topic correlations. In ICML '06: Proceedings of the 23rd international conference on Machine learning, pages 577--584, 2006.
[13]
B. Liu, M. Hu, and J. Cheng. Opinion observer: analyzing and comparing opinions on the web. In WWW '05: Proceedings of the 14th international conference on World Wide Web, pages 342--351, 2005.
[14]
G. J. McLachlan and T. Krishnan. The EM Algorithm and Extensions. Wiley, 1997.
[15]
Q. Mei, C. Liu, H. Su, and C. Zhai. A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In WWW '06: Proceedings of the 15th international conference on World Wide Web, pages 533--542, 2006.
[16]
Q. Mei and C. Zhai. Discovering evolutionary theme patterns from text: an exploration of temporal text mining. In Proceedings of KDD '05, pages 198--207, 2005.
[17]
Q. Mei and C. Zhai. A mixture model for contextual text mining. In KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 649--655, 2006.
[18]
G. Mishne and M. de Rijke. MoodViews: Tools for blog mood analysis. In AAAI 2006 Spring Symposium on Computational Approaches to Analysing Weblogs (AAAI-CAAW 2006), pages 153--154, 2006.
[19]
G. Mishne and N. Glance. Predicting movie sales from blogger sentiment. In AAAI 2006 Spring Symposium on Computational Approaches to Analysing Weblogs (AAAI-CAAW 2006), 2006.
[20]
Opinmind. http://www.opinmind.com.
[21]
B. Pang and L. Lee. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the ACL, pages 271--278, 2004.
[22]
B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the ACL, pages 115--124, 2005.
[23]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 79--86, 2002.
[24]
L. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proc. of the IEEE, 77(2):257--285, Feb. 1989.
[25]
T. Tao and C. Zhai. Regularized estimation of mixture models for robust pseudo-relevance feedback. In SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pages 162--169, 2006.
[26]
J. Wiebe, T. Wilson, and C. Cardie. Annotating expressions of opinions and emotions in language. Language Resources and Evaluation (formerly Computers and the Humanities), 39, 2005.
[27]
J. Yi, T. Nasukawa, R. C. Bunescu, and W. Niblack. Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In Proceedings of ICDM 2003, pages 427--434, 2003.
[28]
C. Zhai, A. Velivelli, and B. Yu. A cross-collection mixture model for comparative text mining. In Proceedings of KDD '04, pages 743--748, 2004.

Cited By

View all
  • (2025)Medical Sentimental Analysis Techniques for Pharmaceutical Exploration Using Deep Learning TechniquesHumanizing Technology With Emotional Intelligence10.4018/979-8-3693-7011-7.ch013(281-306)Online publication date: 31-Jan-2025
  • (2024)A Survey of Cutting-edge Multimodal Sentiment AnalysisACM Computing Surveys10.1145/365214956:9(1-38)Online publication date: 25-Apr-2024
  • (2024)Navigating the Post-API DilemmaProceedings of the ACM Web Conference 202410.1145/3589334.3645503(2476-2484)Online publication date: 13-May-2024
  • Show More Cited By

Index Terms

  1. Topic sentiment mixture: modeling facets and opinions in weblogs

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '07: Proceedings of the 16th international conference on World Wide Web
    May 2007
    1382 pages
    ISBN:9781595936547
    DOI:10.1145/1242572
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 May 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. mixture model
    2. sentiment analysis
    3. topic models
    4. topic-sentiment mixture
    5. weblogs

    Qualifiers

    • Article

    Conference

    WWW'07
    Sponsor:
    WWW'07: 16th International World Wide Web Conference
    May 8 - 12, 2007
    Alberta, Banff, Canada

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)51
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Medical Sentimental Analysis Techniques for Pharmaceutical Exploration Using Deep Learning TechniquesHumanizing Technology With Emotional Intelligence10.4018/979-8-3693-7011-7.ch013(281-306)Online publication date: 31-Jan-2025
    • (2024)A Survey of Cutting-edge Multimodal Sentiment AnalysisACM Computing Surveys10.1145/365214956:9(1-38)Online publication date: 25-Apr-2024
    • (2024)Navigating the Post-API DilemmaProceedings of the ACM Web Conference 202410.1145/3589334.3645503(2476-2484)Online publication date: 13-May-2024
    • (2024)Extractive Multi-Document Summarization Using Transformer-Based Neural Networks2024 4th International Conference on Sustainable Expert Systems (ICSES)10.1109/ICSES63445.2024.10763280(551-558)Online publication date: 15-Oct-2024
    • (2024)Joint modeling of causal phrases-sentiments-aspects using Hierarchical Pitman Yor ProcessInformation Processing & Management10.1016/j.ipm.2024.10375361:4(103753)Online publication date: Jul-2024
    • (2023)Topic modeling methods for short textsJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-22383445:2(1971-1990)Online publication date: 1-Jan-2023
    • (2023)Beneath the Tip of the Iceberg: Current Challenges and New Directions in Sentiment Analysis ResearchIEEE Transactions on Affective Computing10.1109/TAFFC.2020.303816714:1(108-132)Online publication date: 1-Jan-2023
    • (2023)Audio Scene Recognition Algorithms - Based on Topic Models and Graph Attention Networks2023 4th International Conference on Computer, Big Data and Artificial Intelligence (ICCBD+AI)10.1109/ICCBD-AI62252.2023.00036(165-169)Online publication date: 15-Dec-2023
    • (2023)Extracting marketing information from product reviews: a comparative study of latent semantic analysis and probabilistic latent semantic analysisJournal of Marketing Analytics10.1057/s41270-023-00218-611:4(662-676)Online publication date: 8-Apr-2023
    • (2023)Game theory and MCDM-based unsupervised sentiment analysis of restaurant reviewsApplied Intelligence10.1007/s10489-023-04471-153:17(20152-20173)Online publication date: 31-Mar-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media