skip to main content
10.1145/2380718.2380750acmconferencesArticle/Chapter ViewAbstractPublication PageswebsciConference Proceedingsconference-collections
research-article

TweetLDA: supervised topic classification and link prediction in Twitter

Published: 22 June 2012 Publication History

Abstract

L-LDA is a new supervised topic model for assigning "topics" to a collection of documents (e.g., Twitter profiles). User studies have shown that L-LDA effectively performs a variety of tasks in Twitter that include not only assigning topics to profiles, but also re-ranking feeds, and suggesting new users to follow. Building upon these promising qualitative results, we here run an extensive quantitative evaluation of L-LDA. We test the extent to which, compared to the competitive baseline of Support Vector Machines (SVM), L-LDA is effective at two tasks: 1) assigning the correct topics to profiles; and 2) measuring the similarity of a profile pair. We find that L-LDA generally performs as well as SVM, and it clearly outperforms SVM when training data is limited, making it an ideal classification technique for infrequent topics and for (short) profiles of moderately active users. We have also built a web application that uses L-LDA to classify any given profile and graphically map predominant topics in specific geographic regions.

References

[1]
Blei, D. M., Ng, A. Y., and Jordan, M. I. Latent Dirichlet Allocation. Journal Machine Learning Research (March 2003).
[2]
Puniyani, K., Eisenstein, J., Cohen, S., and Xing, E. P. Social links from latent topics in Microblogs. In Proceedings of the Workshop on Computational Linguistics in a World of Social Media NAACL HLT (June 2010).
[3]
Quercia, D., Ellis, J., Capra, L., and Crowcroft, J. In the Mood for Being Influential on Twitter. In Proceedings of the 3rd IEEE Conference on Social Computing (SocialCom) (October 2011).
[4]
Ramage, D., Dumais, S., and Liebling, D. Characterizing Microblogs with Topic Models. In AAAI Conference on Weblogs and Social Media (ICWSM) (May 2010).
[5]
Ramage, D., Hall, D., Nallapati, R., and Manning, C. D. Labeled LDA: A Supervised Topic Model for Credit Attribution in Multi-Labeled Corpora. In Conference on Empirical Methods on Natural Language Processing (EMNLP) (August 2009).

Cited By

View all
  • (2024)Social Behavior Analysis in Exclusive Enterprise Social Networks by FastHANDACM Transactions on Knowledge Discovery from Data10.1145/364655218:6(1-32)Online publication date: 12-Apr-2024
  • (2024)Learning Joint Topic Representation for Detecting Drift in Social Media TextInternational Journal of Uncertainty, Fuzziness and Knowledge-Based Systems10.1142/S021848852450024732:06(955-983)Online publication date: 21-Oct-2024
  • (2023)Mind your own business and communicate the same! – signaling content that makes investors interestedJournal of Entrepreneurship in Emerging Economies10.1108/JEEE-09-2022-028316:4(1023-1042)Online publication date: 19-Jan-2023
  • Show More Cited By

Index Terms

  1. TweetLDA: supervised topic classification and link prediction in Twitter

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WebSci '12: Proceedings of the 4th Annual ACM Web Science Conference
    June 2012
    531 pages
    ISBN:9781450312288
    DOI:10.1145/2380718
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article

    Conference

    WebSci '12
    Sponsor:
    WebSci '12: Web Science 2012
    June 22 - 24, 2012
    Illinois, Evanston

    Acceptance Rates

    Overall Acceptance Rate 245 of 933 submissions, 26%

    Upcoming Conference

    Websci '25
    17th ACM Web Science Conference
    May 20 - 24, 2025
    New Brunswick , NJ , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)44
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Social Behavior Analysis in Exclusive Enterprise Social Networks by FastHANDACM Transactions on Knowledge Discovery from Data10.1145/364655218:6(1-32)Online publication date: 12-Apr-2024
    • (2024)Learning Joint Topic Representation for Detecting Drift in Social Media TextInternational Journal of Uncertainty, Fuzziness and Knowledge-Based Systems10.1142/S021848852450024732:06(955-983)Online publication date: 21-Oct-2024
    • (2023)Mind your own business and communicate the same! – signaling content that makes investors interestedJournal of Entrepreneurship in Emerging Economies10.1108/JEEE-09-2022-028316:4(1023-1042)Online publication date: 19-Jan-2023
    • (2023)Topic Models with Sentiment Priors Based on Distributed RepresentationsJournal of Mathematical Sciences10.1007/s10958-023-06525-8273:4(639-652)Online publication date: 24-Jun-2023
    • (2023)The migrant perspective: Measuring migrants' movements and interests using geolocated tweetsPopulation, Space and Place10.1002/psp.2732Online publication date: 27-Nov-2023
    • (2022)TSPS: A Topic based Shortest Path Set algorithm for influence maximizationIntelligent Data Analysis10.3233/IDA-21579026:2(469-480)Online publication date: 14-Mar-2022
    • (2022)Text Classification Using Document-Relational Graph Convolutional NetworksIEEE Access10.1109/ACCESS.2022.322182010(123205-123211)Online publication date: 2022
    • (2022)Topical and emotional expressions regarding extreme weather disasters on social media: a comparison of posts from official media and the publicHumanities and Social Sciences Communications10.1057/s41599-022-01457-19:1Online publication date: 28-Nov-2022
    • (2022)A model for generating a user dynamic profile on social mediaJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2022.08.03634:10(9132-9145)Online publication date: Nov-2022
    • (2022)Presenting a new motif-based link prediction for predicting activities in FacebookComputer Communications10.1016/j.comcom.2021.11.016184(137-148)Online publication date: Feb-2022
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media