skip to main content
10.1145/3106426.3117760acmconferencesArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
research-article

A methodology for community detection in Twitter

Published: 23 August 2017 Publication History

Abstract

The microblogging service Twitter is one of the world's most popular online social networks and assembles a huge amount of data produced by interactions between users. A careful analysis of this data allows identifying groups of users who share similar traits, opinions, and preferences. We call community detection the process of user group identification, which grants valuable insights not available upfront. In order to extract useful knowledge from Twitter data many methodologies have been proposed, which define the attributes to be used in community detection problems by manual and empirical criteria - oftentimes guided by the aimed type of community and what the researcher attaches importance to. However, such approach cannot be generalized because it is well known that the task of finding out an appropriate set of attributes leans on context, domain, and data set. Aiming to the advance of community detection domain, reduce computational cost and improve the quality of related researches, this paper proposes a standard methodology for community detection in Twitter using feature selection methods. Results of the present research directly affect the way community detection methodologies have been applied to Twitter and quality of outcomes produced.

References

[1]
Jonathan Baarsch and Emre Celebi. 2012. Investigation of Internal Validity Measures for K-Means Clustering. Proceedings of the International MultiConference of Engineers and Computer Scientists - IMECS'12 I (2012).
[2]
Mohamed Bakillah, Ren-Yu Li, and Steve H.L. Liang. 2015. Geo-located community detection in Twitter with enhanced fast-greedy optimization of modularity: the case study of typhoon Haiyan. International Journal of Geographical Information Science 29, 2 (2015), 258--279.
[3]
J. O. Breen. 2012. Mining twitter for airline consumer sentiment. Practical text mining and statistical analysis for non-structured text data applications (2012), 133.
[4]
David Darmon, Elisa Omodei, and Joshua Garland. 2015. Followers Are Not Enough: A Question-Oriented Approach to Community Detection in Online Social Networks. PLoS ONE 10(8):e0134860 (2015). https://doi.org/
[5]
Vanessa Friedemann. 2015. Clustering a Customer Base Using Twitter Data. (2015).
[6]
Hamid Khobzi and Babak Teimourpour. 2015. LCP segmentation: A framework for evaluation of user engagement in online social networks. Computers in Human Behavior 50 (2015), 101--107.
[7]
Kathrin Kirchner, Jelena Zec, and Boris Delibašić. 2016. Facilitating data preprocessing by a generic framework: a proposal for clustering. Artificial Intelligence Review 45, 3 (2016), 271--297.
[8]
Jon Kleinberg. 2002. An impossibility theorem for clustering. Proc. 2002 Conf. Advances in Neural Information Processing Systems 15 (2002), 463--470.
[9]
Alron Jam Lam. 2016. Improving Twitter Community Detection through Contextual Sentiment Analysis. Proceedings of the ACL 2016 Student Research Workshop (2016). https://doi.org/
[10]
Fábio Lobato, Márcia Pinheiro, Antonio Jacob, Olaf Reinhold, and Ádamo Santana. 2016. Social CRM: Biggest Challenges to Make it Work in the Real World. In International Conference on Business Information Systems. Springer, 221--232.
[11]
Ingrid Nascimento, Adamo Santana, and Marcia Pinheiro. 2016. Analise de Comunidades em Redes Sociais utilizando Mineração de Dados: um estudo de caso nas redes da UFPA. In Anais da II Conferência de Social CRM @Belém. Belem, PA, Brasil.
[12]
Mert Ozer, Nyunsu Kim, and Hasan Davulcu. 2016. Community detection in political Twitter networks using Nonnegative Matrix Factorization methods. 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) (2016). https://doi.org/
[13]
Sebastian Raschka. 2015. Python Machine Learning (1 ed.). Packt Publishing Ltd., Livery Place, 35 Livery Street, Birmingham B3 2PB, UK.
[14]
Olaf Reinhold and Rainer Alt. 2012. Social customer relationship management: state of the art and learnings from current projects. (2012).
[15]
Didi Surian, Dat Quoc Nguyen, Georgina Kennedy, and Adam Dunn. 2016. Characterizing Twitter Discussions About HPV Vaccines Using Topic Modeling and Community Detection. Journal of Medical Internet Research 18(8):e232 (2016).
[16]
L. Talavera. 2005. An Evaluation of Filter and Wrapper Methods for Feature Selection in Categorical Clustering. Lecture Notes in Computer Science Advances in Intelligent Data Analysis IV (2005), 440--451.
[17]
Twitter. 2016. Twitter Usage. (2016). https://about.twitter.com/company
[18]
Yang Zhang, Yao WU, and Qing YANG. 2012. Community Discovery in Twitter Based on User Interests. Journal of Computational Information Systems 8(3) (2012), 991--1000.

Cited By

View all
  • (2024)Applications and Resources for Social Media Sentiment Analysis: A Strategic Planning Case StudyManagement Engineering in Emerging Economies10.1007/978-3-031-54485-9_6(121-156)Online publication date: 9-May-2024
  • (2023)Market Overview in Social CRM: An Analysis of Job Advertisements2023 IEEE International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)10.1109/WI-IAT59888.2023.00092(555-562)Online publication date: 26-Oct-2023
  • (2023)A Methodology to Quickly Perform Opinion Mining and Build Supervised Datasets Using Social Networks MechanicsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.325082235:9(9797-9808)Online publication date: 1-Sep-2023
  • Show More Cited By
  1. A methodology for community detection in Twitter

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WI '17: Proceedings of the International Conference on Web Intelligence
    August 2017
    1284 pages
    ISBN:9781450349512
    DOI:10.1145/3106426
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 August 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Twitter
    2. community detection
    3. feature selection
    4. methodology

    Qualifiers

    • Research-article

    Funding Sources

    • Amazon Foundation for Studies and Research (Fapespa)
    • German Academic Exchange Service (DAAD)
    • National Council for the Improvement of Higher Education (CAPES)
    • National Council for Scientific and Technological Development (CNPq)

    Conference

    WI '17
    Sponsor:

    Acceptance Rates

    WI '17 Paper Acceptance Rate 118 of 178 submissions, 66%;
    Overall Acceptance Rate 118 of 178 submissions, 66%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)19
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 08 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Applications and Resources for Social Media Sentiment Analysis: A Strategic Planning Case StudyManagement Engineering in Emerging Economies10.1007/978-3-031-54485-9_6(121-156)Online publication date: 9-May-2024
    • (2023)Market Overview in Social CRM: An Analysis of Job Advertisements2023 IEEE International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)10.1109/WI-IAT59888.2023.00092(555-562)Online publication date: 26-Oct-2023
    • (2023)A Methodology to Quickly Perform Opinion Mining and Build Supervised Datasets Using Social Networks MechanicsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.325082235:9(9797-9808)Online publication date: 1-Sep-2023
    • (2023)Social Network Analysis of Irish Covid-19 Interactions on Telegram2023 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS)10.1109/AICS60730.2023.10470867(1-8)Online publication date: 7-Dec-2023
    • (2023)Modularity-based approach for tracking communities in dynamic social networksKnowledge-Based Systems10.1016/j.knosys.2023.111067281:COnline publication date: 3-Dec-2023
    • (2022)Evaluating Methods for Efficient Community Detection in Social NetworksInformation10.3390/info1305020913:5(209)Online publication date: 19-Apr-2022
    • (2022)Integration of Sentiment Analysis of Social Media in the Strategic Planning Process to Generate the Balanced ScorecardApplied Sciences10.3390/app12231230712:23(12307)Online publication date: 1-Dec-2022
    • (2022)Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets from 2017–2022 and 100 Research QuestionsAnalytics10.3390/analytics10200071:2(72-97)Online publication date: 23-Sep-2022
    • (2022)Analyzing Public Discussions About #SaudiWomenCanDrive Using Network ScienceIEEE Access10.1109/ACCESS.2021.314007310(4739-4749)Online publication date: 2022
    • (2021)Aspect-Based Community Detection of Cultural Heritage Streaming Data2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)10.1109/IISA52424.2021.9555549(1-4)Online publication date: 12-Jul-2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media