skip to main content
10.1145/1569901.1570068acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Agglomerative genetic algorithm for clustering in social networks

Published:08 July 2009Publication History

ABSTRACT

Size and complexity of data repositories collaboratively created by Web users generate a need for new processing approaches. In this paper, we study the problem of detection of fine-grained communities of users in social networks, which can be defined as clustering with a large number of clusters. The practical size of social networks makes the traditional evolutionary based clustering approaches, which represent the entire clustering solution as one individual, hard to apply. We propose an Agglomerative Clustering Genetic Algorithm (ACGA): a population of clusters evolves from the initial state in which each cluster represents one user to a high quality clustering solution. Each step of the evolutionary process is performed locally, engaging only a small part of the social network limited to two clusters and their direct neighborhood. This makes the algorithm practically useful independently of the size of the network. Evaluation on two social network models indicates that ACGA is potentially able to detect communities with accuracy comparable or better than two typical centralized clustering algorithms even though ACGA works under much stricter conditions.

References

  1. Robert Busa-Fekete, Andras Kocsor, and Csaba Bagyinka. A multi-stack based phylogenetic tree building method. Bioinformatics Research and Applications, pages 49--60, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. I. M. Dunbar. Coevolution of neocortical size, group size and language in humans. Behavioral and Brain Sciences, 16(4):681--735, 1993.Google ScholarGoogle ScholarCross RefCross Ref
  3. Zhidan Feng, Xiaowei Xu, Nurcan Yuruk, and Thomas Schweiger. A novel similarity-based modularity function for graph partitioning. Data Warehousing and Knowledge Discovery, pages 385--396, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Rong Ge, Martin Ester, Byron J. Gao, Zengjian Hu, Binay Bhattacharya, and Boaz Ben-Moshe. Joint cluster analysis of attribute data and relationship data: The connected k-center problem, algorithms and applications. ACM Trans. Knowl. Discov. Data, 2(2):1--35, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Girvan and M. E. J. Newman. Community structure in social and biological networks. PNAS, 99(12):7821--7826, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  6. J. Handl and J. Knowles. An evolutionary approach to multiobjective clustering. Evolutionary Computation, IEEE Transactions on, 11(1):56--76, Feb. 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Julia Handl and Joshua Knowles. On semi-supervised clustering via multiobjective optimization. In GECCO '06: Proceedings of the 8th annual conference on Genetic and evolutionary computation, pages 1465--1472, New York, NY, USA, 2006. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Ji He, Ah-Hwee Tan, Chew-Lim Tan, and Sam-Yuan Sung. On Quantitative Evaluation of Clustering Systems. Kluwer Academic Publishers, 2003.Google ScholarGoogle Scholar
  9. Yi Hong, Sam Kwong, Hui Xiong, and Qingsheng Ren. Genetic-guided semi-supervised clustering algorithm with instance-level constraints. In GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation, pages 1381--1388, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Lawrence Hubert and Phipps Arabie. Comparing partitions. J. of Classification, 2(1):193--218, 1985.Google ScholarGoogle ScholarCross RefCross Ref
  11. Jure Leskovec, Jon Kleinberg, and Christos Faloutsos. Graph evolution: Densification and shrinking diameters. ACM Trans. Knowl. Discov. Data, 1(1):2, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Nobukazu Matake, Tomoyuki Hiroyasu, Mitsunori Miki, and Tomoharu Senda. Multiobjective clustering with automatic k-determination for large-scale data. In GECCO '07: Proceedings of the 9th annual conference on Genetic and evolutionary computation, pages 861--868, New York, NY, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M E Newman and M Girvan. Finding and evaluating community structure in networks. Phys Rev E Stat Nonlin Soft Matter Phys, 69(2):026113.1--15, Feb 2004.Google ScholarGoogle Scholar
  14. Clara Pizzuti. Ga-net: A genetic algorithm for community detection in social networks. In PPSN, volume 5199 of Lecture Notes in Computer Science, pages 1081--1090. Springer, 2008.Google ScholarGoogle Scholar
  15. Peter Rousseeuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math., 20(1):53--65, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Jianbo Shi and J. Malik. Normalized cuts and image segmentation. IEEE Trans. on Pattern Analysis and Machine Intelligence, 22(8):888--905, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Mursel Tasgin and Haluk Bingol. Community detection in complex networks using genetic algorithm. In ECCS '06: Proc. of the European Conference on Complex Systems, Apr 2006.Google ScholarGoogle Scholar
  18. A. Sima Uyar and Sule Gunduz Oguducu. A new graph-based evolutionary approach to sequence clustering. In ICMLA '05: Proceedings of the Fourth International Conference on Machine Learning and Applications, pages 273--278. IEEE, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Agglomerative genetic algorithm for clustering in social networks

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GECCO '09: Proceedings of the 11th Annual conference on Genetic and evolutionary computation
      July 2009
      2036 pages
      ISBN:9781605583259
      DOI:10.1145/1569901

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 July 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,669of4,410submissions,38%

      Upcoming Conference

      GECCO '24
      Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader