skip to main content
10.1145/1557019.1557127acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Parallel community detection on large networks with propinquity dynamics

Published:28 June 2009Publication History

ABSTRACT

Graphs or networks can be used to model complex systems. Detecting community structures from large network data is a classic and challenging task. In this paper, we propose a novel community detection algorithm, which utilizes a dynamic process by contradicting the network topology and the topology-based propinquity, where the propinquity is a measure of the probability for a pair of nodes involved in a coherent community structure. Through several rounds of mutual reinforcement between topology and propinquity, the community structures are expected to naturally emerge. The overlapping vertices shared between communities can also be easily identified by an additional simple postprocessing. To achieve better efficiency, the propinquity is incrementally calculated. We implement the algorithm on a vertex-oriented bulk synchronous parallel(BSP) model so that the mining load can be distributed on thousands of machines. We obtained interesting experimental results on several real network data.

Skip Supplemental Material Section

Supplemental Material

p997-zhang.mp4

mp4

137.9 MB

References

  1. S. Boccaletti, M. Ivanchenko, V. Latora, A. Pluchino, and A. Rapisarda. Detecting complex network modularity by dynamical clustering. Phys Rev E Stat Nonlin Soft Matter Phys, 75(4), 2007.Google ScholarGoogle Scholar
  2. A. Capocci, V. D. P. Servedio, G. Caldarelli, and F. Colaiori. Detecting communities in large networks. Physica A: Statistical and Theoretical Physics, 352(2-4):669--676, July 2005.Google ScholarGoogle ScholarCross RefCross Ref
  3. A. Clauset, M. E. J. Newman, and C. Moore. Finding community structure in very large networks. Physical Review E, 70:066111, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  4. L. Donetti and M. A. Munoz. Detecting network communities: a new systematic and efficient algorithm. Journal of Statistical Mechanics: Theory and Experiment, 2004(10):P10012, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  5. J. Duch and A. Arenas. Community detection in complex networks using extremal optimization. Physical Review E, 72:027104, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  6. S. Fortunato and C. Castellano. Community structure in graphs. Chapter of Springer's Encyclopedia of Complexity and System Science, Dec 2007.Google ScholarGoogle Scholar
  7. S. Fortunato, V. Latora, and M. Marchiori. Method to find community structures based on information centrality. Phys. Rev. E, 70(5):056104, Nov 2004.Google ScholarGoogle ScholarCross RefCross Ref
  8. M. Girvan and M. E. Newman. Community structure in social and biological networks. Proc Natl Acad Sci U S A, 99(12):7821--7826, June 2002.Google ScholarGoogle ScholarCross RefCross Ref
  9. R. Guimer¼a, M. Sales-Pardo, and L. A. N. Amaral. Modularity from fluctuations in random graphs and complex networks. Phys. Rev. E, 70(2):025101, Aug 2004.Google ScholarGoogle ScholarCross RefCross Ref
  10. B. W. Kernighan and S. Lin. An efficient heuristic procedure for partitioning graphs. The Bell system technical journal, 49(1):291--307, 1970.Google ScholarGoogle Scholar
  11. M. E. J. Newman. Fast algorithm for detecting community structure in networks. Phys Rev E Stat Nonlin Soft Matter Phys, 69(6), 2004.Google ScholarGoogle Scholar
  12. M. E. J. Newman. Finding community structure in networks using the eigenvectors of matrices. Phys Rev E Stat Nonlin Soft Matter Phys, 74(3), 2006.Google ScholarGoogle Scholar
  13. M. E. J. Newman and M. Girvan. Finding and evaluating community structure in networks. Physical Review E, 69:026113, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  14. G. Palla, I. Derenyi, I. Farkas, and T. Vicsek. Uncovering the overlapping community structure of complex networks in nature and society. Nature, 435(7043):814--818.Google ScholarGoogle ScholarCross RefCross Ref
  15. J. Pei, D. Jiang, and A. Zhang. On mining cross-graph quasi-cliques. In Proceedings of the eleventh ACM SIGKDD international conference on knowledge discovery in data mining, pages 228--238, New York, NY, USA, 2005. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. P. Pons and M. Latapy. Computing communities in large networks using random walks. Journal of Graph Algorithms and Applications, 10(2):191--218, Dec 2006.Google ScholarGoogle ScholarCross RefCross Ref
  17. F. Radicchi, C. Castellano, F. Cecconi, V. Loreto, and D. Parisi. Defining and identifying communities in networks. In Proceedings of the National Academy of Science of the United States of America, volume 101-9, pages 2658--2663, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  18. J. Reichardt and S. Bornholdt. Statistical mechanics of community detection. Phys Rev E Stat Nonlin Soft Matter Phys, Mar 2006.Google ScholarGoogle ScholarCross RefCross Ref
  19. Wikipedia. Bulk synchronous parallel | wikipedia, the free encyclopedia, 2008. {Online; accessed 23-December-2008}.Google ScholarGoogle Scholar
  20. Z. Zeng, J. Wang, L. Zhou, and G. Karypis. Out-of-core coherent closed quasi-clique mining from large dense graph databases. ACM Trans. Database Syst., 32(2):13, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Parallel community detection on large networks with propinquity dynamics

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
      June 2009
      1426 pages
      ISBN:9781605584959
      DOI:10.1145/1557019

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 June 2009

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader