skip to main content
10.1145/2983323.2983836acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Efficient Computation of Importance Based Communities in Web-Scale Networks Using a Single Machine

Published: 24 October 2016 Publication History

Abstract

Finding decompositions of a graph into a family of communities is crucial to understanding its underlying structure. Algorithms for finding communities in networks often rely only on structural information and search for cohesive subsets of nodes. In practice however, we would like to find communities that are not only cohesive, but also influential or important. In order to capture such communities, Li, Qin, Yu, and Mao introduced a novel community model called "k-influential community" based on the concept of $k$-core, with numerical values representing "influence" assigned to the nodes. They formulate the problem of finding the top-r most important communities as finding r connected k-core subgraphs ordered by the lower-bound of their importance. In this paper, our goal is to scale-up the computation of top-r, k-core communities to web-scale graphs of tens of billions of edges. We feature several fast new algorithms for this problem. With our implementations, we show that we can efficiently handle massive networks using a single consumer-level machine within a reasonable amount of time.

References

[1]
T. Akiba, Y. Iwata, and Y. Yoshida. Linear-time enumeration of maximal k-edge-connected subgraphs in large networks by random contraction. In CIKM, 2013.
[2]
R. D. Alba. A graph-theoretic definition of a sociometric clique. Journal of Mathematical Sociology, 3(1):113--126, 1973.
[3]
V. Batagelj and M. Zaversnik. An o (m) algorithm for cores decomposition of networks. arXiv preprint cs/0310049, 2003.
[4]
P. Boldi and S. Vigna. The webgraph framework i: compression techniques. In WWW, 2004.
[5]
L. Chang, J. X. Yu, L. Qin, X. Lin, C. Liu, and W. Liang. Efficiently computing k-edge connected components via graph decomposition. In SIGMOD, 2013.
[6]
J. Cheng, Y. Ke, S. Chu, and M. T. Özsu. Efficient core decomposition in massive networks. In ICDE, 2011.
[7]
J. Cheng, Y. Ke, A. W.-C. Fu, J. X. Yu, and L. Zhu. Finding maximal cliques in massive networks. TODS, 36(4):21, 2011.
[8]
W. Cui, Y. Xiao, H. Wang, and W. Wang. Local search of communities in large graphs. In SIGMOD, 2014.
[9]
S. Fortunato. Community detection in graphs. Physics Reports, 486(3):75--174, 2010.
[10]
E. Gregori, L. Lenzini, and C. Orsini. k-dense communities in the internet as-level topology graph. Computer Networks, 57(1):213--227, 2013.
[11]
X. Huang, H. Cheng, L. Qin, W. Tian, and J. X. Yu. Querying k-truss community in large and dynamic graphs. In SIGMOD, 2014.
[12]
X. Huang, L. V. Lakshmanan, J. X. Yu, and H. Cheng. Approximate closest community search in networks. PVLDB, 9(4), 2015.
[13]
W. Khaouid, M. Barsky, V. Srinivasan, and A. Thomo. K-core decomposition of large networks on a single pc. PVLDB, 9(1):13--23, 2015.
[14]
I. Koch. Enumerating all connected maximal common subgraphs in two graphs. Theoretical Computer Science, 250(1):1--30, 2001.
[15]
A. Kyrola, G. Blelloch, and C. Guestrin. Graphchi: Large-scale graph computation on just a pc. In 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2012.
[16]
R.-H. Li, L. Qin, J. X. Yu, and R. Mao. Influential community search in large networks. PVLDB, 8(5):509--520, 2015.
[17]
R.-H. Li, J. X. Yu, and R. Mao. Efficient core maintenance in large dynamic graphs. TKDE, 26(10):2453--2465, 2014.
[18]
R. D. Luce. Connectivity and generalized cliques in sociometric group structure. Psychometrika, 15(2):169--190, 1950.
[19]
A. E. Sarıyüce, B. Gedik, G. Jacques-Silva, K.-L. Wu, and Ü. V. Çatalyürek. Streaming algorithms for k-core decomposition. PVLDB, 6(6):433--444, 2013.
[20]
S. B. Seidman. Network structure and minimum degree. Social networks, 5(3):269--287, 1983.
[21]
S. B. Seidman and B. L. Foster. A graph-theoretic generalization of the clique concept. Journal of Mathematical sociology, 6(1):139--154, 1978.
[22]
M. Sozio and A. Gionis. The community-search problem and how to plan a successful cocktail party. In KDD, 2010.
[23]
J. Wang and J. Cheng. Truss decomposition in massive networks. PVLDB, 5(9):812--823, 2012.
[24]
D. Wen, L. Qin, Y. Zhang, X. Lin, and J. X. Yu. I/O efficient core graph decomposition at web scale. CoRR, abs/1511.00367, 2015.
[25]
J. Yang and J. Leskovec. Defining and evaluating network communities based on ground-truth. Knowl. and Inf. Syst., 42(1):181--213, 2015.
[26]
R. Zhou, C. Liu, J. X. Yu, W. Liang, B. Chen, and J. Li. Finding maximal k-edge-connected subgraphs from a large graph. In EDBT, 2012.

Cited By

View all
  • (2025)Finding Antagonistic Communities in Signed Uncertain GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.349658637:2(655-669)Online publication date: Feb-2025
  • (2025)H-dominant skyline community search in multi-valued networksThe Journal of Supercomputing10.1007/s11227-024-06679-581:1Online publication date: 1-Jan-2025
  • (2024)SACH: Significant-Attributed Community Search in Heterogeneous Information Networks2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00254(3283-3296)Online publication date: 13-May-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
October 2016
2566 pages
ISBN:9781450340731
DOI:10.1145/2983323
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 October 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. algorithm scalability
  2. big graphs
  3. community discovery
  4. graph compression

Qualifiers

  • Research-article

Conference

CIKM'16
Sponsor:
CIKM'16: ACM Conference on Information and Knowledge Management
October 24 - 28, 2016
Indiana, Indianapolis, USA

Acceptance Rates

CIKM '16 Paper Acceptance Rate 160 of 701 submissions, 23%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)0
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Finding Antagonistic Communities in Signed Uncertain GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.349658637:2(655-669)Online publication date: Feb-2025
  • (2025)H-dominant skyline community search in multi-valued networksThe Journal of Supercomputing10.1007/s11227-024-06679-581:1Online publication date: 1-Jan-2025
  • (2024)SACH: Significant-Attributed Community Search in Heterogeneous Information Networks2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00254(3283-3296)Online publication date: 13-May-2024
  • (2023)Influential Community Search over Large Heterogeneous Information NetworksProceedings of the VLDB Endowment10.14778/3594512.359453216:8(2047-2060)Online publication date: 22-Jun-2023
  • (2023)Efficient Influential Community Search in Large Uncertain GraphsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.313161135:4(3779-3793)Online publication date: 1-Apr-2023
  • (2022)Influential Attributed Communities via Graph Convolutional Network (InfACom-GCN)Information10.3390/info1310046213:10(462)Online publication date: 28-Sep-2022
  • (2022)Discovering top-weighted k-truss communities in large graphsJournal of Big Data10.1186/s40537-022-00588-19:1Online publication date: 3-Apr-2022
  • (2022)Incremental Influential Community Detection in Large NetworksProceedings of the 34th International Conference on Scientific and Statistical Database Management10.1145/3538712.3538724(1-12)Online publication date: 6-Jul-2022
  • (2022)Nucleus Decomposition in Probabilistic Graphs: Hardness and Algorithms2022 IEEE 38th International Conference on Data Engineering (ICDE)10.1109/ICDE53745.2022.00021(218-231)Online publication date: May-2022
  • (2022)Influence propagation in social networks: Interest-based community ranking modelJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2020.08.00434:5(2231-2243)Online publication date: May-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media