ABSTRACT
With the development of Internet, we are gradually entering the era of big data. The size of network is increasing and the structure of network is becoming more complex. Community is a unique network structure with great research value. The task of community analysis goes through two separate phases: first, detection of meaningful community structure from a network, and second, evaluation of the appropriateness of the detected community structure. With the popularity of network research, many community detection algorithms emerged which can be grouped in categories, based on different criteria. In order to applicate community detection algorithms in real-world network analysis, we need to measure the performance of the algorithm. The performance depends on two points, that is, whether the algorithm can give the result of community division in an acceptable time, and whether the algorithm can reveal the community structure of the network with high quality. In recent years, systems used to analyze network and detect community are mushrooming. However, existing systems rarely have evaluation function, either providing social network analysis or providing data analysis service. We need a tool to evaluate community detection algorithms. In response to the challenge, the Community Evaluation System (CES) is proposed to meet the demands of community detection algorithms' evaluation. CES can evaluate community detection algorithms with multiple metrics. It uses B/S mode, integrates Spark, Yarn and HDFS technology to support the operation of large-scale data, and experiments prove that it is effective.
- Lee V E, Ning R, Jin R, and C. Aggarwal. A Survey of Algorithms for Dense Subgraph Discovery[M]// Managing and Mining Graph Data. 2010:303--336. Google ScholarCross Ref
- Chakraborty T, Dalmia A, Mukherjee A, Ganguly N. Metrics for Community Analysis: A Survey[J]. 2016.Google Scholar
- Fortunato S, Hric D. Community detection in networks: A user guide[J]. Physics Reports, 2016, 659:1--44. Google ScholarCross Ref
- Fortunato S. Community detection in graphs[J]. Physics Reports, 2010, 486(3-5):75--174. Google ScholarCross Ref
- Wang M, Wang C, Yu J X, and Zhang J. Community detection in social networks: an in-depth benchmarking study with a procedure-oriented framework[J]. Proceedings of the Vldb Endowment, 2015, 8(10):998--1009. Google ScholarDigital Library
- Liu Y, Wu B, Wang H, et al. BPGM: A Big Graph Mining Tool[J]. Tsinghua Science and Technology, 2014, 19(1):33--38. Google ScholarCross Ref
- Kang U, Tsourakakis C E, Faloutsos C. PEGASUS: A Peta-Scale Graph Mining System Implementation and Observations[C]// Ninth IEEE International Conference on Data Mining. IEEE Computer Society, 2009:229--238.Google Scholar
- Yu L, Zheng J, Shen W C, et al. BC-PDM: data mining, social network analysis and text mining system based on cloud computing[C]// ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2012:1496--1499.Google Scholar
- Leskovec J, Sosi. SNAP: A General-Purpose Network Analysis and Graph-Mining Library[J]. Acm Transactions on Intelligent Systems & Technology, 2016, 8(1):1. Google ScholarDigital Library
- Batagelj V, Mrvar A. Pajek[J]. Encyclopedia of Social Network Analysis & Mining, 2014, 39(6):114--115.Google Scholar
- Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: Machine Learning in Python[J]. Journal of Machine Learning Research, 2013, 12(10):2825--2830.Google Scholar
- CES: A System for Community Evaluation
Recommendations
Defining and evaluating network communities based on ground-truth
Nodes in real-world networks organize into densely linked communities where edges appear with high concentration among the members of the community. Identifying such communities of nodes has proven to be a challenging task due to a plethora of ...
On community detection in real-world networks and the importance of degree assortativity
KDD '13: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data miningGraph clustering, often addressed as community detection, is a prominent task in the domain of graph data mining with dozens of algorithms proposed in recent years. In this paper, we focus on several popular community detection algorithms with low ...
A new metric to compare local community detection algorithms in social networks using geodesic distance
AbstractCommunity detection problem is a well-studied problem in social networks. One major question to this problem is how to evaluate different community detection algorithms. This issue is even more challenging in the problem of local community ...
Comments