ABSTRACT
The problem of computing (α, β)-core in a bipartite graph for given α and β is a fundamental problem in bipartite graph analysis and can be used in many applications such as online group recommendation, fraudsters detection, etc. Existing solution to computing (α, β)-core needs to traverse the entire bipartite graph once. Considering the real bipartite graph can be very large and the requests to compute (α, β)-core can be issued frequently in real applications, the existing solution is too expensive to compute the (α, β)-core. In this paper, we present an efficient algorithm based on a novel index such that the algorithm runs in linear time regarding the result size (thus, the algorithm is optimal since it needs at least linear time to output the result). We prove that the index only requires O(m) space where m is the number of edges in the bipartite graph. Moreover, we devise an efficient algorithm with time complexity O(δ·m) for index construction where δ is bounded by √m and is much smaller than √m in practice. We also discuss efficient algorithms to maintain the index when the bipartite graph is dynamically updated and parallel implementation of the index construction algorithm. The experimental results on real and synthetic graphs (more than 1 billion edges) demonstrate that our algorithms achieve up to 5 orders of magnitude speedup for computing (α, β)-core and up to 3 orders of magnitude speedup for index construction, respectively, compared with existing techniques.
- Adel Ahmed, Vladimir Batagelj, Xiaoyan Fu, Seok-Hee Hong, Damian Merrick, and Andrej Mrvar. 2007. Visualisation and analysis of the Internet movie database. In Visualization, 2007. APVIS'07. 2007 6th International Asia-Pacific Symposium on. IEEE, 17-24.Google ScholarCross Ref
- Mohammad Allahbakhsh, Aleksandar Ignjatovic, Boualem Benatallah, Seyed-Mehdi-Reza Beheshti, Elisa Bertino, and Norman Foo. 2013. Collusion Detection in Online Rating Systems. In Web Technologies and Applications, Yoshiharu Ishikawa, Jianzhong Li, Wei Wang, Rui Zhang, and Wenjie Zhang (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 196-207.Google Scholar
- Jose´ Ignacio Alvarez-Hamelin, Luca Dall'Asta, Alain Barrat, and Alessandro Vespignani. 2005. k-core decomposition: A tool for the visualization of large scale networks. arXiv preprint cs/0504107(2005).Google Scholar
- Sihem Amer-Yahia, Senjuti Basu Roy, Ashish Chawlat, Gautam Das, and Cong Yu. 2009. Group Recommendation: Semantics and Efficiency. Proc. VLDB Endow. 2, 1 (Aug. 2009), 754-765. Google ScholarDigital Library
- Gary D Bader and Christopher WV Hogue. 2003. An automated method for finding molecular complexes in large protein interaction networks. BMC bioinformatics 4, 1 (2003), 2.Google Scholar
- Albert-László Barabási and Re´ka Albert. 1999. Emergence of Scaling in Random Networks. Science 286, 5439 (1999), 509-512. arXiv:http://science.sciencemag.org/content/286/5439/509.full.pdfGoogle Scholar
- Vladimir Batagelj and Matjaz Zaversnik. 2003. An O (m) algorithm for cores decomposition of networks. arXiv preprint cs/0310049(2003).Google Scholar
- Alex Beutel, Wanhong Xu, Venkatesan Guruswami, Christopher Palow, and Christos Faloutsos. 2013. Copycatch: stopping group attacks by spotting lockstep behavior in social networks. In Proceedings of the 22nd international conference on World Wide Web. ACM, 119-130. Google ScholarDigital Library
- Lucas Augusto Montalvão Costa Carvalhoand Hendrik Teixeira Macedo. 2013. Users' satisfaction in recommendation systems for groups: an approach based on noncooperative games. In Proceedings of the 22nd International Conference on World Wide Web. ACM, 951-958. Google ScholarDigital Library
- Monika Cerinšek and Vladimir Batagelj. 2015. Generalized two-mode cores. Social Networks 42(2015), 80 - 87.Google ScholarCross Ref
- Danhao Ding, Hui Li, Zhipeng Huang, and Nikos Mamoulis. 2017. Efficient Fault-Tolerant Group Recommendation Using Alpha-beta-core. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management(CIKM '17). ACM, New York, NY, USA, 2047-2050. Google ScholarDigital Library
- Mike Gartrell, Xinyu Xing, Qin Lv, Aaron Beach, Richard Han, Shivakant Mishra, and Karim Seada. 2010. Enhancing group recommendation by incorporating social relationship interactions. In Proceedings of the 16th ACM international conference on Supporting group work. ACM, 97-106. Google ScholarDigital Library
- Christos Giatsidis, Dimitrios M Thilikos, and Michalis Vazirgiannis. 2011. D-cores: Measuring collaboration of directed graphs based on degeneracy. In Data Mining (ICDM), 2011 IEEE 11th International Conference on. IEEE, 201-210. Google ScholarDigital Library
- Christos Giatsidis, Dimitrios M Thilikos, and Michalis Vazirgiannis. 2011. Evaluating cooperation in communities with the k-core structure. In Advances in Social Networks Analysis and Mining (ASONAM), 2011 International Conference on. IEEE, 87-93. Google ScholarDigital Library
- Jagadeesh Gorla, Neal Lathia, Stephen Robertson, and Jun Wang. 2013. Probabilistic group recommendation via information matching. In Proceedings of the 22nd international conference on World Wide Web. ACM, 495-504. Google ScholarDigital Library
- Jean-Loup Guillaume and Matthieu Latapy. 2004. Bipartite structure of all complex networks. Information processing letters 90, 5 (2004), 215-221. Google ScholarDigital Library
- Jean-Loup Guillaume and Matthieu Latapy. 2006. Bipartite graphs as models of complex networks. Physica A: Statistical Mechanics and its Applications 371, 2(2006), 795-813.Google Scholar
- S. Gunnemann, E. Muller, S. Raubach, and T. Seidl. 2011. Flexible Fault Tolerant Subspace Clustering for Data with Missing Values. In 2011 IEEE 11th International Conference on Data Mining. 231-240. Google ScholarDigital Library
- Dorit S Hochbaum. 1998. Approximating clique and biclique problems. Journal of Algorithms 29, 1 (1998), 174-200. Google ScholarDigital Library
- Ravi Kannan, Prasad Tetali, and Santosh Vempala. 1997. Simple Markov-chain algorithms for generating bipartite graphs and tournaments. In Proceedings of the eighth annual ACM-SIAM symposium on Discrete algorithms. Society for Industrial and Applied Mathematics, 193-200. Google ScholarDigital Library
- Mehdi Kaytoue, Sergei O Kuznetsov, Amedeo Napoli, and Se´bastien Duplessis. 2011. Mining gene expression data with pattern structures in formal concept analysis. Information Sciences 181, 10 (2011), 1989-2001. Google ScholarDigital Library
- Wissam Khaouid, Marina Barsky, Venkatesh Srinivasan, and Alex Thomo. 2015. K-core decomposition of large networks on a single PC. Proceedings of the VLDB Endowment 9, 1 (2015), 13-23. Google ScholarDigital Library
- Sune Lehmann, Martin Schwartz, and Lars Kai Hansen. 2008. Biclique communities. Phys. Rev. E 78 (Jul 2008), 016108. Issue 1.Google ScholarCross Ref
- Michael Ley. 2002. The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives. In Proc. Int. Symposium on String Processing and Information Retrieval. 1-10. Google ScholarDigital Library
- Jinyan Li, Kelvin Sim, Guimei Liu, and Limsoon Wong. {n. d.}. Maximal Quasi-Bicliques with Balanced Noise Tolerance: Concepts and Co-clustering Applications. 72-83. arXiv:http://epubs.siam.org/doi/pdf/10.1137/1.9781611972788.7Google Scholar
- Jinyan Li, Kelvin Sim, Guimei Liu, and Limsoon Wong. 2008. Maximal quasi-bicliques with balanced noise tolerance: Concepts and co-clustering applications. In Proceedings of the 2008 SIAM International Conference on Data Mining. SIAM, 72-83.Google ScholarCross Ref
- Greg Linden, Brent Smith, and Jeremy York. 2003. Amazon. com recommendations: Item-to-item collaborative filtering. IEEE Internet computing1(2003), 76-80. Google ScholarDigital Library
- Xiaowen Liu, Jinyan Li, and Lusheng Wang. 2010. Modeling Protein Interacting Groups by Quasi-Bicliques: Complexity, Algorithm, and Application. IEEE/ACM Trans. Comput. Biol. Bioinformatics 7, 2 (April 2010), 354-364. Google ScholarDigital Library
- Mark EJ Newman, Steven H Strogatz, and Duncan J Watts. 2001. Random graphs with arbitrary degree distributions and their applications. Physical review E 64, 2 (2001), 026118.Google Scholar
- Eirini Ntoutsi, Kostas Stefanidis, Kjetil Nørvåg, and Hans-Peter Kriegel. 2012. Fast group recommendations by applying user clustering. In International Conference on Conceptual Modeling. Springer, 126-140. Google ScholarDigital Library
- Eirini Ntoutsi, Kostas Stefanidis, Katharina Rausch, and Hans-Peter Kriegel. 2014. ”Strength Lies in Differences”: Diversifying Friends for Recommendations Through Subspace Clustering. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management(CIKM '14). ACM, New York, NY, USA, 729-738. Google ScholarDigital Library
- Rene´ Peeters. 2003. The maximum edge biclique problem is NP-complete. Discrete Applied Mathematics 131, 3 (2003), 651-654. Google ScholarDigital Library
- Ardian Kristanto Poernomo and Vivekanand Gopalkrishnan. 2009. Towards Efficient Mining of Proportional Fault-tolerant Frequent Itemsets. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD '09). ACM, New York, NY, USA, 697-706. Google ScholarDigital Library
- Seyed-Vahid Sanei-Mehri, Ahmet Erdem Sariyuce, and Srikanta Tirthapura. 2018. Butterfly Counting in Bipartite Networks. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 2150-2159. Google ScholarDigital Library
- Ahmet Erdem Saríyüce, Bugra Gedik, Gabriela Jacques-Silva, Kun-Lung Wu, and Ümit V Çatalyürek. 2013. Streaming algorithms for k-core decomposition. Proceedings of the VLDB Endowment 6, 6 (2013), 433-444. Google ScholarDigital Library
- Ahmet Erdem Sariyüce, Bugra Gedik, Gabriela Jacques-Silva, Kun-Lung Wu, and Ümit V Çatalyürek. 2016. Incremental k-core decomposition: algorithms and evaluation. The VLDB Journal-The International Journal on Very Large Data Bases 25, 3(2016), 425-447. Google ScholarDigital Library
- Ahmet Erdem Sariyüce and Ali Pinar. 2018. Peeling Bipartite Networks for Dense Subgraph Discovery. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining(WSDM '18). ACM, New York, NY, USA, 504-512. Google ScholarDigital Library
- Stephen B Seidman. 1983. Network structure and minimum degree. Social networks 5, 3 (1983), 269-287.Google Scholar
- Kelvin Sim, Jinyan Li, Vivekanand Gopalkrishnan, and Guimei Liu. 2006. Mining maximal quasi-bicliques to co-cluster stocks and financial ratios for value investment. In Data Mining, 2006. ICDM'06. Sixth International Conference on. IEEE, 1059-1063. Google ScholarDigital Library
- Jun Wang, Arjen P De Vries, and Marcel JT Reinders. 2006. Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 501-508. Google ScholarDigital Library
- Jizhe Wang, Pipei Huang, Huan Zhao, Zhibo Zhang, Binqiang Zhao, and Dik Lun Lee. 2018. Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining(KDD '18). ACM, New York, NY, USA, 839-848. Google ScholarDigital Library
- Stefan Wuchty and Eivind Almaas. 2005. Peeling the yeast protein network. Proteomics 5, 2 (2005), 444-449.Google ScholarCross Ref
- Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, and Wenjie Zhang. 2015. Diversified top-k clique search. In 31st IEEE International Conference on Data Engineering, ICDE 2015, Seoul, South Korea, April 13-17, 2015. 387-398.Google ScholarCross Ref
- Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, and Wenjie Zhang. 2016. Diversified top-k clique search. VLDB J. 25, 2 (2016), 171-196. Google ScholarDigital Library
- Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, and Wenjie Zhang. 2016. I/O Efficient ECC Graph Decomposition via Graph Reduction. PVLDB 9, 7 (2016), 516-527. Google ScholarDigital Library
- Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, and Wenjie Zhang. 2017. Effective and Efficient Dynamic Graph Coloring. PVLDB 11, 3 (2017), 338-351. Google ScholarDigital Library
- Long Yuan, Lu Qin, Xuemin Lin, Lijun Chang, and Wenjie Zhang. 2017. I/O efficient ECC graph decomposition via graph reduction. VLDB J. 26, 2 (2017), 275-300. Google ScholarDigital Library
- Long Yuan, Lu Qin, Wenjie Zhang, Lijun Chang, and Jianye Yang. 2018. Index-Based Densest Clique Percolation Community Search in Networks. IEEE Trans. Knowl. Data Eng. 30, 5 (2018), 922-935.Google ScholarCross Ref
- Quan Yuan, Gao Cong, and Chin-Yew Lin. 2014. COM: A Generative Model for Group Recommendation. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD '14). ACM, New York, NY, USA, 163-172. Google ScholarDigital Library
- Fan Zhang, Long Yuan, Ying Zhang, Lu Qin, Xuemin Lin, and Alexander Zhou. 2018. Discovering Strong Communities with User Engagement and Tie Strength. In Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Gold Coast, QLD, Australia, May 21-24, 2018, Proceedings, Part I. 425-441.Google Scholar
- Yang Zhang and Srinivasan Parthasarathy. 2012. Extracting analyzing and visualizing triangle k-core motifs within networks. In Data Engineering (ICDE), 2012 IEEE 28th International Conference on. IEEE, 1049-1060. Google ScholarDigital Library
- Yun Zhang, Charles A. Phillips, Gary L. Rogers, Erich J. Baker, Elissa J. Chesler, and Michael A. Langston. 2014. On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types. BMC Bioinformatics 15, 1 (15 Apr 2014), 110.Google Scholar
- Yun Zhang, Charles A Phillips, Gary L Rogers, Erich J Baker, Elissa J Chesler, and Michael A Langston. 2014. On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types. BMC bioinformatics 15, 1 (2014), 110.Google Scholar
- Yikai Zhang, Jeffrey Xu Yu, Ying Zhang, and Lu Qin. 2017. A Fast Order-Based Approach for Core Maintenance. In 33rd IEEE International Conference on Data Engineering, ICDE 2017, San Diego, CA, USA, April 19-22, 2017. 337-348.Google ScholarCross Ref
Recommendations
On the extremal graphs for general sum-connectivity index (χ α) with given cyclomatic number when α > 1
AbstractLet V ( G ) and E ( G ) be, respectively, the vertex set and edge set of a graph G. The general sum-connectivity index of a graph G is denoted by χ α ( G ) and is defined as ∑ u v ∈ E ( G ) ( d u + d v ) α, where u v is the edge ...
VEK: a vertex-oriented approach for edge k-core problem
AbstractThe stability of a network has been widely studied as an important indicator of network status, e.g., reliability and activity. A popular model for measuring the (structural) stability of a network is k-core , the maximal induced subgraph in which ...
Efficient distance-generalized (α, β)-core decomposition on bipartite graphs
ACM TURC '23: Proceedings of the ACM Turing Award Celebration Conference - China 2023(α, β)-core decomposition is a fundamental problem in graph analysis, and has been widely adopted for anomaly detection and online group recommendation. Nevertheless,(α, β)-core model only considers the distance-1 neighborhood, which makes it unable to ...
Comments