Abstract
Maximal biclique enumeration is a fundamental problem in analysing bipartite graphs, which has a wide range of real applications, such as web mining, recommendation systems, and social network analysis. As real-world bipartite graphs constantly evolve over time, it is useful and necessary to incrementally maintain maximal bicliques in dynamic bipartite graphs. Existing solutions for this problem suffer from the major issue of enumerating duplicate maximal bicliques, which inevitably bring huge computation overhead. In this paper, we devise a novel framework to efficiently maintain maximal bicliques when the graph evolves with edge insertions. There are two major steps, i.e., new maximal biclique enumeration and subsumed maximal biclique enumeration. In particular, we sort edges in order when a batch of edges is inserted or removed. Based on the sequence of edges, we construct a recursion search tree to avoid duplicate new maximal bicliques and non-maximal bicliques. Besides, for subsumed maximal biclique eumeration, we first check the maximality of bicliques in an efficient way, and then delete these are no longer maximal. Furthermore, we also show that our techniques can be easily extended to deal with edge deletions. The experiment results demonstrate the efficiency of our techniques, which show a superior performance over the state-of-the-art method.
Similar content being viewed by others
References
Alexe, G., Alexe, S., Crama, Y., Foldes, S., Hammer, P.L., Simeone, B.: Consensus algorithms for the generation of all maximal bicliques. Discret. Appl. Math. 145(1), 11–21 (2004)
Beutel, A., Xu, W., Guruswami, V., Palow, C., Faloutsos, C.: Copycatch: stopping group attacks by spotting lockstep behavior in social networks. In: Proceedings of the 22nd International Conference on World Wide Web, pp 119–130 (2013)
Chen, C.M., Wang, C.J., Tsai, M.F., Yang, Y.H.: Collaborative similarity embedding for recommender systems. In: The World Wide Web Conference, pp 2637–2643 (2019)
Chen, L., Liu, C., Zhou, R., Xu, J., Li, J.: Efficient exact algorithms for maximum balanced biclique search in bipartite graphs. In: Proceedings of the 2021 International Conference on Management of Data, pp 248–260 (2021)
Damaschke, P.: Enumerating maximal bicliques in bipartite graphs with favorable degree sequences. Inf. Process. Lett. 114(6), 317–321 (2014)
Das, A., Tirthapura, S.: Incremental maintenance of maximal bicliques in a dynamic bipartite graph. IEEE Transactions on Multi-Scale Computing Systems 4(3), 231–242 (2018)
Deng, H., Lyu, M.R., King, I.: A generalized co-hits algorithm and its application to bipartite graphs. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp 239–248 (2009)
Dias, V.M., De Figueiredo, C.M., Szwarcfiter, J.L.: Generating bicliques of a graph in lexicographic order. Theor. Comput. Sci. 337(1-3), 240–248 (2005)
Dias, V.M., de Figueiredo, C.M., Szwarcfiter, J.L.: On the generation of bicliques of a graph. Discret. Appl. Math. 155(14), 1826–1832 (2007)
Ding, D., Li, H., Huang, Z., Mamoulis, N.: Efficient fault-tolerant group recommendation using (α, β)-core. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp 2047–2050 (2017)
Eppstein, D.: Arboricity and bipartite subgraph listing algorithms. Information Processing Letters 51(4), 207–211 (1994)
Gély, A., Nourine, L., Sadi, B.: Enumeration aspects of maximal cliques and bicliques. Discrete Applied Mathematics 157(7), 1447–1459 (2009)
He, Y., Wang, K., Zhang, W., Lin, X., Zhang, Y.: Exploring cohesive subgraphs with vertex engagement and tie strength in bipartite graphs. Inform. Sci. 572, 277–296 (2021)
Li, Y., Fan, J., Wang, Y., Tan, K.L.: Influence maximization on social graphs: a survey. IEEE Trans. Knowl. Data Eng. 30(10), 1852–1872 (2018)
Li, Y., Fan, J., Ovchinnikov, G., Karras, P.: Maximizing multifaceted network influence. In: 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp 446–457. IEEE (2019)
Liu, B., Yuan, L., Lin, X., Qin, L., Zhang, W., Zhou, J.: Efficient (α, β)-core computation: an index-based approach. In: The World Wide Web Conference, pp 1130–1141 (2019)
Liu, B., Yuan, L., Lin, X., Qin, L., Zhang, W., Zhou, J.: Efficient (α, β)-core computation in bipartite graphs. The VLDB Journal 29(5), 1075–1099 (2020)
Liu, G., Sim, K., Li, J.: Efficient mining of large maximal bicliques. In: International Conference on Data Warehousing and Knowledge Discovery, pp 437–448. Springer (2006)
Makino, K., Uno, T.: New algorithms for enumerating all maximal cliques. In: Scandinavian Workshop on Algorithm Theory, pp 260–272. Springer (2004)
Mukherjee, A.P., Tirthapura, S.: Enumerating maximal bicliques from a large graph using mapreduce. IEEE Trans. Serv. Comput. 10(5), 771–784 (2016)
Prisner, E.: Bicliques in graphs i: bounds on their number. Combinatorica 20(1), 109–117 (2000)
Sanei-Mehri, S.V., Sariyuce, A.E., Tirthapura, S.: Butterfly counting in bipartite networks. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 2150–2159 (2018)
Sanei-Mehri, S.V., Zhang, Y., Sariyüce, A.E., Tirthapura, S.: Fleet: butterfly estimation from a bipartite graph stream. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp 1201–1210 (2019)
Sarıyüce, A.E., Pinar, A.: Peeling bipartite networks for dense subgraph discovery. In: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, pp 504–512 (2018)
Svendsen, M., Mukherjee, A.P., Tirthapura, S.: Mining maximal cliques from a large graph using mapreduce: tackling highly uneven subproblem sizes. Journal of Parallel and Distributed Computing 79, 104–114 (2015)
Wang, J., De Vries, A.P., Reinders, M.J.: Unifying user-based and item-based collaborative filtering approaches by similarity fusion. In: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval, pp 501–508 (2006)
Wang, K., Lin, X., Qin, L., Zhang, W., Zhang, Y.: Vertex priority based butterfly counting for large-scale bipartite networks. Proceedings of the VLDB Endowment 12(10), 1139–1152 (2019)
Wang, K., Lin, X., Qin, L., Zhang, W., Zhang, Y.: Efficient bitruss decomposition for large-scale bipartite graphs. In: 2020 IEEE 36th International Conference on Data Engineering (ICDE), pp 661–672. IEEE (2020)
Wang, K., Lin, X., Qin, L., Zhang, W., Zhang, Y.: Towards efficient solutions of bitruss decomposition for large-scale bipartite graphs. The VLDB Journal, pp 1–24 (2021)
Wang, K., Zhang, W., Lin, X., Zhang, Y., Qin, L., Zhang, Y. : Efficient and effective community search on large-scale bipartite graphs. In: 2021 IEEE 37th International Conference on Data Engineering (ICDE), pp 85–96. IEEE (2021)
Wang, R., Liao, M., Qin, C.: An efficient algorithm for enumerating maximal bicliques from a dynamically growing graph. In: The International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery, pp 329–337. Springer (2019)
Xu, Y., Cheng, J., Fu, A.W.C., Bu, Y.: Distributed maximal clique computation. In: 2014 IEEE International Congress on Big Data, pp 160–167. IEEE (2014)
Yang, Y., Fang, Y., Orlowska, M.E., Zhang, W., Lin, X.: Efficient bi-triangle counting for large bipartite networks. Proceedings of the VLDB Endowment 14(6), 984–996 (2021)
Zhao, X., Xiao, C., Lin, X., Zhang, W., Wang, Y.: Efficient structure similarity searches: a partition-based approach. The VLDB Journal 27(1), 53–78 (2018)
Acknowledgements
This research was supported in part by NSFC (Grant No. 62002108, 61872134), Science and Technology Program of Changsha City (Grant kh2005019), Zhejiang Lab (NO.2021KD0AB02), and the Key Area Research Program of Hunan (2019GK2091).
Author information
Authors and Affiliations
Corresponding authors
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article belongs to the Topical Collection: Special Issue on Large Scale Graph Data Analytics
Guest Editors: Xuemin Lin, Lu Qin, Wenjie Zhang, and Ying Zhang
Rights and permissions
About this article
Cite this article
Ma, Z., Liu, Y., Hu, Y. et al. Efficient maintenance for maximal bicliques in bipartite graph streams. World Wide Web 25, 857–877 (2022). https://doi.org/10.1007/s11280-021-00927-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-021-00927-x