skip to main content
research-article

Linearization of Dependency and Sampling for Participation-based Betweenness Centrality in Very Large B-hypergraphs

Published: 13 March 2020 Publication History

Abstract

A B-hypergraph consisting of nodes and directed hyperedges is a generalization of the directed graph. A directed hyperedge in the B-hypergraph represents a relation from a set of source nodes to a single destination node. We suggest one possible definition of betweenness centrality (BC) in B-hypergraphs, called Participation-based BC (PBC). A PBC score of a node is computed based on the number of the shortest paths in which the node participates. This score can be expressed in terms of dependency on the set of its outgoing hyperedges. In this article, we focus on developing efficient computation algorithms for PBC. We first present an algorithm called ePBC for computing exact PBC scores of nodes, which has a cubic-time complexity. This algorithm, however, can be used for only small-sized B-hypergraphs because of its cubic-time complexity, so we propose linearized PBC (PBC) that is an approximation method of ePBC. PBC that comes with a guaranteed upper bound on its error, uses a linearization of dependency on a set of hyperedges. PBC improves the computing time of ePBC by an order of magnitude (i.e., it requires a quadratic time) while maintaining a high accuracy. PBC works well on small to medium-sized B-hypergraphs, but is not scalable enough for a very large B-hypergraph with more than a million hyperedges. To cope with such a very large B-hypergraph, we propose a very fast heuristic sampling-based method called sampling-based PBC (sPBC). We show through extensive experiments that PBC and sPBC can efficiently estimate PBC scores in various real-world B-hypergraphs with a reasonably good precision@k. The experimental results show that sPBC works efficiently even for a very large B-hypergraph.

References

[1]
Alireza Abbasi, Liaquat Hossain, and Loet Leydesdorff. 2012. Betweenness centrality as a driver of preferential attachment in the evolution of research collaboration networks. Journal of Informetrics 6, 3 (2012), 403--412.
[2]
Tharaka Alahakoon, Rahul Tripathi, Nicolas Kourtellis, Ramanuja Simha, and Adriana Iamnitchi. 2011. K-path centrality: A new centrality measure in social networks. In Proceedings of the 4th Workshop on Social Network Systems. ACM, 1.
[3]
Ziyad AlGhamdi, Fuad Jamour, Spiros Skiadopoulos, and Panos Kalnis. 2017. A benchmark for betweenness centrality approximation algorithms on large graphs. In Proceedings of the 29th International Conference on Scientific and Statistical Database Management. ACM, 6.
[4]
Ted T. Ashburn and Karl B. Thor. 2004. Drug repositioning: Identifying and developing new uses for existing drugs. Nature Reviews Drug Discovery 3, 8 (2004), 673.
[5]
Giorgio Ausiello and Luigi Laura. 2017. Directed hypergraphs: Introduction and fundamental algorithms - A survey. Theoretical Computer Science 658, PB (2017), 293--306.
[6]
David A. Bader, Shiva Kintali, Kamesh Madduri, and Milena Mihail. 2007. Approximating betweenness centrality. In International Workshop on Algorithms and Models for the Web-Graph. Springer, 124--137.
[7]
Peter L. Bartlett, Olivier Bousquet, Shahar Mendelson, et al. 2005. Local rademacher complexities. The Annals of Statistics 33, 4 (2005), 1497--1537.
[8]
Cécile Bothorel and Mohamed Bouklit. 2011. An algorithm for detecting communities in folksonomy hypergraphs. In Proceedings of the 11th International Conference on Innovative Internet Community Services. LNI, 159--168.
[9]
Ulrik Brandes. 2001. A faster algorithm for betweenness centrality. The Journal of Mathematical Sociology 25, 2 (2001), 163--177.
[10]
Ulrik Brandes. 2008. On variants of shortest-path betweenness centrality and their generic computation. Social Networks 30, 2 (2008), 136--145.
[11]
Pablo Carbonell, Davide Fichera, Shashi B. Pandit, and Jean-Loup Faulon. 2012. Enumerating metabolic pathways for the production of heterologous target chemicals in chassis organisms. BMC Systems Biology 6, 1 (2012), 10.
[12]
Linton C. Freeman. 1977. A set of measures of centrality based on betweenness. Sociometry 40, 1 (1977), 35--41.
[13]
Giorgio Gallo, Giustino Longo, Stefano Pallottino, and Sang Nguyen. 1993. Directed hypergraphs and applications. Discrete Applied Mathematics 42, 2–3 (1993), 177--201.
[14]
Jianhang Gao, Qing Zhao, and Ananthram Swami. 2012. The thinnest path problem for secure communications: A directed hypergraph approach. In Proceedings of the 50th Annual Allerton Conference on Communication, Control, and Computing. 847--852.
[15]
Takanori Hayashi, Takuya Akiba, and Yuichi Yoshida. 2015. Fully dynamic betweenness centrality maintenance on massive networks. Proceedings of the VLDB Endowment 9, 2 (2015), 48--59.
[16]
Petter Holme, Beom Jun Kim, Chang No Yoon, and Seung Kee Han. 2002. Attack vulnerability of complex networks. Physical Review E 65, 5 (2002), 056109.
[17]
Woochang Hwang, Young-rae Cho, Aidong Zhang, and Murali Ramanathan. 2006. Bridging centrality: Identifying bridging nodes in scale-free networks. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 20--23.
[18]
Swami Iyer, Timothy Killingback, Bala Sundaram, and Zhen Wang. 2013. Attack robustness and centrality of complex networks. PloS One 8, 4 (2013), e59613.
[19]
Livnat Jerby, Tomer Shlomi, and Eytan Ruppin. 2010. Computational reconstruction of tissue-specific metabolic models: Application to human liver metabolism. Molecular Systems Biology 6, 1 (2010), 401.
[20]
Jeff Kahn, Nathan Linial, and Alex Samorodnitsky. 1996. Inclusion-exclusion: Exact and approximate. Combinatorica 16, 4 (1996), 465--477.
[21]
Steffen Klamt, Utz-Uwe Haus, and Fabian Theis. 2009. Hypergraphs and cellular networks. PLoS Computational Biology 5, 5 (2009), e1000385.
[22]
C. Klein, a. Marino, M.-F. Sagot, P. Vieira Milreu, and M. Brilli. 2012. Structural and dynamical analysis of biological networks. Briefings in Functional Genomics 11, 6 (2012), 420--433.
[23]
Eric D. Kolaczyk, David B. Chua, and Marc Barthélemy. 2009. Group betweenness and co-betweenness: Inter-related notions of coalition centrality. Social Networks 31, 3 (2009), 190--203.
[24]
Dániel Kondor, Márton Pósfai, István Csabai, and Gábor Vattay. 2014. Do the rich get richer? An empirical analysis of the Bitcoin transaction network. PloS One 9, 2 (2014), e86197.
[25]
Kwang Hee Lee and Myoung Ho Kim. 2017. Computing betweenness centrality in B-hypergraphs. In Proceedings of the 2017 ACM Conference on Information and Knowledge Management, Singapore, November 06–10, 2017. 2147--2150.
[26]
Min-Joong Lee, Jungmin Lee, Jaimie Yejean Park, Ryan Hyun Choi, and Chin-Wan Chung. 2012. Qube: A quick algorithm for updating betweenness centrality. In Proceedings of the 21st International Conference on World Wide Web. ACM, 351--360.
[27]
Loet Leydesdorff. 2007. Betweenness centrality as an indicator of the interdisciplinarity of scientific journals. Journal of the American Society for Information Science and Technology 58, 9 (2007), 1303--1319.
[28]
Xiaoming Liu, Johan Bollen, Michael L. Nelson, and Herbert Van de Sompel. 2005. Co-authorship networks in the digital library research community. Information Processing 8 Management 41, 6 (2005), 1462--1480.
[29]
M. E. J. Newman. 2005. A measure of betweenness centrality based on random walks. Social Networks 27, 1 (2005), 39--54.
[30]
Lars Relund Nielsen, Kim Allan Andersen, and Daniele Pretolani. 2005. Finding the K shortest hyperpaths. Computers and Operations Research 32, 6 (2005), 1477--1497.
[31]
Hiroyuki Ogata, Susumu Goto, Kazushige Sato, Wataru Fujibuchi, Hidemasa Bono, and Minoru Kanehisa. 1999. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Research 27, 1 (1999), 29--34.
[32]
Rami Puzis, Yuval Elovici, and Shlomi Dolev. 2007. Fast algorithm for successive computation of group betweenness centrality. Physical Review E 76, 5 (2007), 056709.
[33]
Rami Puzis, Manish Purohit, and V. S. Subrahmanian. 2013. Betweenness computation in the single graph representation of hypergraphs. Social Networks 35, 4 (2013), 561--572.
[34]
Stephen Ranshous, Cliff A. Joslyn, Sean Kreyling, Kathleen Nowak, Nagiza F. Samatova, Curtis L. West, and Samuel Winters. 2017. Exchange pattern mining in the bitcoin transaction directed hypergraph. In Proceedings of the International Conference on Financial Cryptography and Data Security. Springer, 248--263.
[35]
Matteo Riondato and Evgenios M. Kornaropoulos. 2016. Fast approximation of betweenness centrality through sampling. Data Mining and Knowledge Discovery 30, 2 (2016), 438--475.
[36]
Matteo Riondato and Eli Upfal. 2016. ABRA: Approximating betweenness centrality in static and dynamic graphs with rademacher averages. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1145--1154.
[37]
Matteo Riondato and Eli Upfal. 2018. ABRA: Approximating betweenness centrality in static and dynamic graphs with rademacher averages. ACM Transactions on Knowledge Discovery from Data 12, 5 (2018), 61.
[38]
Anna Ritz, Brendan Avent, and T. Murali. 2015. Pathway analysis with signaling hypergraphs. IEEE/ACM Transactions on Computational Biology and Bioinformatics 14, 5 (2015). 1042–1055.
[39]
Carl F. Schaefer, Kira Anthony, Shiva Krupa, Jeffrey Buchoff, Matthew Day, Timo Hannay, and Kenneth H. Buetow. 2009. PID: The pathway interaction database. Nucleic Acids Research 37, suppl 1 (2009), D674--D679.
[40]
Shai Shalev-Shwartz and Shai Ben-David. 2014. Understanding Machine Learning: From Theory to Algorithms. CUP.
[41]
Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. 2008. ArnetMiner: Extraction and mining of academic social networks. In Proceeding of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. (2008), 990--998.
[42]
Antonio P. Volpentesta. 2008. Hypernetworks in a directed hypergraph. European Journal of Operational Research 188, 2 (2008), 390--405.
[43]
Yuichi Yoshida. 2014. Almost linear-time algorithms for adaptive betweenness centrality using hypergraph sketches. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1416--1425.
[44]
Hasun Yu, Jinmyung Jung, Seyeol Yoon, Mijin Kwon, Sunghwa Bae, Soorin Yim, Jaehyun Lee, Seunghyun Kim, Yeeok Kang, and Dohoen Lee. 2017. CODA: Integrating multi-level context-oriented directed associations for analysis of drug effects. Scientific Reports 7, 1 (2017), 7519.
[45]
Wen Zhang, Xinrui Liu, Yanlin Chen, Wenjian Wu, Wei Wang, and Xiaohong Li. 2018. Feature-derived graph regularized matrix factorization for predicting drug side effects. Neurocomputing 287, C (2018), 154--162.

Cited By

View all
  • (2022)Centrality Measures in Finding Influential Nodes for the Big-Data NetworkHandbook of Smart Materials, Technologies, and Devices10.1007/978-3-030-84205-5_103(2393-2409)Online publication date: 10-Nov-2022
  • (2021)Graph Neural Networks for Fast Node Ranking ApproximationACM Transactions on Knowledge Discovery from Data10.1145/344621715:5(1-32)Online publication date: 26-Jun-2021

Index Terms

  1. Linearization of Dependency and Sampling for Participation-based Betweenness Centrality in Very Large B-hypergraphs

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Transactions on Knowledge Discovery from Data
        ACM Transactions on Knowledge Discovery from Data  Volume 14, Issue 3
        June 2020
        381 pages
        ISSN:1556-4681
        EISSN:1556-472X
        DOI:10.1145/3388473
        Issue’s Table of Contents
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 13 March 2020
        Accepted: 01 December 2019
        Revised: 01 October 2019
        Received: 01 October 2018
        Published in TKDD Volume 14, Issue 3

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. B-hypergraph
        2. Directed hypergraph
        3. betweenness centrality
        4. inclusion-exclusion principle
        5. rademacher complexity

        Qualifiers

        • Research-article
        • Research
        • Refereed

        Funding Sources

        • Korea Electric Power Corporation
        • Korea government(MSIT)

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)12
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 07 Mar 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2022)Centrality Measures in Finding Influential Nodes for the Big-Data NetworkHandbook of Smart Materials, Technologies, and Devices10.1007/978-3-030-84205-5_103(2393-2409)Online publication date: 10-Nov-2022
        • (2021)Graph Neural Networks for Fast Node Ranking ApproximationACM Transactions on Knowledge Discovery from Data10.1145/344621715:5(1-32)Online publication date: 26-Jun-2021

        View Options

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media