research-article

Efficient analytical computation of expected frequency of motifs of small size by marginalization in uncertain network

Authors:

Takayasu Fushimi,

Hiroshi MotodaAuthors Info & Claims

ASONAM '21: Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

Pages 1 - 8

https://doi.org/10.1145/3487351.3488275

Published: 19 January 2022 Publication History

Abstract

Counting motifs in an uncertain graph for which each link is associated with a connection probability is computationally expensive when the graph is huge due to the extremely large number of possible worlds. Natural approach is to rely on sampling-based approximation methods, but this still needs many sample graphs for obtaining accurate results. We propose a novel method that analytically computes the expected frequency of motif without relying on expensive sampling. Marginalizing the probability of each possible world on a candidate motif can drastically reduce the number of possible worlds that need be considered when the size of motif is small. Experiments using real-world data confirm that the proposed method is effective and efficient. It is far better than the state-of-the-art sampling-based method. The accuracy is guaranteed and the running time is about 4 order of magnitude faster. It runs at a speed that does not depend on the connection probability.

References

[1]

R. Milo, S. Shen-Orr, S. Itzkovitz, N. Kashtan, D. Chklovskii, and U. Alon, "Network motifs: simple building blocks of complex networks." Science (New York, N.Y.), vol. 298, no. 5594, pp. 824--827, October 2002.

[2]

S. Wernicke, "A faster algorithm for detecting network motifs," in Proceedings of the 5th International Conference on Algorithms in Bioinformatics, ser. WABI'05. Berlin, Heidelberg: Springer-Verlag, 2005, pp. 165--177.

[3]

R. Itzhack, Y. Mogilevski, and Y. Louzoun, "An optimal algorithm for counting network motifs," Physica A: Statistical Mechanics and its Applications, vol. 381, pp. 482--490, 2007.

[4]

J. A. Grochow and M. Kellis, "Network motif discovery using subgraph enumeration and symmetry-breaking," in Proceedings of the 11th Annual International Conference on Research in Computational Molecular Biology, ser. RECOMB'07. Berlin, Heidelberg: Springer-Verlag, 2007, pp. 92--106.

[5]

A. Pinar, C. Seshadhri, and V. Vishal, "Escape: Efficiently counting all 5-vertex subgraphs," in Proceedings of the 26th International Conference on World Wide Web, ser. WWW '17. Republic and Canton of Geneva, CHE: International World Wide Web Conferences Steering Committee, 2017, pp. 1431--1440.

[6]

J. J. Pfeiffer and J. Neville, "Methods to determine node centrality and clustering in graphs with uncertain structure," in Proceedings of the Fifth International Conference on Weblogs and Social Media. The AAAI Press, 2011, pp. 590--593.

[7]

R. Jin, L. Liu, B. Ding, and H. Wang, "Distance-constraint reachability computation in uncertain graphs," Proceedings of the VLDB Endowment, vol. 4, no. 9, pp. 551--562, Jun. 2011.

Digital Library

[8]

R. Jin, L. Liu, and C. C. Aggarwal, "Discovering highly reliable subgraphs in uncertain graphs," in Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD '11. New York, NY, USA: ACM, 2011, pp. 992--1000.

Digital Library

[9]

L. Liu, R. Jin, C. Aggarwal, and Y. Shen, "Reliable clustering on uncertain graphs," in 2012 IEEE 12th International Conference on Data Mining, Dec 2012, pp. 459--468.

[10]

M. Ceccarello, C. Fantozzi, A. Pietracaprina, G. Pucci, and F. Vandin, "Clustering uncertain graphs," Proceedings of the VLDB Endowment, vol. 11, no. 4, pp. 472--484, Dec. 2017.

Digital Library

[11]

J. Kim, M.-L. Li, K. Candan, and M. Sapino, "Personalized PageRank in uncertain graphs with mutually exclusive edges," in Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, ser. SIGIR '17. New York, NY, USA: ACM, 2017, pp. 525--534.

Digital Library

[12]

C. Ma, R. Cheng, L. V. S. Lakshmanan, T. Grubenmann, Y. Fang, and X. Li, "Linc: A motif counting algorithm for uncertain graphs," Proc. VLDB Endow., vol. 13, no. 2, p. 155--168, Oct. 2019.

Digital Library

[13]

N. K. Ahmed, J. Neville, R. A. Rossi, and N. Duffield, "Efficient graphlet counting for large networks," in 2015 IEEE International Conference on Data Mining, 2015, pp. 1--10.

[14]

X. Chen, Y. Li, P. Wang, and J. C. S. Lui, "A general framework for estimating graphlet statistics via random walk," Proc. VLDB Endow., vol. 10, no. 3, pp. 253--264, Nov. 2016.

Digital Library

[15]

M. Bressan, F. Chierichetti, R. Kumar, S. Leucci, and A. Panconesi, "Counting graphlets: Space vs time," in Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, ser. WSDM '17. New York, NY, USA: Association for Computing Machinery, 2017, pp. 557--566.

Digital Library

[16]

P. Kaluza, M. Vingron, and A. S. Mikhailov, "Self-correcting networks: Function, robustness, and motif distributions in biological signal processing," Chaos: An Interdisciplinary Journal of Nonlinear Science, vol. 18, no. 02, pp. 026113:1--026113:17, 2008.

[17]

T. Milenkovic, W. L. Ng, W. Hayes, and N. Przulj, "Optimal network alignment with graphlet degree vectors," Cancer informatics, vol. 9, pp. 121--37, 2010.

[18]

X. Huang, H. Cheng, L. Qin, W. Tian, and J. X. Yu, "Querying k-truss community in large and dynamic graphs," in Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, ser. SIGMOD '14. New York, NY, USA: Association for Computing Machinery, 2014, pp. 1311--1322.

Digital Library

[19]

M. D. McDonnell, O. N. Yaveroglu, B. A. Schmerl, N. Iannella, and L. M. Ward, "Motif-role-fingerprints: The building-blocks of motifs, clustering-coefficients and transitivities in directed networks," PLOS ONE, vol. 9, no. 12, pp. 1--25, 12 2014.

[20]

H. Yin, A. R. Benson, J. Leskovec, and D. F. Gleich, "Local higher-order graph clustering," in Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD '17. New York, NY, USA: Association for Computing Machinery, 2017, pp. 555--564.

Digital Library

[21]

N. Tran, K. P. Choi, and L. Zhang, "Counting motifs in the human interactome," Nature communications, vol. 4, p. 2241, 08 2013.

[22]

A. Todor, A. Dobra, and T. Kahveci, "Counting motifs in probabilistic biological networks," in Proceedings of the 6th ACM Conference on Bioinformatics, Computational Biology and Health Informatics, ser. BCB '15. New York, NY, USA: Association for Computing Machinery, 2015, pp. 116--125.

Digital Library

[23]

B. Klimt and Y. Yang, "The enron corpus: A new dataset for email classification research," in Machine Learning: ECML 2004, J.-F. Boulicaut, F. Esposito, F. Giannotti, and D. Pedreschi, Eds. Berlin, Heidelberg: Springer Berlin Heidelberg, 2004, pp. 217--226.

Cited By

Fushimi TSaito KMotoda H(2022)Efficient computation of expected motif frequency in uncertain graphs by exploiting possible world marginalization and motif transitionSocial Network Analysis and Mining10.1007/s13278-022-00956-y12:1Online publication date: 3-Sep-2022
https://doi.org/10.1007/s13278-022-00956-y

Index Terms

Efficient analytical computation of expected frequency of motifs of small size by marginalization in uncertain network

Index terms have been assigned to the content through auto-classification.

Recommendations

Comparative genomics reveals unusually long motifs in mammalian genomes

Between short regulatory motifs and long ‘ultraconserved’ regions lies a whole spectrum of functional elements that remains uncharted.

– Manolis Kellis, RECOMB Regulatory Genomics satellite workshop, December 2005

Motivation: The recent discovery of ...
Circular code motifs near the ribosome decoding center

A maximal C3 self-complementary trinucleotide circular code X is identified in genes of bacteria, eukaryotes, plasmids and viruses (Michel, 2015; Arquès and Michel, 1996). A translation (framing) code based on the circular code was proposed in Michel (...
Circular code motifs in the ribosome decoding center

Display Omitted Identification of circular code motifs in the ribosome decoding center.The universally conserved nucleotides A1492 and A1493 in circular code motifs.Identification of the conserved nucleotide G530 in nuclear and chloroplast rRNAs.The ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ASONAM '21: Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

November 2021

693 pages

ISBN:9781450391283

DOI:10.1145/3487351

Editors:
Michele Coscia
IT University of Copenhagen, Denmark
,
Alfredo Cuzzocrea
University of Calabria, Italy
,
Kai Shu
Illinois Institute of Technology
,
General Chairs:
Ralf Klamma
RWTH Aachen University, Germany
,
Sharyn O'Halloran
Columbia University
,
Jon Rokne
University of Calgary, Canada

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data

In-Cooperation

IEEE CS

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 January 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article

Funding Sources

JSPS KAKENHI

Conference

ASONAM '21

Sponsor:

SIGKDD

ASONAM '21: International Conference on Advances in Social Networks Analysis and Mining

November 8 - 11, 2021

Virtual Event, Netherlands

Acceptance Rates

ASONAM '21 Paper Acceptance Rate 22 of 118 submissions, 19%;

Overall Acceptance Rate 116 of 549 submissions, 21%

Upcoming Conference

KDD '25

Sponsor:
sigkdd
sigkdd

The 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 3 - 7, 2025

Toronto , ON , Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
117
Total Downloads

Downloads (Last 12 months)11
Downloads (Last 6 weeks)0

Reflects downloads up to 19 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Fushimi TSaito KMotoda H(2022)Efficient computation of expected motif frequency in uncertain graphs by exploiting possible world marginalization and motif transitionSocial Network Analysis and Mining10.1007/s13278-022-00956-y12:1Online publication date: 3-Sep-2022
https://doi.org/10.1007/s13278-022-00956-y

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten