skip to main content
10.1145/1150402.1150418acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

NeMoFinder: dissecting genome-wide protein-protein interactions with meso-scale network motifs

Published:20 August 2006Publication History

ABSTRACT

Recent works in network analysis have revealed the existence of network motifs in biological networks such as the protein-protein interaction (PPI) networks. However, existing motif mining algorithms are not sufficiently scalable to find meso-scale network motifs. Also, there has been little or no work to systematically exploit the extracted network motifs for dissecting the vast interactomes.We describe an efficient network motif discovery algorithm, NeMoFinder, that can mine meso-scale network motifs that are repeated and unique in large PPI networks. Using NeMoFinder, we successfully discovered, for the first time, up to size-12 network motifs in a large whole-genome S. cerevisiae (Yeast) PPI network. We also show that such network motifs can be systematically exploited for indexing the reliability of PPI data that were generated via highly erroneous high-throughput experimental methods.

References

  1. I. Albert and R. Albert, Conserved network motifs allow protein-protein interaction prediction, Bioinformatics, Volume 20, Number 18, Pages 3346--3352, 2004]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Chen, W. Hsu, M. L. Lee, and S. K. Ng, Discovering and exploiting meso-scale network motifs in protein interactomes, National University of Singapore, TRC6/06, 2006]]Google ScholarGoogle Scholar
  3. M. B. Eisen, P. T. Spellman, P. O. Brown, and D. Botstein, Cluster analysis and display of genome-wide expression patterns, Proc. Natl Acad. Sci. USA, 1998, volume 95, pages 14863--14868]]Google ScholarGoogle ScholarCross RefCross Ref
  4. S. Fortin, The graph isomorphism problem, Technical Report TR96-20, Department of Computing Science, University of Alberta, 1996]]Google ScholarGoogle Scholar
  5. A. Grigoriev, A relationship between gene expression and protein interactions on the proteome scale, Nucleic Acids Res, Volume 29, Number 17, Pages 3513--3519, 2001]]Google ScholarGoogle Scholar
  6. J. Huan, W. Wang, and J. Prins, Efficient mining of frequent subgraph in the presence of isomorphism, ICDM, 2003, pages 549--552]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Huan, W. Wang, J. Prins, and J. Yang, SPIN: Mining maximal frequent subgraphs from graph databases, SIGKDD, 2004]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Inokuchi, T. Washio, and H. Motoda, An Apriori-based algorithm for mining frequent substructures from graph, PKDD, 2000, pages 13--23]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. N. Kashtan, S. Itzkovitz, R. Milo, and U. Alon, Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs, Bioinformatics, 2004, volume 20, number 11, pages 1746--1758]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Kuramochi and G. Karypis, An efficient algorithm for discovering frequent subgraphs, TKDE, 2001]]Google ScholarGoogle Scholar
  11. M. Kuramochi and G. Karypis, Finding Frequent Patterns in a Large Sparse Graph, In SIAM International Conference on Data Mining, 2004]]Google ScholarGoogle Scholar
  12. S. Maslov and K. Sneppen, Specificity and stability in topology of protein networks, Science, Volume 296, Number 5569, Pages 910--913, 2002]]Google ScholarGoogle Scholar
  13. C. V. Mering, R. Krause, B. Snel, et al, Comparative assessment of largescale data sets of protein-protein interactions, Nature, volume 417, pages 399--403, 2002]]Google ScholarGoogle Scholar
  14. H. W. Mewes, D. Frishman, U. Guldener, et al, MIPS: a database for genomes and protein sequences, Nucleic Acids Res, Volume 30, Number 1, Pages 31--34, 2002]]Google ScholarGoogle ScholarCross RefCross Ref
  15. R. Milo, S. Shen-Orr, S. Itzkovitz, N. Kashtan, D. Chklovskii, and U. Alon, Network Motifs: Simple Building Blocks of Complex Networks, Science, volume 298, pages 824--827, 2002]]Google ScholarGoogle Scholar
  16. R. Saito, H. Suzuki, and Y. Hayashizaki, Interaction generality, a measurement to assess the reliability of a protein-protein interaction, Nucleic Acids Res, 2002, volume 30, pages 1163--1168]]Google ScholarGoogle ScholarCross RefCross Ref
  17. R. Saito, H. Suzuki, and Y. Hayashizaki, Construction of reliable protein-protein interaction networks with a new interaction generality measure, Bioinformatics, 2002, volume 19, pages 756--763]]Google ScholarGoogle Scholar
  18. F. Schreiber and H. Schwobbermeyer, Frequency Concepts and Pattern Detection for the Analysis of Motifs in Networks, Transactions on Computational Systems Biology, volume 3, pages 89--104, LNBI 3737, 2005]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. V. Spirin and L. A. Mirny, Protein complexes and functional modules in molecular networks, PNAS, 2003, volume 100, number 21, pages 12123--12128]]Google ScholarGoogle ScholarCross RefCross Ref
  20. Uetz, P., Giot, L., Cagney, G., et al, A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae, Nature, Volume 403, Number 6770, Pages 623--627, 2000]]Google ScholarGoogle Scholar
  21. X. Yan and J. Han, gSpan: Graph-based substructure pattern mining, ICDM, 2002]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. NeMoFinder: dissecting genome-wide protein-protein interactions with meso-scale network motifs

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2006
      986 pages
      ISBN:1595933395
      DOI:10.1145/1150402

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 August 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader