Skip to main content

Semantic Clustering for Identifying Overlapping Biological Communities

  • Conference paper
  • First Online:
  • 909 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 10477))

Abstract

Proteins encoded by the genes associated with a common disorder interact together, participate in similar pathways, and share Gene Ontology (GO) terms. Drug discovery for certain disease may arise from a hypothesis that genes contributing to a common disorder have an increased tendency for their products to be linked at various functional levels. This may be induced from experimental studies of protein-protein interactions, co-regulation, co-expression, and annotated semantic information (e.g., those stored in Gene Ontology). Our aim is to improve the quality of aggregation discovery in dense biological interactions by incorporating such information embedded in biological repositories and mapping them in the spectral embedding space.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www.genome.jp/kegg/.

  2. 2.

    http://www.uniprot.org/.

  3. 3.

    http://www.lasige.di.fc.ul.pt/webtools/proteinon/.

  4. 4.

    http://www.cbio.uct.ac.za/ITGOM/tools/itgom.php.

References

  1. Ashburner, M., Ball, C.A., Blake, J.A., et al.: Gene ontology: tool for the unification of biology. Nat. Genet. 25(1), 25–29 (2000)

    Article  Google Scholar 

  2. Azuaje, F., Wang, H., Bodenreider, O.: Ontology-driven similarity approaches to supporting gene functional assessment. In: Proceedings of the ISMB 2005 SIG meeting on Bio-ontologies, pp. 9–10, June 2005

    Google Scholar 

  3. Botstein, D., Chervitz, S.A., Cherry, J.M.: Yeast as a model organism. Science 277(5330), 1259–1260 (1997)

    Article  Google Scholar 

  4. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, Norwell (1981)

    Book  MATH  Google Scholar 

  5. Couto, F.M., Silva, M.J., Coutinho, P.M.: Measuring semantic similarity between gene ontology terms. Data Knowl. Eng. 61(1), 137–152 (2007)

    Article  Google Scholar 

  6. Costanzo, M., et al.: The genetic landscape of a Cell. Science 327(5964), 425–431 (2010)

    Article  Google Scholar 

  7. Filippone, M., Camastra, F., Masulli, F., Rovetta, S.: A survey of kernel and spectral methods for clustering Pattern recognition 41, ISSN: 0031–3203, pp. 176–190 (2008)

    Google Scholar 

  8. Fred, A.L.N., Jain, A.K.: Combining multiple clusterings using evidence accumulation. IEEE Trans. Pattern Anal. Mach. Intell. 27(6), 835–850 (2005)

    Article  Google Scholar 

  9. Havens, T.C., Bezdek, J.C., Leckie, C., Ramamohanarao, K., Palaniswami, M.: A soft modularity function for detecting fuzzy communities in social networks. Fuzzy Syst. IEEE Trans. 21(6), 1170–1175 (2013)

    Article  Google Scholar 

  10. Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. In: International Conference on Research in Computational Linguistics, ROCLING X, pp. 19–33 (1997)

    Google Scholar 

  11. Krogan, N., et al.: Global landscape of protein complexes in the yeast saccharomyces cerevisiae. Nature 440, 637–643 (2006)

    Article  Google Scholar 

  12. Lin, D.: An information-theoretic definition of similarity. In: Shavlik, J. (ed.) Fifteenth International Conference on Machine Learning, ICML Madison, pp. 296–304 (1998)

    Google Scholar 

  13. Lord, P.W., Stevens, R.D., Brass, A., Goble, C.A.: Semantic similarity measures as tools for exploring the gene ontology. Pac. Symp. Biocomputing 8, 601–612 (2003)

    MATH  Google Scholar 

  14. Mahmoud, H., Masulli, F., Rovetta, S., Abdullatif, A.: Comparison of methods for community detection in networks. In: Villa, A.E.P., Masulli, P., Pons Rivero, A.J. (eds.) ICANN 2016. LNCS, vol. 9887, pp. 216–224. Springer, Cham (2016). doi:10.1007/978-3-319-44781-0_26

    Chapter  Google Scholar 

  15. Mahmoud, H., Masulli, F., Rovetta, S., Russo, G.: Detecting overlapping protein communities in disease networks. In: di Serio, C., Liò, P., Nonis, A., Tagliaferri, R. (eds.) CIBB 2014. LNCS, vol. 8623, pp. 109–120. Springer, Cham (2015). doi:10.1007/978-3-319-24462-4_10

    Chapter  Google Scholar 

  16. Mahmoud, H., Masulli, F., Rovetta, S., Russo, G.: Community detection in protein-protein interaction networks using spectral and graph approaches. In: Formenti, E., Tagliaferri, R., Wit, E. (eds.) CIBB 2013 2013. LNCS, vol. 8452, pp. 62–75. Springer, Cham (2014). doi:10.1007/978-3-319-09042-9_5

    Google Scholar 

  17. Mahmoud, H., Masulli, F., Rovetta, S.: A fuzzy clustering segmentation approach for feature-based medical image registration. In: Ninth International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics CIBB2012, p. 41 (2012)

    Google Scholar 

  18. Mazandu, G.K., Mulder, N.J.: A topology-based metric for measuring term similarity in the gene ontology. Adv. Bioinform. 15, 195–211 (2012)

    Google Scholar 

  19. Mazandu, G.K., Mulder, N.J.: Information content-based gene ontology functional similarity measures: which one to use for a given biological data type? PLoS ONE 9(12), e113859 (2014)

    Article  Google Scholar 

  20. Nepusz, T., Petrczi, A., Ngyessy, L., Bazs, F.: Fuzzy communities and the concept of bridgeness in complex networks. Phys. Rev. E 77(1), 016107 (2008)

    Article  MathSciNet  Google Scholar 

  21. Newman, M.E.J.: Modularity and community structure in networks. proc. Nat. Acad. sci. 103(23), pp. 857–869 (2006)

    Google Scholar 

  22. Ng, J., Jordan, M.I., Weiss, Y.: On spectral clustering: analysis and an algorithm. In: Proceedings of Neural Information Processing Systems, pp. 849–856 (2002)

    Google Scholar 

  23. Rada, R., Mili, H., Bichnell, E., Blettner, M.: Development and application of a metric on semantic nets. IEEE Trans. Syst. Man Cybern. B Cybern. 9, 17–30 (1989)

    Article  Google Scholar 

  24. Rodriguez, M.A., Egenhofer, M.J.: Determining semantic similarity among entity lasses from different ontologies. IEEE Trans. Knowl. Data Eng. 15, 442–456 (2003)

    Article  Google Scholar 

  25. Sevilla, J.L., Segura, V., Podhorski, A., Guruceaga, E., Mato, J.M., Martinez-Cruz, L.A., et al.: Correlation between gene expression and GO semantic similarity. IEEE/ACM Trans. Comput. Biol. Bioinf. 2(4), 330–338 (2005)

    Article  Google Scholar 

  26. Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Mellish, C.S. (ed.) 14th International Joint Conference on Artificial Intelligence, IJCAI 1, pp. 448–453 (1995)

    Google Scholar 

  27. Zhang, S., Wang, R.S., Zhang, X.S.: Identification of overlapping community structure in complex networks using fuzzy c-means clustering. Phys. A 374(1), 483–490 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hassan Mahmoud .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Mahmoud, H., Masulli, F., Rovetta, S. (2017). Semantic Clustering for Identifying Overlapping Biological Communities. In: Bracciali, A., Caravagna, G., Gilbert, D., Tagliaferri, R. (eds) Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2016. Lecture Notes in Computer Science(), vol 10477. Springer, Cham. https://doi.org/10.1007/978-3-319-67834-4_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67834-4_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67833-7

  • Online ISBN: 978-3-319-67834-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics