Skip to main content

A Problem-Driven Approach for Building a Bioinformatics GraphDB

  • Conference paper
  • First Online:
Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB 2016)

Abstract

The development of high throughput technology in biological and medical domains has seen a growing intervention of informatics support. Indeed, the big amount of data produced is difficult to analyse and interpret in terms of time consuming and number of different resources used. In this context, the challenge would be to have an integrated and multi component database with a user friendly interface able to solve biological problems without a priori high-level of bioinformatics knowledge. This need arises from the evidence that biologists have multi-task and multi-levels problems to solve. To this aim, we propose a bottom-up, graph-based approach for integrating bioinformatics resources, usually databases, starting from typical biological scenarios, in order to solve novel bioinformatics problems. The integrated resources can be queried by means of a graph traversal language such as Gremlin.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amberger, J.S., Bocchini, C.A., Schiettecatte, F., Scott, A.F., Hamosh, A.: Omim. org: online mendelian inheritance in man (omim®), an online catalog of human genes and genetic disorders. Nucleic Acids Res. 43(D1), D789–D798 (2015)

    Article  Google Scholar 

  2. Apache tinkerpop: the gremlin traversal language. https://github.com/tinkerpop/gremlin/wiki

  3. Betel, D., Wilson, M., Gabow, A., Marks, D.S., Sander, C.: The microrna. org resource: targets and expression. Nucleic Acids Res. 36(suppl 1), D149–D153 (2008)

    Google Scholar 

  4. Bhattacharya, A., Ziebarth, J.D., Cui, Y.: Polymirts database 3.0: linking polymorphisms in micrornas and their target sites with human diseases and biological pathways. Nucleic Acids Res. 42(D1), D86–D91 (2014)

    Article  Google Scholar 

  5. Botstein, D., Cherry, J.M., Ashburner, M., Ball, C., Blake, J., Butler, H., Davis, A., Dolinski, K., Dwight, S., Eppig, J., et al.: Gene ontology: tool for the unification of biology. Nat. Genet. 25(1), 25–29 (2000)

    Article  Google Scholar 

  6. Brennecke, J., Stark, A., Russell, R.B., Cohen, S.M.: Principles of microrna-target recognition. PLoS Biol. 3(3), e85 (2005)

    Article  Google Scholar 

  7. Croft, D., Mundo, A.F., Haw, R., Milacic, M., Weiser, J., Wu, G., Caudy, M., Garapati, P., Gillespie, M., Kamdar, M.R., et al.: The reactome pathway knowledgebase. Nucleic Acids Res. 42(D1), D472–D477 (2014)

    Article  Google Scholar 

  8. Dennis Jr., G., Sherman, B.T., Hosack, D.A., Yang, J., Gao, W., Lane, H.C., Lempicki, R.A., et al.: David: database for annotation, visualization, and integrated discovery. Genome Biol. 4(5), P3 (2003)

    Article  Google Scholar 

  9. Dweep, H., Gretz, N.: Mirwalk 2.0: a comprehensive atlas of microrna-target interactions. Nat. Methods 12(8), 697–697 (2015)

    Article  Google Scholar 

  10. Rojas, I., Ortuño, F. (eds.): IWBBIO 2017. LNCS, vol. 10209. Springer, Cham (2017)

    Google Scholar 

  11. Fiannaca, A., La Paglia, L., La Rosa, M., Messina, A., Storniolo, P., Urso, A.: Integrated DB for bioinformatics: a case study on analysis of functional effect of MiRNA SNPs in cancer. In: Renda, M.E., Bursa, M., Holzinger, A., Khuri, S. (eds.) ITBAM 2016. LNCS, vol. 9832, pp. 214–222. Springer, Cham (2016). doi:10.1007/978-3-319-43949-5_17

    Chapter  Google Scholar 

  12. Fiannaca, A., La Rosa, M., La Paglia, L., Rizzo, R., Urso, A.: Analysis of mirna expression profiles in breast cancer using biclustering. BMC Bioinform. 16(Suppl 4), S7 (2015)

    Article  Google Scholar 

  13. Gray, K.A., Yates, B., Seal, R.L., Wright, M.W., Bruford, E.A.: Genenames.org: the HGNC resources in 2015. Nucleic Acids Res. 43(D1), D1079–D1085 (2015)

    Article  Google Scholar 

  14. Huang, D.W., Sherman, B.T., Lempicki, R.A.: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37(1), 1–13 (2009)

    Article  Google Scholar 

  15. Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M., Tanabe, M.: Kegg as a reference resource for gene and protein annotation. Nucleic Acids Res. 44(D1), D457–D462 (2016)

    Article  Google Scholar 

  16. Kozomara, A., Griffiths-Jones, S.: MiRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 39(Database issue), D152–7 (2011)

    Google Scholar 

  17. Kumar, S., Dudley, J.: Bioinformatics software for biologists in the genomics era. Bioinformatics 23(14), 1713–1717 (2007)

    Article  Google Scholar 

  18. Laganà, A., Forte, S., Giudice, A., Arena, M., Puglisi, P.L., Giugno, R., Pulvirenti, A., Shasha, D., Ferro, A.: Miro: a mirna knowledge base. Database 2009, bap. 008 (2009)

    Google Scholar 

  19. Mi, H., Poudel, S., Muruganujan, A., Casagrande, J.T., Thomas, P.D.: Panther version 10: expanded protein families and functions, and analysis tools. Nucleic Acids Res. 44(D1), D336–D342 (2016)

    Article  Google Scholar 

  20. Rodriguez, M.A.: The gremlin graph traversal machine and language (invited talk). In: Proceedings of the 15th Symposium on Database Programming Languages - DBPL 2015, pp. 1–10. ACM Press, New York (2015)

    Google Scholar 

  21. Romero-Cordoba, S.L., Salido-Guadarrama, I., Rodriguez-Dorantes, M., Hidalgo-Miranda, A.: Mirna biogenesis: biological impact in the development of cancer. Cancer Biol. Ther. 15(11), 1444–1455 (2014)

    Article  Google Scholar 

  22. Rosenbloom, K.R., Armstrong, J., Barber, G.P., Casper, J., Clawson, H., Diekhans, M., Dreszer, T.R., Fujita, P.A., Guruvadoo, L., Haeussler, M., et al.: The UCSC genome browser database: 2015 update. Nucleic Acids Res. 43(D1), D670–D681 (2015)

    Article  Google Scholar 

  23. Schuler, G.D., Epstein, J.A., Ohkawa, H., Kans, J.A.: Entrez: molecular biology database and retrieval system. Methods Enzymol. 266, 141–162 (1996)

    Article  Google Scholar 

  24. Sigrist, C.J.A., de Castro, E., Cerutti, L., Cuche, B.A., Hulo, N., Bridge, A., Bougueleret, L., Xenarios, I.: New and continuing developments at PROSITE. Nucleic Acids Res. 41(Database issue), D344–7 (2013)

    Google Scholar 

  25. Szklarczyk, D., Franceschini, A., Wyder, S., Forslund, K., Heller, D., Huerta-Cepas, J., Simonovic, M., Roth, A., Santos, A., Tsafou, K.P., Kuhn, M., Bork, P., Jensen, L.J., von Mering, C.: STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 43(Database issue), D447–52 (2015)

    Google Scholar 

  26. The Gene ontology consortium: going forward. Nucleic Acids Res. 43(D1), D1049–D1056 (2015)

    Google Scholar 

  27. The UniProt Consortium: UniProt: a hub for protein information. Nucleic Acids Res. 43(D1), D204–D212 (2015)

    Article  Google Scholar 

  28. Xie, B., Ding, Q., Han, H., Wu, D.: MiRCancer: a microRNA-cancer association database constructed by text mining on literature. Bioinformatics 29(5), 638–644 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Massimo La Rosa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Fiannaca, A., La Rosa, M., La Paglia, L., Messina, A., Rizzo, R., Urso, A. (2017). A Problem-Driven Approach for Building a Bioinformatics GraphDB. In: Bracciali, A., Caravagna, G., Gilbert, D., Tagliaferri, R. (eds) Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2016. Lecture Notes in Computer Science(), vol 10477. Springer, Cham. https://doi.org/10.1007/978-3-319-67834-4_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67834-4_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67833-7

  • Online ISBN: 978-3-319-67834-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics