skip to main content
10.1145/3107411.3107436acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Associating Genomics and Clinical Information by Means of Semantic Based Ranking

Published: 20 August 2017 Publication History

Abstract

Relating genomic data with clinical and disease information is a new challenge for life sciences research. High performance computational platforms allow huge quantity of biological data production with new technologies (e.g. Next Generation Sequencing techniques). Nowadays, genomic ontologies describing genes and functions, as well as databases containing diseases groups, are available. We focus on the problem of enriching genomic datasets containing miRNA genes by adding related disease information. The enrichment is performed by using ontologies to find genes-to-diseases associations. Ontologies are used to describe molecular genomic processes and functions, as well as disease classes and experimental details. International Classification of Diseases (ICD) is used for the classification of diseases and clinical information. Diseases are ranked by using a Google Page Rank based algorithm. An application tool called Surf App! has been coded and developed in R and tested on a neurological disease dataset.

References

[1]
Stephen F Altschul, Warren Gish, Webb Miller, Eugene W Myers, and David J Lipman. 1990. Basic local alignment search tool. Journal of molecular biology 215, 3 (1990), 403--410.
[2]
Stephen F Altschul, Thomas L Madden, Alejandro A Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J Lipman. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic acids research 25, 17 (1997), 3389--3402.
[3]
Michael Ashburner, Catherine A Ball, Judith A Blake, David Botstein, Heather Butler, J Michael Cherry, Allan P Davis, Kara Dolinski, Selina S Dwight, Janan T Eppig, et al. 2000. Gene Ontology: tool for the unification of biology. Nature genetics 25, 1 (2000), 25--29.
[4]
Tim Beck, Robert K Hastings, Sirisha Gollapudi, Robert C Free, and Anthony J Brookes. 2014. GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies. European Journal of Human Genetics 22, 7 (2014), 949--952.
[5]
Mario Cannataro, Pietro Hiram Guzzi, and Pierangelo Veltri. 2010. Using ontologies for querying and analysing protein-protein interaction data. Procedia Computer Science 1, 1 (2010), 997--1004.
[6]
Mario Cannataro, Domenico Talia, Giuseppe Tradigo, Paolo Trunfio, and Pierangelo Veltri. 2008. SIGMCC: A system for sharing meta patient records in a Peer-to-Peer environment. Future Generation Computer Systems 24, 3 (2008), 222--234.
[7]
GENE CARDS. 2016. The Human Gene Database. (2016). http://www.genecards. org/ Website accessed on Sept. 2016.
[8]
James J Cimino, Elaine J Ayres, Lyubov Remennik, Sachi Rath, Robert Freedman, Andrea Beri, Yang Chen, and Vojtech Huser. 2014. The National Institutes of Health's Biomedical Translational Research Information System (BTRIS): design, contents, functionality and experience to date. Journal of biomedical informatics 52 (2014), 11--27.
[9]
The Gene Ontology Consortium. 2015. Gene Ontology Consortium: going forward. Nucleic Acids Research 43, D1 (2015), D1049--D1056.
[10]
Francesca Cristiano and Pierangelo Veltri. 2016. Methods and Techniques for miRNA data analysis. Microarray Data Analysis: Methods and Applications (2016), 11--23.
[11]
Francesca Cristiano, Pierangelo Veltri, Mattia Prosperi, and Giuseppe Tradigo. 2016. On the identification of long non-coding rnas from RNA-Seq. In Bioinformatics and Biomedicine (BIBM), 2016 IEEE International Conference on. IEEE, 1103--1106.
[12]
dbSNP. 2016. dbSNP Short Genetic Variations. (2016). http://www.ncbi.nlm.nih. gov/SNP/ Website accessed on Sept. 2016.
[13]
Eugene Demidenko. 2015. Microarray enriched gene rank. BioData mining 8, 1 (2015), 2.
[14]
DisGeNET. 2016. Gene-disease association data retrieved from DisGeNET v4.0. (2016). http://www.disgenet.org/ Integrative Biomedical Informatics Group GRIB/IMIM/UPF - Website accessed on Sept. 2016.
[15]
Open Galen. 2017. GALEN. (2017). http://www.opengalen.org/ June 2017.
[16]
Pietro H Guzzi, Maria Teresa Di Martino, Giuseppe Tradigo, Pierangelo Veltri, Pierfrancesco Tassone, Pierosandro Tagliaferri, and Mario Cannataro. 2011. Automatic summarisation and annotation of microarray data. Soft Computing 15, 8 (2011), 1505--1512.
[17]
Partner HealthCare. 2016. Informatics for Integrating Biology and the Bedside. (2016). https://www.i2b2.org/ Website accessed on Sept. 2016.
[18]
HGMD. 2016. The Human Gene Mutation Database. (2016). http://www.hgmd. org/ Website accessed on Sept. 2016.
[19]
Da Wei Huang, Brad T Sherman, and Richard A Lempicki. 2009. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature protocols 4, 1 (2009), 44--57.
[20]
W. Huang et al. 2009. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. 31, 1 (2009), 1--13.
[21]
RStudio Inc. {n. d.}. Shiny - A web application framework for R. ({n. d.}). https://shiny.rstudio.com Website accessed on June 2017.
[22]
National Cancer Institute. 2017. NCI Thesaurus. (2017). http://ncit.nci.nih.gov/ Website accessed on June 2017.
[23]
NIH National Human Genome Research Institute. 2016. Breast Cancer Information Core. (2016). http://research.nhgri.nih.gov/bic/ Website accessed on Sept. 2016.
[24]
Snomed International. 2017. SNOMED - Clinical terms. (2017). http://www. ihtsdo.org/snomed-ct/ Website accessed on June 2017.
[25]
Peter B Jensen, Lars J Jensen, and Søren Brunak. 2012. Mining electronic health records: towards better research applications and clinical care. Nature Reviews Genetics 13, 6 (2012), 395--405.
[26]
Warren A Kibbe, Cesar Arze, Victor Felix, Elvira Mitraka, Evan Bolton, Gang Fu, Christopher J Mungall, Janos X Binder, James Malone, Drashtti Vasant, et al. 2014. Disease Ontology 2015 update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data. Nucleic acids research (2014), gku1011.
[27]
Rajewsky Lab. 2016. MeSH (Medical Subject Headings) is the NLM controlled vocabulary thesaurus used for indexing articles for . (2016). https: //www.ncbi.nlm.nih.gov/mesh Website accessed on Sept. 2016.
[28]
Petra Leidinger, Christina Backes, Stephanie Deutscher, Katja Schmitt, Sabine C Mueller, Karen Frese, Jan Haas, Klemens Ruprecht, Friedemann Paul, Cord Stähler, et al. 2013. A blood based 12-miRNA signature of Alzheimer disease patients. Genome biology 14, 7 (2013), R78.
[29]
C. Liu et al. 2014. DiseaseConnect: a comprehensive web server for mechanismbased disease--disease connections. Nucleic acids research 42, W1 (2014), W137-- W146.
[30]
Musen MA, Noy NF, Shah NH, Whetzel PL, Chute CG, Story MA, and Smith B. 2012. NCBO team. The National Center for Biomedical Ontology. J Am Med Inform Assoc. 19, 2 (2012).
[31]
N.S.B. Miyoshi et al. 2013. Computational framework to support integration of biomolecular and clinical data within a translational approach. BMC Bioinformatics 14, 180 (2013).
[32]
J.L. Morrison et al. {n. d.}. Generank: using search engine technology for the analysis of microarray experiments. BMC Bioinformatics ({n. d.}).
[33]
U.S. National Library of Medicine. 2017. UMLS - Unified Medical Language System. (2017). http://www.nlm.nih.gov/research/umls/ Website accessed on June 2017.
[34]
OMIM. 2016. Online Mendelian Inheritance in Man. (2016). http://www.omim.org Website accessed on Sept. 2016.
[35]
World Health Organization. 2016. International Classification of Diseases. (2016). http://www.who.int/classifications/icd/en/ Website accessed on Sept. 2016.
[36]
Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank citation ranking: Bringing order to the web. Technical Report. Stanford InfoLab.
[37]
Janet Piñero, Àlex Bravo, Núria Queralt-Rosinach, Alba Gutiérrez-Sacristán, Jordi Deu-Pons, Emilio Centeno, Javier García-García, Ferran Sanz, and Laura I Furlong. 2017. DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants. Nucleic acids research 45, D1 (2017), D833--D839.
[38]
Lynn Marie Schriml, Cesar Arze, Suvarna Nadendla, Yu-Wei Wayne Chang, Mark Mazaitis, Victor Felix, Gang Feng, and Warren Alden Kibbe. 2011. Disease Ontology: a backbone for disease semantic integration. Nucleic acids research 40, D1 (2011), D940--D946.
[39]
School of Medicine University of Washington. 2016. FMA - Foundational Model of Analysis. (2016). http://sig.biostr.washington.edu/projects/fm/AboutFM.html Website accessed on June 2017.
[40]
OBO Technical WG. 2017. OBO. (2017). http://www.obofoundry.org/ Website accessed on June 2017.
[41]
Christof Winter, Glen Kristiansen, Stephan Kersting, Janine Roy, Daniela Aust, Thomas Knösel, Petra Rümmele, Beatrix Jahnke, Vera Hentrich, Felix Rückert, et al. 2012. Google goes cancer: improving outcome prediction for cancer patients by network-based ranking of marker genes. PLoS Comput Biol 8, 5 (2012), e1002511.
[42]
Lun Yang, Langlai Xu, and Lin He. 2009. A CitationRank algorithm inheriting Google technology designed to highlight genes responsible for serious adverse drug reaction. Bioinformatics 25, 17 (2009), 2244--2250.

Index Terms

  1. Associating Genomics and Clinical Information by Means of Semantic Based Ranking

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ACM-BCB '17: Proceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics
    August 2017
    800 pages
    ISBN:9781450347228
    DOI:10.1145/3107411
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 August 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. google page rank
    2. icd9cm
    3. mirna
    4. ngs

    Qualifiers

    • Research-article

    Conference

    BCB '17
    Sponsor:

    Acceptance Rates

    ACM-BCB '17 Paper Acceptance Rate 42 of 132 submissions, 32%;
    Overall Acceptance Rate 254 of 885 submissions, 29%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 78
      Total Downloads
    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media