Abstract
Existing biomedical ontologies encode scientific knowledge that is exploited in ontology-based annotated entities, e.g., genes described using Gene Ontology (GO) annotations. Ontology-based annotations correspond to building blocks for computing relatedness between annotated entities, as well as for data mining techniques that attempt to discover domain patterns or suggest novel associations among annotated entities. However, effectiveness of these annotation-based approaches can be considerably impacted by the quality of the annotations, and models that allow for the description of the quality of the annotations are required to validate and explain the behavior of these approaches. We propose AnnEvol, a framework to describe datasets of ontology-based annotated entities in terms of evolutionary properties of the annotations of these entities over time. AnnEvol complements state-of-the-art approaches that perform an annotation-wise description of the datasets, and conducts an annotation set-wise description which characterizes the evolution of annotations into semantically similar annotations. We empirically evaluate the expressiveness power of AnnEvol in a set of proteins annotated with GO using UniProt-GOA and Swiss-Prot. Our experimental results suggest that AnnEvol captures evolutionary behavior of the studied GO annotations, and clearly differentiates patterns of annotations depending on both the annotation provider and on the model organism of the studied proteins.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
References
Benik, J., Chang, C., Raschid, L., Vidal, M.-E., Palma, G., Thor, A.: Finding cross genome patterns in annotation graphs. In: Bodenreider, O., Rance, B. (eds.) DILS 2012. LNCS, vol. 7348, pp. 21–36. Springer, Heidelberg (2012)
Devos, D., Valencia, A.: Practical limits of function prediction. Proteins: Struct. Funct. Bioinf. 41(1), 98–107 (2000)
Ding, H., Takigawa, I., Mamitsuka, H., Zhu, S.: Similarity-based machine learning methods for predicting drug-target interactions: a brief review. Briefings Bioinform. 15(5), 734–747 (2013)
Köhler, S., et al.: The human phenotype ontology project: linking molecular biology and disease through phenotype data. Nucl. Acids Res. 42(D1), D966–D974 (2014)
Gross, A., Hartung, M., Kirsten, T., Rahm, E.: Estimating the quality of ontology-based annotations by considering evolutionary changes. In: Paton, N.W., Missier, P., Hedeler, C. (eds.) DILS 2009. LNCS, vol. 5647, pp. 71–87. Springer, Heidelberg (2009)
Palma, G., Vidal, M.-E., Haag, E., Raschid, L., Thor, A.: Determining similarity of scientific entities in annotation datasets. Database J. Biol. Databases Curation (2015). bau123. doi:10.1093/database/bau123
Palma, G., Vidal, M.-E., Raschid, L.: Drug-target interaction prediction using semantic similarity and edge partitioning. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 131–146. Springer, Heidelberg (2014)
Palma, G., Vidal, M.-E., Raschid, L., Thor, A.: Exploiting semantics from ontologies and shared annotations to find patterns in annotated linked open data. In: 3rd International Workshop on Linked Science 2013-Supporting Reproducibility, Scientific Investigations and Experiments, p. 15 (2013)
Pesquita, C., Pessoa, D., Faria, D., Couto, F.: Cessm: collaborative evaluation of semantic similarity measures. Challenges Bioinf. (JB2009) 157, 190 (2009)
Škunca, N., Altenhoff, A., Dessimoz, C.: Quality of computationally inferred gene ontology annotations. PLoS Comput. Biol. 8(5), e1002533 (2012)
Smith, T., Waterman, M.: Identification of common molecular subsequences. J. Mol. Biol. 147(1), 195–197 (1981)
Acknowledgments
This work was supported by the German Ministry of Economy and Energy within the TIGRESS project (Ref. KF2076928MS3) and the EU’s 7th Framework Programme FI.ICT-2011.1.8 (FI-STAR, Grant 604691).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Traverso-Ribón, I., Vidal, ME., Palma, G. (2015). AnnEvol: An Evolutionary Framework to Description Ontology-Based Annotations. In: Ashish, N., Ambite, JL. (eds) Data Integration in the Life Sciences. DILS 2015. Lecture Notes in Computer Science(), vol 9162. Springer, Cham. https://doi.org/10.1007/978-3-319-21843-4_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-21843-4_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21842-7
Online ISBN: 978-3-319-21843-4
eBook Packages: Computer ScienceComputer Science (R0)