Verification of Uncurated Protein Annotations

Verification of Uncurated Protein Annotations

Francisco M. Couto, Mário J. Silva, Vivian Lee, Emily Dimmer, Evelyn Camon, Rolf Apweiler
ISBN13: 9781605662749|ISBN10: 1605662747|ISBN13 Softcover: 9781616925284|EISBN13: 9781605662756
DOI: 10.4018/978-1-60566-274-9.ch016
Cite Chapter Cite Chapter

MLA

Couto, Francisco M., et al. "Verification of Uncurated Protein Annotations." Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration, edited by Violaine Prince and Mathieu Roche, IGI Global, 2009, pp. 301-314. https://doi.org/10.4018/978-1-60566-274-9.ch016

APA

Couto, F. M., Silva, M. J., Lee, V., Dimmer, E., Camon, E., & Apweiler, R. (2009). Verification of Uncurated Protein Annotations. In V. Prince & M. Roche (Eds.), Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration (pp. 301-314). IGI Global. https://doi.org/10.4018/978-1-60566-274-9.ch016

Chicago

Couto, Francisco M., et al. "Verification of Uncurated Protein Annotations." In Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration, edited by Violaine Prince and Mathieu Roche, 301-314. Hershey, PA: IGI Global, 2009. https://doi.org/10.4018/978-1-60566-274-9.ch016

Export Reference

Mendeley
Favorite

Abstract

Molecular Biology research projects produced vast amounts of data, part of which has been preserved in a variety of public databases. However, a large portion of the data contains a significant number of errors and therefore requires careful verification by curators, a painful and costly task, before being reliable enough to derive valid conclusions from it. On the other hand, research in biomedical information retrieval and information extraction are nowadays delivering Text Mining solutions that can support curators to improve the efficiency of their work to deliver better data resources. Over the past decades, automatic text processing systems have successfully exploited biomedical scientific literature to reduce the researchers’ efforts to keep up to date, but many of these systems still rely on domain knowledge that is integrated manually leading to unnecessary overheads and restrictions in its use. A more efficient approach would acquire the domain knowledge automatically from publicly available biological sources, such as BioOntologies, rather than using manually inserted domain knowledge. An example of this approach is GOAnnotator, a tool that assists the verification of uncurated protein annotations. It provided correct evidence text at 93% precision to the curators and thus achieved promising results. GOAnnotator was implemented as a web tool that is freely available at http://xldb.di.fc.ul.pt/rebil/tools/goa/.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.