Skip to main content

Combining Context and Existing Knowledge When Recognizing Biological Entities – Early Results

  • Conference paper
Advances in Knowledge Discovery and Data Mining (PAKDD 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5012))

Included in the following conference series:

  • 2549 Accesses

Abstract

Entity recognition has been studied for several years with good results. However, as the focus of information extraction (IE) and entity recognition (ER) has been set on biology and bioinformatics, the existing methods do not produce as good results as before. This is mainly due to the complex naming conventions of biological entities. In our information extraction system for biomedical documents called OAT (Ontology Aided Text mining system) we developed our own method for recognizing the biological entities. The difference to the existing methods, which use lexicons, rules and statistics, is that we combine the context of the entity with the existing knowledge about the relationships of the entities. This has produced encouraging preliminary results. This paper describes the approach we are using in our information extraction system for entity recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Chang, J., Schutze, H., Altman, R.: GAPSCORE: Finding Gene And Protein Names One Word at a Time. Bioinformatics 20(2), 216–225 (2004)

    Article  Google Scholar 

  2. Cohen, A., Hersh, W.: A survey of current work in biomedical text mining. Breefings in Bioinformatics 6, 57–71 (2005)

    Article  Google Scholar 

  3. Hanisch, D., Fluck, J., Mevissen, H.: Playing Biology’s Name Game: Identifying Protein Names in Scientific Text. In: Pacific Symposium on Biocomputing, vol. 8, pp. 403–414 (2003)

    Google Scholar 

  4. Tanabe, L., Wilbur, W.: Tagging Gene And Protein Names in Biomedical Text. Bioinformatics 18(8), 1124–1132 (2002)

    Article  Google Scholar 

  5. Timonen, M.: Implementation of Ontology-Based Biological Knowledge Base, Master’s Thesis, Department of Computer Science, University of Helsinki, Helsinki (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Takashi Washio Einoshin Suzuki Kai Ming Ting Akihiro Inokuchi

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Timonen, M., Pesonen, A. (2008). Combining Context and Existing Knowledge When Recognizing Biological Entities – Early Results. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2008. Lecture Notes in Computer Science(), vol 5012. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68125-0_109

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68125-0_109

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68124-3

  • Online ISBN: 978-3-540-68125-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics