Abstract
Entity recognition has been studied for several years with good results. However, as the focus of information extraction (IE) and entity recognition (ER) has been set on biology and bioinformatics, the existing methods do not produce as good results as before. This is mainly due to the complex naming conventions of biological entities. In our information extraction system for biomedical documents called OAT (Ontology Aided Text mining system) we developed our own method for recognizing the biological entities. The difference to the existing methods, which use lexicons, rules and statistics, is that we combine the context of the entity with the existing knowledge about the relationships of the entities. This has produced encouraging preliminary results. This paper describes the approach we are using in our information extraction system for entity recognition.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chang, J., Schutze, H., Altman, R.: GAPSCORE: Finding Gene And Protein Names One Word at a Time. Bioinformatics 20(2), 216–225 (2004)
Cohen, A., Hersh, W.: A survey of current work in biomedical text mining. Breefings in Bioinformatics 6, 57–71 (2005)
Hanisch, D., Fluck, J., Mevissen, H.: Playing Biology’s Name Game: Identifying Protein Names in Scientific Text. In: Pacific Symposium on Biocomputing, vol. 8, pp. 403–414 (2003)
Tanabe, L., Wilbur, W.: Tagging Gene And Protein Names in Biomedical Text. Bioinformatics 18(8), 1124–1132 (2002)
Timonen, M.: Implementation of Ontology-Based Biological Knowledge Base, Master’s Thesis, Department of Computer Science, University of Helsinki, Helsinki (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Timonen, M., Pesonen, A. (2008). Combining Context and Existing Knowledge When Recognizing Biological Entities – Early Results. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2008. Lecture Notes in Computer Science(), vol 5012. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68125-0_109
Download citation
DOI: https://doi.org/10.1007/978-3-540-68125-0_109
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68124-3
Online ISBN: 978-3-540-68125-0
eBook Packages: Computer ScienceComputer Science (R0)