Comparing General and Medical Texts for Information Retrieval Based on Natural Language Processing: An Inquiry into Lexical Disambiguation

Ruch, Patrick; Baud, Robert; Geissb&#252;hler, Antoine; Rassinoux, Anne-Marie

doi:10.3233/978-1-60750-928-8-261

loading subjects...

Comparing General and Medical Texts for Information Retrieval Based on Natural Language Processing: An Inquiry into Lexical Disambiguation

Authors

Patrick Ruch, Robert Baud, Antoine Geissbühler, Anne-Marie Rassinoux

Pages

261 - 265

DOI

10.3233/978-1-60750-928-8-261

Series

Studies in Health Technology and Informatics

Ebook

Volume 84: MEDINFO 2001

Abstract

In this paper we compare two types of corpus, focusing on the lexical ambiguity of each of them. The first corpus consists mainly of general newspaper articles and literature excerpts, while the second belongs to the medical domain. To conduct the study, we have used two different disambiguation tools. First, each tool was validated in its respective application area. We then use these systems in order to assess and compare both the general ambiguity rate and the particularities of each domain. Quantitative results show that medical documents are lexically less ambiguous than unrestricted documents. Our conclusions emphasize the importance of the application area in the design of NLP tools.

Contact

IOS Press Copyright 2024

Contact

IOS Press Copyright 2024

This website uses cookies

This website uses cookies