Abstract
We are interested in retrieving relevant information from biomedical documents according to healthcare professional’s information needs. It is well known that biomedical documents are indexed using conceptual descriptors issued from terminologies for a better retrieval performance. Our attempt to develop a conceptual retrieval framework relies on the hypothesis that there are several broad categories of knowledge that could be captured from different terminologies and processed by retrieval algorithms. With this in mind, we propose a multi-terminology based indexing approach for selecting the best representative concepts for each document. We instantiate this general approach on four terminologies namely MeSH (Medical Subject Headings), SNOMED (Systematized Nomenclature of Medicine), ICD-10 (International Classification of Diseases) and GO (Gene Ontology). Experimental studies were conducted on large and official document test collections of real world clinical queries and associated judgments extracted from MEDLINE scientific collections, namely TREC Genomics 2004 & 2005. The obtained results demonstrate the advantages of our multi-terminology based biomedical information retrieval approach over state-of-the art approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zhou, X., Hu, X., Zhang, X.: Topic signature language models for ad hoc retrieval. IEEE Transactions on Knowledge and Data Engineering 19(9), 1276–1287 (2007)
Krauthammer, M., Nenadic, G.: Term identification in the biomedical literature. Journal of Biomedical Informatics 37, 512–528 (2004)
Keizer, N.F., et al.: Understanding terminological systems I: Terminology and Typology. Methods of Information in Medicine, 16–21 (2000)
Cornet, R., de Keizer, N.: Forty years of SNOMED: a literature review. BMC Medical Informaticas and Decision Making, pp. 268–272 (2008)
Nyström, M., et al.: Enriching a primary health care version of ICD-10 using SNOMED CT mapping. Journal of Biomedical Semantics, 7–28 (2010)
Taboada, M., et al.: An automated approach to mapping external terminologies to the UMLS. IEEE Transactions of Biomedical Engeenering, 605–618 (2009)
Avillach, P., Joubert, M., Fieschi, M.: A Model for Indexing Medical Documents Combining Statistical and Symbolic Knowledge. In: Proc. AMIA Symp., pp. 31–35 (2007)
Pereira, S., Neveol, A., et al.: Using multi-terminology indexing for the assignment of MeSH descriptors to health resources. In: Proc. AMIA Symp., pp. 586–590 (2008)
Darmoni, S.J., Pereira, S., Sakji, S., Merabti, T., Prieur, É., Joubert, M., Thirion, B.: Multiple terminologies in a health portal: Automatic indexing and information retrieval. In: Combi, C., Shahar, Y., Abu-Hanna, A. (eds.) AIME 2009. LNCS, vol. 5651, pp. 255–259. Springer, Heidelberg (2009)
Ingwersen, P.: Cognitive perspectives of information retrieval interaction-elements of cognitive theory. Journal of Documentation 52, 3–50 (1996)
Fox, E.A., Shaw, J.A.: Combination of Multiple Searches. In: TREC 1994, pp. 243–252 (1994)
Krauthammer, M., Rzhetsky, A., et al.: Using BLAST for identifying gene and protein names in journal articles. Gene, 245–252 (2000)
Aronson, A.R., Mork, J.G., Gay, C., Humphrey, S.M., Rogers, W.J.: The NLM Indexing Initiative’s Medical Text Indexer. In: Medinfo 2004, pp. 268–272 (2004)
Ruch, P.: Automatic assignment of biomedical categories: toward a generic approach. Bioinformatics 22(6), 658–664 (2006)
Zhou, X., Zhang, X., Hu, X.: MaxMatcher: Biological Concept Extraction Using Approximate Dictionary Lookup. In: Yang, Q., Webb, G. (eds.) PRICAI 2006. LNCS (LNAI), vol. 4099, pp. 1145–1149. Springer, Heidelberg (2006)
Frantzi, K., Ananiadou, S., Mima, H.: Automatic recognition of multi-word terms: the C-value/NC-value method. Int. Journal on Digital Libraries 3, 115–130 (2000)
Hliaoutakis, A., et al.: The AMTEx approach in the medical document indexing and retrieval application. Data Knowledge Engineering, 380–392 (2009)
Robertson, S.E., Walker, S., Hancock-Beaulieu, M.: Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive. In: TREC-7, pp. 199–210 (1998)
Dinh, D., Tamine, L.: Combining global and local semantic contexts for improving biomedical information retrieval. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 375–386. Springer, Heidelberg (2011)
Amati, G.: Probabilistic models for Information Retrieval based on Divergence from Randomness. PhD thesis, University of Glasgow (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dinh, D., Tamine, L. (2011). Voting Techniques for a Multi-terminology Based Biomedical Information Retrieval. In: Peleg, M., Lavrač, N., Combi, C. (eds) Artificial Intelligence in Medicine. AIME 2011. Lecture Notes in Computer Science(), vol 6747. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22218-4_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-22218-4_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22217-7
Online ISBN: 978-3-642-22218-4
eBook Packages: Computer ScienceComputer Science (R0)