Abstract
There is a huge growth in the volume of published biomedical research in recent years. Many medical search engines are designed and developed to address the over growing information needs of biomedical experts and curators. Significant progress has been made in utilizing the knowledge embedded in medical ontologies and controlled vocabularies to assist these engines. However, the lack of common architecture for utilized ontologies and overall retrieval process, hampers evaluating different search engines and interoperability between them under unified conditions. In this paper, a unified architecture for medical search engines is introduced. Proposed model contains standard schemas declared in semantic web languages for ontologies and documents used by search engines. Unified models for annotation and retrieval processes are other parts of introduced architecture. A sample search engine is also designed and implemented based on the proposed architecture in this paper. The search engine is evaluated using two test collections and results are reported in terms of precision vs. recall and mean average precision for different approaches used by this search engine.
Similar content being viewed by others
References
Leroy, G., and Chen, H. C., Meeting medical terminology needs: The ontology-enhanced medical concept mapper. IEEE Trans. Inf. Technol. Biomed. 5 (4)261–270, 2001.
Yoo, I., Hu, X., and Song, I. Y., Biomedical ontology improves biomedical literature clustering performance: A comparison study. Int. J. Bioinformatics Res. Appl. 3 (3)414–428, 2007.
Freitas, F., Schulz, S., and Moraes, E., Survey of current terminologies and ontologies in biology and medicine. RECIIS—Electronic Journal in Communication, Information and Innovation in Health. 3 (1)7–18, 2009.
Ashburner, M., et al., Gene ontology: Tool for the unification of biology. The gene ontology consortium. Nat. Genet. 25 (1)25–29, 2000.
Pedersen, T., Pakhomov, S. V. S., and Patwardhan, S., Measures of semantic similarity and relatedness in the biomedical domain. Journal of Biomedical Informatics. 40 (3)288–299, 2007.
Kagolovsky, Y., and Moehr, J. R., A new look at information retrieval evaluation: Proposal for solutions. J. Med. Syst. 28 (1)103–116, 2004.
Kagolovsky, Y., and Moehr, J. R., Current status of the evaluation of information retrieval. J. Med. Syst. 27 (5)409–424, 2003.
Kagolovsky, Y., and Moehr, J. R., Introducing a conceptual Information Retrieval (IR) framework. J. Med. Syst. 28 (1)89–101, 2004.
Walczak, S., and Multiagent, A., Architecture for developing medical information retrieval agents. J. Med. Syst. 27 (5)479–498, 2003.
OpenGALEN Mission Statement. http://www.opengalen.org/
Foundational model of ontology. http://sig.biostr.washington.edu/projects/fm/AboutFM.html
Rada, R., and Bicknell, E., Ranking documents with a thesaurus. J. Am. Soc. Inf. Sci. 40:304–310, 1989.
Lord, P., Stevens, R., Brass, A., and Goble, C., Investigating semantic similarity measures across the gene ontology: The relationship between sequence and annotation. Bioinformatics. 19 (10)1275–83, 2003.
Fellbaum, C., WordNet: An electronic lexical database. MIT, Cambridge, MA, 1998.
Resnik, P., Using information content to evaluate semantic similarity in a taxonomy. In: IJCAI-95—Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1:448–453, 1995.
Jiang, J., and Conrath, D., Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceedings of the 10th International Conference on Research in Computational Linguistics, 19–33, 1997.
Wroe, C., Stevens, R., Goble, C., and Ashburner, M., A methodology to migrate the Gene Ontology to a description logic environment using DAML+OIL. Pac. Symp. Biocomput. 8(1):624–635, 2003.
The DARPA Agent Markup Language Homepage. http://www.daml.org/
OWL Web Ontology Language. http://www.w3.org/TR/owl-features/
Kashyap, V., and Borgida, A., Representing the UMLS semantic network using OWL (or “What’s in a semantic web link?”). In: Proceedings of the Second International Semantic Web Conference, 1–16, 2003.
Mukherjea, S., Information retrieval and knowledge discovery utilising a biomedical semantic web. Brief. Bioinform. 6 (3)252–262, 2005.
Schulz, S., Stenzhorn, H., Boeker, M., and Smith, B., Strengths and limitations of formal ontologies in the biomedical domain. RECIIS—Electronic Journal in Communication, Information and Innovation in Health. 3 (1)31–45, 2009.
Protégé. http://protege.stanford.edu
Aronson, A. R., Effective mapping of biomedical text to the UMLS Metathesaurus: The MetaMap Program. In: Proceedings of the AMIA Annual Fall Symposium, 17–21, 2001.
Zou, Q., Chu, W. W., Morioka, C., Leazer, G. H., and Kangarloo, H., IndexFinder: A method of extracting key concepts from clinical texts for indexing. In: Proceedings of the AMIA Annual Symposium, 763–767, 2003.
Denny, J. C., Irani, P. R., Wehbe, F. H., Smithers, J. D., and Spickard, A., The KnowledgeMap project: Development of a concept-based medical school curriculum database. In: Proceedings of the Annual AMIA Symposium, 195–199, 2003.
Hu, X., Lin, T. Y., Song, I. Y., Yoo, I., and Song, M., A semi-supervised efficient learning approach to extract biological relationships from web-based biomedical digitallibrary. Web Intelligence and Agent Systems. 4 (3)327–339, 2006.
Jiang, J., and Zhai, C. X., An empirical study of tokenization strategies for biomedical information retrieval. Inf. Retr. 10:341–363, 2007.
Hersh, W., Buckley, C., Leone, T., and Hickam, D., OHSUMED: An interactive retrieval evaluation and new large test collection for research. In: Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval, 17:192–201, 1994.
Aggarwal, V., The application of the unified modeling language in object-oriented analysis of healthcare information systems. J. Med. Syst. 26 (5)383–397, 2002.
Mao, W., Chu, W., Free-text medical document retrieval via phrase-based vector space model. In: Proceedings of the AMIA Annual Symposium, 489–493, 2002.
Srinivasan, P., Query expansion and MEDLINE. Information Processing and Management. 32 (4)431–443, 1996.
Jiang, J., He, X., and Zhai, C. X., Robust pseudo feedback estimation and HMM passage extraction. In: Proceedings of the 15th Text REtrieval Conference (TREC’06), 2007.
Jalali, V., and Borujerdi, M. R. M., Concept based pseudo relevance feedback in biomedical field. Studies in Computational Intelligence. 209:69–79, 2009.
SPARQL Query Language for RDF. http://www.w3.org/TR/rdf-sparql-query/
OWL-QL Project for the Stanford Knowledge Systems Laboratory. http://ksl.stanford.edu/projects/owl-ql/
Burgun, A., Botti, G., Fieschi, M., and Beux, P. L., Issues in the design of medical ontologies used for knowledge sharing. J. Med. Syst. 25 (2)95–108, 2001.
Jalali, V., and Borujerdi, M. R. M., The effect of using domain specific ontologies in query expansion in medical field. Innovations in Information Technology. 5(1):277–281, 2008.
Lucene, http://lucene.apache.org/
Li, Y., Bandar, Z., and McLean, D., An approach for measuring semantic similarity between words using multiple information sources. IEEE Trans. Knowl. Data Eng. 15 (4)871–882, 2003.
Hersh, W., Buckley, C., Leone, T., Hickam, D., OHSUMED: An interactive retrieval evaluation and new large test collection for research. In: Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval, 17:192–201, 1994.
trec_eval. http://trec.nist.gov/trec_eval/
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jalali, V., Matash Borujerdi, M.R. A Unified Architecture for Biomedical Search Engines Based on Semantic Web Technologies. J Med Syst 35, 237–249 (2011). https://doi.org/10.1007/s10916-009-9360-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10916-009-9360-z