Abstract
This paper describes research methodologies and experimental settings for the task of relation identification and classification between pairs of medical entities, using clinical data. The models that we use represent a combination of lexical and syntactic features, medical semantic information, terms extracted from a vector-space model created using a random projection algorithm, and additional contextual information extracted at sentence-level. The best results are obtained using an SVM classification algorithm with a combination of the above mentioned features, plus a set of additional features that capture the distributional semantic correlation between the concepts and each relation of interest.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Donaldson, I., Martin, J., de Bruijn, B., Wolting, C., Lay, V., Tuekam, B.: Prebind and textomy: Mining the biomedical literature for protein-protein interactions using a support vector machine. BMC Bioinformatics 4(11), 11–24 (2003)
Mitsumori, T., Murata, M., Fukuda, Y., Doi, K., Doi, H.: Extracting protein-protein interaction information from biomedical text with svm. IEICE Transactions on Information and Systems 89(8), 2464–2466 (2006)
Bunescu, R., Mooney, R.: shortest path dependency kernel for relation extraction. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT/EMNLP), pp. 724–731 (2005)
Giuliano, C., Lavelli, A., Romano, L.: Exploiting shallow linguistic information for relation extraction from biomedical literature. In: 11th Conference of the European Chapter of the Association for Computational Linguistics, pp. 401–409 (2006)
Frunza, O., Inkpen, D., Tran, T.: A machine learning approach for identifying disease-treatment relations in short texts. IEEE Transactions on Knowledge and Data Engineering (2010) (in press)
Roberts, K., Rink, B., Harabagiu, S.: Extraction of medical concepts, assertions, and relations from discharge summaries for the fourth i2b2/va shared task (2010)
Chapman, W., Chu, D., Dowling, J.N.: Context: an algorithm for identifying contextual features from clinical text. In: ACL 2007 Workshop on Biological, Translational, and Clinical Language Processing (BioNLP 2007), pp. 81–88 (2007)
Widdows, D., Ferraro, K.: Semantic vectors: a scalable open source package and online technology management application. In: Calzolari, N., (Conference Chair), Choukri, K., Maegaard, B., Mariani, J., Odjik, J., Piperidis, S., Tapias, D. (eds.) Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008). European Language Resources Association (ELRA), Marrakech (2008), http://www.lrec-conf.org/proceedings/lrec2008/
Frunza, O., Inkpen, D.: Identifying and classifying semantic relations between medical concepts in clinical data, i2b2 challenge (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Frunza, O., Inkpen, D. (2011). Extracting Relations between Diseases, Treatments, and Tests from Clinical Data. In: Butz, C., Lingras, P. (eds) Advances in Artificial Intelligence. Canadian AI 2011. Lecture Notes in Computer Science(), vol 6657. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21043-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-21043-3_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21042-6
Online ISBN: 978-3-642-21043-3
eBook Packages: Computer ScienceComputer Science (R0)