Abstract
This work proposes a method for automatically discovering incompatible medical concepts in text corpora. The approach is distantly supervised based on a seed set of incompatible concept pairs like symptoms or conditions that rule each other out. Two concepts are considered incompatible if their definitions match a template, and contain an antonym pair derived from WordNet, VerbOcean, or a hand-crafted lexicon. Our method creates templates from dependency parse trees of definitional texts, using seed pairs. The templates are applied to a text corpus, and the resulting candidate pairs are categorized and ranked by statistical measures. Since experiments show that the results face semantic ambiguity problems, we further cluster the results into different categories. We applied this approach to the concepts in Unified Medical Language System, Human Phenotype Ontology, and Mammalian Phenotype Ontology. Out of 77,496 definitions, 1,958 concept pairs were detected as incompatible with an average precision of 0.80.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Völker, J., Vrandečić, D., Sure, Y., Hotho, A.: Learning Disjointness. In: Franconi, E., Kifer, M., May, W. (eds.) ESWC 2007. LNCS, vol. 4519, pp. 175–189. Springer, Heidelberg (2007)
de Marneffe, M.C., Rafferty, A.N., Manning, C.D.: Finding contradictions in text. In: McKeown, K., Moore, J.D., Teufel, S., Allan, J., Furui, S. (eds.) ACL, The Association for Computer Linguistics, pp. 1039–1047 (2008)
Lindberg, D., Humphreys, B., McCray, A.: The unified medical language system. Methods of Information in Medicine 32(4), 281–291 (1993)
Robinson, P.N., Mundlos, S.: The Human Phenotype Ontology. Clinical Genetics 77(6), 525–534 (2010)
Smith, C., Goldsmith, C.A., Eppig, J.: The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic information. Genome Biology 6(1), R7+(2004)
Fellbaum, C.: WordNet: An Electronic Lexical Database (Language, Speech, and Communication). The MIT Press (May 1998)
Chklovski, T., Pantel, P.: VerbOcean: Mining the Web for Fine-Grained Semantic Verb Relations. In: Lin, D., Wu, D. (eds.) Proceedings of EMNLP, pp. 33–40 (2004)
Pado, S., de Marneffe, M.C., MacCartney, B., Rafferty, A.N., Yeh, E., Manning, C.D.: Deciding entailment and contradiction with stochastic and edit distance-based alignment. In: Proceedings of the TAC 2008 (2009)
Turney, P.D.: A uniform approach to analogies, synonyms, antonyms, and associations. In: Scott, D., Uszkoreit, H. (eds.) COLING, pp. 905–912 (2008)
Lobanova, A., Bouma, G., Sang, E.T.K.: Using a treebank for finding opposites. In: Dickinson, M., Mrisep, K., Passarotti, M. (eds.) Proceedings of TLT9, pp. 139–150 (2010)
Mohammad, S., Dorr, B.J., Hirst, G.: Computing word-pair antonymy. In: Tseng, S., Chen, T., Liu, Y. (eds.) EMNLP, pp. 982–991. ACL (2008)
Schwab, D., Lafourcade, M., Prince, V.: Antonymy and conceptual vectors. In: Tseng, S., Chen, T., Liu, Y. (eds.) Proceedings of COLING, pp. 1–7 (2002)
Sanchez-Graillet, O., Poesio, M.: Discovering contradicting protein-protein interactions in text. In: Proceedings of the Workshop on BioNLP (2007)
Aronson, A.R., Lang, F.M.: An overview of metamap: historical perspective and recent advances. JAMIA 17(3), 229–236 (2010)
Krippendorff, K.: Content Analysis: An Introduction to Its Methodology. SAGE Publications (September 1980)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Grycner, A., Ernst, P., Siu, A., Weikum, G. (2013). Knowledge Discovery on Incompatibility of Medical Concepts. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37247-6_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-37247-6_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37246-9
Online ISBN: 978-3-642-37247-6
eBook Packages: Computer ScienceComputer Science (R0)