Abstract
We describe an ontology engineering methodology by which conceptual knowledge is extracted from an informal medical thesaurus (UMLS) and automatically converted into a formal description logics system. Our approach consists of four steps: concept definitions are automatically generated from the UMLS source, integrity checking of taxonomic and partonomic hierarchies is performed by the terminological classifier, cycles and inconsistencies are eliminated, and incremental refinement of the evolving knowledge base is performed by a domain expert. We report on experiments with a knowledge base composed of 164,000 concepts and 76,000 relations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Alessandro Artale, Enrico Franconi, Nicola Guarino, and Luca Pazzi. Part-whole relations in object-centered systems: An overview. Data & Knowledge Engineering, 20(3):347–383, 1996.
James J. Cimino. Distributed cognition and knowledge-based controlled medical terminologies. Artificial Intelligence in Medicine, 12(1): 153–168, 1998.
James J. Cimino, Paul D. Clayton, George Hripsack, and Stephen B. Johnson. Knowledge-based approaches to the maintenance of a large controlled medical terminology. Journal of the American Medical Informatics Association, 1(1):35–50, 1994.
D. Alan Cruse. On the transitivity of the part-whole relation. Journal of Linguistics, 15:29–38, 1979.
Aldo Gangemi, Domenico M. Pisanelli, and Geri Steve. An overview of the ONION project: Applying ontologies to the integration of medical terminologies. Data & Knowledge Engineering, 31(2): 183–220, 1999.
Udo Hahn, Martin Romacker, and Stefan Schulz. Discourse structures in medical reports-watch out! The generation of referentially coherent and valid text knowledge bases in the medSynDiKATe system. International Journal of Medical Informatics, 53(1):1–28, 1999.
Udo Hahn, Martin Romacker, and Stefan Schulz. How knowledge drives understanding: Matching medical ontologies with the needs of medical language processing. Artificial Intelligence in Medicine, 15(1):25–51, 1999.
Udo Hahn, Stefan Schulz, and Martin Romacker. Part-whole reasoning: A case study in medical ontology engineering. IEEE Intelligent Systems & Their Applications, 14(5):59–67, 1999.
Ira J. Haimowitz, Ramesh S. Patil, and Peter Szolovits. Representing medical knowledge in a terminological language is difficult. In R. A. Greenes, editor, SCAMC’88-Proceedings of the 12th Annual Symposium on Computer Applications in Medical Care, pages 101–105. Washington, D.C.: IEEE Computer Society Press, 1988.
Ian Horrocks and Ulrike Sattler. A description logic with transitive and inverse roles and role hierarchies. Journal of Logic and Computation, 9(3):385–410, 1999.
Robert MacGregor and Raymond Bates. The LOOM knowledge representation language. Technical Report RS-87-188, Information Sciences Institute, University of Southern California, 1987.
Robert M. MacGregor. A description classifier for the predicate calculus. In AAAI’94-Proceedings of the 12th National Conference on Artificial Intelligence, volume 1, pages 213–220. Seattle, WA, USA, July 31–August 4, 1994. Menlo Park, CA: AAAI Press & MIT Press, 1994.
Eric Mays, Robert Weida, Robert Dionne, Meir Laker, Brian White, Chihong Liang, and Frank J. Oles. Scalable and expressive medical terminologies. In J. J. Cimino, editor, AMIA’96-Proceedings of the 1996 AMIA Annual Fall Symposium (formerly SCAMC). Beyond the Superhighway: Exploiting the Internet with Medical Informatics, pages 259–263. Washington, D.C., October 26–30, 1996. Philadelphia, PA: Hanley & Belfus, 1996.
Alexa T. McCray. The nature of lexical knowledge. Methods of Information in Medicine, 37(4/5):353–360, 1998.
Alexa T. McCray and Stuart J. Nelson. The representation of meaning in the UMLS. Methods of Information in Medicine, 34(1/2):193–201, 1995.
Domenico M. Pisanelli, Aldo Gangemi, and Geri Steve. An ontological analysis of the UMLS metathesaurus. In C. G. Chute, editor, AMIA’98-Proceedings of the 1998 AMIA Annual Fall Symposium. A Paradigm Shift in Health Care Information Systems: Clinical Infrastructures for the 21st Century, pages 810–814. Orlando, FL, November 7–11, 1998. Philadelphia, PA: Hanley & Belfus, 1998.
Alan L. Rector. Clinical terminology: Why is it so hard? Methods of Information in Medicine, 38:147–157, 1999.
Alan L. Rector. Analysis of propagation along transitive roles: Formalisation of the galen experience with medical ontologies. In I. Horrocks and Tessaris S., editors, DL02-2002 International Workshop on Description Logics, Toulouse, France, 2002. Published as CEUR Workshop Proceedings (CEUR-WS.org) via http://CEUR-WS.org/Vol-53/.
Alan L. Rector, Sean Bechhofer, Carole A. Goble, Ian Horrocks, W. Anthony Nowlan, and W. Danny Solomon. The GRAIL concept modelling language for medical terminology. Artificial Intelligence in Medicine, 9:139–171, 1997.
Jeremy E. Rogers, Colin Price, Alan Rector, W. Daniel Solomon, and Nick Smeijko. Validating clinical terminology structures: Integration and cross-validation of Read Thesaurus and Galen. In C. G. Chute, editor, AMIA’98-Proceedings of the 1998 AMIA Annual Fall Symposium. A Paradigm Shift in Health Care Information Systems: Clinical Infrastructures for the 21st Century, pages 845–849. Orlando, FL, November 7–11, 1998. Philadelphia, PA: Hanley & Belfus, 1998.
Cornelius Rosse, José Leonardo V. Mejino, Bharath R. Modayur, Rex Jakobovits, Kevin P. Hinshaw, and James F. Brinkley. Motivation and organizational principles for anatomical knowledge representation: The Digital Anatomist symbolic knowledge base. Journal of the American Medical Informatics Association, 5(1): 17–40, 1998.
James G. Schmolze and William S. Mark. The Nikl experience. Computational Intelligence, 6(1):48–69, 1991.
Rainer Schubert and Karl-Heinz Höhne. Partonomies for interactive explorable 3D-models of anatomy. In C. G. Chute, editor, AMIA’ 98-Proceedings of the 1998 AMIA Annual Fall Symposium. A Paradigm Shift in Health Care Information Systems: Clinical Infrastructures for the 21st Century, pages 433–437. Orlando, FL, November 7–11, 1998. Philadelphia, PA: Hanley & Belfus, 1998.
Erich B. Schulz, Colin Price, and Philip J. B. Brown. Symbolic anatomic knowledge representation in the Read Codes Version 3: Structure and application. Journal of the American Medical Informatics Association, 4(1):38–48, 1997.
Stefan Schulz and Udo Hahn. Mereotopological reasoning about parts and (w)holes in bio-ontologies. In Chris Welty and Barry Smith, editors, Formal Ontology in Information Systems. Collected Papers from the 2nd International Conference, pages 210–221. Ogunquit, Maine, USA, October 17–19, 2001. New York, NY: ACM Press, 2001.
Stefan Schulz and Udo Hahn. Necessary parts and wholes in bio-ontologies. In D. Fensel, F. Giunchiglia, D. McGuinness, and M.-A. Williams, editors, Principles of Knowledge Representation and Reasoning. Proceedings of the 8th International Conference-KR 2002, pages 387–394. Toulouse, France, April 22–25, 2002. San Francisco, CA: Morgan Kaufmann, 2002.
Stefan Schulz, Udo Hahn, and Martin Romacker. Modeling anatomical spatial relations with description logics. In J. M. Overhage, editor, AMIA 2000-Proceedings of the Annual Symposium of the American Medical Informatics Association. Converging Information, Technology, and Health Care, pages 779–783. Los Angeles, CA, November 4–8, 2000. Philadelphia, PA: Hanley & Belfus, 2000.
Kent A. Spackman and Keith E. Campbell. Compositional concept representation using SNOMED: Towards further convergence of clinical terminologies. In C. G. Chute, editor, AMIA’98-Proceedings of the 1998 AMIA Annual Fall Symposium. A Paradigm Shift in Health Care Information Systems: Clinical Infrastructures for the 21st Century, pages 740–744. Orlando, FL, November 7–11, 1998. Philadelphia, PA: Hanley & Belfus, 1998.
Françoise Volot, M. Joubert, and Marius Fieschi. Review of biomedical knowledge and data representation with Conceptual Graphs. Methods of Information in Medicine, 37(1):86–96, 1998.
Morton Winston, Roger Chaffin, and Douglas J. Herrmann. A taxonomy of part-whole relationships. Cognitive Science, 11:417–444, 1987.
William A. Woods and James G. Schmolze. The Kl-One family. Computers & Mathematics with Applications, 23(2/5): 133–177, 1992.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hahn, U., Schulz, S. (2002). Turning Lead into Gold? Feeding a Formal Knowledge Base with Informal Conceptual Knowledge. In: Gómez-Pérez, A., Benjamins, V.R. (eds) Knowledge Engineering and Knowledge Management: Ontologies and the Semantic Web. EKAW 2002. Lecture Notes in Computer Science(), vol 2473. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45810-7_19
Download citation
DOI: https://doi.org/10.1007/3-540-45810-7_19
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44268-4
Online ISBN: 978-3-540-45810-4
eBook Packages: Springer Book Archive