skip to main content
10.1145/2110363.2110397acmconferencesArticle/Chapter ViewAbstractPublication PagesihiConference Proceedingsconference-collections
research-article

Exploiting semantic structure for mapping user-specified form terms to SNOMED CT concepts

Published: 28 January 2012 Publication History

Abstract

The elements of clinical databases are usually named after the clinical terms used in various design artifacts. These terms are instinctively supplied by the users, and hence, different users often use different terms to describe the same clinical concept. This term diversity makes future database integration and analysis a huge challenge. In this paper, we study the problem of standardization of the terms used in a specific kind of user-designed artifact, the encounter forms or templates, using a popular clinical terminology, the SNOMED CT.
In particular, we focus on the problem of mapping the terms on an encounter form to SNOMED CT concepts. Existing term mapping techniques are solely based on syntactic string similarity. Such techniques are unable to disambiguate among the terms that resemble one another linguistically, and yet differ semantically. To improve existing techniques, we consider the context of a term in the mapping process and propose a hybrid approach relying on linguistics as well as structural information. For a given form term, this approach (i) exploits the semantic structure of the form to derive the term's context, and (ii) maps the term to a linguistically- matching SNOMED CT concept that is compatible with the derived context. We test the approach on over 900 clinician-specified terms used in 26 forms. This method achieves 23% improvement in precision and 38% improvement in recall, over a pure linguistic-based approach. Our first contribution is that we introduce and address a new problem of mapping form terms to standard concepts. The second contribution is that the experimental evaluation confirms that structural information has a major role in improving mapping performance, and in addressing the key challenges associated with semantic mapping.

References

[1]
Dataline software ltd. http://www.dataline.co.uk.
[2]
Ihtsdo: International health terminology standards development organisation. http://www.ihtsdo.org/.
[3]
SNOMED Clinical Terms User Guide. Technical report, The International Health Terminology Standards Development Organisation, 07 2009.
[4]
Y. An, R. Khare, I.-Y. Song, and X. Hu. Automatically mapping and integrating multiple data entry forms into a database. In In the proceedings of 30th International Conference on Conceptual Modeling, 2011.
[5]
Y. An, J. Mylopoulos, and A. Borgida. Building Semantic Mappings from Databases to Ontologies. In Proceedings of American Association for Artificial Intelligence (AAAI), 2006.
[6]
O. Bodenreider. The unified medical language system (umls): Integrating biomedical terminology. Nucleic Acids Research, 32:D267--D270, 2004.
[7]
R. Elmasri and S. B. Navathe. Fundamentals of Database Systems, 3rd Ed. Addison-Wesley, 2000.
[8]
S. Gupta, G. E. Kaiser, P. Grimm, M. F. Chiang, and J. Starren. Automating content extraction of html documents. WORLD WIDE WEB - INTERNET AND INFORMATION SYSTEMS, pages 179--224, 2005.
[9]
A. Halevy. Why Your Data Won't Mix. ACM Queue, 3(8):50--58, 2005.
[10]
J. Han and M. Kamber. Data Mining: Concepts and Techniques, 2nd ed. Morgan Kaufmann, 2006.
[11]
S. B. Henry, K. E. Campbell, and W. L. Holzemer. Representation of nursing terms for the description of patient problems using snomed iii. Proceedings of the Annual Symposium on Computer Application in Medical Care, pages 700--704, 1993.
[12]
M. A. Hernández, L. Popa, H. Ho, and F. Naumann. Clio: A schema mapping tool for information integration. In Proceedings of ISPAN, 2005.
[13]
S. Hina, E. Atwell, and O. J. and. Secure information extraction from clinical documents using snomed ct gazetteer and natural language processing. In Proceedins of International Conference for Internet Technology and Secured Transactions (ICITST), pages 1--5, 2010.
[14]
S. Jean, H. Dehainsala, D. N. Xuan, G. Pierra, L. Bellatreche, and Y. Aït-Ameur. OntoDB: It is Time to Embed your Domain Ontology in your Database. Demo presentation, 2007.
[15]
E. Jiménez-ruiz, B. C. Grau, I. Horrocks, R. Berlanga, and R. Berlanga. Logic-based assessment of the compatibility of umls ontology sources. In Journal of Biomedical Semantics, pages 1--6, 2011.
[16]
R. C. B. Jr., J. J. Cimino, and P. D. Clayton. Mapping clinically useful terminology to a controlled medical vocabulary. In Proceedings of Annual Symposium of Computing Applications in Medical Care, pages 211--215, 1994.
[17]
R. Khare, Y. An, and I.-Y. Song. Understanding search interfaces: A survey. SIGMOD Record, 39(1):33--40, 2010.
[18]
R. Khare, Y. An, I.-Y. Song, and X. Hu. Can clinicians create high-quality databases? a study on a flexible electronic health record (fehr) system. In Proceedings of 1st ACM International Health Informatics Symposium (IHI), 2010.
[19]
Y. Li, S. Lipsky Gorman, and N. Elhadad. Section classification in clinical notes using supervised hidden markov model. In Proceedings of the 1st ACM International Health Informatics Symposium, IHI '10, pages 744--750, New York, NY, USA, 2010. ACM.
[20]
R. B. Llavori, E. Jiménez-Ruiz, V. Nebot, and I. Sanz. Faeton: Form analysis and extraction tool for ontology construction. IJCAT, pages 224--233, 2010.
[21]
J. Madhavan, P. A. Bernstein, and E. Rahm. Generic schema matching with cupid. In VLDB '01: Proceedings of the 27th International Conference on Very Large Data Bases, pages 49--58, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc.
[22]
C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, 2008.
[23]
E. Myers. Rnotes: Nurse's Clinical Pocket Guide. F. A. Davis Company, second edition, 2006.
[24]
J. Patrick, Y. Wang, and P. Budd. An automated system for conversion of clinical notes into snomed clinical terminology. In In Proc. of HKMD-07, volume 68 of CRPIT, pages 219--226, 2007.
[25]
R. Rada, B. Blum, E. Calhoun, H. Mili, H. Orthner, and S. Singer. A vocabulary for medical informatics. Comput. Biomed. Res., 20:244--263, June 1987.
[26]
A. R. Rahil Qamar and. Most: A system to semantically map clinical model data to snomed-ct. In In the proceedings of Semantic Mining Conference on SNOMED-CT, pages 38--43, 2006.
[27]
E. Rahm and P. A. Bernstein. A survey of approaches to automatic schema matching. VLDB JOURNAL, 10:2001, 2001.
[28]
J. Rogers and O. Bodenreider. Snomed ct: Browsing the browsers. In R. Cornet and K. A. Spackman, editors, Proceedings of Conference in Knowledge Representation in Medicine, 2008.
[29]
S. Russell and P. Norvig. Artificial Intelligence: A Modern Approach. Prentice-Hall, 2003.
[30]
H. Stenzhorn, E. J. Pacheco, P. Nohama, and S. Schulz. Automatic mapping of clinical documentation to snomed ct. In MIE, pages 228--232, 2009.
[31]
V. Sugumaran and V. C. Storey. An ontology-based framework for generating and improving database design. In Proceedings of the 6th International Conference on Applications of Natural Language to Information Systems-Revised Papers, NLDB '02, pages 1--12, London, UK, 2002. Springer-Verlag.
[32]
A. Tordai, J. van Ossenbruggen, G. Schreiber, and B. J. Wielinga. Aligning large skos-like vocabularies: Two case studies. In ESWC (1)'10, pages 198--212, 2010.
[33]
L. W. Wright, H. K. G. Nardini, A. R. Aronson, and T. C. Rindflesch. Hierarchical concept indexing of full-text documents. J. Am. Soc. Inf. Sci, 50:514--523, 1999.

Cited By

View all
  • (2015)Scaling drug indication curation through crowdsourcingDatabase10.1093/database/bav0162015Online publication date: 22-Mar-2015
  • (2014)LabeledInJournal of Biomedical Informatics10.1016/j.jbi.2014.08.00452:C(448-456)Online publication date: 1-Dec-2014
  • (2012)Bridging encounter forms and electronic medical record databasesProceedings of the 2012 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2012.6392709(1-4)Online publication date: 4-Oct-2012

Index Terms

  1. Exploiting semantic structure for mapping user-specified form terms to SNOMED CT concepts

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      IHI '12: Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
      January 2012
      914 pages
      ISBN:9781450307819
      DOI:10.1145/2110363
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 28 January 2012

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. forms
      2. mapping
      3. semantics
      4. snomed ct
      5. structure
      6. terms

      Qualifiers

      • Research-article

      Conference

      IHI '12
      Sponsor:
      IHI '12: ACM International Health Informatics Symposium
      January 28 - 30, 2012
      Florida, Miami, USA

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 01 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2015)Scaling drug indication curation through crowdsourcingDatabase10.1093/database/bav0162015Online publication date: 22-Mar-2015
      • (2014)LabeledInJournal of Biomedical Informatics10.1016/j.jbi.2014.08.00452:C(448-456)Online publication date: 1-Dec-2014
      • (2012)Bridging encounter forms and electronic medical record databasesProceedings of the 2012 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM.2012.6392709(1-4)Online publication date: 4-Oct-2012

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media