Skip to main content

Automatic Extraction of Structured Information from Drug Descriptions

  • Conference paper
  • First Online:
Mining Intelligence and Knowledge Exploration (MIKE 2018)

Abstract

This paper describes a Conditional Random Field (CRF) based named entity extraction model that is used for identifying relevant information from drug prescriptions. The entities that the model is able to extract are: dosage, measuring unit, to whom the treatment is directed, frequency and the total duration of treatment. A corpus with 1800 sentences has been compiled and annotated by two experts from drug prescription texts. Using the set of features identified by us, the CRF model hits around 95% F1-measure values for unit, dosage and frequency detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://transmartfoundation.org.

References

  1. Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly Media, Inc., Sebastopol (2009)

    MATH  Google Scholar 

  2. Okazaki, N.: CRFsuite: a fast implementation of Conditional Random Fields (CRFs) (2007)

    Google Scholar 

  3. Patrick, J., Li, M.: A cascade approach to extracting medication events. In: Proceedings of the Australasian Language Technology Association Workshop 2009, pp. 99–103 (2009)

    Google Scholar 

  4. Patrick, J., Li, M.: High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge. J. Am. Med. Inf. Assoc. 17(5), 524–527 (2010)

    Article  Google Scholar 

  5. Rubrichi, S., Quaglini, S.: Summary of product characteristics content extraction for a safe drugs usage. J. Biomed. Inform. 45(2), 231–239 (2012)

    Article  Google Scholar 

  6. Slavescu, R.R., Masca, C., Slavescu, K.C.: Sequence labeling for extracting relevant pieces of information from raw text medicine descriptions. In: Proceedings of the International Conference on Advancements of Medicine and Health Care through Technology, October 2018, Cluj-Napoca, Romania (2018, In press)

    Google Scholar 

  7. Sutton, C., McCallum, A.: An introduction to conditional random fields. Found. Trends Mach. Learn. 4(4), 267–373 (2012)

    Article  Google Scholar 

  8. Tao, C., Filannino, M., Uzuner, Ö.: Prescription extraction using CRFs and word embeddings. J. Biomed. Inform. 72, 60–66 (2017)

    Article  Google Scholar 

  9. Tikk, D., Solt, I.: Improving textual medication extraction using combined conditional random fields and rule-based systems. J. Am. Med. Inform. Assoc. 17(5), 540–544 (2010)

    Article  Google Scholar 

  10. Uzuner, Ö., Solti, I., Cadag, E.: Extracting medication information from clinical text. J. Am. Med. Inform. Assoc. 17(5), 514–518 (2010)

    Article  Google Scholar 

  11. Zhang, Y., Jiang, M., Wang, J., Xu, H.: Semantic role labeling of clinical text: comparing syntactic parsers and features. In: AMIA 2016, American Medical Informatics Association Annual Symposium, Chicago, IL, USA (2016)

    Google Scholar 

Download references

Acknowledgments

The work for this paper has been supported in part by the Computer Science Department of the Technical University of Cluj-Napoca, Romania.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Radu Razvan Slavescu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Slavescu, R.R., Maşca, C., Slavescu, K.C. (2018). Automatic Extraction of Structured Information from Drug Descriptions. In: Groza, A., Prasath, R. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2018. Lecture Notes in Computer Science(), vol 11308. Springer, Cham. https://doi.org/10.1007/978-3-030-05918-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-05918-7_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-05917-0

  • Online ISBN: 978-3-030-05918-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics