Automatic Extraction of Structured Information from Drug Descriptions

Slavescu, Radu Razvan; Maşca, Constantin; Slavescu, Kinga Cristina

doi:10.1007/978-3-030-05918-7_3

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11308))

Included in the following conference series:

International Conference on Mining Intelligence and Knowledge Exploration

880 Accesses
2 Citations

Abstract

This paper describes a Conditional Random Field (CRF) based named entity extraction model that is used for identifying relevant information from drug prescriptions. The entities that the model is able to extract are: dosage, measuring unit, to whom the treatment is directed, frequency and the total duration of treatment. A corpus with 1800 sentences has been compiled and annotated by two experts from drug prescription texts. Using the set of features identified by us, the CRF model hits around 95% F1-measure values for unit, dosage and frequency detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://transmartfoundation.org.

References

Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly Media, Inc., Sebastopol (2009)
MATH Google Scholar
Okazaki, N.: CRFsuite: a fast implementation of Conditional Random Fields (CRFs) (2007)
Google Scholar
Patrick, J., Li, M.: A cascade approach to extracting medication events. In: Proceedings of the Australasian Language Technology Association Workshop 2009, pp. 99–103 (2009)
Google Scholar
Patrick, J., Li, M.: High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge. J. Am. Med. Inf. Assoc. 17(5), 524–527 (2010)
Article Google Scholar
Rubrichi, S., Quaglini, S.: Summary of product characteristics content extraction for a safe drugs usage. J. Biomed. Inform. 45(2), 231–239 (2012)
Article Google Scholar
Slavescu, R.R., Masca, C., Slavescu, K.C.: Sequence labeling for extracting relevant pieces of information from raw text medicine descriptions. In: Proceedings of the International Conference on Advancements of Medicine and Health Care through Technology, October 2018, Cluj-Napoca, Romania (2018, In press)
Google Scholar
Sutton, C., McCallum, A.: An introduction to conditional random fields. Found. Trends Mach. Learn. 4(4), 267–373 (2012)
Article Google Scholar
Tao, C., Filannino, M., Uzuner, Ö.: Prescription extraction using CRFs and word embeddings. J. Biomed. Inform. 72, 60–66 (2017)
Article Google Scholar
Tikk, D., Solt, I.: Improving textual medication extraction using combined conditional random fields and rule-based systems. J. Am. Med. Inform. Assoc. 17(5), 540–544 (2010)
Article Google Scholar
Uzuner, Ö., Solti, I., Cadag, E.: Extracting medication information from clinical text. J. Am. Med. Inform. Assoc. 17(5), 514–518 (2010)
Article Google Scholar
Zhang, Y., Jiang, M., Wang, J., Xu, H.: Semantic role labeling of clinical text: comparing syntactic parsers and features. In: AMIA 2016, American Medical Informatics Association Annual Symposium, Chicago, IL, USA (2016)
Google Scholar

Download references

Acknowledgments

The work for this paper has been supported in part by the Computer Science Department of the Technical University of Cluj-Napoca, Romania.

Author information

Authors and Affiliations

Department of Computer Science, Technical University of Cluj-Napoca, Bariţiu 28, 400027, Cluj-Napoca, Romania
Radu Razvan Slavescu & Constantin Maşca
“Iuliu Hatieganu” University of Medicine and Pharmacy, Cluj-Napoca, Romania
Kinga Cristina Slavescu

Authors

Radu Razvan Slavescu
View author publications
You can also search for this author in PubMed Google Scholar
Constantin Maşca
View author publications
You can also search for this author in PubMed Google Scholar
Kinga Cristina Slavescu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Radu Razvan Slavescu .

Editor information

Editors and Affiliations

Technical University of Cluj-Napoca, Cluj-Napoca, Romania
Adrian Groza
Indian Institute of Information Technology, Sri City, India
Rajendra Prasath

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Slavescu, R.R., Maşca, C., Slavescu, K.C. (2018). Automatic Extraction of Structured Information from Drug Descriptions. In: Groza, A., Prasath, R. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2018. Lecture Notes in Computer Science(), vol 11308. Springer, Cham. https://doi.org/10.1007/978-3-030-05918-7_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-05918-7_3
Published: 18 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05917-0
Online ISBN: 978-3-030-05918-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics