Skip to main content

Implementation of a Medical Coding Support System by Combining Approaches: NLP and Machine Learning

  • Conference paper
  • First Online:
Heterogeneous Data Management, Polystores, and Analytics for Healthcare (DMAH 2018, Poly 2018)

Abstract

Diagnosis-Related Groups (DRG) billing for hospital stays is based on the collection of coded and standardized information constituting the Hospital Discharge Abstract (HDA). The HDA describes the pathological state of the patient and the care provided during his stay. This work aims to design and implement a coding support system for diagnoses and acts expressed in ICD-10 (ICD-10: International Classification of Disease, \(10^{th}\) version) and GNPA (GNPA: General Nomenclature of Professional Acts), respectively. The proposed solution takes a medical report as input and provides a list of recommended diagnoses and acts. It is based on the combination of two approaches, namely, NLP (NLP: Natural Language Processing) and machine learning. Firstly, the medical reports are pre-processed, via the NLP algorithms, in order to better understand the extent of the codes concerned. Secondly, the use of machine learning approaches offers the means of making the choice of codes as relevant as possible. The experiments carried out showed very satisfactory results, which are confirmed by hospital practitioners.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.atih.sante.fr/cim-10-fr-2017-usage-pmsi.

  2. 2.

    https://www.ameli.fr/medecin/exercice-liberal/facturation-remuneration/nomenclatures-codage/ngap.

References

  1. Scheurwegs, E., et al.: Selecting relevant features from the electronic health record for clinical code prediction. J. Biomed. Inform. 74, 92–103 (2017)

    Article  Google Scholar 

  2. Weathers, A.L.: Use of the electronic health record for coding in outpatient neurology. CONTINUUM: Lifelong Learn. Neurol. 23(2), e12–e16 (2017). Selected Topics in Outpatient Neurology

    Google Scholar 

  3. Duclos, C., et al.: Medical vocabulary, terminological resources and information coding in the health domain. In: Venot, A., Burgun, A., Quantin, C. (eds.) Medical Informatics, e-Health, pp. 11–41. Springer, Heidelberg (2014). https://doi.org/10.1007/978-2-8178-0478-1_2

    Chapter  Google Scholar 

  4. Pons, E., et al.: Natural language processing in radiology: a systematic review. Radiology 279(2), 329–343 (2016)

    Article  Google Scholar 

  5. Holzinger, A., Schantl, J., Schroettner, M., Seifert, C., Verspoor, K.: Biomedical text mining: state-of-the-art, open problems and future challenges. In: Holzinger, A., Jurisica, I. (eds.) Interactive Knowledge Discovery and Data Mining in Biomedical Informatics. LNCS, vol. 8401, pp. 271–300. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-43968-5_16

    Chapter  Google Scholar 

  6. Doan, S., et al.: Natural language processing in biomedicine: a unified system architecture overview. In: Clinical Bioinformatics, pp. 275–294. Humana Press, New York (2014)

    Google Scholar 

  7. Zaki, M.J., Meira Jr., W., Meira, W.: Data Mining and Analysis: Fundamental Concepts and Algorithms. Cambridge University Press, Cambridge (2014)

    Book  MATH  Google Scholar 

  8. Aggarwal, C.C., Zhai, C.: A survey of text classification algorithms. In: Aggarwal, C., Zhai, C. (eds.) Mining text data, pp. 163–222. Springer, Boston (2012). https://doi.org/10.1007/978-1-4614-3223-4_6

    Chapter  Google Scholar 

  9. Korde, V., Mahender, C.N.: Text classification and classifiers: a survey. Int. J. Artif. Intell. Appl. 3(2), 85 (2012)

    Google Scholar 

  10. Holzinger, A. (ed.): Machine Learning for Health Informatics: State-of-the-Art and Future Challenges, vol. 9605. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-319-50478-0

    Book  Google Scholar 

  11. Wang, Z., Xue, X.: Multi-class support vector machine. In: Ma, Y., Guo, G. (eds.) Support Vector Machines Applications, pp. 23–48. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-02300-7_2

    Chapter  Google Scholar 

  12. Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 45(4), 427–437 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Idir Amine Amarouche , Dehbia Ahmed Zaid or Tayeb Kenaza .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Amarouche, I.A., Ahmed Zaid, D., Kenaza, T. (2019). Implementation of a Medical Coding Support System by Combining Approaches: NLP and Machine Learning. In: Gadepally, V., Mattson, T., Stonebraker, M., Wang, F., Luo, G., Teodoro, G. (eds) Heterogeneous Data Management, Polystores, and Analytics for Healthcare. DMAH Poly 2018 2018. Lecture Notes in Computer Science(), vol 11470. Springer, Cham. https://doi.org/10.1007/978-3-030-14177-6_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-14177-6_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-14176-9

  • Online ISBN: 978-3-030-14177-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics