skip to main content
10.1145/3581783.3612564acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

LGFat-RGCN: Faster Attention with Heterogeneous RGCN for Medical ICD Coding Generation

Published: 27 October 2023 Publication History

Abstract

With the increasing volume of healthcare data, automated International Classification of Diseases (ICD) has become increasingly relevant and is frequently regarded as a medical multi-label prediction problem. Current methods struggle to accurately classify medical diagnosis texts that represent deep and sparse categories. Unlike these works that model the label with code hierarchy or description for label prediction, we argue that the label generation with structural information can provide more comprehensive knowledge based on the observation that label synonyms and parent-child relationships in vary from their context in clinical contexts. In this study, we introduce \tool, a heterogeneous graph model with improved attention for automated ICD coding. Notably, our approach represents the model to consider this task as a labelled graph generation problem. Our enhanced attention mechanism boosts the model's capacity to learn from multi-relational heterogeneous graph representations. Additionally, we propose a discriminator for labelled graphs (LG) that computes the reward for each ICD code in the labelled graph generator. Our experimental findings demonstrate that our proposed model significantly outperforms all existing strong baseline methods and attains the best performance on three benchmark datasets.

References

[1]
Tal Baumel, Jumana Nassour-Kassis, Raphael Cohen, Michael Elhadad, and Noémie Elhadad. 2018. Multi-label classification of patient notes: case study on icd code assignment. In Workshops at the thirty-second AAAI conference on artificial intelligence.
[2]
Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, JasonWeston, and Oksana Yakhnenko. 2013. Translating embeddings for modeling multi-relational data. Advances in neural information processing systems, 26.
[3]
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, and Weifeng Chong. 2020. HyperCore: hyperbolic and co-graph representation for automatic ICD coding. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, (July 2020), 3105--3114.
[4]
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu, and Weifeng Chong. 2020. Hypercore: hyperbolic and co-graph representation for automatic icd coding. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 3105--3114.
[5]
Niel Chah. 2017. Freebase-triples: a methodology for processing the freebase data dumps. arXiv preprint arXiv:1712.08707.
[6]
Yuanfei Dai, Shiping Wang, Neal N Xiong, and Wenzhong Guo. 2020. A survey on knowledge graph embedding: approaches, applications and benchmarks. Electronics, 9, 5, 750.
[7]
Luciano RS De Lima, Alberto HF Laender, and Berthier A Ribeiro-Neto. 1998. A hierarchical approach to the automatic categorization of medical documents. In Proceedings of the seventh international conference on Information and knowledge management, 132--139.
[8]
Matú? Falis, Maciej Pajak, Aneta Lisowska, Patrick Schrempf, Lucas Deckers, Shadia Mikhael, Sotirios Tsaftaris, and Alison O'Neil. 2019. Ontological attention ensembles for capturing semantic concepts in icd code prediction from clinical text. In Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019), 168--177.
[9]
Shaoxiong Ji, Erik Cambria, and Pekka Marttinen. 2020. Dilated convolutional attention network for medical code assignment from clinical text. arXiv preprint arXiv:2009.14578.
[10]
Alistair EW Johnson et al. 2016. Mimic-iii, a freely accessible critical care database. Scientific data, 3, 1, 1--9.
[11]
Fei Li and Hong Yu. 2020. Icd coding from clinical text using multi-filter residual convolutional neural network. In proceedings of the AAAI conference on artificial intelligence number 05. Vol. 34, 8180--8187.
[12]
Junyu Luo, Cao Xiao, Lucas Glass, Jimeng Sun, and Fenglong Ma. 2021. Fusion: towards automated icd coding via feature compression. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2096--2101.
[13]
Andrew Kachites McCallum, Kamal Nigam, Jason Rennie, and Kristie Seymore. 2000. Automating the construction of internet portals with machine learning. Information Retrieval, 3, 127--163.
[14]
James Mullenbach, Sarah Wiegreffe, Jon Duke, Jimeng Sun, and Jacob Eisenstein. 2018. Explainable prediction of medical codes from clinical text. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Association for Computational Linguistics, New Orleans, Louisiana, (June 2018), 1101--1111.
[15]
James Mullenbach, Sarah Wiegreffe, Jon Duke, Jimeng Sun, and Jacob Eisenstein. 2018. Explainable prediction of medical codes from clinical text. arXiv preprint arXiv:1802.05695.
[16]
Adler Perotte, Rimma Pivovarov, Karthik Natarajan, Nicole Weiskopf, Frank Wood, and Noémie Elhadad. 2014. Diagnosis code assignment: models and evaluation metrics. Journal of the American Medical Informatics Association, 21, 2, 231--237.
[17]
Aaditya Prakash, Siyuan Zhao, Sadid A Hasan, Vivek Datla, Kathy Lee, Ashequl Qadir, Joey Liu, and Oladimeji Farri. 2017. Condensed memory networks for clinical diagnostic inferencing. In Thirty-first AAAI conference on artificial intelligence.
[18]
Najmeh Sadoughi, Greg P Finley, James Fone, Vignesh Murali, Maxim Korenevski, Slava Baryshnikov, Nico Axtmann, Mark Miller, and David Suendermann-Oeft. 2018. Medical code prediction with multi-view convolution and description regularized label-dependent attention. arXiv preprint arXiv:1811.01468.
[19]
Michael Schlichtkrull, Thomas N Kipf, Peter Bloem, Rianne van den Berg, Ivan Titov, and Max Welling. 2018. Modeling relational data with graph convolutional networks. In European semantic web conference. Springer, 593--607.
[20]
Haoran Shi, Pengtao Xie, Zhiting Hu, Ming Zhang, and Eric P Xing. 2017. Towards automated icd coding using deep learning. arXiv preprint arXiv:1711.04075.
[21]
Aaron Sonabend, Winston Cai, Yuri Ahuja, Ashwin Ananthakrishnan, Zongqi Xia, Sheng Yu, and Chuan Hong. 2020. Automated icd coding via unsupervised knowledge integration (unite). International journal of medical informatics, 139, 104135.
[22]
Fabian Stephany and Fabian Braesemann. 2017. An exploration of wikipedia data as a measure of regional knowledge distribution. In International Conference on Social Informatics. Springer, 31--40.
[23]
Zequn Sun, ChengmingWang,Wei Hu, Muhao Chen, Jian Dai,Wei Zhang, and Yuzhong Qu. 2020. Knowledge graph alignment network with gated multi-hop neighborhood aggregation. In Proceedings of the AAAI Conference on Artificial Intelligence number 01. Vol. 34, 222--229.
[24]
Qiuling Suo, Fenglong Ma, Ye Yuan, Mengdi Huai, Weida Zhong, Jing Gao, and Aidong Zhang. 2018. Deep patient similarity learning for personalized healthcare. IEEE transactions on nanobioscience, 17, 3, 219--227.
[25]
Reed T Sutton, David Pincock, Daniel C Baumgart, Daniel C Sadowski, Richard N Fedorak, and Karen I Kroeker. 2020. An overview of clinical decision support systems: benefits, risks, and strategies for success. NPJ digital medicine, 3, 1, 17.
[26]
Thanh Vu, Dat Quoc Nguyen, and Anthony Nguyen. 2020. A label attention model for icd coding from clinical text. arXiv preprint arXiv:2007.06351.
[27]
Guoyin Wang, Chunyuan Li, Wenlin Wang, Yizhe Zhang, Dinghan Shen, Xinyuan Zhang, Ricardo Henao, and Lawrence Carin. 2018. Joint embedding of words and labels for text classification. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2321--2331.
[28]
Shanshan Wang, Pengjie Ren, Zhumin Chen, Zhaochun Ren, Jian-Yun Nie, Jun Ma, and Maarten de Rijke. 2020. Coding electronic health records with adversarial reinforcement path generation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 801--810.
[29]
Shi Wang, Daniel Tang, and Luchen Zhang. 2021. A large-scale hierarchical structure knowledge enhanced pre-training framework for automatic ICD coding. In Neural Information Processing - 28th International Conference, ICONIP 2021, Sanur, Bali, Indonesia, December 8-12, 2021, Proceedings, Part VI (Communications in Computer and Information Science). Teddy Mantoro, Minho Lee, Media Anugerah Ayu, Kok Wai Wong, and Achmad Nizar Hidayanto, (Eds.) Vol. 1517. Springer, 494--502.
[30]
Shi Wang, Daniel Tang, Luchen Zhang, Huilin Li, and Ding Han. 2022. Hienet: bidirectional hierarchy framework for automated icd coding. In International Conference on Database Systems for Advanced Applications. Springer, 523--539.
[31]
Zhihao Wang and Xin Li. 2019. Hybrid-te: hybrid translation-based temporal knowledge graph embedding. In 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI). IEEE, 1446--1451.
[32]
Lingfei Wu, Jiliang Tang, Yinglong Xia, Jian Pei, and Xiaojie Guo. 2021. The sixth international workshop on deep learning on graphs-methods and applications (dlg-kdd'21). In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 4167--4168.
[33]
Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. 2020. A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems, 32, 1, 4--24.
[34]
Pengtao Xie and Eric Xing. 2018. A neural architecture for automated icd coding. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1066--1076.
[35]
Xiancheng Xie, Yun Xiong, Philip S Yu, and Yangyong Zhu. 2019. Ehr coding with multi-scale feature attention and structured knowledge graph propagation. In Proceedings of the 28th ACM international conference on information and knowledge management, 649--658.
[36]
Keyang Xu et al. 2019. Multimodal machine learning for automated icd coding. In Machine learning for healthcare conference. PMLR, 197--215.
[37]
Zheng Yuan, Chuanqi Tan, and Songfang Huang. 2022. Code synonyms do matter: multiple synonyms matching network for automatic icd coding. arXiv preprint arXiv:2203.01515.
[38]
Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, and Maosong Sun. 2020. Graph neural networks: a review of methods and applications. AI Open, 1, 57--81.
[39]
Peng Zhou, Wei Shi, Jun Tian, Zhenyu Qi, Bingchen Li, Hongwei Hao, and Bo Xu. 2016. Attention-based bidirectional long short-term memory networks for relation classification. In Proceedings of the 54th annual meeting of the association for computational linguistics (volume 2: Short papers), 207--212.
[40]
Tong Zhou, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Kun Niu, Weifeng Chong, and Shengping Liu. 2021. Automatic icd coding via interactive shared representation networks with self-distillation mechanism. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 5948--5957.

Cited By

View all
  • (2024)AdHierNet: Enhancing Adversarial Robustness and Interpretability in Text Classification2024 6th International Conference on Natural Language Processing (ICNLP)10.1109/ICNLP60986.2024.10692972(41-45)Online publication date: 22-Mar-2024
  • (2023)AI-Driven Health Advice: Evaluating the Potential of Large Language Models as Health AssistantsJournal of Computational Methods in Engineering Applications10.62836/jcmea.v3i1.030106(1-7)Online publication date: 6-Nov-2023
  • (2023)Exploring the Factors of Supply Chain Concentration in Chinese A-Share Listed EnterprisesJournal of Computational Methods in Engineering Applications10.62836/jcmea.v3i1.030105(1-17)Online publication date: 6-Nov-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '23: Proceedings of the 31st ACM International Conference on Multimedia
October 2023
9913 pages
ISBN:9798400701085
DOI:10.1145/3581783
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 October 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. gaze detection
  2. neural networks
  3. text tagging

Qualifiers

  • Research-article

Funding Sources

Conference

MM '23
Sponsor:
MM '23: The 31st ACM International Conference on Multimedia
October 29 - November 3, 2023
Ottawa ON, Canada

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)127
  • Downloads (Last 6 weeks)9
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)AdHierNet: Enhancing Adversarial Robustness and Interpretability in Text Classification2024 6th International Conference on Natural Language Processing (ICNLP)10.1109/ICNLP60986.2024.10692972(41-45)Online publication date: 22-Mar-2024
  • (2023)AI-Driven Health Advice: Evaluating the Potential of Large Language Models as Health AssistantsJournal of Computational Methods in Engineering Applications10.62836/jcmea.v3i1.030106(1-7)Online publication date: 6-Nov-2023
  • (2023)Exploring the Factors of Supply Chain Concentration in Chinese A-Share Listed EnterprisesJournal of Computational Methods in Engineering Applications10.62836/jcmea.v3i1.030105(1-17)Online publication date: 6-Nov-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media