Abstract
Medication recommendation aims to provide clinicians with safe medicine combinations for the treatment of patients. Existing medication recommendation models are built based on the temporal structured code data of electronic health records (EHRs) in the medical database; nevertheless, unstructured data in EHRs, such as textual data containing rich information, are underexploited. To fill the gap, a novel multimodal interactive fusion network (MIFNet) is proposed for medication recommendation, which integrates both structured code information and unstructured text information in EHRs. Our model first extracts a series of informative feature representations to encode comprehensive patient health history and control potential drug–drug interactions (DDI), including medical code, clinical notes, and the DDI knowledge graph. Next, a novel cross-modal interaction extraction block is proposed to capture the intricate interaction information between the two modalities. Finally, a multimodal fusion block is adopted to fuse the constructed features and generate a medication combination list. Experiments are conducted on the public MIMIC-III dataset, and the results demonstrate that the proposed model outperforms the state-of-the-art medication recommendation methods on main evaluation metrics.








Similar content being viewed by others
Data availability
The dataset that supports the study is available from https://physionet.org/content/mimiciii/1.4/.
References
Al-Taie Z, Hannink M, Mitchem J, Papageorgiou C, Shyu C-R (2021a) Drug repositioning and subgroup discovery for precision medicine implementation in triple negative breast cancer. Cancers 13(24):6278. https://doi.org/10.3390/cancers13246278
Al-Taie Z, Liu D, Mitchem JB, Papageorgiou C, Kaifi JT, Warren WC, Shyu C-R (2021b) Explainable artificial intelligence in high-throughput drug repositioning for subgroup stratifications with interventionable potential. J Biomed Inf 118:103792. https://doi.org/10.1016/j.jbi.2021.103792
Almirall D, Compton SN, Gunlicks-Stoessel M, Duan N, Murphy SA (2012) Designing a pilot sequential multiple assignment randomized trial for developing an adaptive treatment strategy. Stat Med 31(17):1887–1902. https://doi.org/10.1002/sim.4512
Alsentzer E, Murphy JR, Boag W, Weng W-H, Jin D, Naumann T, McDermott M (2019) Publicly available clinical BERT embeddings. In: Proceedings of the 2nd Clinical Natural Language Processing Workshop, https://doi.org/10.18653/v1/W19-1909
An Y, Zhang H, Sheng Y, Wang J, Chen X (2021) MAIN: multimodal attention-based fusion networks for diagnosis prediction. In: 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), https://doi.org/10.1109/BIBM52615.2021.9669634
An Y, Zhang L, You M, Tian X, Jin B, Wei X (2021b) Mesin: multilevel selective and interactive network for medication recommendation. Knowl-Based Syst 233:107534. https://doi.org/10.1016/j.knosys.2021.107534
Bajor JM, Lasko TA (2016) Predicting medications from diagnostic codes with recurrent neural networks. In: International Conference on Learning Representations
Baumel T, Nassour-Kassis J, Cohen R, Elhadad M, Elhadad N (2018) Multi-label classification of patient notes: case study on ICD code assignment. In: Workshops at the Thirty-Second AAAI Conference on Artificial Intelligence
Bhoi S, Lee ML, Hsu W, Fang HSA, Tan NC (2021) Personalizing medication recommendation with a graph-based approach. ACM Transact Inf Syst (TOIS) 40(3):1–23. https://doi.org/10.1145/3488668
Choi E, Bahadori MT, Sun J, Kulas J, Schuetz A, Stewart W (2016) Retain: an interpretable predictive model for healthcare using reverse time attention mechanism. Adv Neural Inf Process Syst 29
Chorowski JK, Bahdanau D, Serdyuk D, Cho K, Bengio Y (2015) Attention-based models for speech recognition. Adv Neural Inf Process Syst 28
Gong F, Wang M, Wang H, Wang S, Liu M (2021) SMR: medical knowledge graph embedding for safe medicine recommendation. Big Data Res 23:100174. https://doi.org/10.1016/j.bdr.2020.100174
Huang K, Altosaar J, Ranganath R (2019) Clinicalbert: modeling clinical notes and predicting hospital readmission. https://arxiv.org/abs/1904.05342.
Jiang Y, Sha L, Rahmaniheris M, Wan B, Hosseini M, Tan P, Berlin RB Jr (2016) Sepsis patient detection and monitor based on auto-BN. J Med Syst 40(4):111. https://doi.org/10.1007/s10916-016-0444-2
Johnson AE, Pollard TJ, Shen L, Lehman LW, Feng M, Ghassemi M, Moody B, Szolovits P, Celi LA, Mark RG (2016) MIMIC-III, a freely accessible critical care database. Sci Data 3:160035. https://doi.org/10.1038/sdata.2016.35
Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) Doha, Qatar. https://doi.org/10.3115/v1/D14-1181
Lakkaraju H, Rudin C (2017) Learning cost-effective and interpretable treatment regimes. In: Artificial Intelligence and Statistics
Le H, Tran T, Venkatesh S (2018) Dual memory neural computer for asynchronous two-view sequential learning. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, https://doi.org/10.1145/3219819.3219981
Li R, Ma F, Gao J (2021) Integrating multimodal electronic health records for diagnosis prediction. In: AMIA Annual Symposium Proceedings
Li X, Zhang Y, Li X, Wei H, Lu M (2023) DGCL: distance-wise and graph contrastive learning for medication recommendation. J Biomed Inform 139:104301. https://doi.org/10.1016/j.jbi.2023.104301
Lian J, Zhou X, Zhang F, Chen Z, Xie X, Sun G (2018) xdeepfm: combining explicit and implicit feature interactions for recommender systems. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, https://doi.org/10.1145/3219819.3220023
Lin S, Wang M, Shi C, Xu Z, Chen L, Gao Q, Chen J (2022) MR-KPA: medication recommendation by combining knowledge-enhanced pre-training with a deep adversarial network. BMC Bioinformatics 23(1):1–19. https://doi.org/10.1186/s12859-022-05102-1
Liu J, Zhang Z, Razavian N (2018) Deep ehr: chronic disease prediction using medical notes. In: Machine Learning for Healthcare Conference
Lopez-Ubeda P, Diaz-Galiano MC, Martin-Noguerol T, Luna A, Urena-Lopez LA, Martin-Valdivia MT (2021) Automatic medical protocol classification using machine learning approaches. Comput Methods Programs Biomed 200:105939. https://doi.org/10.1016/j.cmpb.2021.105939
Luaces O, Díez J, Barranquero J, del Coz JJ, Bahamonde A (2012) Binary relevance efficacy for multilabel classification. Progr Artif Intell 1(4):303–313. https://doi.org/10.1007/s13748-012-0030-x
Lyu W, Dong X, Wong R, Zheng S, Abell-Hart K, Wang F, Chen C (2022) A multimodal transformer: fusing clinical notes with structured EHR data for interpretable in-hospital mortality prediction. In: AMIA ... Annual Symposium proceedings. AMIA Symposium, 2022, pp 719–728. Retrieved 2022, from https://europepmc.org/articles/PMC10148371
Ma L, Zhang Y (2015) Using Word2Vec to process big text data. In: 2015 IEEE International Conference on Big Data (Big Data), https://doi.org/10.1109/BigData.2015.7364114
Meng Y, Lu C, Jin M, Xu J, Zeng X, Yang J (2022) A weighted bilinear neural collaborative filtering approach for drug repositioning. Brief Bioinf 23(2):581. https://doi.org/10.1093/bib/bbab581
Meng Y, Speier W, Ong MK, Arnold CW (2021) Bidirectional representation learning from transformers using multimodal electronic health record data to predict depression. IEEE J Biomed Health Inform 25(8):3121–3129. https://doi.org/10.1109/JBHI.2021.3063721
Nemati S, Holder A, Razmi F, Stanley MD, Clifford GD, Buchman TG (2018) An interpretable machine learning model for accurate prediction of sepsis in the ICU. Crit Care Med 46(4):547–553. https://doi.org/10.1097/CCM.0000000000002936
Nguyen H, Patrick J (2016) Text mining in clinical domain: dealing with noise. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, https://doi.org/10.1145/2939672.2939720
Pushpakom S, Iorio F, Eyers PA, Escott KJ, Hopper S, Wells A, Doig A, Guilliams T, Latimer J, McNamee C, Norris A, Sanseau P, Cavalla D, Pirmohamed M (2019) Drug repurposing: progress, challenges and recommendations. Nat Rev Drug Discov 18(1):41–58. https://doi.org/10.1038/nrd.2018.168
Qiao Z, Wu X, Ge S, Fan W (2019) Mnn: multimodal attentional neural networks for diagnosis prediction. In: International Joint Conference on Artificial Intelligence, https://doi.org/10.24963/ijcai.2019/823
Rahman W, Hasan MK, Lee S, Zadeh A, Mao C, Morency L-P, Hoque E (2020) Integrating multimodal information in large pretrained transformers. In: Proceedings of the conference. Association for Computational Linguistics. Meeting, https://doi.org/10.18653/v1/2020.acl-main.214
Ramachandram D, Taylor GW (2017) Deep multimodal learning: a survey on recent advances and trends. IEEE Signal Process Mag 34(6):96–108. https://doi.org/10.1109/MSP.2017.2738401
Read J, Pfahringer B, Holmes G, Frank E (2011) Classifier chains for multi-label classification. Mach Learn 85(3):333–359. https://doi.org/10.1007/s10994-011-5256-5
Shang J, Xiao C, Ma T, Li H, Sun J (2019) Gamenet: graph augmented memory networks for recommending medication combination. In: Proceedings of the AAAI Conference on Artificial Intelligence, https://doi.org/10.1609/aaai.v33i01.33011126
Song J, Wang Y, Tang S, Zhang Y, Chen Z, Zhang Z, Zhang T, Wu F (2020) Local-global memory neural network for medication prediction. IEEE Transact Neural Netw Learn Syst 32(4):1723–1736. https://doi.org/10.1109/TNNLS.2020.2989364
Srivastava N, Salakhutdinov RR (2012) Multimodal learning with deep boltzmann machines. Adv Neural Inf Process Syst, 25
Su Y, Shi Y, Lee W, Cheng L, Guo H (2022) TAHDNet: time-aware hierarchical dependency network for medication recommendation. J Biomed Inform 129:104069. https://doi.org/10.1016/j.jbi.2022.104069
Sundararajan M, Taly A, Yan Q (2017) Axiomatic attribution for deep networks. In: International conference on machine learning
Tatonetti NP, Ye PP, Daneshjou R, Altman RB (2012) Data-driven prediction of drug effects and interactions. Sci Transl Med. https://doi.org/10.1126/scitranslmed.3003377
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Adv Neural Inf Process Syst 30
Wang H, Wu Y, Gao C, Deng Y, Zhang F, Huang J, Liu J (2021a) Medication combination prediction using temporal attention mechanism and simple graph convolution. IEEE J Biomed Health Inform 25(10):3995–4004. https://doi.org/10.1109/JBHI.2021.3082548
Wang L, Zhang W, He X, Zha H (2018) Personalized prescription for comorbidity. In: Database Systems for Advanced Applications (pp. 3–19). https://doi.org/10.1007/978-3-319-91458-9_1
Wang S, Ren P, Chen Z, Ren Z, Ma J, de Rijke M (2019) Order-free medicine combination prediction with graph convolutional reinforcement learning. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management, https://doi.org/10.1145/3357384.3357965
Wang Y, Chen W, Pi D, Yue L, Wang S, Xu M (2021) Self-supervised adversarial distribution regularization for medication recommendation. https://doi.org/10.24963/ijcai.2021/431
Wu R, Qiu Z, Jiang J, Qi G, Wu X (2022) Conditional generation net for medication recommendation. In: Proceedings of the ACM Web Conference 2022, https://doi.org/10.1145/3485447.3511936
Xu Z, So DR, Dai AM (2021) Mufasa: Multimodal fusion architecture search for electronic health records. In: Proceedings of the AAAI Conference on Artificial Intelligence, https://doi.org/10.1609/aaai.v35i12.17260
Yang B, Wu L (2021) How to leverage multimodal EHR data for better medical predictions? In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, https://doi.org/10.18653/v1/2021.emnlp-main.329
Yang C, Xiao C, Ma F, Glass L, Sun J (2021) Safedrug: dual molecular graph encoders for safe drug recommendations. In: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, https://doi.org/10.24963/ijcai.2021/514
Zadeh A, Chen M, Poria S, Cambria E, Morency L-P (2017) Tensor fusion network for multimodal sentiment analysis.In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, https://doi.org/10.18653/v1/D17-1115
Zhang S, Tong H, Xu J, Maciejewski R (2019) Graph convolutional networks: a comprehensive review. Comput Soc Netw 6(1):1–23. https://doi.org/10.1186/s40649-019-0069-y
Zhang Y, Chen R, Tang J, Stewart WF, Sun J (2017) LEAP: learning to prescribe effective and safe treatment combinations for multimorbidity. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, https://doi.org/10.1145/3097983.3098109
Zhao B-W, Su X-R, Hu P-W, Ma Y-P, Zhou X, Hu L (2022) A geometric deep learning framework for drug repositioning over heterogeneous information networks. Briefings Bioinf 23(6):bbac384. https://doi.org/10.1093/bib/bbac384
Acknowledgements
The authors thank the editors and the anonymous reviewers for their insightful comments and constructive suggestions throughout the review process.
Funding
The work has been supported by National Natural Science Foundation of China (No. 72171176, NO. 71771179, NO. 72021002) and China-Germany Cooperation project supported by National Natural Science Foundation of China (M-0310).
Author information
Authors and Affiliations
Contributions
ZH and YD conceived and designed this study. ZH and JH performed the experiments. ZH, MC, and YD assisted with research and writing. All authors assisted in drafting and revising the manuscript at all stages, and all have approved it in this final form.
Corresponding author
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare.
Ethical approval
Not applicable.
Consent to participation
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Huo, J., Hong, Z., Chen, M. et al. MIFNet: multimodal interactive fusion network for medication recommendation. J Supercomput 80, 12313–12345 (2024). https://doi.org/10.1007/s11227-024-05908-1
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-024-05908-1