Predicting Outcomes for Cancer Patients with Transformer-Based Multi-task Learning

Gerrard, Leah; Peng, Xueping; Clarke, Allison; Schlegel, Clement; Jiang, Jing

doi:10.1007/978-3-030-97546-3_31

Leah Gerrard^11,12,
Xueping Peng¹¹,
Allison Clarke¹²,
Clement Schlegel¹² &
…
Jing Jiang¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13151))

Included in the following conference series:

Australasian Joint Conference on Artificial Intelligence

1937 Accesses

Abstract

Cancer patients often experience numerous hospital admissions as a result of their cancer and treatment, which can negatively impact treatment progress and quality of life. Accurately predicting outcomes for cancer patients is therefore crucial in providing personalised care and improving patient outcomes. Existing models leveraging deep learning with Electronic Health Record (EHR) data to predict outcomes for cancer patients are limited, despite the demonstrated success of these approaches with cancer imaging data and non-cancer EHR applications. Additionally, current methods focus on single-task predictions, and increasing evidence suggests jointly training a model on two related tasks can improve predictive performance. To address these limitations, we propose a Transformer-based Multi-Task (TransMT) model that captures relationships between diagnosis codes and sequential hospital visits to simultaneously predict related outcomes for hospitalised cancer patients. Experiments conducted on two public datasets show the proposed model outperforms both single-task and recurrent neural network approaches in predicting future diagnosis and hospital readmission, and demonstrates the benefits of using deep learning with EHR data for cancer-related research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.hcup-us.ahrq.gov/toolssoftware/ccs/CCSUsersGuide.pdf.

References

Brunner-La Rocca, H.P., Peden, C., Soong, J., Holman, P., Bogdanovskaya, M., Barclay, L.: Reasons for readmission after hospital discharge in patients with chronic diseases-information from an international dataset. PLOS ONE 15, e0233457 (2020)
Google Scholar
Cai, X., Gao, J., Ngiam, K.Y., Ooi, B.C., Zhang, Y., Yuan, X.: Medical concept embedding with time-aware attention. In: IJCAI, pp. 3984–3990 (2018)
Google Scholar
Choi, E., Bahadori, M.T., Sun, J., Kulas, J., Schuetz, A., Stewart, W.: Retain: an interpretable predictive model for healthcare using reverse time attention mechanism. In: NeurIPS, pp. 3504–3512 (2016)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Fadol, A., et al.: A quality improvement approach to reducing hospital readmissions in patients with cancer and heart failure. Cardio-Oncol. 5, 5 (2019)
Google Scholar
Gao, R., et al.: Deep multi-task prediction of lung cancer and cancer-free progression from censored heterogenous clinical imaging. CoRR (2019)
Google Scholar
Gensheimer, M., et al.: Automated survival prediction in metastatic cancer patients using high-dimensional electronic medical record data. Ann. Oncol. 29 (2018)
Google Scholar
Gupta, S., et al.: Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry. BMJ Open 4(3) (2014)
Google Scholar
Harutyunyan, H., Khachatrian, H., Kale, D.C., Ver Steeg, G., Galstyan, A.: Multitask learning and benchmarking with clinical time series data. Sci. Data 6(1) (2019)
Google Scholar
Hughes, L., Witham, M.: Causes and correlates of 30 day and 180 day readmission following discharge from a medicine for the elderly rehabilitation unit. BMC Geriatrics 18 (2018)
Google Scholar
Ji, H., Abushomar, H., Chen, X.K., Qian, C., Gerson, D.: All-cause readmission to acute care for cancer patients. Healthc. Quart. Toronto Ont. 15, 14–6 (2012)
Google Scholar
Johnson, A., Bulgarelli, L., Pollard, T., Horng, S., Celi, L.A., Mark, R.: Mimic-iv (version 0.4). PhysioNet (2020)
Google Scholar
Johnson, A.E., et al.: Mimic-iii, a freely accessible critical care database. Sci. Data 3, 160035 (2016)
Google Scholar
Lahlou, C., Crayton, A., Trier, C., Willett, E.: Explainable health risk predictor with transformer-based medicare claim encoder. CoRR abs/2105.09428 arxiv.org/abs/2105.09428 (2021)
Le, T.L.T., Thome, N., Bernard, S., Bismuth, V., Patoureaux, F.: Multitask classification and segmentation for cancer diagnosis in mammography. arXiv preprint arXiv:1909.05397 (2019)
Li, Y., et al.: BEHRT: transformer for electronic health records. CoRR (2019)
Google Scholar
Lin, Z., Feng, M., Santos, C.N.d., Yu, M., Xiang, B., Zhou, B., Bengio, Y.: A structured self-attentive sentence embedding. arXiv:1703.03130 (2017)
Liu, L., et al.: Multi-task learning via adaptation to similar tasks for mortality prediction of diverse rare diseases. In: AMIA Annual Symposium Proceedings. American Medical Informatics Association, vol. 2020, p. 763 (2020)
Google Scholar
Long, G., Shen, T., Tan, Y., Gerrard, L., Clarke, A., Jiang, J.: Federated learning for privacy-preserving open innovation future on digital health. arXiv preprint arXiv:2108.10761 (2021)
Ma, F., Chitta, R., Zhou, J., You, Q., Sun, T., Gao, J.: Dipole: diagnosis prediction in healthcare via attention-based bidirectional recurrent neural networks. In: SIGKDD, pp. 1903–1911. ACM (2017)
Google Scholar
McDermott, M., et al.: A comprehensive evaluation of multi-task learning and multi-task pre-training on ehr time-series data. arXiv preprint arXiv:2007.10185 (2020)
Miaskowski, C., et al.: Latent class analysis reveals distinct subgroups of patients based on symptom occurrence and demographic and clinical characteristics. J. Pain Symptom Manage. 50(1), 28–37 (2015)
Google Scholar
Min, X., Yu, B., Wang, F.: Predictive modeling of the hospital readmission risk from patients’ claims data using machine learning: a case study on copd. Sci. Reports 9, 2362 (2019)
Google Scholar
Parikh, R.B., et al.: Machine learning approaches to predict 6-Month mortality among patients with cancer. JAMA Netw. Open 2(10), e1915997–e1915997 (2019)
Google Scholar
Peng, X., Long, G., Shen, T., Wang, S., Jiang, J., Blumenstein, M.: Temporal self-attention network for medical concept embedding. In: ICDM, pp. 498–507. IEEE (2019)
Google Scholar
Peng, X., Long, G., Shen, T., Wang, S., Jiang, J., Zhang, C.: Bitenet: bidirectional temporal encoder network to predict medical outcomes. In: ICDM, pp. 412–421. IEEE (2020)
Google Scholar
Rao, S., et al.: An explainable transformer-based deep learning model for the prediction of incident heart failure. CoRR (2021)
Google Scholar
Rasmy, L., Xiang, Y., Xie, Z., Tao, C., Zhi, D.: Med-bert: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction. NPJ Digital Med. 4(1), 1–13 (2021)
Google Scholar
Ruck, J., Canner, J., Smith, T., Johnston, F.: Use of inpatient palliative care by type of malignancy. J. Palliative Med. 21 (2018)
Google Scholar
Ruder, S.: An overview of multi-task learning in deep neural networks. arXiv preprint arXiv:1706.05098 (2017)
Vaswani, A., et al.: Attention is all you need. In: NeurIPS, pp. 5998–6008 (2017)
Google Scholar
Zeiler, M.D.: Adadelta: an adaptive learning rate method. arXiv preprint arXiv:1212.5701 (2012)

Download references

Acknowledgements

This research is supported by an Australian Government Research Training Program Scholarship. We also thank the Australian Government Department of Health for supporting this work.

Author information

Authors and Affiliations

Faculty of Engineering and IT, Australian AI Institute, University of Technology Sydney, Sydney, Australia
Leah Gerrard, Xueping Peng & Jing Jiang
Health Economics and Research Division, Australian Government Department of Health, Canberra, Australia
Leah Gerrard, Allison Clarke & Clement Schlegel

Authors

Leah Gerrard
View author publications
You can also search for this author in PubMed Google Scholar
Xueping Peng
View author publications
You can also search for this author in PubMed Google Scholar
Allison Clarke
View author publications
You can also search for this author in PubMed Google Scholar
Clement Schlegel
View author publications
You can also search for this author in PubMed Google Scholar
Jing Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Leah Gerrard or Xueping Peng .

Editor information

Editors and Affiliations

University of Technology Sydney, Sydney, NSW, Australia
Guodong Long
RMIT University, Melbourne, SA, Australia
Xinghuo Yu
University of Queensland, Brisbane, QLD, Australia
Sen Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gerrard, L., Peng, X., Clarke, A., Schlegel, C., Jiang, J. (2022). Predicting Outcomes for Cancer Patients with Transformer-Based Multi-task Learning. In: Long, G., Yu, X., Wang, S. (eds) AI 2021: Advances in Artificial Intelligence. AI 2022. Lecture Notes in Computer Science(), vol 13151. Springer, Cham. https://doi.org/10.1007/978-3-030-97546-3_31

Download citation

DOI: https://doi.org/10.1007/978-3-030-97546-3_31
Published: 19 March 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-97545-6
Online ISBN: 978-3-030-97546-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics