Abstract
In healthcare, the human body is a controlled input-output system, which generates different observations with the variations of external interventions. The intervention acts as the input, and the output is the phenotype observation that reflects the latent health state of the body system. The objective of healthcare is to determine effective intervention strategies that can nurse an unhealthy human body to a healthy state. With the advances of Internet-of-Things (IoT) and body sensor networks, it becomes convenient to observe the multimedia data of the human body anywhere and anytime. To aid healthcare decision making, we put forward to construct the human body simulators based on deep neural networks (DNNs) for healthcare research. At first, we formulate the model of the human body system based on DNNs. During our analysis, we realize that DNN-based models could simulate practical situations, e.g. some health states are unreachable. Then, we combine deep reinforcement learning (DRL) with conceptual embedding techniques to explore effective healthcare strategies for simulated human bodies. We implement a virtual human body simulator, which can take interventions and represent its hidden states by high-dimensional images, and a DRL-based treatment module, which can diagnose latent health state through the image observations and choose interventions to nurse the simulated body to a target state. By combining the body simulator and treatment module, we create a dynamic closed-loop for healthcare information processing. Experimental simulations are performed to validate the feasibility of the offered approach.
Similar content being viewed by others
References
Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M, Kudlur M, Levenberg J, Monga R, Moore S, Murray DG, Steiner B, Tucker P, Vasudevan V, Warden P, Wicke M, Yu Y, Zheng X (2016) Tensorflow: a system for large-scale machine learning. In: Proceedings of the 12th USENIX conference on operating systems design and implementation, OSDI’16. USENIX Association, Berkeley, pp 265–283
Caggianese G, Cuomo S, Esposito M, Franceschini M, Gallo L, Infarinato F, Minutolo A, Piccialli F, Romano P (2018) Serious games and in-cloud data analytics for the virtualization and personalization of rehabilitation treatments. IEEE Trans Indust Inform 15(1):517–526
Choi E, Bahadori MT, Schuetz A, Stewart WF, Sun J (2016) Doctor ai: predicting clinical events via recurrent neural networks. In: Machine learning for healthcare conference, pp 301–318
Dai Y, Liu X, Wang G (2018) A body simulator with delayed health state transition. In: 2018 IEEE international conference on ubiquitous intelligence & computing. IEEE, pp 635–641
Dai Y, Wang G (2018) Analyzing tongue images using a conceptual alignment deep autoencoder. IEEE Access 6:5962–5972
Dai Y, Wang G (2018) A deep inference learning framework for healthcare. Pattern Recogn Lett. https://doi.org/10.1016/j.patrec.2018.02.009
Dai Y, Wang G, Chen S, Xie D, Chen S (2017) Using deep neural networks to simulate human body. In: 2017 IEEE international symposium on parallel and distributed processing with applications and 2017 IEEE international conference on ubiquitous computing and communications (ISPA/IUCC). IEEE, pp 959–966
Dai Y, Wang G, Li KC (2018) Conceptual alignment deep neural networks. J Intell Fuzzy Sys 34(3):1631–1642
Dartmann G, Song H, Schmeink A (2019) Big data analytics for cyber-physical systems: machine learning for the internet of things. Elsevier, Amsterdam
Doody RS, Pavlik V, Massman P, Rountree S, Darby E, Chan W (2010) Predicting progression of alzheimer’s disease. Alzheimers Research & Therapy 2 (2):1–9
Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S (2017) Dermatologist-level classification of skin cancer with deep neural networks. Nature 542:115–118
Esteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K, Cui C, Corrado G, Thrun S, Dean J (2019) A guide to deep learning in healthcare. Nature Medicine 25(1):24–29
Gu S, Holly E, Lillicrap T, Levine S (2017) Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates. In: 2017 IEEE international conference on robotics and automation (ICRA). IEEE, pp 3389–3396
Hannun AY, Rajpurkar P, Haghpanahi M, Tison GH, Bourn C, Turakhia MP, Ng AY (2019) Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network. Nat Med 25(1):65
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2(5):359–366
Hovorka R (2011) Closed-loop insulin delivery: from bench to clinical practice. Nat Rev Endocrinol 7(7):385
Hu Y, Duan K, Zhang Y, Hossain MS, Rahman SMM, Alelaiwi A (2018) Simultaneously aided diagnosis model for outpatient departments via healthcare big data analytics. Multimed Tools Appl 77(3):3729–3743
Kaur P, Kumar R, Kumar M (2019) A healthcare monitoring system using random forest and internet of things (IoT). Multimed Tools Appl 78:19905–19916
Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Li J, Monroe W, Ritter A, Dan J, Galley M, Gao J (2016) Deep reinforcement learning for dialogue generation. In: Conference on empirical methods in natural language processing, pp 1192–1202
Lillicrap TP, Hunt JJ, Pritzel A, Heess N, Erez T, Tassa Y, Silver D, Wierstra D (2016) Continuous control with deep reinforcement learning. In: International conference on learning representations (ICLR), pp 1–10
Liu S, Bai W, Liu G, Li W, Srivastava HM (2018) Parallel fractal compression method for big video data. Complexity 2018
Liu S, Bai W, Zeng N, Wang S (2019) A fast fractal based compression for MRI images. IEEE Access 7:62412–62420
Liu S, Guo C, Al-Turjman F, Muhammad K, de Albuquerque VHC (2020) Reliability of response region: a novel mechanism in visual tracking by edge computing for IIoT environments. Mech Syst Signal Process 138:106537
Liu Y, Logan B, Liu N, Xu Z, Tang J, Wang Y (2017) Deep reinforcement learning for dynamic treatment regimes on medical registry data. In: 2017 IEEE international conference on healthcare informatics (ICHI). https://doi.org/10.1109/ICHI.2017.45, pp 380–385
Liu Y, Njilla LL, Wang J, Song H (2019) An LSTM enabled dynamic stackelberg game theoretic method for resource allocation in the cloud. In: 2019 international conference on computing, networking and communications (ICNC). IEEE, pp 797–801
Miotto R, Wang F, Wang S, Jiang X, Dudley JT (2017) Deep learning for healthcare: review, opportunities and challenges. Brief Bioinform, pp 1–11
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529C533
Mould D (2012) Models for disease progression: new approaches and uses. Clinical Pharmacology & Therapeutics 92(1):125–131
Nemati S, Ghassemi MM, Clifford GD (2016) Optimal medication dosing from suboptimal clinical examples: a deep reinforcement learning approach. In: Engineering in medicine and biology society, pp 2978–2981
Nguyen P, Tran T, Wickramasinghe N, Venkatesh S (2017) Deepr: a convolutional net for medical records. IEEE J Biomed Health Inform 21(1):22–30
Orphanou K, Stassopoulou A, Keravnou E (2014) Temporal abstraction and temporal bayesian networks in clinical domains: a survey. Artif Intell Med 60(3):133–149
Pham T, Tran T, Phung D, Venkatesh S (2017) Predicting healthcare trajectories from medical records: a deep learning approach. J Biomed Inform 69:218–229
Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A, Hubert T, Baker L, Lai M, Bolton A, et al. (2017) Mastering the game of go without human knowledge. Nature 550(7676):354C359
Sukkar R, Katz E, Zhang Y, Raunig D, Wyman BT (2012) Disease progression modeling using hidden markov models. In: 2012 annual international conference of the IEEE engineering in medicine and biology society. IEEE, pp 2845–2848
Wang L, Zhang W, He X, Zha H (2018) Supervised reinforcement learning with recurrent neural network for dynamic treatment recommendation. In: Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. ACM, pp 2447–2456
Wang X, Sontag D, Wang F (2014) Unsupervised learning of disease progression models. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, pp 85–94
Zhang Y, Sun L, Song H, Cao X (2014) Ubiquitous WSN for healthcare: recent advances and future prospects. IEEE Internet of Things Journal 1(4):311–318
Acknowledgements
The authors thank the editor and anonymous reviewers for their helpful comments and valuable suggestions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work is supported in part by the National Natural Science Foundation of China under Grant Numbers 61632009, the Guangdong Provincial Natural Science Foundation under Grant Number 2017A030308006, the High Level Talents Program of Higher Education in Guangdong Province under Funding Support Number 2016ZJ01, Hunan Provincial Science and Technology Project Foundation under Grant Numbers 2018TP1018 and 2018RS3065.
Appendix
Appendix
1.1 The generated tongue images of decoding network
Given a 9-dimensional health state vector, the decoding network will generate a 32 × 32 × 3 tongue image with corresponding features. The typical tongue images of nine BC types are illustrated in Fig. 8. The representation space of the generated tongue images by the trained decoding network are illustrated in Fig. 9.
1.2 Different scales of regulating network
The scale of hidden layer in the regulating network can affect the complexity of the treatment. The intervention dimension also can affect the probability of finding a better healthcare strategy. We generate 5 random regulating networks for different scales respectively, and use the conceptual alignment DDPG to treat the same model five times. The best converged results are illustrated in Tables 9, 10 and 11.
Rights and permissions
About this article
Cite this article
Dai, Y., Wang, G., Muhammad, K. et al. A closed-loop healthcare processing approach based on deep reinforcement learning. Multimed Tools Appl 81, 3107–3129 (2022). https://doi.org/10.1007/s11042-020-08896-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-08896-5