skip to main content
10.1145/3607720.3607746acmotherconferencesArticle/Chapter ViewAbstractPublication PagesnissConference Proceedingsconference-collections
research-article

Patient smart home monitoring using vision neural network transformers

Published:13 November 2023Publication History

ABSTRACT

Image captioning is a task that involves generating natural language descriptions of the content of an image, and has the potential to support healthcare providers in monitoring patient conditions and routines at home. The ability to remotely monitor patients can provide valuable information to healthcare providers, allowing them to identify changes in patient behavior and facilitate timely interventions. In this study, we examine the usability of transformer neural networks for image caption generation from surveillance camera footage taken at regular intervals of one minute. Our objective is to develop and evaluate a transformer neural network model for generating captions of patient behavior, trained and evaluated on the Common Objects in Context (COCO) dataset. Our study provides a proof-of-concept for the potential of transformer neural networks in image captioning for remote monitoring of patient behavior. By generating natural language descriptions of patient behavior, healthcare providers can obtain valuable insights into patient routines and conditions, allowing them to monitor patients remotely and identify changes in behavior that may require intervention. Furthermore, our study highlights the potential for transformer neural networks to support healthcare providers in identifying patterns and trends in patient behavior over time.

References

  1. Haleem, A., Javaid, M., Singh, R. P., & Suman, R. (2021). Telemedicine for healthcare: Capabilities, features, barriers, and applications. Sensors international, 2, 100117. https://doi.org/10.1016/j.sintl.2021.100117Google ScholarGoogle ScholarCross RefCross Ref
  2. I. Hrga, M. Ivašić-Kos (2019). Deep Image Captioning: An Overview. In the proceedings the 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).Google ScholarGoogle Scholar
  3. de Miguel, K., Brunete, A., Hernando, M., & Gambao, E. (2017). Home Camera-Based Fall Detection System for the Elderly. Sensors (Basel, Switzerland), 17(12), 2864. https://doi.org/10.3390/s17122864Google ScholarGoogle ScholarCross RefCross Ref
  4. Issam B., Xiaojun Z. (2022). Wearable sensors and machine learning in post-stroke rehabilitation assessment: A systematic review. Biomedical Signal Processing and Control, Volume 71, Part B, 103197. https://doi.org/10.1016/j.bspc.2021.103197Google ScholarGoogle ScholarCross RefCross Ref
  5. Alugubelli, N., Abuissa, H., & Roka, A. (2022). Wearable Devices for Remote Monitoring of Heart Rate and Heart Rate Variability—What We Know and What Is Coming. Sensors, 22(22), 8903. MDPI AG. Retrieved from http://dx.doi.org/10.3390/s22228903Google ScholarGoogle ScholarCross RefCross Ref
  6. Vandermi S., Vinicius S. Souzaa, Robson G. da Cruzb, (2018). MobiHealth: a System to Improve Medication Adherence in Hypertensive Patients. In the proceedings of the 8th International Conference on Current and Future Trends of Information and Communication Technologies in Healthcare (ICTH2018). Procedia Computer Science 141 (2018) 366–373Google ScholarGoogle Scholar
  7. Javanmardi, S., Latif, A., Sadeghi, M., Jahanbanifard, M., Bonsangue, M., & Verbeek, F. (2022). Caps Captioning: A Modern Image Captioning Approach Based on Improved Capsule Network. Sensors, 22(21), 8376. MDPI AG. Retrieved from http://dx.doi.org/10.3390/s22218376Google ScholarGoogle ScholarCross RefCross Ref
  8. Iwamura, K., Louhi Kasahara, J. Y., Moro, A., Yamashita, A., & Asama, H. (2021). Image Captioning Using Motion-CNN with Object Detection. Sensors, 21(4), 1270. MDPI AG. Retrieved from http://dx.doi.org/10.3390/s21041270Google ScholarGoogle ScholarCross RefCross Ref
  9. Lu, K.-L., & Chu, E. (2018). An Image-Based Fall Detection System for the Elderly. Applied Sciences, 8(10), 1995. https://doi.org/10.3390/app8101995Google ScholarGoogle ScholarCross RefCross Ref
  10. Beddiar, D. R., Oussalah, M., & Seppänen, T. (2022). Automatic captioning for medical imaging (MIC): a rapid review of literature. Artificial intelligence review, 1–58. Advance online publication. https://doi.org/10.1007/s10462-022-10270-wGoogle ScholarGoogle ScholarDigital LibraryDigital Library
  11. Marefat, A., Marefat, M., Hassannataj Joloudari, J., Nematollahi, M. A., & Lashgari, R. (2023). CCTCOVID: COVID-19 detection from chest X-ray images using Compact Convolutional Transformers. Frontiers in public health, 11, 1025746. https://doi.org/10.3389/fpubh.2023.1025746Google ScholarGoogle ScholarCross RefCross Ref
  12. Kelei He, Chen Gan, Zhuoyuan Li, Islem Rekik, Zihao Yin, Wen Ji, Yang Gao, Qian Wang, Junfeng Zhang, Dinggang Shen (2023). Transformers in medical image analysis, Intelligent Medicine, Volume 3, Issue 1, Pages 59-78, ISSN 2667-1026, https://doi.org/10.1016/j.imed.2022.07.002.Google ScholarGoogle ScholarCross RefCross Ref
  13. Kiran M., Surajit M., Bhushankumar N. (2022). A review: Data pre-processing and data augmentation techniques. Global Transitions Proceedings. https://doi.org/10.1016/j.gltp.2022.04.020Google ScholarGoogle ScholarCross RefCross Ref
  14. Yamashita, R., Nishio, M., Do, R.K.G. et a. Convolutional neural networks: an overview and application in radiology. Insights Imaging 9, 611–629 (2018). https://doi.org/10.1007/s13244-018-0639-9Google ScholarGoogle ScholarCross RefCross Ref
  15. Rashid khana , M Shujah Islama , Khadija Kanwal, (). A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism. https://arxiv.org/ftp/arxiv/papers/2203/2203.01594.pdfGoogle ScholarGoogle Scholar
  16. Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, Piotr Dollár (2014). Microsoft COCO: Common Objects in Context. arXiv:1405.0312. https://doi.org/10.48550/arXiv.1405.0312Google ScholarGoogle ScholarCross RefCross Ref
  17. Ashish Vaswani, et. al. Attention Is All You Need. arXiv:1706.03762 [cs.CL], 2017, https://doi.org/10.48550/arXiv.1706.03762Google ScholarGoogle ScholarCross RefCross Ref
  18. Alexey Dosovitskiy, An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. 2021, arXiv:2010.11929 [cs.CV], https://doi.org/10.48550/arXiv.2010.11929Google ScholarGoogle ScholarCross RefCross Ref
  19. Jason Brownlee (2017). A Gentle Introduction to Calculating the BLEU Score for Text in Python. Deep Learning for Natural Language Processing.Google ScholarGoogle Scholar
  20. Satanjeev Banerjee, Alon Lavie (2005). METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments. Carnegie Mellon University, School of Computer Science Journal.Google ScholarGoogle Scholar

Index Terms

  1. Patient smart home monitoring using vision neural network transformers
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            NISS '23: Proceedings of the 6th International Conference on Networking, Intelligent Systems & Security
            May 2023
            451 pages
            ISBN:9798400700194
            DOI:10.1145/3607720

            Copyright © 2023 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 13 November 2023

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited
          • Article Metrics

            • Downloads (Last 12 months)31
            • Downloads (Last 6 weeks)8

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format