Skip to main content

Accurate Remote PPG Waveform Recovery from Video Using a Multi-task Learning Temporal Model

  • Conference paper
  • First Online:
Advances in Visual Computing (ISVC 2024)

Abstract

Remote photoplethysmography (rPPG) offers a convenient, non-contact method for extracting cardiac-related signals from video. Despite its significant potential for comprehensive cardiac health monitoring, existing methods are limited to extracting heart rate because they can only recover heart-rate-correlated periodic patterns rather than the complete and precise PPG waveform needed for thorough biometric analysis. To address this issue, we designed a multi-loss model aimed at accurately restoring rPPG waveforms, focusing on capturing critical fiducial points and pulse contours. Our model employs a multi-task learning architecture that integrates primary rPPG signal reconstruction mean squared error (MSE) loss, peak loss, trough loss, and signal-to-noise ratio (SNR) loss to enhance signal recovery. Additionally, we incorporated Temporal Shift Modules (TSM) and Long Short-Term Memory (LSTM) networks to capture both short-term and long-term temporal dependencies, effectively handling low-quality or cross-dataset training data. The experimental results show that our model significantly improves rPPG signal restoration on the PURE and UBFC-rPPG datasets, outperforming two representative models, DeepPhys and TS-CAN, by reducing systolic peak and foot/onset estimation errors by over 30%, accurately capturing diastolic peaks and dicrotic notches, and achieving a DTW distance of 6.54, indicating enhanced waveform contour recovery.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. How medical device companies are transforming cardiac care with remote monitoring. https://empeek.com/insights/how-medical-device-companies-are-transforming-cardiac-care-with-remote-monitoring/. Accessed 13 Jan 2023

  2. Bobbia, S., Macwan, R., Benezeth, Y., Mansouri, A., Dubois, J.: Unsupervised skin tissue segmentation for remote photoplethysmography. Pattern Recogn. Lett. 124, 82–90 (2019)

    Article  Google Scholar 

  3. Chen, W., McDuff, D.: DeepPhys: video-based physiological measurement using convolutional attention networks. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 349–365 (2018)

    Google Scholar 

  4. Dall’Olio, L., et al.: Prediction of vascular aging based on smartphone acquired PPG signals. Sci. Rep. 10(1), 19756 (2020)

    Article  Google Scholar 

  5. Dasari, A., Prakash, S.K.A., Jeni, L.A., Tucker, C.S.: Evaluation of biases in remote photoplethysmography methods. NPJ Dig. Med. 4(1), 91 (2021)

    Article  Google Scholar 

  6. Güler, R.A., Neverova, N., Kokkinos, I.: DensePose: dense human pose estimation in the wild. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7297–7306 (2018)

    Google Scholar 

  7. Kendall, A., Gal, Y., Cipolla, R.: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7482–7491 (2018)

    Google Scholar 

  8. Liu, X., Fromm, J., Patel, S., McDuff, D.: Multi-task temporal shift attention networks for on-device contactless vitals measurement. Adv. Neural. Inf. Process. Syst. 33, 19400–19411 (2020)

    Google Scholar 

  9. Liu, X., et al.: rPPG-toolbox: deep remote PPG toolbox. Adv. Neural Inf. Process. Syst. 36 (2024)

    Google Scholar 

  10. Lu, H., Han, H., Zhou, S.K.: Dual-GAN: joint BVP and noise modeling for remote physiological measurement. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12404–12413 (2021)

    Google Scholar 

  11. Mohan, P.M., Nisha, A.A., Nagarajan, V., Jothi, E.S.J.: Measurement of arterial oxygen saturation (SpO 2) using PPG optical sensor. In: 2016 International Conference on Communication and Signal Processing (ICCSP), pp. 1136–1140. IEEE (2016)

    Google Scholar 

  12. Narayanswamy, G., et al.: BigSmall: efficient multi-task learning for disparate spatial and temporal physiological measurements. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 7914–7924 (2024)

    Google Scholar 

  13. Ouyang, V., et al.: The use of multi-site photoplethysmography (PPG) as a screening tool for coronary arterial disease and atherosclerosis. Physiol. Meas. 42(6), 064006 (2021)

    Article  Google Scholar 

  14. Slapničar, G., Luštrek, M., Marinko, M.: Continuous blood pressure estimation from PPG signal. Informatica 42(1) (2018)

    Google Scholar 

  15. Stricker, R., Müller, S., Gross, H.M.: Non-contact video-based pulse rate measurement on a mobile service robot. In: The 23rd IEEE International Symposium on Robot and Human Interactive Communication, pp. 1056–1062. IEEE (2014)

    Google Scholar 

  16. Yu, Z., Li, X., Zhao, G.: Remote photoplethysmograph signal measurement from facial videos using spatio-temporal networks. arXiv preprint arXiv:1905.02419 (2019)

  17. Yu, Z., Shen, Y., Shi, J., Zhao, H., Torr, P.H., Zhao, G.: PhysFormer: facial video-based physiological measurement with temporal difference transformer. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4186–4196 (2022)

    Google Scholar 

  18. Zhang, Y., Feng, Z.: A SVM method for continuous blood pressure estimation from a PPG signal. In: Proceedings of the 9th International Conference on Machine Learning and Computing, pp. 128–132 (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tianming Zhao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhou, F., Zhao, T., Holsinger, A., Yao, Z. (2025). Accurate Remote PPG Waveform Recovery from Video Using a Multi-task Learning Temporal Model. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2024. Lecture Notes in Computer Science, vol 15046. Springer, Cham. https://doi.org/10.1007/978-3-031-77392-1_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-77392-1_37

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-77391-4

  • Online ISBN: 978-3-031-77392-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics