Abstract
Remote photoplethysmography (rPPG) offers a convenient, non-contact method for extracting cardiac-related signals from video. Despite its significant potential for comprehensive cardiac health monitoring, existing methods are limited to extracting heart rate because they can only recover heart-rate-correlated periodic patterns rather than the complete and precise PPG waveform needed for thorough biometric analysis. To address this issue, we designed a multi-loss model aimed at accurately restoring rPPG waveforms, focusing on capturing critical fiducial points and pulse contours. Our model employs a multi-task learning architecture that integrates primary rPPG signal reconstruction mean squared error (MSE) loss, peak loss, trough loss, and signal-to-noise ratio (SNR) loss to enhance signal recovery. Additionally, we incorporated Temporal Shift Modules (TSM) and Long Short-Term Memory (LSTM) networks to capture both short-term and long-term temporal dependencies, effectively handling low-quality or cross-dataset training data. The experimental results show that our model significantly improves rPPG signal restoration on the PURE and UBFC-rPPG datasets, outperforming two representative models, DeepPhys and TS-CAN, by reducing systolic peak and foot/onset estimation errors by over 30%, accurately capturing diastolic peaks and dicrotic notches, and achieving a DTW distance of 6.54, indicating enhanced waveform contour recovery.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
How medical device companies are transforming cardiac care with remote monitoring. https://empeek.com/insights/how-medical-device-companies-are-transforming-cardiac-care-with-remote-monitoring/. Accessed 13 Jan 2023
Bobbia, S., Macwan, R., Benezeth, Y., Mansouri, A., Dubois, J.: Unsupervised skin tissue segmentation for remote photoplethysmography. Pattern Recogn. Lett. 124, 82–90 (2019)
Chen, W., McDuff, D.: DeepPhys: video-based physiological measurement using convolutional attention networks. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 349–365 (2018)
Dall’Olio, L., et al.: Prediction of vascular aging based on smartphone acquired PPG signals. Sci. Rep. 10(1), 19756 (2020)
Dasari, A., Prakash, S.K.A., Jeni, L.A., Tucker, C.S.: Evaluation of biases in remote photoplethysmography methods. NPJ Dig. Med. 4(1), 91 (2021)
Güler, R.A., Neverova, N., Kokkinos, I.: DensePose: dense human pose estimation in the wild. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7297–7306 (2018)
Kendall, A., Gal, Y., Cipolla, R.: Multi-task learning using uncertainty to weigh losses for scene geometry and semantics. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7482–7491 (2018)
Liu, X., Fromm, J., Patel, S., McDuff, D.: Multi-task temporal shift attention networks for on-device contactless vitals measurement. Adv. Neural. Inf. Process. Syst. 33, 19400–19411 (2020)
Liu, X., et al.: rPPG-toolbox: deep remote PPG toolbox. Adv. Neural Inf. Process. Syst. 36 (2024)
Lu, H., Han, H., Zhou, S.K.: Dual-GAN: joint BVP and noise modeling for remote physiological measurement. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12404–12413 (2021)
Mohan, P.M., Nisha, A.A., Nagarajan, V., Jothi, E.S.J.: Measurement of arterial oxygen saturation (SpO 2) using PPG optical sensor. In: 2016 International Conference on Communication and Signal Processing (ICCSP), pp. 1136–1140. IEEE (2016)
Narayanswamy, G., et al.: BigSmall: efficient multi-task learning for disparate spatial and temporal physiological measurements. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 7914–7924 (2024)
Ouyang, V., et al.: The use of multi-site photoplethysmography (PPG) as a screening tool for coronary arterial disease and atherosclerosis. Physiol. Meas. 42(6), 064006 (2021)
Slapničar, G., Luštrek, M., Marinko, M.: Continuous blood pressure estimation from PPG signal. Informatica 42(1) (2018)
Stricker, R., Müller, S., Gross, H.M.: Non-contact video-based pulse rate measurement on a mobile service robot. In: The 23rd IEEE International Symposium on Robot and Human Interactive Communication, pp. 1056–1062. IEEE (2014)
Yu, Z., Li, X., Zhao, G.: Remote photoplethysmograph signal measurement from facial videos using spatio-temporal networks. arXiv preprint arXiv:1905.02419 (2019)
Yu, Z., Shen, Y., Shi, J., Zhao, H., Torr, P.H., Zhao, G.: PhysFormer: facial video-based physiological measurement with temporal difference transformer. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4186–4196 (2022)
Zhang, Y., Feng, Z.: A SVM method for continuous blood pressure estimation from a PPG signal. In: Proceedings of the 9th International Conference on Machine Learning and Computing, pp. 128–132 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhou, F., Zhao, T., Holsinger, A., Yao, Z. (2025). Accurate Remote PPG Waveform Recovery from Video Using a Multi-task Learning Temporal Model. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2024. Lecture Notes in Computer Science, vol 15046. Springer, Cham. https://doi.org/10.1007/978-3-031-77392-1_37
Download citation
DOI: https://doi.org/10.1007/978-3-031-77392-1_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-77391-4
Online ISBN: 978-3-031-77392-1
eBook Packages: Computer ScienceComputer Science (R0)