Skip to main content
Log in

Tamil and English speech database for heartbeat estimation

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

The aim of this research work is to provide an open source database containing speech signals and the corresponding heartbeat rates, so as to further widen the area of research in speech signal processing, especially estimation of heartbeat rate from speech. Tamil and English Speech Database for Heartbeat Estimation consists of 10,040 speech recordings. The speech signals were recorded from 109 persons, 52 females and 57 males with an average age of 25 years and 6 months. The informed consented volunteers were asked to perform three tasks; like answering and reading in rest state; answering and reading after physical exercise and answering after watching video clips. 24-th and 72-nd order Mel-Frequency Cepstral Coefficients and 14-th and 52-nd order Auto Regressive Reflection Coefficients are extracted from the speech signal. Prediction of heartbeat is done by linear regression using support vector machine. The statistical significance of the heartbeat prediction results are improved by 10-fold speaker-independent cross validation scheme. Experimental results show a minimum average estimation error of ± 13.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Barros, A. K., & Ohnishi, N. (2001). Heart instantaneous frequency (HIF): An alternative approach to extract heart rate variability. IEEE Transactions on Biomedical Engineering, 48(8), 850–855.

    Article  Google Scholar 

  • Bernardi, L., Wdowczyk-Szulc, J., Valenti, C., Castoldi, S., Passino, C., Spadacini, G., G., & Sleight, P. (2000). Effects of controlled breathing, mental activity and mental stress with or without verbalization on heart rate variability. Journal of the American College of Cardiology, 35(6), 1462–1469.

    Article  Google Scholar 

  • Hayre, H. S., & Holland, J. C. (1980). Cross-correlation of voice and heart rate as stress measures. Applied Acoustics, 13(1), 57–62.

    Article  Google Scholar 

  • James, A. P. (2015). Heart rate monitoring using human speech spectral features. Human-Centric Computing and Information Sciences, 5, 33. https://doi.org/10.1186/s13673-015-0052-z.

    Article  Google Scholar 

  • Johnson, H. J., & Campos, J. J. (1967). The effect of cognitive tasks and verbalization instructions on heart rate and skin conductance. Psychophysiology, 4(2), 143–150.

    Article  Google Scholar 

  • Kathol, A., & Shriberg, E. (2015). The SRI biofrustration corpus: audio, video, and physiological signals for continuous user modelling. In Proceedings of AAAI Spring Symposium Series 2015, (pp. 96–99) Palo Alto, California.

  • Makhoul, J. (1975). Linear prediction: a tutorial review. Proceedings of the IEEE, 63(4), 561–580.

    Article  Google Scholar 

  • Mesleh, A., Skopin, D., Baglikov, S., & Quteishat, A. (2012). Heart rate extraction from vowel speech signals. Journal of computer science and technology, 27(6), 1243–1251.

    Article  Google Scholar 

  • Milton, A. (2015). Automatic recognition of speech emotions using class-specific multiple classifier scheme. Ph.D. Thesis, Anna University, Chennai, India.

  • Rabiner, L. R., & Schafer, R. W. (2004s). Digital processing of speech signals. Delhi: Pearson Education (Singapore) Pte.Ltd.

    Google Scholar 

  • Ryskaliyev, A., Askaruly, S., & James, A. (2016). Speech signal analysis for the estimation of heart rates under different emotional states. In Proceedings of IEEE International Conference on Advances in Computing, Communications and Informatics, (pp. 1160–1165) Jaipur, India.

  • Schnell, I., Potchter, O., Epstein, Y., Yaakov, Y., Hermesh, H., Brenner, S., & Tirosh, E. (2013). The effects of exposure to environmental factors on heart rate variability: An ecological perspective. Environmental Pollution, 183, 7–13.

    Article  Google Scholar 

  • Schuller, B., Friedmann, F., & Eyben, F. (2013). Automatic recognition of physiological parameters in the human voice: heart rate and skin conductance. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, (pp. 7219–7223) Vancouver, BC, Canada.

  • Schuller, B., Friedmann, F., & Eyben, F. (2014). The Munich biovoice corpus: effects of physical exercising, heart rate and skin conductance on human speech production. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, (pp. 1506–1510) Reykjavik, Iceland.

  • Seraganian, P., Szabob, A., & Brown, T. G. (1997). The effect of vocalization on the heart rate response to mental arithmetic. Physiology & Behavior, 62(2), 221–224.

    Article  Google Scholar 

  • Smith, J., Tsiartas, A., Shriberg, E., Kathol, A., Willoughby, A., & Zambotti, M. D. (2017). Analysis and prediction of heart rate using speech features from natural speech. In IEEE International Conference in Acoustics, Speech and Signal Processing, (pp. 989–993) New Orleans, LA, USA.

  • Tsiartas, A., Kathol, A., Shriberg, E., Zambotti, M. D., & Willoughby, A. (2015). Prediction of heart rate changes from speech features during interaction with a misbehaving dialog system. In Proceedings of Interspeech 2015, (pp. 3175–3179) Dresden, Germany.

  • Vapnik, V. N. (1998). Statistical learning theory. New York: Wiley.

    MATH  Google Scholar 

Download references

Acknowledgements

We sincerely thank the Management, Principal, Students and Staff Members of St. Xavier’s Catholic College of Engineering, Nagercoil, for their valuable participation and support during the TESDHE database recording process.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. Milton.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Milton, A., Monsely, K.A. Tamil and English speech database for heartbeat estimation. Int J Speech Technol 21, 967–973 (2018). https://doi.org/10.1007/s10772-018-9557-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-018-9557-y

Keywords

Navigation