Abstract
The aim of this research work is to provide an open source database containing speech signals and the corresponding heartbeat rates, so as to further widen the area of research in speech signal processing, especially estimation of heartbeat rate from speech. Tamil and English Speech Database for Heartbeat Estimation consists of 10,040 speech recordings. The speech signals were recorded from 109 persons, 52 females and 57 males with an average age of 25 years and 6 months. The informed consented volunteers were asked to perform three tasks; like answering and reading in rest state; answering and reading after physical exercise and answering after watching video clips. 24-th and 72-nd order Mel-Frequency Cepstral Coefficients and 14-th and 52-nd order Auto Regressive Reflection Coefficients are extracted from the speech signal. Prediction of heartbeat is done by linear regression using support vector machine. The statistical significance of the heartbeat prediction results are improved by 10-fold speaker-independent cross validation scheme. Experimental results show a minimum average estimation error of ± 13.
Similar content being viewed by others
References
Barros, A. K., & Ohnishi, N. (2001). Heart instantaneous frequency (HIF): An alternative approach to extract heart rate variability. IEEE Transactions on Biomedical Engineering, 48(8), 850–855.
Bernardi, L., Wdowczyk-Szulc, J., Valenti, C., Castoldi, S., Passino, C., Spadacini, G., G., & Sleight, P. (2000). Effects of controlled breathing, mental activity and mental stress with or without verbalization on heart rate variability. Journal of the American College of Cardiology, 35(6), 1462–1469.
Hayre, H. S., & Holland, J. C. (1980). Cross-correlation of voice and heart rate as stress measures. Applied Acoustics, 13(1), 57–62.
James, A. P. (2015). Heart rate monitoring using human speech spectral features. Human-Centric Computing and Information Sciences, 5, 33. https://doi.org/10.1186/s13673-015-0052-z.
Johnson, H. J., & Campos, J. J. (1967). The effect of cognitive tasks and verbalization instructions on heart rate and skin conductance. Psychophysiology, 4(2), 143–150.
Kathol, A., & Shriberg, E. (2015). The SRI biofrustration corpus: audio, video, and physiological signals for continuous user modelling. In Proceedings of AAAI Spring Symposium Series 2015, (pp. 96–99) Palo Alto, California.
Makhoul, J. (1975). Linear prediction: a tutorial review. Proceedings of the IEEE, 63(4), 561–580.
Mesleh, A., Skopin, D., Baglikov, S., & Quteishat, A. (2012). Heart rate extraction from vowel speech signals. Journal of computer science and technology, 27(6), 1243–1251.
Milton, A. (2015). Automatic recognition of speech emotions using class-specific multiple classifier scheme. Ph.D. Thesis, Anna University, Chennai, India.
Rabiner, L. R., & Schafer, R. W. (2004s). Digital processing of speech signals. Delhi: Pearson Education (Singapore) Pte.Ltd.
Ryskaliyev, A., Askaruly, S., & James, A. (2016). Speech signal analysis for the estimation of heart rates under different emotional states. In Proceedings of IEEE International Conference on Advances in Computing, Communications and Informatics, (pp. 1160–1165) Jaipur, India.
Schnell, I., Potchter, O., Epstein, Y., Yaakov, Y., Hermesh, H., Brenner, S., & Tirosh, E. (2013). The effects of exposure to environmental factors on heart rate variability: An ecological perspective. Environmental Pollution, 183, 7–13.
Schuller, B., Friedmann, F., & Eyben, F. (2013). Automatic recognition of physiological parameters in the human voice: heart rate and skin conductance. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, (pp. 7219–7223) Vancouver, BC, Canada.
Schuller, B., Friedmann, F., & Eyben, F. (2014). The Munich biovoice corpus: effects of physical exercising, heart rate and skin conductance on human speech production. In Proceedings of the Ninth International Conference on Language Resources and Evaluation, (pp. 1506–1510) Reykjavik, Iceland.
Seraganian, P., Szabob, A., & Brown, T. G. (1997). The effect of vocalization on the heart rate response to mental arithmetic. Physiology & Behavior, 62(2), 221–224.
Smith, J., Tsiartas, A., Shriberg, E., Kathol, A., Willoughby, A., & Zambotti, M. D. (2017). Analysis and prediction of heart rate using speech features from natural speech. In IEEE International Conference in Acoustics, Speech and Signal Processing, (pp. 989–993) New Orleans, LA, USA.
Tsiartas, A., Kathol, A., Shriberg, E., Zambotti, M. D., & Willoughby, A. (2015). Prediction of heart rate changes from speech features during interaction with a misbehaving dialog system. In Proceedings of Interspeech 2015, (pp. 3175–3179) Dresden, Germany.
Vapnik, V. N. (1998). Statistical learning theory. New York: Wiley.
Acknowledgements
We sincerely thank the Management, Principal, Students and Staff Members of St. Xavier’s Catholic College of Engineering, Nagercoil, for their valuable participation and support during the TESDHE database recording process.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Milton, A., Monsely, K.A. Tamil and English speech database for heartbeat estimation. Int J Speech Technol 21, 967–973 (2018). https://doi.org/10.1007/s10772-018-9557-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-018-9557-y