Significance of Duration Modification in Reducing Listening Effort of Slurred Speech from Patients with Traumatic Brain Injury

Banerjee, Oindrila; Govind, D.; Gangashetty, Suryakanth V.; Dubey, Akhilesh Kumar; Aravindakshan, Rajeev; Panicker, Sasikumar; Reshma, K.

doi:10.1007/978-3-031-48309-7_47

Oindrila Banerjee¹³,
D. Govind¹³,
Suryakanth V. Gangashetty¹³,
Akhilesh Kumar Dubey¹³,
Rajeev Aravindakshan¹⁴,
Sasikumar Panicker¹⁵ &
…
K. Reshma¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14338))

Included in the following conference series:

International Conference on Speech and Computer

774 Accesses

Abstract

The objective of the work presented in the paper is to check the significance of duration modification for improving the speech intelligibility of the patients having slurred speech disorder due to traumatic brain injury (TBI). A slow speaking rate has been observed in the speech utterances of a patient having diffuse axonal injury, a type of TBI. To compensate the slow speaking rate, the utterances are subjected to duration modification for various scaling factors. Subjective listening tests are then conducted for assessing the effort required to understand the spoken utterances among a group of medical and non-medical listeners. The improved mean opinion scores (MOS) confirmed that the duration modification is indeed reduce the listening effort while perceiving the slurred speech utterances. From the listening tests, a speaker dependent duration modification factor of 0.75 has provided the best enhancement of the slurred speech with improved intelligibility.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Using Acoustic Phonetics in the Assessment and Treatment of Speech Disorders

Choice of Signal Short-Term Energy Parameter for Assessing Speech Intelligibility in the Process of Speech Rehabilitation

Altered speech patterns in subjects with post-traumatic headache due to mild traumatic brain injury

Article Open access 23 July 2021

References

Adank, P., McGettigan, C., Kotz, S.A.E.: The Cognitive and Neural Organisation of Speech Processing. Frontiers Media, Lausanne (2016)
Book Google Scholar
Celin, T.A.M., Vijayalakshmi, P., Nagarajan, T.: Data augmentation techniques for transfer learning-based continuous dysarthric speech recognition. Circuits Syst. Sig. Process. 42, 601–623 (2023)
Article Google Scholar
Dowling, G.A.: Levels of cognitive fnctioning: evaluation of interrater reliability. J. Neuro Surg. Nurs. 17(2), 129–134 (1985)
Article Google Scholar
Drugman, T., Thomas, M., Gudnason, J., Naylor, P., Dutoit, T.: Detection of glottal closure instants from speech signals: a quantitative review. IEEE Trans. Audio Speech Lang. Process. 20, 994–1006 (2012)
Google Scholar
Gale, R., Chen, L., Dolata, J., van Santen, J., Asgari, M.: Improving ASR systems for children with autism and language impairment using domain focused DNN transfer techniques. In: Proceedings Interspeech (2019)
Google Scholar
Govind, D., Prasanna, S.R.M., Yegnanarayana, B.: Neutral to target emotion conversion using source and suprasegmental information. In: Proceedings Interspeech 2011, August 2011
Google Scholar
Hartmann, A., Kegelmeyer, D., Kloos, A.: Use of an errorless learning approach in a person with concomitant traumatic spinal cord injury and brain injury: a case report. J. Neurol. Phys. Ther. 42(2), 102–109 (2018)
Article Google Scholar
Kathania, H.K., Kadiri, S.R., Alku, P., Kurimo, M.: A formant modification method for improved ASR for children speech. Speech Commun. 136, 98–106 (2022)
Article Google Scholar
Krishnamoorthy, P., Prasanna, S.R.M.: Reverberant speech enhancement by temporal and spectral processing. IEEE Trans. Audio Speech Lang. Process. 17(2), 253–266 (2009)
Google Scholar
Krishnamoorthy, P., Prasanna, S.R.M.: Enhancement of noisy speech by temporal and spectral processing. Speech Commun. 53(2), 154–174 (2011)
Article Google Scholar
MacDonald, R.L., et al.: Disordered speech data collection: lessons learned at 1 million utterances from project euphonia. In: Proceedings Interspeech (2021)
Google Scholar
Mesfin, F., Gupta, N., Hays, A.S., et al.: Diffuse Axonal Injury. Treasure Island (FL). StatPearls Publishing (2022). https://www.ncbi.nlm.nih.gov/books/NBK448102
Mitchell, C., Bowen, A., Tyson, S., Butterfint, Z., Conroy, P.: Interventions for dysarthria due to stroke and other adult-acquired, non-progressive brain injury. Cochrane Database Syst. Rev. 25(1) (2017)
Google Scholar
Moulines, E., Charpentier, F.: Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun. 9, 452–467 (1990)
Article Google Scholar
Murty, K.S.R., Yegnanarayana, B.: Epoch extraction from speech signals. IEEE Trans. Audio Speech Lang. Process. 16(8), 1602–1614 (2008)
Article Google Scholar
Nasreddine, Z.S., et al.: The montreal cognitive assessment, MoCa: a brief screening tool for mild cognitive impairment. J. Am. Geriatr. Soc. 63(4), 695–704 (2005)
Article Google Scholar
Naylor, P.A., Kounoudes, A., Gudnason, J., Brookes, M.: Estimation of glottal closure instants in voiced speech using DYPSA algorithm. IEEE Trans. Audio Speech Lang. Process. 15(1), 34–43 (2007)
Google Scholar
Nicolas-Alonso, L.F., Gomez-Gil, J.: Brain computer interfaces- a review. Sensors 12(2), 1211–1279 (2012)
Article Google Scholar
Prasanna, S.R.M., Govind, D., Rao, K.S., Yenanarayana, B.: Fast prosody modification using instants of significant excitation. In: Proceedings Speech Prosody, May 2010
Google Scholar
Prasanna, S.R.M., Yegnanarayana, B.: Extraction of pitch in adverse conditions. In: Proceedings ICASSP, Montreal, Canada, May 2004
Google Scholar
Quatieri, T.F., McAulay, R.J.: Shape invariant time scale and pitch modification of speech. IEEE Trans. Sig. Process. 40(3), 497–510 (1992)
Article Google Scholar
Raman, S., Serrano, L., Winneke, A., Navas, E., Hernaez, I.: Intelligibility and listening effort of Spanish oesophageal speech. Appl. Sci. 9(16), 3233 (2019)
Article Google Scholar
Rao, K.S., Yegnanarayana, B.: Prosody modification using instants of significant excitation. IEEE Trans. Audio Speech Lang. Process. 14, 972–980 (2006)
Google Scholar
Rao, K.S., Yegananarayana, B.: Duration modification using glottal closure instants and vowel onset points. Speech Commun. 51(12), 1263–1269 (2009)
Article Google Scholar
Row, H.P., Gutz, S.E., Maffei, M.F., Green, K.T.J.R.: Characterizing dysarthria diversity for automatic speech recognition: a tutorial from the clinical perspective. Frontiers Comput. Sci. 19 (2022)
Google Scholar
Rudzicz, F.: Acoustic transformations to improve the intelligibility of dysarthric speech. In: Proceedings Second Workshop on Speech and Language Processing for Assistive Technologies (2011)
Google Scholar
Schultz, T., Wand, M., Hueber, T., Krsienski, D.J., Herff, C., Brumberg, J.S.: Biosignal-based spoken communication: a survey. IEEE Trans. Audio Speech Lang. Process. (2015)
Google Scholar
Shor, J., et al.: Personalizing ASR for dysarthric and accented speech with limited data. In: Proceedings Interspeech, pp. 784–788 (2019)
Google Scholar
Tremblay, P., Dick, A.S.: Broca and Wernicke are dead or moving past the classic model of language neurobiology. Brain Lang. 162, 60–71 (2016)
Article Google Scholar

Download references

Acknowledgements

Authors would like to convey our sincere gratitude towards all the people who participated in the listening test. The paper would not have been possible without the time spent by the doctors of All India Institute of Medical Sciences (AIIMS) Mangalagiri who have prior experience interacting with stroke and TBI patients. Further, authors would like to appreciate the hospital management of Kumar center for stroke and neuro rehabilitation for helping us to collection the data and providing the ethical clearance for using the data for the academic research.

The funding for this paper is from the National Language Translation Mission (NLTM) sub consortium of the project titled “Speech Technologies in Indian Languages”, MEITY, Govt. of India.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Greenfields, Vaddeswaram, 500302, Andhra Pradesh, India
Oindrila Banerjee, D. Govind, Suryakanth V. Gangashetty & Akhilesh Kumar Dubey
All India Institute of Medical Sciences (AIIMS) Mangalagiri, Vaddeswaram, 500302, Andhra Pradesh, India
Rajeev Aravindakshan
Kumar Centre for Stroke and Neuro Rehabilitation, Vaduthala, Kochi, 682023, India
Sasikumar Panicker & K. Reshma

Authors

Oindrila Banerjee
View author publications
You can also search for this author in PubMed Google Scholar
D. Govind
View author publications
You can also search for this author in PubMed Google Scholar
Suryakanth V. Gangashetty
View author publications
You can also search for this author in PubMed Google Scholar
Akhilesh Kumar Dubey
View author publications
You can also search for this author in PubMed Google Scholar
Rajeev Aravindakshan
View author publications
You can also search for this author in PubMed Google Scholar
Sasikumar Panicker
View author publications
You can also search for this author in PubMed Google Scholar
K. Reshma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. Govind .

Editor information

Editors and Affiliations

St. Petersburg Federal Research Center of the Russian Academy of Sciences, St. Petersburg, Russia
Alexey Karpov
Koneru Lakshmaiah Education Foundation, Vaddeswaram, India
K. Samudravijaya
Indian Institute of Information Technology Dharwad, Dharwad, India
K. T. Deepak
Indian Institute of Technology Dharwad, Dharwad, India
Rajesh M. Hegde
KIIT Group of Colleges, Gurugram, India
Shyam S. Agrawal
Indian Institute of Technology Dharwad, Dharwad, India
S. R. Mahadeva Prasanna

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Banerjee, O. et al. (2023). Significance of Duration Modification in Reducing Listening Effort of Slurred Speech from Patients with Traumatic Brain Injury. In: Karpov, A., Samudravijaya, K., Deepak, K.T., Hegde, R.M., Agrawal, S.S., Prasanna, S.R.M. (eds) Speech and Computer. SPECOM 2023. Lecture Notes in Computer Science(), vol 14338. Springer, Cham. https://doi.org/10.1007/978-3-031-48309-7_47

Download citation

DOI: https://doi.org/10.1007/978-3-031-48309-7_47
Published: 22 November 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-48308-0
Online ISBN: 978-3-031-48309-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Significance of Duration Modification in Reducing Listening Effort of Slurred Speech from Patients with Traumatic Brain Injury