A Subspace Projection Approach for Analysis of Speech Under Stressed Condition

Shukla, Sumitra; Dandapat, S.; Prasanna, S. R. Mahadeva

doi:10.1007/s00034-016-0284-9

A Subspace Projection Approach for Analysis of Speech Under Stressed Condition

Published: 07 March 2016

Volume 35, pages 4486–4500, (2016)
Cite this article

Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Sumitra Shukla¹,
S. Dandapat¹ &
S. R. Mahadeva Prasanna¹

345 Accesses
8 Citations
Explore all metrics

Abstract

In this paper, a novel subspace projection approach is proposed for analysis of speech signal under stressed condition. The subspace projection method is based on the assumption of orthogonality between speech subspace and stress subspace. Speech and stress subspaces contain speech and stress information, respectively. The projection of stressed speech vectors onto the speech subspace will separate speech-specific information. In this work, the speech subspace consists of neutral speech vectors. Speech and stress recognition techniques are used to verify the orthogonal relation between speech and stress subspaces. The evaluation database consists of 119 word vocabulary under neutral, angry, sad and Lombard conditions. Hidden Markov models for speech and stress recognition are used with mel-frequency cepstral coefficient features for evaluation of estimated speech and stress information.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Subspace filtering approach based on orthogonal projection for better analysis of stressed speech under clean and noisy environments

Article 03 September 2016

Stress Identification from Speech Using Clustering Techniques

i-Vectors in speech processing applications: a survey

Article 06 August 2015

References

M. Afify, Y. Gong, J.P.A. Haton, A general additive and convolutive bias compensation approach applied to noisy Lombard speech recognition. IEEE Trans. Speech Audio Process. 6, 524–538 (1998)
Article Google Scholar
R.S. Bolia, R.E. Slyh, Perception of stress and speaking style for selected elements of the SUSAS database. Speech Commun. 40, 493–501 (2003)
Article Google Scholar
A. Borowicz, A signal subspace approach to spatio-temporal prediction for multichannel speech enhancement. EURASIP J. Audio Speech Music Process. A (2015). doi:10.1186/s13636-015-0051-z
Y. Chen, Cepstral domain talker stress compensation for robust speech recognition. IEEE Trans. Acoust. Speech Signal Process 36, 433–439 (1988)
Article MATH Google Scholar
Y. Ephraim, H.L.V. Trees, A signal subspace approach for speech enhancement. IEEE Trans. Speech Audio Process. 3, 251–266 (1995)
Article Google Scholar
J.H.L. Hansen, Morphological constrained feature enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effect. IEEE Trans. Speech Audio Process. 2, 598–614 (1994)
Article Google Scholar
J.H.L. Hansen, E. Sahar, HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress. IEEE Trans. Speech Audio Process. 4, 201–216 (1998)
Google Scholar
J. Huang, Y. Zhao, Energy-constrained signal subspace method for speech enhancement and recognition. IEEE Signal Process. Lett. 4, 283–285 (1997)
Article Google Scholar
H. Lev-Ari, Y. Ephraim, Extension of the signal subspace speech enhancement approach to colored noise. IEEE Signal Process. Lett. 10, 104–106 (2003)
Article Google Scholar
Y. Linde, A. Buzo, R.M. Gray, An introduction for vector quantizer design. IEEE Trans. Commun. 28, 84–95 (1980)
Article Google Scholar
R.P. Lippmann, E.A. Mack, D.B. Paul, Multi-style training for robust isolated-word speech recognition, in Proceedings of IEEE ICASSP 1987 (1987), pp. 705–708
S. Ramamohan, S. Dandapat, Sinusoidal model based analysis and classification of stressed speech. IEEE Trans. Audio Speech Lang. Process. 14, 737–746 (2006)
Article Google Scholar
S. Shukla, S. Dandapat, S.R.M. Prasanna, Subspace projection based analysis of speech under stressed condition, in IEEE Processing on WICT, ed. by A. Abraham, S.M. Thampi, S. Pal, E. Corchado, V. Snasel, S. Abraham, S. Ramakrishan (IEEE, Trivandrum, India, 2012)
S. Shukla, S. Dandapat, S.R.M. Prasanna, Spectral slope based analysis and classification of stressed speech. Int. J. Speech Technol. 14, 245–258 (2011)
Article Google Scholar
S. Shukla, S.R.M. Prasanna, S. Dandapat, Stressed speech processing: human vs automatic in non-professional speakers scenario, in IEEE Proceedings on NCC 2011, Bangalore (2011)
H.J.M. Steeneken, J.H.L. Hansen, Speech under stress conditions: overview of the effect on speech production and on system performance, in Proceedings on International Conference on Acoustics, Speech and Signal Processing, Phoenix, Arizona (1999), pp. 2079–2082
G. Strang, Linear Algebra and its Applications, 4th edn. (Cengage Learing, Boston, 2006)
MATH Google Scholar
K.Y. Su, C.H. Lee, Speech recognition using weighted HMM and subspace projection approaches. IEEE Trans. Speech Audio Process. 2, 69–79 (1994)
Article Google Scholar
A.W.C. Tan, M.V.C. Rao, B.S.D. Sagar, A signal subspace approach for speech modelling and classification. Speech Commun. 87, 500–508 (2007)
MATH Google Scholar
R. Tong, G. Bao, Z. Ye, A higher order subspace algorithm for multichannel speech enhancement. IEEE Signal Process. Lett. 22, 2004–2008 (2015)
Article Google Scholar
D. Ververidis, C. Kotropoulos, Emotional speech recognition: resources, features, and methods. Speech Commun. 48, 1162–1181 (2006)
Article Google Scholar
B.D. Womack, J.H.L. Hansen, Classification of speech under stress using target driven features. Speech Commun. 20, 131–150 (1996)
Article Google Scholar
G. Zhou, J.H.L. Hansen, Nonlinear feature based classification of speech under stress. IEEE Trans. Speech Audio Process. 9, 201–216 (2001)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics and Electrical Engineering, Indian Institute of Technology Guwahati, Guwahati, 781039, India
Sumitra Shukla, S. Dandapat & S. R. Mahadeva Prasanna

Authors

Sumitra Shukla
View author publications
You can also search for this author in PubMed Google Scholar
S. Dandapat
View author publications
You can also search for this author in PubMed Google Scholar
S. R. Mahadeva Prasanna
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sumitra Shukla.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shukla, S., Dandapat, S. & Prasanna, S.R.M. A Subspace Projection Approach for Analysis of Speech Under Stressed Condition. Circuits Syst Signal Process 35, 4486–4500 (2016). https://doi.org/10.1007/s00034-016-0284-9

Download citation

Received: 07 August 2015
Revised: 16 February 2016
Accepted: 17 February 2016
Published: 07 March 2016
Issue Date: December 2016
DOI: https://doi.org/10.1007/s00034-016-0284-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Subspace Projection Approach for Analysis of Speech Under Stressed Condition

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Subspace filtering approach based on orthogonal projection for better analysis of stressed speech under clean and noisy environments

Stress Identification from Speech Using Clustering Techniques

i-Vectors in speech processing applications: a survey

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A Subspace Projection Approach for Analysis of Speech Under Stressed Condition

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Subspace filtering approach based on orthogonal projection for better analysis of stressed speech under clean and noisy environments

Stress Identification from Speech Using Clustering Techniques

i-Vectors in speech processing applications: a survey

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation