The effect of different acoustic noise on speech signal formant frequency location

Sadeghi, Mohsen; Marvi, Hossein; Ali, Maaruf

doi:10.1007/s10772-018-9540-7

The effect of different acoustic noise on speech signal formant frequency location

Published: 06 August 2018

Volume 21, pages 741–752, (2018)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

209 Accesses
1 Citation
Explore all metrics

Abstract

The presence of noise is one of the major challenges and concerns in speech recognition systems. There are in particular different kinds of noises (pink, white and leopard) that can adversely affect a speech signal in various ways and degrees. In this study, the extent of resistance of a speech signal’s formants or in other words, the displacement of the formants have been measured against being subjected to different conventional noises. The methodology adopted was to apply different noises to the original voice signal, then to measure and to investigate the amount of formant location displacement. In this paper, the mean square movement (MSM) parameter has been introduced. This represents the deviation and displacement amount of the frequencies of the formants caused by applying the various noises. All of the investigations were conducted under three different SNR conditions (5, 10 and 15 dB). This allowed for the assessment of the influence of the signal-to-noise ratio (SNR) on the MSM parameter and the extent of the displacements of the formants. The results indicate that the frequency of the formants under these three SNR amounts was resistant against the machine gun type of noise, whilst white noise caused the most measureable effect and displacement in the frequencies of the formants.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improving Objective Speech Quality Indicators in Noise Conditions

The Microphone Type and Voice Acoustic Parameters Values – A Comparative Study

Study on the Improvement of Intelligibility for Elderly Speech Using Formant Frequency Shift Method

References

Acero, A. (1999). Formant analysis and synthesis using hidden Markov models. Sixth European Conference on Speech Communication and Technology. Retrieved July 19, 2018 from https://www.microsoft.com/en-us/research/wp-content/uploads/1999/09/1999-alexac-eurospeech.pdf.
Darwin, C. (2008). Computational auditory scene analysis: Principles, algorithms and applications. The Journal of the Acoustical Society of America, 124(1), 13–13.
Article Google Scholar
Dendrinos, M., Bakamidis, S., & Carayannis, G. (1991). Speech enhancement from noise: A regenerative approach. Speech Communication, 10(1), 45–57.
Article Google Scholar
Duan, Z., Mysore, G. J., & Smaragdis, P. (2012). Speech enhancement by online non-negative spectrogram decomposition in nonstationary noise environments. Thirteenth Annual Conference of the International Speech Communication Association. Retrieved, July 19, 2018 from https://ccrma.stanford.edu/~gautham/Site/Publications_files/duan-interspeech2012.pdf.
Gargouri, D., Kammoun, M. A., & Hamida, A. B. (2006, May). A comparative study of formant frequencies estimation techniques. Proceedings of the 5th WSEAS International Conference on Signal Processing, Istanbul, Turkey (pp. 15–19).
Hagerman, B. (1984). Clinical measurements of speech reception threshold in noise. Scandinavian Audiology, 13(1), 57–63.
Article Google Scholar
Hernando, J., & Nadeu, C. (1997). Linear prediction of the one-sided autocorrelation sequence for noisy speech recognition. IEEE Transactions on Speech and Audio Processing, 5(1), 80–84.
Article Google Scholar
Hu, Y., & Loizou, P. C. (2007). A comparative intelligibility study of single-microphone noise reduction algorithms. The Journal of the Acoustical Society of America, 122(3), 1777–1786.
Article Google Scholar
Hu, Y., & Loizou, P. C. (2008). Evaluation of objective quality measures for speech enhancement. IEEE Transactions on Audio, Speech, and Language Processing, 16(1), 229–238.
Article Google Scholar
Kammi, S., & Mollaei, M. R. K. (2017). Noisy speech enhancement with sparsity regularization. Speech Communication, 87, 58–69.
Article Google Scholar
Kim, G., & Loizou, P. C. (2010). Improving speech intelligibility in noise using environment-optimized algorithms. IEEE Transactions on Audio, Speech, and Language Processing, 18(8), 2080–2090.
Article Google Scholar
Loizou, P. C., & Kim, G. (2011). Reasons why current speech-enhancement algorithms do not improve speech intelligibility and suggested solutions. IEEE Transactions on Audio, Speech, and Language Processing, 19(1), 47–56.
Article Google Scholar
Peinado, A., & Segura, J. (2006). Speech recognition over digital channels: Robustness and standards. Chichester: Wiley.
Book Google Scholar
Rabiner, L. R., & Schafer, R. W. (2007). Introduction to digital speech processing. Foundations and Trends® in Signal Processing, 1(1–2), 1–194.
Article MATH Google Scholar
Sameti, H., Sheikhzadeh, H., Deng, L., & Brennan, R. L. (1998). HMM-based strategies for enhancement of speech signals embedded in nonstationary noise. IEEE Transactions on Speech and Audio processing, 6(5), 445–455.
Article Google Scholar
Signal Processing Information Base (2013, July 21). Retrieved March 20, 2017, from http://spib.linse.ufsc.br/noise.html
Teacher, C., & Watkins, H. (1978). ANDVT microphone and audio system study. Ketron final report. Washington, DC: Ketron, Inc.
Google Scholar
Weber, K., Bengio, S., & Bourlard, H. (2001). Hmm2-extraction of formant features and their use for robust ASR. European Conference on Speech Communication and Technology (Eurospeech 2001) (No. EPFL-CONF-82693, pp. 607–610).

Download references

Author information

Authors and Affiliations

Faculty of Electrical and Computer Engineering, Shahrood University of Technology, Shahrood, Iran
Mohsen Sadeghi & Hossein Marvi
International Association of Educators and Researchers (IAER), Kemp House, 160 City Road, London, EC1V 2NX, UK
Maaruf Ali

Authors

Mohsen Sadeghi
View author publications
You can also search for this author in PubMed Google Scholar
Hossein Marvi
View author publications
You can also search for this author in PubMed Google Scholar
Maaruf Ali
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohsen Sadeghi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sadeghi, M., Marvi, H. & Ali, M. The effect of different acoustic noise on speech signal formant frequency location. Int J Speech Technol 21, 741–752 (2018). https://doi.org/10.1007/s10772-018-9540-7

Download citation

Received: 08 July 2017
Accepted: 23 July 2018
Published: 06 August 2018
Issue Date: September 2018
DOI: https://doi.org/10.1007/s10772-018-9540-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The effect of different acoustic noise on speech signal formant frequency location

Abstract

Access this article

Similar content being viewed by others

Improving Objective Speech Quality Indicators in Noise Conditions

The Microphone Type and Voice Acoustic Parameters Values – A Comparative Study

Study on the Improvement of Intelligibility for Elderly Speech Using Formant Frequency Shift Method

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The effect of different acoustic noise on speech signal formant frequency location

Abstract

Access this article

Similar content being viewed by others

Improving Objective Speech Quality Indicators in Noise Conditions

The Microphone Type and Voice Acoustic Parameters Values – A Comparative Study

Study on the Improvement of Intelligibility for Elderly Speech Using Formant Frequency Shift Method

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation