A Speech Stream Detection in Adverse Acoustic Environments Based on Cross Correlation Technique

Zhang, Ru-bo; Wu, Tian; Li, Xue-yao; Xu, Dong

doi:10.1007/11881223_82

Ru-bo Zhang²¹,
Tian Wu²¹,
Xue-yao Li²¹ &
…
Dong Xu²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4222))

Included in the following conference series:

International Conference on Natural Computation

715 Accesses

Abstract

Speech signal detection is very important in many areas of speech signal process technology. In real environments, speech signal is usually corrupted by background noise, which greatly affects the performance of speech signal detection system. Correlation analysis is a waveform analysis method which is commonly used in time domain, and the similarity of two signals can be measured by using of the correlation function. This paper presents a new approach based on waveform track from cross correlation coefficients to detect speech signal in adverse acoustic environments. This approach firstly removes irrelevant signal so as to decrease the interference from noise by making use of computing cross correlation coefficients, and then decides whether contains speech signal or not according to the waveform track. Moreover, the performance of the algorithm is compared to the approach based on short-term energy and the approach based on spectrum-entropy in various noise conditions, and algorithm is quantified by using the probability of correct classification. The experiments show that the waveform from cross correlation coefficients is powerful in anti-interference, especially being robust to colored noise such as babble.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jia, C., Xu, B.: An Improved Entropy–based Endpoint Detection Algorithm. In: International Conference on Spoken Language Processing (ICSLP 2002), Taipei, pp. 285–288 (2002)
Google Scholar
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall PTR, New Jersey (1993)
Google Scholar
Bullington, K., Fraser, I.M.: Engineering aspects of TASI. Bell System Technical Journal 38, 353–364 (1959)
Google Scholar
Zhu, S.Q., Qiu, X.H.: Research on Endpoint Detection of Speech Signals. Computer Simulation 22, 214–216 (2005)
Google Scholar
Chen, L., Zhang, X.W.: New Methods of Speech Segmentation and Enhancement Based on Fractal Dimension. Signal Processing Proceedings, 281–284 (2000)
Google Scholar
Julien, P., Jean-Luc, R., Régine, A.O.: Robust speech / music classification in audio documents. In: Dans: International Conference on Spoken Language Processing (ICSLP 2002). Denver vol. 3, pp. 2005–2008 (2002)
Google Scholar
Varga, A.P., Steeneken, H.J.M., Tomlinson, M., Jones, D.: The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition. DRA Speech Research Unit Technical Report (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science and Technology, Harbin Engineering University, Harbin, Heilongjiang, 150001, China
Ru-bo Zhang, Tian Wu, Xue-yao Li & Dong Xu

Authors

Ru-bo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tian Wu
View author publications
You can also search for this author in PubMed Google Scholar
Xue-yao Li
View author publications
You can also search for this author in PubMed Google Scholar
Dong Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Life Science Research Center, School of Electronic Engineering, Xidian University, 710071, Xi’an, Shaanxi, China
Licheng Jiao
School of Electrical and Electronic Engineering, Nanyang Technological University, Block S1, Nanyang Avenue, 639798, Singapore
Lipo Wang
School of Electronic Engineering, Xidian Univ., P.O. Box, 710071, Xi’an, P.R. China
Xinbo Gao
College of Mathematics and Information Science, Hebei Normal University, 050016, Shijiazhuang, Hebei, P.R. China
Jing Liu
Multi-Agent Systems Lab,Department of Computer Science, University of Science and Technology of China, 230026, Hefei, China
Feng Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Rb., Wu, T., Li, Xy., Xu, D. (2006). A Speech Stream Detection in Adverse Acoustic Environments Based on Cross Correlation Technique. In: Jiao, L., Wang, L., Gao, X., Liu, J., Wu, F. (eds) Advances in Natural Computation. ICNC 2006. Lecture Notes in Computer Science, vol 4222. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881223_82

Download citation

DOI: https://doi.org/10.1007/11881223_82
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45907-1
Online ISBN: 978-3-540-45909-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics