Abstract
Speech signal detection is very important in many areas of speech signal process technology. In real environments, speech signal is usually corrupted by background noise, which greatly affects the performance of speech signal detection system. Correlation analysis is a waveform analysis method which is commonly used in time domain, and the similarity of two signals can be measured by using of the correlation function. This paper presents a new approach based on waveform track from cross correlation coefficients to detect speech signal in adverse acoustic environments. This approach firstly removes irrelevant signal so as to decrease the interference from noise by making use of computing cross correlation coefficients, and then decides whether contains speech signal or not according to the waveform track. Moreover, the performance of the algorithm is compared to the approach based on short-term energy and the approach based on spectrum-entropy in various noise conditions, and algorithm is quantified by using the probability of correct classification. The experiments show that the waveform from cross correlation coefficients is powerful in anti-interference, especially being robust to colored noise such as babble.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jia, C., Xu, B.: An Improved Entropy–based Endpoint Detection Algorithm. In: International Conference on Spoken Language Processing (ICSLP 2002), Taipei, pp. 285–288 (2002)
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall PTR, New Jersey (1993)
Bullington, K., Fraser, I.M.: Engineering aspects of TASI. Bell System Technical Journal 38, 353–364 (1959)
Zhu, S.Q., Qiu, X.H.: Research on Endpoint Detection of Speech Signals. Computer Simulation 22, 214–216 (2005)
Chen, L., Zhang, X.W.: New Methods of Speech Segmentation and Enhancement Based on Fractal Dimension. Signal Processing Proceedings, 281–284 (2000)
Julien, P., Jean-Luc, R., Régine, A.O.: Robust speech / music classification in audio documents. In: Dans: International Conference on Spoken Language Processing (ICSLP 2002). Denver vol. 3, pp. 2005–2008 (2002)
Varga, A.P., Steeneken, H.J.M., Tomlinson, M., Jones, D.: The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition. DRA Speech Research Unit Technical Report (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, Rb., Wu, T., Li, Xy., Xu, D. (2006). A Speech Stream Detection in Adverse Acoustic Environments Based on Cross Correlation Technique. In: Jiao, L., Wang, L., Gao, X., Liu, J., Wu, F. (eds) Advances in Natural Computation. ICNC 2006. Lecture Notes in Computer Science, vol 4222. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881223_82
Download citation
DOI: https://doi.org/10.1007/11881223_82
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45907-1
Online ISBN: 978-3-540-45909-5
eBook Packages: Computer ScienceComputer Science (R0)