Skip to main content

A Speech Stream Detection in Adverse Acoustic Environments Based on Cross Correlation Technique

  • Conference paper
Advances in Natural Computation (ICNC 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4222))

Included in the following conference series:

  • 715 Accesses

Abstract

Speech signal detection is very important in many areas of speech signal process technology. In real environments, speech signal is usually corrupted by background noise, which greatly affects the performance of speech signal detection system. Correlation analysis is a waveform analysis method which is commonly used in time domain, and the similarity of two signals can be measured by using of the correlation function. This paper presents a new approach based on waveform track from cross correlation coefficients to detect speech signal in adverse acoustic environments. This approach firstly removes irrelevant signal so as to decrease the interference from noise by making use of computing cross correlation coefficients, and then decides whether contains speech signal or not according to the waveform track. Moreover, the performance of the algorithm is compared to the approach based on short-term energy and the approach based on spectrum-entropy in various noise conditions, and algorithm is quantified by using the probability of correct classification. The experiments show that the waveform from cross correlation coefficients is powerful in anti-interference, especially being robust to colored noise such as babble.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jia, C., Xu, B.: An Improved Entropy–based Endpoint Detection Algorithm. In: International Conference on Spoken Language Processing (ICSLP 2002), Taipei, pp. 285–288 (2002)

    Google Scholar 

  2. Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall PTR, New Jersey (1993)

    Google Scholar 

  3. Bullington, K., Fraser, I.M.: Engineering aspects of TASI. Bell System Technical Journal 38, 353–364 (1959)

    Google Scholar 

  4. Zhu, S.Q., Qiu, X.H.: Research on Endpoint Detection of Speech Signals. Computer Simulation 22, 214–216 (2005)

    Google Scholar 

  5. Chen, L., Zhang, X.W.: New Methods of Speech Segmentation and Enhancement Based on Fractal Dimension. Signal Processing Proceedings, 281–284 (2000)

    Google Scholar 

  6. Julien, P., Jean-Luc, R., Régine, A.O.: Robust speech / music classification in audio documents. In: Dans: International Conference on Spoken Language Processing (ICSLP 2002). Denver vol. 3, pp. 2005–2008 (2002)

    Google Scholar 

  7. Varga, A.P., Steeneken, H.J.M., Tomlinson, M., Jones, D.: The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition. DRA Speech Research Unit Technical Report (1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, Rb., Wu, T., Li, Xy., Xu, D. (2006). A Speech Stream Detection in Adverse Acoustic Environments Based on Cross Correlation Technique. In: Jiao, L., Wang, L., Gao, X., Liu, J., Wu, F. (eds) Advances in Natural Computation. ICNC 2006. Lecture Notes in Computer Science, vol 4222. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881223_82

Download citation

  • DOI: https://doi.org/10.1007/11881223_82

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-45907-1

  • Online ISBN: 978-3-540-45909-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics