Skip to main content

The Merging Algorithm for an Extraction of Valid Speech-Sounds

  • Conference paper
  • First Online:
  • 664 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2668))

Abstract

In general, high frequency noises included in a normal speech stream are difficult to remove from the speech stream. Because an unvoiced phoneme seems like a high frequency noise, it may be removed during denoising. A low frequency noise (hum noise), on the other hand, may come from a circuitry imbalance, a wrongly designed ground point in PCB, or imbalance among the parts mounted on a board. This experiment results show that the merging algorithm is very robust against external effects. The merging algorithm is proposed to extract valid speech-sounds in terms of position and frequency range. It needs some numerical methods for an adaptive DWT implementation and performs unvoiced/voiced classification and denoising. Since the merging algorithm can decide the processing parameters relating to voices only and is independent of system noises, it is useful for extracting valid speechsounds. The merging algorithm has an adaptive feature for arbitrary system noises and an excellent denoising SNR (signal-to-noise ratio). Its extraction shows that the denoising of compounded noise and the improved extraction and the merging algorithm can not be disturbed by an unexpected system interference.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Goldberg, R., Riek, L.: A Practical Handbook of Speech Coders. CRC Press, Boca Raton, FL (2000)

    MATH  Google Scholar 

  2. Goswami, J.C., Chan, A.K.: Fundamentals of Wavelets: Theory, Algorithms and Applications. John Wiley & Sons, New York (1999)

    Google Scholar 

  3. Teolis, A.: Computational Signal Processing with Wavelets. Springer Verlag, New York (1998)

    MATH  Google Scholar 

  4. Burrus, C.S., Gopinath, R.A., Guo, H.: Introduction to Wavelets and Wavelet Transforms: A Primer. Prentice Hall, New Jersey (1997)

    Google Scholar 

  5. Marzetta, T.L.: A new interpretation for capon’s maximum likelihood method of frequency-wavenumber spectral estimation. IEEE Trans. Acoustics, Speech, and Signal Processing 31 (1983)

    Google Scholar 

  6. Deller, J.R., Hansen, J.H.L., Proakis, J.G.: Discrete-Time Processing of Speech Signals. IEEE Press, New York (2000)

    Google Scholar 

  7. Donoho, D.L.: Denoising by soft-thresholding. IEEE Trans. Information Theory 41 (1995)

    Google Scholar 

  8. Parsons, T.W.: Voice and Speech Processing. McGraw-Hill, New York (1986)

    Google Scholar 

  9. Jurasfky, D., Martin, J.H., Linden, K.V., Jurafsky, D.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition. Prentice Hall, New Jersey (2000)

    Google Scholar 

  10. Morgan, N., Gold, B.: Speech and Audio Signal Processing: Processing and Perception of Speech and Music. John Wiley & Sons, New York (1999)

    Google Scholar 

  11. Rabiner, L., Juang, B.H., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall, New Jersey (1993)

    Google Scholar 

  12. Huang, X., Acero, A., Hon, H.W., Reddy, R.: Spoken Language Processing. Prentice Hall, New Jersey (2001)

    Google Scholar 

  13. Quatieri, T.F.: Discrete-Time Speech Signal Processing. Prentice Hall, New Jersey (2001)

    Google Scholar 

  14. Mitra, S.K.: Digital Signal Processing: A Compter-Based Approach. 2nd edn. McGraw-Hill, New York (2000)

    Google Scholar 

  15. Ogden, R.T.: Essential Wavelets for Statistical Applications and Data Analysis. Springer Verlag, New York (1996)

    Google Scholar 

  16. Furui, S.: Digital Speech Processing, Synthesis and Recognition. 2nd edn. Marcel Dekker, New York (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kim, J.O., Paek, H.W., Chung, C.H., Yim, W.Y., Lee, S.H. (2003). The Merging Algorithm for an Extraction of Valid Speech-Sounds. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds) Computational Science and Its Applications — ICCSA 2003. ICCSA 2003. Lecture Notes in Computer Science, vol 2668. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44843-8_65

Download citation

  • DOI: https://doi.org/10.1007/3-540-44843-8_65

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40161-2

  • Online ISBN: 978-3-540-44843-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics