Abstract
In general, high frequency noises included in a normal speech stream are difficult to remove from the speech stream. Because an unvoiced phoneme seems like a high frequency noise, it may be removed during denoising. A low frequency noise (hum noise), on the other hand, may come from a circuitry imbalance, a wrongly designed ground point in PCB, or imbalance among the parts mounted on a board. This experiment results show that the merging algorithm is very robust against external effects. The merging algorithm is proposed to extract valid speech-sounds in terms of position and frequency range. It needs some numerical methods for an adaptive DWT implementation and performs unvoiced/voiced classification and denoising. Since the merging algorithm can decide the processing parameters relating to voices only and is independent of system noises, it is useful for extracting valid speechsounds. The merging algorithm has an adaptive feature for arbitrary system noises and an excellent denoising SNR (signal-to-noise ratio). Its extraction shows that the denoising of compounded noise and the improved extraction and the merging algorithm can not be disturbed by an unexpected system interference.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Goldberg, R., Riek, L.: A Practical Handbook of Speech Coders. CRC Press, Boca Raton, FL (2000)
Goswami, J.C., Chan, A.K.: Fundamentals of Wavelets: Theory, Algorithms and Applications. John Wiley & Sons, New York (1999)
Teolis, A.: Computational Signal Processing with Wavelets. Springer Verlag, New York (1998)
Burrus, C.S., Gopinath, R.A., Guo, H.: Introduction to Wavelets and Wavelet Transforms: A Primer. Prentice Hall, New Jersey (1997)
Marzetta, T.L.: A new interpretation for capon’s maximum likelihood method of frequency-wavenumber spectral estimation. IEEE Trans. Acoustics, Speech, and Signal Processing 31 (1983)
Deller, J.R., Hansen, J.H.L., Proakis, J.G.: Discrete-Time Processing of Speech Signals. IEEE Press, New York (2000)
Donoho, D.L.: Denoising by soft-thresholding. IEEE Trans. Information Theory 41 (1995)
Parsons, T.W.: Voice and Speech Processing. McGraw-Hill, New York (1986)
Jurasfky, D., Martin, J.H., Linden, K.V., Jurafsky, D.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition. Prentice Hall, New Jersey (2000)
Morgan, N., Gold, B.: Speech and Audio Signal Processing: Processing and Perception of Speech and Music. John Wiley & Sons, New York (1999)
Rabiner, L., Juang, B.H., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall, New Jersey (1993)
Huang, X., Acero, A., Hon, H.W., Reddy, R.: Spoken Language Processing. Prentice Hall, New Jersey (2001)
Quatieri, T.F.: Discrete-Time Speech Signal Processing. Prentice Hall, New Jersey (2001)
Mitra, S.K.: Digital Signal Processing: A Compter-Based Approach. 2nd edn. McGraw-Hill, New York (2000)
Ogden, R.T.: Essential Wavelets for Statistical Applications and Data Analysis. Springer Verlag, New York (1996)
Furui, S.: Digital Speech Processing, Synthesis and Recognition. 2nd edn. Marcel Dekker, New York (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, J.O., Paek, H.W., Chung, C.H., Yim, W.Y., Lee, S.H. (2003). The Merging Algorithm for an Extraction of Valid Speech-Sounds. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds) Computational Science and Its Applications — ICCSA 2003. ICCSA 2003. Lecture Notes in Computer Science, vol 2668. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44843-8_65
Download citation
DOI: https://doi.org/10.1007/3-540-44843-8_65
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40161-2
Online ISBN: 978-3-540-44843-3
eBook Packages: Springer Book Archive