The Merging Algorithm for an Extraction of Valid Speech-Sounds

Kim, Jin Ok; Paek, Han Wook; Chung, Chin Hyun; Yim, Wha Young; Lee, Sang Hyo

doi:10.1007/3-540-44843-8_65

The Merging Algorithm for an Extraction of Valid Speech-Sounds

Jin Ok Kim¹⁰,
Han Wook Paek¹¹,
Chin Hyun Chung¹¹,
Wha Young Yim¹¹ &
…
Sang Hyo Lee¹¹

Conference paper
First Online: 01 January 2003

664 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2668))

Abstract

In general, high frequency noises included in a normal speech stream are difficult to remove from the speech stream. Because an unvoiced phoneme seems like a high frequency noise, it may be removed during denoising. A low frequency noise (hum noise), on the other hand, may come from a circuitry imbalance, a wrongly designed ground point in PCB, or imbalance among the parts mounted on a board. This experiment results show that the merging algorithm is very robust against external effects. The merging algorithm is proposed to extract valid speech-sounds in terms of position and frequency range. It needs some numerical methods for an adaptive DWT implementation and performs unvoiced/voiced classification and denoising. Since the merging algorithm can decide the processing parameters relating to voices only and is independent of system noises, it is useful for extracting valid speechsounds. The merging algorithm has an adaptive feature for arbitrary system noises and an excellent denoising SNR (signal-to-noise ratio). Its extraction shows that the denoising of compounded noise and the improved extraction and the merging algorithm can not be disturbed by an unexpected system interference.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Goldberg, R., Riek, L.: A Practical Handbook of Speech Coders. CRC Press, Boca Raton, FL (2000)
MATH Google Scholar
Goswami, J.C., Chan, A.K.: Fundamentals of Wavelets: Theory, Algorithms and Applications. John Wiley & Sons, New York (1999)
Google Scholar
Teolis, A.: Computational Signal Processing with Wavelets. Springer Verlag, New York (1998)
MATH Google Scholar
Burrus, C.S., Gopinath, R.A., Guo, H.: Introduction to Wavelets and Wavelet Transforms: A Primer. Prentice Hall, New Jersey (1997)
Google Scholar
Marzetta, T.L.: A new interpretation for capon’s maximum likelihood method of frequency-wavenumber spectral estimation. IEEE Trans. Acoustics, Speech, and Signal Processing 31 (1983)
Google Scholar
Deller, J.R., Hansen, J.H.L., Proakis, J.G.: Discrete-Time Processing of Speech Signals. IEEE Press, New York (2000)
Google Scholar
Donoho, D.L.: Denoising by soft-thresholding. IEEE Trans. Information Theory 41 (1995)
Google Scholar
Parsons, T.W.: Voice and Speech Processing. McGraw-Hill, New York (1986)
Google Scholar
Jurasfky, D., Martin, J.H., Linden, K.V., Jurafsky, D.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition. Prentice Hall, New Jersey (2000)
Google Scholar
Morgan, N., Gold, B.: Speech and Audio Signal Processing: Processing and Perception of Speech and Music. John Wiley & Sons, New York (1999)
Google Scholar
Rabiner, L., Juang, B.H., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall, New Jersey (1993)
Google Scholar
Huang, X., Acero, A., Hon, H.W., Reddy, R.: Spoken Language Processing. Prentice Hall, New Jersey (2001)
Google Scholar
Quatieri, T.F.: Discrete-Time Speech Signal Processing. Prentice Hall, New Jersey (2001)
Google Scholar
Mitra, S.K.: Digital Signal Processing: A Compter-Based Approach. 2nd edn. McGraw-Hill, New York (2000)
Google Scholar
Ogden, R.T.: Essential Wavelets for Statistical Applications and Data Analysis. Springer Verlag, New York (1996)
Google Scholar
Furui, S.: Digital Speech Processing, Synthesis and Recognition. 2nd edn. Marcel Dekker, New York (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information and Communication Engineering, Sungkyunkwan University, 300, Chunchun-dong, Jangan-gu, Suwon, Kyunggi-do, 440-746, KOREA
Jin Ok Kim
Department of Information and Control Engineering, Kwangwoon University, 447-1, Wolgye-dong, Nowon-gu, Seoul, 139-701, KOREA
Han Wook Paek, Chin Hyun Chung, Wha Young Yim & Sang Hyo Lee

Authors

Jin Ok Kim
View author publications
You can also search for this author in PubMed Google Scholar
Han Wook Paek
View author publications
You can also search for this author in PubMed Google Scholar
Chin Hyun Chung
View author publications
You can also search for this author in PubMed Google Scholar
Wha Young Yim
View author publications
You can also search for this author in PubMed Google Scholar
Sang Hyo Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Army High Performance Computing Research Center, USA
Vipin Kumar
Department of Computer Science and Engineering, University of Minessota, MN, 55455, USA
Vipin Kumar
Department of Computer Science, University of Calgary, Calgary, AB, T2N1N4, Canada
Marina L. Gavrilova
Heuchera Technologies Inc., 122 9251-8 Yonge Street, Richmond Hill, ON, Canada, L4C 9T3
Chih Jeng Kenneth Tan
School of Computer Science, The Queen’s University of Belfast, Belfast, BT7 1NN, Northern Ireland, UK
Chih Jeng Kenneth Tan
Département d’informatique et de recherche opérationelle, Université de Montréal, Montréal, Québec, H3C 3J7, Canada
Pierre L’Ecuyer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, J.O., Paek, H.W., Chung, C.H., Yim, W.Y., Lee, S.H. (2003). The Merging Algorithm for an Extraction of Valid Speech-Sounds. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds) Computational Science and Its Applications — ICCSA 2003. ICCSA 2003. Lecture Notes in Computer Science, vol 2668. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44843-8_65

Download citation

DOI: https://doi.org/10.1007/3-540-44843-8_65
Published: 18 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40161-2
Online ISBN: 978-3-540-44843-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics