Bandwidth Extension of a Narrowband Speech Coder for Music Delivery over IP

Lee, Young Han; Kim, Hong Kook; Lee, Mi Suk; Kim, Do Young

doi:10.1007/978-3-540-77368-9_20

Bandwidth Extension of a Narrowband Speech Coder for Music Delivery over IP

Young Han Lee¹,
Hong Kook Kim¹,
Mi Suk Lee² &
…
Do Young Kim²

Conference paper

1229 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4413))

Abstract

In this paper, we propose a bandwidth extension (BWE) algorithm of a narrowband speech coder for music delivery services over IP networks. The proposed BWE algorithm is based on an embedded structure of using a baseline coder followed by an enhancement layer. To minimize the bit-rate increase by the enhancement layer, the proposed algorithm shares spectral envelope and excitation parameters between the baseline coder and the enhancement layer. In this paper, we choose the iLBC as the baseline coder and mel-frequency cepstral coefficients (MFCCs) are used to reconstruct higher frequency components at the enhancement layer. By doing this, the bit-rate of the proposed BWE coder is 15.45 kbit/s which is just 0.25 kbit/s higher than the iLBC. We compare the quality of the proposed BWE coder with that of the iLBC, and it is shown from an informal listening test that the proposed BWE coder provides significantly better quality than the iLBC for all four different kinds of music genres such as pop, classical, jazz and rock.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

ITU-T Recommendation G.729: Coding of speech at 8 kbit/s using conjugate-structure algebraic-code-excited linear-prediction (CS-ACELP) ITU (March 1996)
Google Scholar
ITU-T Recommendation G.723.1: Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s ITU (March 1996)
Google Scholar
ITU-T Recommendation G.728: Coding of speech at 16 kbit/s using low-delay code excited linear prediction, ITU (October 1992)
Google Scholar
IETF RFC 3951, Internet Low Bit Rate Codec specification (December 2004)
Google Scholar
Pan, D.Y.: Digital Audio Compression. Digital Technical Journal 5(2), 1–14 (1993)
Google Scholar
Goode, B.: Voice Over Internet Protocol. Proc. IEEE 90, 1495–1517 (2002)
Article Google Scholar
Andersen, S.V., Kleijn, W.B., Hagen, R., Linden, J., Murthi, M.N., Skoglund, J.: iLBC-A Linear Predictive Coder with Robustness to Packet Losses. In: Proc of IEEE 2002 Workshop on Speech Coding, Tsukuba, Japan, pp. 23–25 (2002)
Google Scholar
Davis, S., Mermelstein, P.: Comparison of parametric representations for monosylable word recognition in continuously spoken sentences. IEEE Trans. Acoustic, Speech and Signal Processing 28, 357–366 (1980)
Article Google Scholar
Kataoka, A., Kurihara, S., Sasaki, S., Hayashi, S.: A 16-kbit/s wideband speech codec scalable with G.729. In: Proc. Eurospeech, Rhodes, Greece, pp. 1491–1494 (1997)
Google Scholar
Lee, G.H., Yoon, J.S., Kim, H.K.: A MFCC-based CELP speech coder for server-based speech recognition in network environments. In: Proc. of Eurospeech, Lisbon, Portugal, pp. 3169–3172 (2005)
Google Scholar
Eriksson, T., Lindén, J., Skoglund, J.: Interframe LSF quantization for noisy channels. IEEE Trans. Speech Audio Process 7(5), 495–509 (1999)
Article Google Scholar
Juang, B.H., Gray, A.H.: Multiple stage vector quantization for speech coding. In: Proc. of ICASSP, Paris, France, pp. 597–600 (May 1982)
Google Scholar
EBU Tech Document 3253, Sound Quality Assessment Material (SQAM) (1988)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Information and Communications, Gwangju Institute of Science and Technology (GIST), Gwangju 500-712, Korea
Young Han Lee & Hong Kook Kim
BcN Service Research Group, BcN Research Division ETRI, 161 Gajeong-dong, Yuseong-gu, Daejeon, 305-350, Korea
Mi Suk Lee & Do Young Kim

Authors

Young Han Lee
View author publications
You can also search for this author in PubMed Google Scholar
Hong Kook Kim
View author publications
You can also search for this author in PubMed Google Scholar
Mi Suk Lee
View author publications
You can also search for this author in PubMed Google Scholar
Do Young Kim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Marcin S. Szczuka Daniel Howard Dominik Ślȩzak Haeng-kon Kim Tai-hoon Kim Il-seok Ko Geuk Lee Peter M. A. Sloot

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, Y.H., Kim, H.K., Lee, M.S., Kim, D.Y. (2007). Bandwidth Extension of a Narrowband Speech Coder for Music Delivery over IP. In: Szczuka, M.S., et al. Advances in Hybrid Information Technology. ICHIT 2006. Lecture Notes in Computer Science(), vol 4413. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77368-9_20

Download citation

DOI: https://doi.org/10.1007/978-3-540-77368-9_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77367-2
Online ISBN: 978-3-540-77368-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics