Abstract
The concepts and experiments presented are focused on modifications of an existing parametric speech coding algorithm (CELP) introduced in order to improve subjective speech quality in telephone connections. The perceptual coding to bit rate limiting was added and algorithms qualifying speech components to the categories of ”voiced”, ”unvoiced”, ”transients” using rough sets were studied. The speech signal quality achieved with the proposed hybrid codec was compared to the quality offered by some standard speech codecs.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Pawlak, Z.: A Treatise on Rough Sets. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets IV, pp. 1–17. Springer, Berlin (2005)
Kulesza, M., Szwoch, G., Czyzewski, A.: Improving signal quality in speech codec using a hybrid perceptual-parametric algorithm. In: Multimedia and Network Information Systems’ (MISSI), Wroclaw, Poland, 21-22 Sept., 2006, pp. 181–192 (2006)
Ritz, C.H.: Lossless wideband speech coding. In: 10th International Conference on Speech Science and Technology, Sydney, Australia (Dec. 2004)
Czyzewski, A.: Applications of Neural Networks and Perceptual Masking to Audio Restoration. Journal of New Music Research 22(5), 339–349 (2001)
Verma, T.S., Levine, S.N., Meng, T.H.: Transient Modeling Synthesis: a flexible analysis/synthesis tool for transient signals. In: International Computer Music Conference, Greece (1997)
Chu, W.C.: Speech Coding Algorithms. Foundation and Evolution of Standardized Coders. John Wiley & Sons, Hoboken (2003)
Goldberg, R., Riek, L.: A Practical Handbook of Speech Coders. CRC Press, Boca Raton (2000)
Kliewer, J., Mertins, A.: Audio subband coding with improved representation of transient signal segments. In: Proc 9th European Signal Processing Conference (EUSICPO-98), Rhodes, Greece, September 1998, pp. 1245–1248 (1998)
Babu, V.S., et al.: Transient Detection for Transform Domain Coders. In: AES 116th Convention, Berlin (2004)
ISO/IEC 14496-3:2001 Information technology - Generic coding of moving pictures and associated audio information: Part 3: Advanced Audio Coding (AAC) (2001)
OGG Vorbis Specification: http://xiph.org/vorbis/
Painter, T., Spanias, A.: Perceptual Coding of Digital Audio. Proceedings of IEEE 88, 451–513 (2000)
Opticom, Opera your digital ear. User manual, version 3.5 (2002)
Czyzewski, A., et al.: Intelligent Algorithms for Movie Sound Tracks Restoration. In: Peters, J.F., Skowron, A. (eds.) Transactions on Rough Sets V (2006)
ITU-T Recommendation P.862, Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs (2003)
Kulesza, M., Szwoch, G., Czyzewski, A.: High quality speech coding using combined parametric and perceptual modules. In: 13th World Enformatika Conference Proc., Budapest, Hungary, 26–28 May, 2006, pp. 244–249 (2006)
Czyzewski, A., Królikowski, R.: Neuro-Rough Control of Masking Thresholds for Audio Signal Enhancement. Journal of Neurocomputing 36, 5–27 (2001)
Annadana, R., Ferreira, A., Sinha, D.: A new low bit rate speech coding scheme for mixed content. In: 120th AES Convention, Paris, France (May 2006)
Ahmadi, S., Jelinek, M.: n the architecture, operation, and applications of VMRWB: The new cdma2000 wideband speech coding standard. IEEE Communication Magazine 44(5), 74–81 (2006)
Chazan, D., et al.: High quality sinusoidal modeling of wideband speech for the purposes of speech synthesis and modification. In: IEEE International Conference on Acoustic, Speech, and Signal Processing - ICASSP, Toulouse, May 2006, IEEE, Los Alamitos (2006)
Fuemmeler, J., Hardie, R., Gardner, W.: Techniques for the regeneration of wideband speech form narrow band speech. EURASIP Journal on Applied Signal Processing 2001(4), 266–274 (2001)
Levine, S., Smith III., J.: Improvements to the Switched Parametric & Transform Audio Coder. In: Proc. 1999 IEEE Workshop on Application of Signal Processing to Audio and Acoustics, New York, Oct. 1999, IEEE Computer Society Press, Los Alamitos (1999)
Najafzadeh-Azghandi, H., Kabal, P.: Perceptual coding of narrowband audio signals at 8 kbit/s. In: Proc. IEEE Workshop Speech Coding, Pocono Manor, IEEE Computer Society Press, Los Alamitos (1997)
Ojala, P., et al.: The adaptive multirate wideband speech codec: system characteristics, quality advances, and deployment strategies. IEEE Communication Magazine 44(5), 59–65 (2006)
Kulesza, M., et al.: High Quality Speech Codec Employing Sines+Noise+Transients Model. In: 53rd Open Seminar on Acoustics, Zakopane, Poland, 11–15 Sept. (2006)
Yang, M.: Low bit rate speech coding. IEEE Potentials 23(4), 32–36 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this chapter
Cite this chapter
Czyzewski, A. (2007). Speech Coding Employing Intelligent Signal Processing Techniques. In: Peters, J.F., Skowron, A., Marek, V.W., Orłowska, E., Słowiński, R., Ziarko, W. (eds) Transactions on Rough Sets VII. Lecture Notes in Computer Science, vol 4400. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71663-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-71663-1_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71662-4
Online ISBN: 978-3-540-71663-1
eBook Packages: Computer ScienceComputer Science (R0)