Skip to main content
Log in

Overall performance evaluation of adaptive multi rate 06.90 speech codec based on code excited linear prediction algorithm using MATLAB

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

Today, the primary constrain in wireless communication system is limited bandwidth and power. Wireless systems involved in transmission of speech envisage that efficient and effective methods need to be developed for maintaining quality-of-speech, especially at the receiving end, with maximum saving of bandwidth and power. Amongst all elements of the communication system (transmitter, channel and receiver), transmission channel (carrier of information/data, also called the medium) is the most critical and plays a key role in the transmission and reception of information/data. Channel conditions decide the quality of speech at receiver. Modeling a channel is a complex task. Many techniques are adopted to mitigate the effect of the channel. AMR (Adaptive Multi Rate) is one such technique that counteracts the deleterious effect of the channel on speech. This technique employs variable bit rate that dynamically switches to specific modes of operation (switching bit rates—called modes of operation) depending upon the channel conditions.

In this paper, the application of Code Excited Linear Prediction (CELP) source coder on speech followed by AMR codec is investigated and studied. An e-test bench using MATLAB is created to implement the CELP based AMR Codec scheme, and the same studied and investigated through a series of simulation. Here, both subjective and objective evaluations are carried out. Objective evaluations are categorized into waveform based, spectral based and perceptual based analysis. The results of the simulations are recorded and compared in various graphs and tables, which include calculation of various parameters like Absolute Error (ABS), Mean Square Error (MSE), Root Mean Square Error (RMSE), Signal to Noise Ratio (SNR), segmental SNR (segSNR) (Y. Hu and P. Loizou in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 1, pp. 153–156, 2006a; Proc. Interspeech, pp. 1447–1450, 2006b), Weighted-Slope Spectral distance (WSS) (Y. Hu and P. Loizou in Speech Commun. 49, 588–601, 2007), Perceptual Evaluation of Speech Quality (PESQ) (ITU-T rec. P.862, 2000), Log-Likelihood Ratio (LLR), Itakura-Saito Distance measure (ISD), Cepstrum Distance Measures (CEP) (V. Turbin and N. Faucheur in Proc. Online Workshop Meas. Speech Audio Quality Netw., pp. 81–84, 2005), Frequency Weighted Segmental SNR (fwSNRseg), Predicted rating of overall Quality (Covl), Rating of Speech Distortion (Csig), Rating of Background Distortion (Cbak) (ITU-T rec. P.835, 2003) and MeanOpinion Score (MOS). Simulation results clearly advocate that, it is possible to producevariable bitrates (tuning to channel conditions) in CELP coder by affecting coefficients of the coder while still maintaining a good quality of speech. Further, higher the bit-rate used, the better is the quality of speech (which can be verified from the results obtained with PESQ and MOS analysis) and at the same time offered simulation delay time also increases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Bhatt, N., et al. (2010). Implementation & performance analysis of CELP based GSM AMR Codec using MATLAB. In IEEE conference, CNC 2010, Calicut, India.

    Google Scholar 

  • Falk, T. H., & Chan, W. (2006). Single-ended speech quality measurement using machine learning methods. IEEE Transactions on Audio, Speech, and Language Processing, 14(6), 1935–1947.

    Article  Google Scholar 

  • Grundlehner, B., Lecocq, J., Balan, R., & Rosca, J. (2005). Performance assessment method for speech enhancement systems. In Proc. 1st annu. IEEE BENELUX/DSP valley signal process. symp.

    Google Scholar 

  • Hu, Y., & Loizou, P. (2006a). Subjective comparison of speech enhancement algorithms. In Proc. IEEE int. conf. acoust., speech, signal process. (Vol. 1, pp. 153–156).

    Google Scholar 

  • Hu, Y., & Loizou, P. (2006b). Evaluation of objective measures for speech enhancement. In Proc. inter speech (pp. 1447–1450).

    Google Scholar 

  • Hu, Y., & Loizou, P. (2007). Subjective comparison and evaluation of speech enhancement algorithms. Speech Communication, 49, 588–601.

    Article  Google Scholar 

  • Hu, Y., & Loizou, P. C. (2008). Evaluation of objective quality measures for speech enhancement. IEEE Transactions on Audio, Speech, and Language Processing, 16(1).

  • ITU-T (2000). Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs (ITU-T Rec. P.862, 2001).

  • ITU-T P.835 (2003). Subjective test methodology for evaluating speech communication systems that include noise suppression algorithm (ITU-T Rec. P. 835).

  • Jarvinen, K. (2000). Standardisation of the adaptive multirate codec. In IEEE conference.

    Google Scholar 

  • Salmela, J., & Mattila, V. (2004). New intrusive method for the objective quality evaluation of acoustic noise suppression in mobile communications. In Proc. 116th audio eng. soc. conv., Preprint 6145.

    Google Scholar 

  • The NOIZEUS database (2009). Available: http://www.utdallas.edu/~loizou/speech/noize.

  • Turbin, V., & Faucheur, N. (2005). A perceptual objective measure for noise reduction systems. In Proc. online workshop meas. speech audio quality netw (pp. 81–84).

    Google Scholar 

  • Xiao, J., Li, X., & Wan, L. (1998). Software simulation in GSM environment of federal standard 1016 CELP vocoder. In International conference on communication technology, Oct. 22–24, Beijing, China.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ninad Bhatt.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bhatt, N., Kosta, Y. Overall performance evaluation of adaptive multi rate 06.90 speech codec based on code excited linear prediction algorithm using MATLAB. Int J Speech Technol 15, 119–129 (2012). https://doi.org/10.1007/s10772-011-9126-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-011-9126-0

Keywords

Navigation