Skip to main content

Speech Communication

  • Chapter
  • First Online:
Quality of Experience

Part of the book series: T-Labs Series in Telecommunication Services ((TLABS))

Abstract

The goal of any speech service is the transmission and/or processing of speech signals. In this chapter we discuss the Quality of Experience (QoE) of speech communication systems, including networks, speech processing applications and terminals. We then give an overview of the methods employed to quantify and further estimate the QoE of speech communication services with a focus on diagnostic instrumental models. Such models provide indications on either the technical causes of degradations or the quality features impacted by a component in the speech communication system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    In the literature, the terms “voice service” and “speech service” are mostly used interchangeably. Here, we will refer to “voice” when the characteristics of the human voice are addressed, and to “speech” when both the signal carrier and the referred content are of interest.

References

  1. Chen K, Huang C, Huang P, Lei C (2006) Quantifying skype user satisfaction. In: Proceedings of the conference on applications, technologies, architectures, and protocols for computer communications (SIGCOMM), pp 399–410. Pisa

    Google Scholar 

  2. Côté N (2011) Integral and diagnostic intrusive prediction of speech quality. Springer, Berlin

    Book  Google Scholar 

  3. Côté N, Gautier-Turbin V, Möller S (2007) Influence of loudness level on the overall quality of transmitted speech. In: Proceedings of the 123rd AES convention, 7175, New York

    Google Scholar 

  4. Fastl H, Zwicker E (2007) Psychoacoustics: facts and models, 3rd edn. Springer, Berlin

    Book  Google Scholar 

  5. Fletcher H, Galt RH (1950) The perception of speech and its relation to telephony. J Acoust Soc Am 22(2):89–151

    Article  Google Scholar 

  6. Guéguin M, Le Bouquin-Jeannes R, Gautier-Turbin V, Faucon G, Barriac V (2008) On the evaluation of the conversational speech quality in telecommunications. EURASIP J Adv Signal Process. Article ID 185248

    Google Scholar 

  7. Hardy W (2003) VoIP service quality: measuring and evaluating packet-switched voice. McGraw-Hill, New York

    Google Scholar 

  8. Huo L, Wältermann M, Heute U, Möller S (2008) Estimation of the speech quality dimension “discontinuity”. In: Proceedings of the 8th ITG-Fachbericht-Sprachkommunikation, Aachen

    Google Scholar 

  9. IEEE Standards Publication 297 (1969) Recommended practice for speech quality measurements. Institute of Electrical and Electronics Engineers, New York

    Google Scholar 

  10. ITU-R Recommendation BS.1116-1 (1997) Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems. International Telecommunication Union, Geneva

    Google Scholar 

  11. ITU-R Recommendation BS.1284-1 (2003) General methods for the subjective assessment of sound quality. International Telecommunication Union, Geneva

    Google Scholar 

  12. ITU-T Handbook on Telephonometry (1992) International Telecommunication Union, Geneva

    Google Scholar 

  13. ITU-T Recommendation G.107 (2011) The e-model, a computational model for use in transmission planning. International Telecommunication Union, Geneva

    Google Scholar 

  14. ITU-T Recommendation G.107.1 (2011) Wideband e-model. International Telecommunication Union, Geneva

    Google Scholar 

  15. ITU-T Recommendation G.113 (2007) Transmission impairments due to speech processing. International Telecommunication Union, Geneva

    Google Scholar 

  16. ITU-T Recommendation G.718 (2008) Frame error robust narrow-band and wideband embedded variable bit-rate coding of speech and audio from 8–32 kbit/s. International Telecommunication Union, Geneva

    Google Scholar 

  17. ITU-T Recommendation G.722.1 (2005) Low-complexity coding at 24 and 32 kbit/s for hands-free operation in systems with low frame loss. International Telecommunication Union, Geneva

    Google Scholar 

  18. ITU-T Recommendation G.729.1 (2006) Based embedded variable bit-rate coder: an 8–32 kbit/s scalable wideband coder bitstream interoperable with G.729. International Telecommunication Union, Geneva

    Google Scholar 

  19. ITU-T Recommendation P.501 (2012) Test signals for use in telephonometry. International Telecommunication Union, Geneva

    Google Scholar 

  20. ITU-T Recommendation P.502 (2000) Objective test methods for speech communication systems using complex test signals. International Telecommunication Union, Geneva

    Google Scholar 

  21. ITU-T Recommendation P.564 (2007) Conformance testing for voice over IP transmission quality assessment models. International Telecommunication Union, Geneva

    Google Scholar 

  22. ITU-T Recommendation P.800 (1996) Methods for subjective determination of transmission quality. International Telecommunication Union, Geneva

    Google Scholar 

  23. ITU-T Recommendation P.863 (2011) Perceptual objective listening quality assessment. International Telecommunication Union, Geneva

    Google Scholar 

  24. Jung O (2012) Assessment of conversational speech quality inside vehicles, concerning influences of room acoustics and driving noises. Acta Acustica Acustica 98(3):461–474

    Article  Google Scholar 

  25. Möller S, Berger J, Raake A, Wältermann M, Weiss B (2011) A new dimension-based framewrok model for the quality of speech communication services. In: Third international workshop on quality of multimedia experience (QoMEX), pp 107–112

    Google Scholar 

  26. Möller S, Kettler F, Gierlich HW, Poschen S, Côté N, Raake A, Wältermann M (2012) Extending the e-model for capturing noise reduction and echo canceller impairments. J Audio Eng Soc 60(3):165–175

    Google Scholar 

  27. Möller S, Raake A, Kitawaki N, Takahashi A, Wältermann M (2006) Impairment factor framework for wideband speech codecs. IEEE Trans Audio Speech Lang Process 14(6):1969–1976

    Article  Google Scholar 

  28. Quackenbush S, Barnwell T, Clements M (1988) Objective measures of speech quality. Prentice Hall, Englewood Cliffs

    Google Scholar 

  29. Raake A (2006) Speech quality of VoIP—Assessment and prediction. Wiley, Chichester

    Book  Google Scholar 

  30. Rabiner L (1995) The impact of voice processing on modern telecommunications. Speech Commun 17(3–4):217–226

    Article  Google Scholar 

  31. Richters JS, Dvorak CA (1988) A framework for defining the quality of communications services. IEEE Commun Mag 26(10):17–23

    Article  Google Scholar 

  32. Rix A, Hollier M, Hekstra A, Beerends J (2002) Perceptual evaluation of speech quality (PESQ), the new ITU standard for end-to-end speech quality assessment, part i-time alignment. J Audio Eng Soc 50(10):755

    Google Scholar 

  33. Scalart P, Filho J (1996) Speech enhancement based on a priori signal to noise estimation. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing (ICASSP-96), vol 2, pp 629–632

    Google Scholar 

  34. Scholz K, Wältermann M, Huo L, Raake A, Möller S, Heute U (2006) Estimation of the quality dimension “directness/frequency content” for the instrumental assessment of speech quality. In: Proceedings of the 9th international conference on spoken language processing (ICSLP), Pittsburgh, pp 1523–1526

    Google Scholar 

  35. Sen D (2004) Predicting foreground SH, SL and BNH DAM scores for multidimensional objective measure of speech quality. In: IEEE international conference on acoustics, speech, and signal processing (ICASSP’04), vol 1, pp 493–496

    Google Scholar 

  36. Sen D, Lu W (2012) Objective evaluation of speech signal quality by the prediction of multiple foreground diagnostic acceptability measure attributes. J Acoust Soc Am 131(5):4087–4103

    Article  Google Scholar 

  37. Takahashi A, Yoshino H, Kitawaki N (2004) Perceptual QoS assessment technologies for VoIP. IEEE Commun Mag 42(7):28–34

    Article  Google Scholar 

  38. Thiede T, Treurniet W, Bitto R, Schmidmer C, Sporer T, Beerends J, Colomes C (2000) PEAQ—The ITU standard for objective measurement of perceived audio quality. J Audio Eng Soc 48(1/2):3–29

    Google Scholar 

  39. Voiers WD (1977) Diagnostic acceptability measure for speech communication systems. In: IEEE international conference on acoustics, speech, and signal processing (ICASSP’77), Hartford, pp 204–207

    Google Scholar 

  40. Wältermann M (2013) Dimension-based quality modeling of transmitted speech. Springer, Berlin

    Book  Google Scholar 

  41. Wältermann M, Raake A, Möller S (2010) Quality dimensions of narrowband and wideband speech transmission. Acta Acustica Acustica 96(6):1090–1103

    Article  Google Scholar 

  42. Weiss B, Möller S, Raake A, Berger J, Ullmann R (2009) Modeling call quality for time-varying transmission characteristics using simulated conversational structures. Acta Acustica Acustica 95(12):1140–1151

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nicolas Côté .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Côté, N., Berger, J. (2014). Speech Communication. In: Möller, S., Raake, A. (eds) Quality of Experience. T-Labs Series in Telecommunication Services. Springer, Cham. https://doi.org/10.1007/978-3-319-02681-7_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-02681-7_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-02680-0

  • Online ISBN: 978-3-319-02681-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics