Abstract
Assessing the perceptual quality of speech and audio signals is an important consideration in multimedia networks and devices. This paper constitutes an introduction to the standardized speech and audio quality assessment methods in ITU recommendations and other international organizations. A brief survey on the subjective and objective quality evaluation methods for the speech and the audio is provided. Recent developments as well as new topics for future developments are also outlined.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Campbell, D., Jones, E., Glavin, M.: Audio Quality Assessment Techniques-A Review, and Recent Developments. Signal Processing 89, 1489–1500 (2009)
de Lima, A.A., Freeland, F.P., de Jesus, R.A., Bispo, B.C., Biscainho, L.W.P., Netto, S.L., Said, A., Kalker, A., Schafer, R., Lee, B., Jam, M.: On the quality assessment of sound signals. In: 2008 IEEE International Symposium on Circuits and Systems, vol. 3, pp. 416–419 (2008)
AES: Measuring and Predicting Perceived Audio Quality. J. of Audio Engineering Society 53, 443–448 (2005)
Rix, A., Beerends, J., Kim, D.-S., Kroon, P., Ghitza, O.: Objective Assessment of Speech and Audio Quality - Technology and Applications. IEEE Transactions on Audio, Speech and Language Processing 14 (2006)
Loizou, P.C.: Speech Enhancement. CRC Press (2007)
ANSI S3.2-1989(R1999): Method for Measuring the Intelligibility of Speech over Communication Systems
Meyer Sound: Speech Intelligibility Papers, http://www.meyersound.com/support/papers
Yoon, C., Kim, S., Oh, Y.: A Study on the Standardization of Articulation Testing Method and Its Evaluation Suitable for Korean Language(I). J. AIK. 20, 117–125 (1988)
Yoon, C., Kim, S., Oh, Y.: A Study on the Standardization of Articulation Testing Method and Its Evaluation Suitable for Korean Language(II). J. AIK. 21, 95–108 (1989)
Hahm, T.: Complementary Study on Construction of Korean Word Lists for Speech Audiometry. Inje Medical J. 7, 1–19 (1986)
Byun, S., Chung, S., Kim, H., Go, Y.: A Survey of Phonetically Balanced Words Lists Used in Training Hospitals in Korea. Korean J. Otolaryngol 48, 1086–1090 (2005)
Byun, S.: Frequencies of Korean Phonemes and Reliability of Korean Phonetically Balanced Word Lists. Korean J. Otolaryngol 44, 485–489 (2001)
ANSI S3.5–1997(R2012): American National Standard Methods for Calculation of the Speech Intelligibility Index
Steeneken, H.J., Houtgast, T.: A physical method for measuring speech-transmission quality. The J. of the Acoustical Society of America 67, 318–326 (1980)
IEC 60268-16: Sound system equipment - Part 16: Objective rating of speech intelligibility by speech transmission index
NFPA 72: National Fire Alarm and Signaling Code, http://www.nfpa.org/aboutthecodes/AboutTheCodes.asp?DocNum=72
Everest, F.A., Pohlmann, K.C.: Master Handbook of Acoustics. McGraw-Hill (2009)
Steeneken, H.J.M.: Standardisation of Performance Criteria and Assessments Methods for Speech Communication
Ballou, G.: Handbook for Sound Engineers. Elsevier (2008)
Li, F.F., Cox, T.J.: Speech transmission index from running speech: A neural network approach. The J. of the Acoustical Society of America 113, 1999–2008 (2003)
Eggenschwiler, K., Machner, R.: Intercomparision Measurements of Room Acoustical Parameters for Speech Intelligibility in a Room with a Sound System. J. of Audio Engineering Society 53 (2005)
Han, N., Mak, C.M.: Improving speech intelligibility in classrooms through the mirror image model. Applied Acoustics 69, 945–950 (2008)
ITU-T P.800: Methods for Subjective Determination of Transmission Quality (1996)
ITU-T P.862: Perceptual Evaluation of Speech Quality (PESQ): An Objective Method for End-to-End Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs (2001)
ITU-T P.862.2: Wideband extension to Recommendation P.862 for the assessment of wideband telephone networks and speech codecs (2007)
ITU-T P.863: Perceptual Objective Listening Quality Assessment (2011)
ITU-T P.563: Single-ended method for objective speech quality assessment in narrow-band telephony applications (2004)
ITU-R BS.1284-1: General Methods for the Subjective Assessment of Sound Quality (1997)
ITU-R BS.1116-1: Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems (1994)
ITU-R BS.1534-1: Method for the subjective assessment of intermediate quality level of coding systems (2001)
ITU-R BS.1387-1: Method for Objective Measurements of Perceived Audio Quality (1998)
Kabal, P.: An Examination and Interpretation of ITU-R BS.1387: Perceptual Evaluation of Audio Quality (2003)
Creusere, C.D., Hardin, J.C.: Assessing the Quality of Audio Containing Temporally Varying Distortions. IEEE Transactions on Audio, Speech and Language Processing 19, 711–720 (2011)
Creusere, C.D., Member, S., Kallakuri, K.D., Vanam, R., Member, S.: An Objective Metric of Human Subjective Audio Quality Optimized for a Wide Range of Audio Fidelities 16, 129–136 (2008)
Huber, R., Kollmeier, B.: PEMO-Q: A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception. IEEE Transactions on Audio, Speech and Language Processing 14, 1902–1911 (2006)
Tan, C.-T., Moore, B.C.J., Zacharov, N., Mattila, V.-V.: Predicting the perceived quality of nonlinearly distorted music and speech signals. J. of AES 52, 699–711 (2004)
George, S., Zielinski, S., Rumsey, F.: Feature Extraction for the Prediction of Multichannel Spatial Audio Fidelity. IEEE Trans. Audio, Speech, Lang. Process. 14, 1994–2005 (2006)
Zielinski, S., Rumsey, F., Kassier, R., Bech, S.: Development and Initial validation of a Multichannel Audio Quality Expert System. J. of AES 53, 4–21 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oh, W., Lee, SK. (2012). Quality Assessment of Sound Signals in Multimedia and Communication Systems. In: Kim, Th., Kang, JJ., Grosky, W.I., Arslan, T., Pissinou, N. (eds) Computer Applications for Bio-technology, Multimedia, and Ubiquitous City. BSBT MulGraB IUrC 2012 2012 2012. Communications in Computer and Information Science, vol 353. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35521-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-35521-9_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35520-2
Online ISBN: 978-3-642-35521-9
eBook Packages: Computer ScienceComputer Science (R0)