Abstract
Bandwidth efficiency and error robustness are essential issues for different multimedia streaming applications. This paper presents strategies for high-quality audio streaming based on fragmenting perceptually coded audio frames and shuffling the data components among multiple packets for transportation. This is done to increase robustness against packet loss. We also address the delivery of audio data consisting of components with different proportional priorities. Our approach is rationalized with streaming tests using the MPEG AAC audio codec in a simulated network environment and formal listening tests to evaluate the resulting audio output. According to the results, the proposed schemes improve audio quality significantly with reasonable increase to network resource utilization compared to traditional error robustness measures.
Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Perkins, C., Kouvelas, I., Hodson, O., Hardman, V., Handley, M., Bolot, J.-C., Vega-Garcia, A., Fosse-Parisis, S.: RTP payload for redundant audio data. IETF RFC 2198 (1997)
Wah, B.W., Su, X., Lin, D.: A survey of error-concealment schemes for real-time audio and video transmissions over the Internet. In: Proceedings of the IEEE International Symposium on Multimedia Software Engineering (MSE '00), pp. 17–24. Taipei, Taiwan (2000)
Lauber, P., Sperschneider, R.: Error concealment for compressed digital audio. Convention Paper 5460 at the 111th AES Convention, New York (2001)
Miao, L., Lu, J., Gu, J.: An improved error resilience scheme for transmission of MPEG-4 audio over EGPRS. In: Proceedings of the IEEE Vehicular Technology Conference (VTC Fall '01), pp. 414–417. Atlantic City, NJ (2001)
Sperschneider, R., Homm, D., Chambat, L.-H.: Error resilient source coding with variable-length codes and its application to MPEG advanced audio coding. Audio Engineering Society Convention Paper 5271 at the 109th AES Convention. Los Angeles (2000)
Sperschneider, R., Homm, D., Chambat, L.-H.: Error resilient source coding with differential variable-length codes and its application to MPEG advanced audio coding. Audio Engineering Society Convention Paper 5555 at the 112th AES Convention. Munich, Germany (2002)
Chawla, K., Driessen, P., Qiu, X.: Transmission of streaming data over an EGPRS wireless network. In: Proceedings of IEEE Vehicular Technology Conference (VTC '00), vol. 1, pp. 118–122. Tokyo (2000)
Transparent End-to-End Packet Switched Streaming Service (PSS): RTP usage model. 3rd Generation Partnership Project TR 26.937 V6.0.0 (2004)
Rosenberg, J., Schultzrinne, H.: An RTP payload format for generic forward error correction. IETF RFC 2733 (1999)
Finlayson, R.: A more loss-tolerant RTP payload format for MP3 audio. IETF RFC 3119 (2001)
van der Meer, J., Mackie, D., Swaminathan, V., Singer, D., Singer, P.: RTP payload format for transport of MPEG-4 elementary streams. IETF RFC 3640 (2003)
Stockhammer, T., Viegand, T., Oelbaum, T., Obermeier, F.: Video coding and transport layer techniques for H.264/AVC-based transmission over packet-lossy networks. In: Proceedings of the International Conference on Image Processing (ICIP '03), pp. 481–484. Barcelona, Spain (2003)
Korhonen, J.: Error robustness scheme for perceptually coded audio based on interframe shuffling of samples. In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP '02), pp. 2053–2056. Orlando, FL (2002)
Korhonen, J.: Robust audio streaming over lossy packet-switched networks. In: Proceedings of International Conference on Information Networking (ICOIN '03), pp. 1343–1352. Jeju Island, South Korea, (2003)
Korhonen, J., Wang, Y.: Schemes for error resilient streaming of perceptually coded audio. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP '02), vol. 5, pp. 740–743. Hong Kong (2003)
Painter, T., Spanias, A.: Perceptual coding of digital audio. Proc. IEEE 88(4), 451–515 (2000)
Coding of Audio-Visual Objects – Part 3: Audio. ISO/IEC International Standard 14496-3 (2001)
Wang, Y., Vilermo, M.: Modified discrete cosine transform: its implications for audio coding and error concealment. J. Audio Eng. Soc. 51(1/2) (2003)
Herre, J., Eberlein, E.: Error concealment in the spectral domain. Convention Paper 3364 at the 93rd AES Convention. San Francisco, USA (1992)
Quackenbush, S., Driessen, P.: Error mitigation in MPEG-4 audio packet communication systems. Audio Engineering Society Convention Paper 5981 at the 109th AES Convention. New York, USA (2003)
Wang, Y., Streich, S.: A drumbeat-pattern based error concealment method for music streaming applications. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP '02), pp. 2817–2820. Orlando, FL (2002)
Kauppinen, I., Roth, K.: Audio signal extrapolation—theory and applications. In: Proceedings of the 5th Conference on Digital Audio Effects, pp. 105–110. Hamburg, Germany (2002)
Schultzrinne, H., Casner, S., Frederick, R., Jacobson, V.: A transport protocol for real-time applications. IETF RFC 3550 (2003)
Li, V., Zaichen, Z.: Internet multicast routing and transport control protocols. Proc. IEEE 90(3), 360–391 (2002)
Rey, L., Leon, D., Miyazaki, A., Varsa, V., Hakenberg, R.: RTP retransmission payload format. Internet draft, March 2004 (work in progress)
Hynninen, J., Zacharov, N.: Guineapig—a generic subjective test system for multichannel audio. Audio Engineering Society Convention Paper 4871 at the 106th AES Convention. Munich, Germany (1999)
Kylliäinen, M., Helimäki, H., Zacharov, N., Cozens, J.: Compact high performance listening spaces. In: Proceedings of Euronoise. Naples, Italy (2003)
Moore, B.C.J., Glasberg, B.R., Baer, T.: A model for the prediction of thresholds, loudness and partial loudness. J. Audio Eng. Soc. 45(4), 224–240 (1997)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Korhonen, J., Wang, Y. & Isherwood, D. Toward bandwidth-efficient and error-robust audio streaming over lossy packet networks. Multimedia Systems 10, 402–412 (2005). https://doi.org/10.1007/s00530-005-0169-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-005-0169-4