Toward bandwidth-efficient and error-robust audio streaming over lossy packet networks

Korhonen, Jari; Wang, Ye; Isherwood, David

doi:10.1007/s00530-005-0169-4

Toward bandwidth-efficient and error-robust audio streaming over lossy packet networks

Research Article
Published: 07 July 2005

Volume 10, pages 402–412, (2005)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Jari Korhonen¹,
Ye Wang² &
David Isherwood³

76 Accesses
3 Citations
6 Altmetric
Explore all metrics

Abstract

Bandwidth efficiency and error robustness are essential issues for different multimedia streaming applications. This paper presents strategies for high-quality audio streaming based on fragmenting perceptually coded audio frames and shuffling the data components among multiple packets for transportation. This is done to increase robustness against packet loss. We also address the delivery of audio data consisting of components with different proportional priorities. Our approach is rationalized with streaming tests using the MPEG AAC audio codec in a simulated network environment and formal listening tests to evaluate the resulting audio output. According to the results, the proposed schemes improve audio quality significantly with reasonable increase to network resource utilization compared to traditional error robustness measures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improved audio compression through advanced adaptive data processing and distribution in an ANN framework

Article 06 February 2025

Audiovisual quality of live music streaming over mobile networks using MPEG-DASH

Article 23 June 2020

Compressive sampling and adaptive dictionary learning for the packet loss recovery in audio multimedia streaming

Article 07 November 2015

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

Perkins, C., Kouvelas, I., Hodson, O., Hardman, V., Handley, M., Bolot, J.-C., Vega-Garcia, A., Fosse-Parisis, S.: RTP payload for redundant audio data. IETF RFC 2198 (1997)
Wah, B.W., Su, X., Lin, D.: A survey of error-concealment schemes for real-time audio and video transmissions over the Internet. In: Proceedings of the IEEE International Symposium on Multimedia Software Engineering (MSE '00), pp. 17–24. Taipei, Taiwan (2000)
Lauber, P., Sperschneider, R.: Error concealment for compressed digital audio. Convention Paper 5460 at the 111th AES Convention, New York (2001)
Miao, L., Lu, J., Gu, J.: An improved error resilience scheme for transmission of MPEG-4 audio over EGPRS. In: Proceedings of the IEEE Vehicular Technology Conference (VTC Fall '01), pp. 414–417. Atlantic City, NJ (2001)
Sperschneider, R., Homm, D., Chambat, L.-H.: Error resilient source coding with variable-length codes and its application to MPEG advanced audio coding. Audio Engineering Society Convention Paper 5271 at the 109th AES Convention. Los Angeles (2000)
Sperschneider, R., Homm, D., Chambat, L.-H.: Error resilient source coding with differential variable-length codes and its application to MPEG advanced audio coding. Audio Engineering Society Convention Paper 5555 at the 112th AES Convention. Munich, Germany (2002)
Chawla, K., Driessen, P., Qiu, X.: Transmission of streaming data over an EGPRS wireless network. In: Proceedings of IEEE Vehicular Technology Conference (VTC '00), vol. 1, pp. 118–122. Tokyo (2000)
Transparent End-to-End Packet Switched Streaming Service (PSS): RTP usage model. 3rd Generation Partnership Project TR 26.937 V6.0.0 (2004)
Rosenberg, J., Schultzrinne, H.: An RTP payload format for generic forward error correction. IETF RFC 2733 (1999)
Finlayson, R.: A more loss-tolerant RTP payload format for MP3 audio. IETF RFC 3119 (2001)
van der Meer, J., Mackie, D., Swaminathan, V., Singer, D., Singer, P.: RTP payload format for transport of MPEG-4 elementary streams. IETF RFC 3640 (2003)
Stockhammer, T., Viegand, T., Oelbaum, T., Obermeier, F.: Video coding and transport layer techniques for H.264/AVC-based transmission over packet-lossy networks. In: Proceedings of the International Conference on Image Processing (ICIP '03), pp. 481–484. Barcelona, Spain (2003)
Korhonen, J.: Error robustness scheme for perceptually coded audio based on interframe shuffling of samples. In: Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP '02), pp. 2053–2056. Orlando, FL (2002)
Korhonen, J.: Robust audio streaming over lossy packet-switched networks. In: Proceedings of International Conference on Information Networking (ICOIN '03), pp. 1343–1352. Jeju Island, South Korea, (2003)
Korhonen, J., Wang, Y.: Schemes for error resilient streaming of perceptually coded audio. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP '02), vol. 5, pp. 740–743. Hong Kong (2003)
Painter, T., Spanias, A.: Perceptual coding of digital audio. Proc. IEEE 88(4), 451–515 (2000)
Article Google Scholar
Coding of Audio-Visual Objects – Part 3: Audio. ISO/IEC International Standard 14496-3 (2001)
Wang, Y., Vilermo, M.: Modified discrete cosine transform: its implications for audio coding and error concealment. J. Audio Eng. Soc. 51(1/2) (2003)
Herre, J., Eberlein, E.: Error concealment in the spectral domain. Convention Paper 3364 at the 93rd AES Convention. San Francisco, USA (1992)
Quackenbush, S., Driessen, P.: Error mitigation in MPEG-4 audio packet communication systems. Audio Engineering Society Convention Paper 5981 at the 109th AES Convention. New York, USA (2003)
Wang, Y., Streich, S.: A drumbeat-pattern based error concealment method for music streaming applications. In: Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP '02), pp. 2817–2820. Orlando, FL (2002)
Kauppinen, I., Roth, K.: Audio signal extrapolation—theory and applications. In: Proceedings of the 5th Conference on Digital Audio Effects, pp. 105–110. Hamburg, Germany (2002)
Schultzrinne, H., Casner, S., Frederick, R., Jacobson, V.: A transport protocol for real-time applications. IETF RFC 3550 (2003)
Li, V., Zaichen, Z.: Internet multicast routing and transport control protocols. Proc. IEEE 90(3), 360–391 (2002)
Article Google Scholar
Rey, L., Leon, D., Miyazaki, A., Varsa, V., Hakenberg, R.: RTP retransmission payload format. Internet draft, March 2004 (work in progress)
Hynninen, J., Zacharov, N.: Guineapig—a generic subjective test system for multichannel audio. Audio Engineering Society Convention Paper 4871 at the 106th AES Convention. Munich, Germany (1999)
Kylliäinen, M., Helimäki, H., Zacharov, N., Cozens, J.: Compact high performance listening spaces. In: Proceedings of Euronoise. Naples, Italy (2003)
Moore, B.C.J., Glasberg, B.R., Baer, T.: A model for the prediction of thresholds, loudness and partial loudness. J. Audio Eng. Soc. 45(4), 224–240 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Nokia Research Center, Tampere, Finland
Jari Korhonen
National University of Singapore, Singapore
Ye Wang
Nokia Corp., Tampere, Finland
David Isherwood

Authors

Jari Korhonen
View author publications
Search author on:PubMed Google Scholar
Ye Wang
View author publications
Search author on:PubMed Google Scholar
David Isherwood
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Jari Korhonen.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Korhonen, J., Wang, Y. & Isherwood, D. Toward bandwidth-efficient and error-robust audio streaming over lossy packet networks. Multimedia Systems 10, 402–412 (2005). https://doi.org/10.1007/s00530-005-0169-4

Download citation

Published: 07 July 2005
Issue Date: August 2005
DOI: https://doi.org/10.1007/s00530-005-0169-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Toward bandwidth-efficient and error-robust audio streaming over lossy packet networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Improved audio compression through advanced adaptive data processing and distribution in an ANN framework

Audiovisual quality of live music streaming over mobile networks using MPEG-DASH

Compressive sampling and adaptive dictionary learning for the packet loss recovery in audio multimedia streaming

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now