GOP-level transmission distortion modeling for mobile streaming video

https://doi.org/10.1016/j.image.2007.12.002Get rights and content

Abstract

Unequal loss protection is an effective tool in delivering compressed video streaming over packet-switched networks robustly. A critical component in any unequal-loss-protection scheme is a metric for evaluating the importance of different frames in a Group-Of-Pictures (GOP). In the case of video streaming over 3G mobile networks, packet loss usually corresponds to whole-frame loss due to low bandwidth and small picture size, which results in high error rates and thus most of the existing low-complexity transmission-distortion-estimate models may be ineffective. In this paper, we firstly develop a recursive algorithm to compute the GOP-level transmission distortion at pixel-level precision using pre-computed video information. Based on the study on the propagating behavior of the whole-frame-loss transmission distortion, we then propose a piecewise linear-fitting approach to achieve low-complexity transmission distortion modeling. The simulation results demonstrate that the proposed two models are accurate and robust. The proposed transmission distortion models are fast and accurate importance assessment tools in allocating limited channel resources optimally for the mobile streaming video.

Introduction

Nowadays, the expanded bandwidth for the air interfaces has made a solid ground for streaming media applications on 3G mobile network. With the advantages of wireless system in time and place, mobile streaming media service is very attractive. Due to different kinds of fading and multipath interference for the wireless channels, mobile video communication over 3G networks experiences burst packet losses. Moreover, the compressed video signal is extremely vulnerable against transmission errors, since low bit-rate video coding schemes rely on interframe coding for high coding efficiency. The coding structure of motion compensated interframe prediction creates strong spatio-temporal dependency in video frames [1], [2]. Consequently, the unavoidable packet losses during wireless transmission may result in error propagation of reconstructed video and thus induce severe quality degradation. This type of picture distortion is called transmission distortion.

In order to combat the effect of losses, many error control techniques are proposed for the packet video transmission [2], [3], [4], [5]. In Ref. [4], the error-control mechanisms are classified into four categories: forward error correction (FEC), retransmission, error resilience, and error concealment. For the streaming media applications, retransmission-based schemes are not allowed usually for real-time applications because of long time delay and hundreds of multicast members. Error-resilient schemes deal with packet loss on the compression layer, and most of them (e.g., resynchronization marking, data partitioning, and data recovery, etc.), are targeted to recover the decoding from bit errors. For packet video, transmission error is usually the entire packet loss and these bit-recovery based error-control mechanisms may cease to be effective [4]. Since complex error-concealment mechanisms have to be restricted by both the limited processing ability and power consumption of the handheld devices, the simplest and most common approach, previous frame repetition, is adopted in the mobile streaming media applications usually. FEC-based unequal loss protection (ULP) is one of the efficient error-control schemes used for the transmission of compressed video streaming over packet-loss networks [6], [7], by which limited channel resources are allocated efficiently to achieve increased rate-distortion (R-D) performance.

One of the most critical aspects of efficient resource allocation is accurate evaluation of the end-to-end video quality [8]. In the accurate transmission-distortion-estimate (TDE) schemes with moderate complexity, such as ROPE algorithm [9] and the statistics-based analytic model [10], the overall distortion accumulated from previous frames is computed to determine the coding mode for current macroblock (MB), where the total transmission distortion include distortion in current MB and its propagation in subsequent frames cannot be obtained. The low-complexity models include the method considering the intra refreshing and spatial loop filtering [11] and the method using the basic concepts of control systems [12]. It is worth noting that the low-complexity estimate models above are applicable for the low error-rates applications. For example, the error rate in Ref. [11] is less than 6% and that in Ref. [12] is about 8%. In the mobile video applications, packet loss usually corresponds to whole-frame loss due to low bandwidth and small picture size [13], which results in high error rates (an amount of blocks with nonzero motion vectors (MVs) are corrupted due to previous frame concealment) and thus the above low-complexity schemes are ineffective [11]. The simplified distortion-estimate model without considering the picture complexity, such as ELEP in Ref. [6] where only temporal error propagation is taken into account, are usually not accurate enough and thus optimal R-D performance cannot be achieved. Based on statistics of error propagation, a method for lightweight prediction of video distortion is proposed in Ref. [14].

In this paper, by exploiting the pre-computed information (such as MVs) of stored video signals, we first present a recursive TDE algorithm in this paper to compute the GOP-level transmission distortion for whole-frame losses. We then develop a low-complexity TDE model using piecewise linear-fitting approach based on the study on the error propagation behaviors of whole-frame losses. The experimental results demonstrate that the two proposed TDE models are accurate and robust. The proposed transmission distortion models provide a type of fast and accurate importance-assessment tools in allocating limited channel resources effectively.

The rest of this paper is organized as follows. In Section 2, we analyze the error propagation characteristics and unequal importance for different frames in a Group-Of-Pictures (GOP). Section 3 presents the recursive TDE algorithm for the stored video streaming, and Section 4 gives a piecewise linear-fitting approach based low-complexity estimate model. Simulation results are shown in Section 5. Finally, conclusions are drawn in Section 6.

Section snippets

Interframe error propagation

The common video coding scheme employs interframe prediction to remove temporal redundancies. Although interframe coding generally achieves higher compression efficiency, it is more sensitive to channel packet losses since each interframe prediction depends on its predecessor and any packet loss may break the prediction chain and affect all subsequent inter-predicted frames.

Let a packet containing data from the current frame be lost in the channel, and let the decoder perform

Recursive computation of the GOP-level transmission distortion

Unfortunately, the computational complexity is very high when Eq. (3) is used to calculate the GOP-level transmission distortion, and the low-complexity TDE models will be badly suited to the high error rates (induced by whole-frame losses) wireless video. Thus, for the mobile streaming media with whole-frame losses, the theoretical or approximate model with modest complexity is needed to compute the GOP-level transmission distortion with reasonable accuracy. Considering the characteristic that

The behavior of the whole-frame-loss error propagation

For typical mobile video, the compressed size of one video frame can become fairly small (800 bytes on average for 10 frames per second of QCIF video over 64 Kbit/s wireless channel [13]), and a single packet per video frame is often adopted to ensure efficient packet header overhead. Thus, a packet loss corresponds to one whole-frame-loss [13], [19].

The authors in Ref. [10] have demonstrated that the impulse transmission distortion has an exponential fading behavior. However, the fading behavior

Simulation results

Simulations have been carried out to evaluate the performance of the proposed GOP-level TDE models. We used H.264/AVC reference software JM11.0 [22] to encode six typical test videos without considering any channel losses. All of them are coded with a GOP size of 30 frames (one previous frame is used as reference) and the frame rate is set to 15 frames per second (fps). The proposed two TDE models are simulated separately to get the GOP-level transmission distortion for each frame in a GOP.

The

Conclusion

In this paper, we have firstly proposed a recursive estimation algorithm to compute the GOP-level transmission distortion induced by whole-frame losses. Based on the study on the propagation behavior of the whole-frame-loss transmission distortion for stored video streaming, we have then developed a low-complexity model to estimate the GOP-level transmission distortion accurately and robustly. With the estimation, the transmitter can assess the importance for each frame in a GOP effectively.

Acknowledgments

The authors would like to acknowledge the financial support of National Science Foundation of China (NSFC) under grants nos. 60502034 and 60625103. We also would like to thank the anonymous reviewers for their valuable suggestions that greatly improved the presentation of this paper.

References (22)

  • X.K. Yang et al.

    Unequal loss protection for robust transmission of motion compensated video over the Internet

    Signal Process.: Image Commun.

    (2003)
  • M. Murroni

    A power-based unequal error protection system for digital cinema broadcasting over wireless channels

    Signal Process.: Image Commun.

    (2007)
  • Q. Zhang et al.

    End-to-end QoS for video delivery over wireless Internet

    Proc. IEEE

    (January 2005)
  • B. Girod et al.

    Feedback-based error control for mobile video transmission

    Proc. IEEE

    (October 1999)
  • Y. Wang et al.

    Error control and concealment for video communication: a review

    Proc. IEEE

    (May 1998)
  • D. Wu et al.

    Transporting real-time video over the Internet: challenges and approaches

    Proc. IEEE

    (December 2000)
  • Z.G. Li et al.

    A unified architecture for real time video coding systems

    IEEE Trans. Circuits Syst. Video Technol.

    (2003)
  • A.K. Katsaggelos et al.

    Advances in efficient resource allocation for packet-based real-time video transmission

    Proc. IEEE

    (January 2005)
  • R. Zhang et al.

    Video coding with optimal inter/intra-mode switching for packet loss resilience

    IEEE J. Selected Areas Commun.

    (June 2000)
  • Z.H. He et al.

    Joint source channel rate-distortion analysis for adaptive mode selection and rate control in wireless video coding

    IEEE Transact. Circuits Syst. Video Technol., Special Issue on Wireless Video

    (June 2002)
  • K. Stuhlmuller et al.

    Analysis of video transmission over lossy channels

    IEEE J. Selected Areas Commun.

    (June 2000)
  • Cited by (15)

    • Estimation of accurate effective loss rate for FEC video transmission

      2014, Signal Processing: Image Communication
      Citation Excerpt :

      So far, many distortion models for video transmission over lossy channels have been proposed in the literature. Zhang et al. [18] developed a low-complexity transmission distortion model using a piece-wise linear fitting approach for the whole-frame loss. Sabir et al. [19] presented a statistical distortion model for MPEG-4 coded video streams transmitted over noisy/fading channels with error-resilient tools.

    • Wireless multi-view video streaming with subcarrier allocation

      2016, IEICE Transactions on Communications
    • A tutorial and review on inter-layer fec coded layered video streaming

      2015, IEEE Communications Surveys and Tutorials
    • Impact of random and burst packet losses on H.264 scalable video coding

      2013, 2013 IEEE Information Theory Workshop, ITW 2013
    View all citing articles on Scopus
    View full text