Low-complexity rate-distortion optimal transcoding of MPEG I-frames

https://doi.org/10.1016/S0923-5965(99)00011-9Get rights and content

Abstract

The transcoding of MPEG video streams is important in a variety of situations such as in rate reduction for small bandwidth downstream channels and in extracting a fast visual playback stream for digital video cassette recorders. This paper addresses transcoding of MPEG intra-coded frames by selectively copying DCT AC coefficients from the primary MPEG stream into the secondary MPEG stream. A rate-distortion optimal transcoding approach, as well as several computationally more attractive suboptimal approaches, is presented. The proposed solutions outperform the more traditional codeword extraction methods from literature.

Introduction

A typical MPEG encoder that compresses digital video for broadcasting purposes has a target bit-rate approximately between 4 and 10 Mbit/s. The choice of this rate is primarily determined by the available or allocated capacity of the terrestrial or satellite transmission link. In some cases these primary wide-band channels feed into secondary downstream channels of lower capacity. For instance, in a cable head-end a network operator may decide to allocate a smaller bandwidth to a specific video service than the bandwidth the service occupies on reception. Clearly this is at the advantage of providing more different services on the cable network however at the cost of a lower quality per channel. Another example is the provision of fast visual playback facilities in digital video cassette recorders. Fast visual playback, i.e., a fast forward or reverse playback mode which displays video pictures, is particularly complicated for MPEG compressed digital video stored on a helical scan video cassette recorder [2], [4], [5]. One solution to fast visual playback is to store (and display on fast forward or reverse) a low bit-rate stream on the digital cassette in addition to the original MPEG bit stream. This requires the real-time extraction of a low bit-rate MPEG stream when recording the primary MPEG video bit stream.

The problem of converting a compressed video stream such as MPEG into another (MPEG) compressed video stream of lower bit-rate is known as transcoding [7], [9]. Though with transcoding a wide class of format conversions for compressed video is meant, we confine ourselves in this paper to the efficient bit-rate reduction of the intra-coded frames (I-frames) of a given MPEG-2 video bit stream, without fully decoding and re-encoding (or re-quantization) the video signal. This last requirement is particularly important for cost-effective real-time systems. A typical configuration under consideration in this paper is the real-time conversion of a 10 Mbit/s primary MPEG-2 video stream into a 1 Mbit/s secondary I-frame-only MPEG-2 stream.

The solution strategy that we propose in this paper is that of selectively copying encoded non-zero AC coefficients or codewords from the primary stream into the secondary stream. Since no re-quantization nor inverse DCT operations are necessary, this method is inherently fast. Similar approaches have been suggested in [8], [10], [13]. The transcoding methods that we propose aim at rate-distortion optimality, and outperform earlier proposed techniques without increasing the complexity of the transcoder significantly.

In Section 2 various approaches to transcoding are discussed. Section 3 concentrates on rate-distortion optimal methods for coefficient or AC codeword selection, while Section 4 proposes a control strategy that governs the selection mechanism. In Section 5 an experimental evaluation is given for the transcoding methods described in this paper. The paper concludes with a discussion in Section 6.

Section snippets

Low complexity approaches to transcoding

In extracting a secondary stream only intra-coded information (I-frame) of the primary stream is used. The reason for this is that we focus on fast and low-complexity transcoding. Transcoding of predicted video frames (P-frames) would require motion compensation of the additional distortion introduced by transcoding of previous I- or P-frames [7], [9]. The complexity of this operation is considered too high in the context of this paper.

Fig. 1 gives two examples of how intra-frames from the

Coefficient/codeword selection mechanism

For the coefficient selection and the codeword selection approaches a mechanism is required to determine the set of optimal non-zero AC coefficient and the cut-off level, respectively, per DCT block. We assume that the target bit-rate for the frame to be transcoded is known. Clearly then the selection mechanism needs to deliver the maximum quality of the decoded picture, given this bit-rate constraint. This section addresses how to solve this rate-distortion optimization problem.

Control of the Lagrange parameter λ

As indicated in Section 2, all selection methods require control of the Lagrange parameter λ. For CWS controlling λ implicitly means controlling the cut-off level c. For ZCS only the cut-off level c is controlled without rate-distortion considerations. In all cases the objective is to ensure that the target bit-rate (4) for a given frame is reached. One possibility is to find λ by an intelligent iterative search technique. Fig. 6(a) shows the block diagram of the resulting transcoder. Observe

Performance comparison

This section presents a performance comparison from two points of view. In the first place we evaluate the most promising transcoding methods experimentally. In this way we can compare the numerical performance differences between the various proposed techniques and their simplifications, and the differences with the known methods “full transcoding” and “zonal codeword selection”. Secondly, we compare the transcoding techniques with regard to their complexity. Though ideally complexity is

Discussion

This paper has proposed several transcoding methods that are based on copying selected AC DCT coefficients or codewords from a primary MPEG stream into a secondary MPEG stream. We have shown that the methods proposed outperform the more traditional codeword extraction methods from literature with various degrees of increase in computational complexity. The codeword selection method in combination with a (simplified) Lagrange feed-back control has a low complexity and qualifies for application

References (13)

  • G. Keesman et al.

    Transcoding of MPEG bitstreams

    Signal Processing: Image Communication

    (1996)
  • T. Berger

    Rate Distortion TheoryA Mathematical Basis for Data Compression

    (1971)
  • P.H.N. de With, A.M.A. Rijckaert, H.-W. Keesen, J. Kaaden, C. Opelt, An experimental digital consumer HDTV recorder...
  • H. Everett III

    Generalized Lagrange multiplier method for solving problems of optimum allocation of resources

    Oper. Res.

    (1963)
  • E.D. Frimout, Fast playback of helical-scan recorded MPEG video, Ph.D. Thesis, Delft University of Technology,...
  • E.D. Frimout, R.L. Lagendijk, J. Biemond, Extraction of a dedicated fast playback MPEG bit stream, in: Proceedings of...
There are more references available in the full text version of this article.

Cited by (5)

View full text