Journal of Visual Communication and Image Representation
Study on the distribution of DCT residues and its application to R-D analysis of video coding
Introduction
Block discrete-cosine-transform (DCT) is widely employed in a serial of digital video compression standards [13], [8], [9], [14], [10]. The dominant design of a video encoder is a three-stage structure, as shown in Fig. 1. Frames of raw video data are divided into non-overlapped blocks and then transformed to coefficients in the frequency domain by DCT. In the second stage, quantization is employed to considerably reduce the magnitude of these DCT coefficients. Lastly, techniques of entropy coding are used to further increase the compression ratio. Such a hybrid system can compress raw video data efficiently while suffering from distortion to some extent.
Quantization errors are introduced in the quantization of DCT coefficients. Those quantization errors in the base-layer/non-scalable video coding are usually known as DCT residues [20]. Their statistics implicitly define visual quality of compressed video. In a signal-to-noise ratio scalable coder [3], [20], they are the source of the enhancement-layer. Therefore, knowledge on the distribution of DCT residues is significant in understanding and analyzing the performance of a DCT-based video compression system. It also makes sense in applications regarding postprocessing of coded video [27] and so on.
This paper investigates the distribution of DCT residues and highlights its application to rate-distortion (R-D) modeling in generic video coding. Individual frequency components of DCT residues are studied on the basis of quantization theory and well-accepted statistical models for source DCT coefficients. The probability density functions (PDFs) of individual frequency components of DCT residues, as well as the overall PDF of DCT residues, are presented. The first major advantage of the proposed study is that it not only provides mathematical justification on the distribution of DCT residues but also quantifies its relationship to video source and the quantization strategy. Another important advantage is it can accurately predict the possible distribution of DCT residues under a certain quantization strategy prior to actual video coding. Hence, it is useful in the analysis of one-pass video coding. In order to show such advantages, the proposed distribution model is applied to develop an R-D model. Based on the R-D model, a rate control algorithm is developed to control an MPEG-4 video coder.
The rest of the paper is organized as follows. Section 2 discusses some limitations of existing modeling work for the distribution of DCT residues. Section 3 presents the proposed distribution model. Section 4 compares the goodness-of-fit of the proposed distribution model to the experimental distribution. Section 5 applies the proposed distribution model to R-D modeling for video coding. Section 6 shows experimental results of the proposed distortion model and the proposed rate control algorithm. A conclusion is given in Section 7. Supporting experimental results are shown in corresponding sections.
Section snippets
Background and problem description
Existing work on the distribution of DCT residues is reviewed first. Then, distortion models in typical R-D analysis work are presented by the analysis of their underlying idea dealing with the distribution of DCT residues.
Modeling the distribution of DCT residues
DCT residues are quantization errors in essence. Their distribution should depend on both the distribution of source DCT coefficients and the quantizer characteristics. This section models their relationship by studying the compression of individual frequency components of source DCT coefficients.
Goodness-of-fit test
This section investigates the goodness-of-fit of the proposed distribution model to the actual DCT residues by the KS test [4]. The KS test justifies whether or not the sample of data are consistent with a specified distribution model. The test statistic (tks) is defined by the maximal difference between the empirical distribution G(zi) and the specified distribution F(zi) at samples zi. A small value of tks indicates a good fit.
In the KS test, the empirical distribution function G(zi) is
Application to R-D modeling
In this section, a novel distortion model is proposed based on the proposed PDF for DCT residues. Subsequently, the distortion model is combined with the classic R-D function to obtain a rate model.
Experimental results of R-D modeling
In this section, the performance of the proposed distortion model at different QP is examined. Subsequently, the proposed rate model is applied to develop a rate control algorithm.
Conclusion
In this paper, the distribution of DCT residues is investigated. The PDF of DCT residues in individual frequencies, as well as the overall PDF of all DCT components, are modeled based on studying the quantization of individual DCT components. Goodness-of-fit test suggests the proposed PDF is more accurate in comparison to Laplacian and Gaussian distributions. Subsequently, the proposed PDF is applied to R-D modeling for generic DCT video coding. After a new R-D model is presented, the
References (34)
- et al.
A new rate control scheme using quadratic rate distortion model
IEEE Trans. Circuits Syst. Video Technol.
(1997) - M. Dai, D. Loguinov, H. Radha, Statistical analysis and distortion modeling of MPEG-4 FGS, in: International Conference...
- et al.
Efficient H.261-based two-layer video codecs for ATM networks
IEEE Trans. Circuits Syst. Video Technol.
(1995) - et al.
Nonparametric Statistical Inference
(2003) - et al.
Source model for transform video coder and its application: I. Fundamental theory
IEEE Trans. Circuits Syst. Video Technol.
(1997) - et al.
Low-delay rate control for DCT video coding via ρ-domain source modeling
IEEE Trans. Circuits Syst. Video Technol.
(2001) - et al.
Optimum bit allocation and accurate rate control for video coding via ρ-domain source modeling
IEEE Trans. Circuits Syst. Video Technol.
(2002) - Information Technology, ISO/IEC 11172-2, Coding of moving pictures and associated audio for digital storage media at up...
- Information Technology, ISO/IEC 13818-2, Generic coding of moving pictures and associated audio information: video,...
- ISO/IEC 14496-2 MPEG-4 Visual Version 1, Coding of audio-visual objects – part 2: visual, April...
Digital Coding of Waveforms Principles and Applications to Speech and Video
Almost uniformity of quantization errors
IEEE Trans. Instrum. Meas.
Rate control for H.264 video with enhanced rate and distortion models
IEEE Trans. Circuits Syst. Video Technol.
Cited by (11)
A study on quantization effects of DCT based compression
2017, Proceedings - International Conference on Image Processing, ICIPNew R-D Optimization Criterion for Fast Mode Decision Algorithms in Video Coding and Transrating
2016, IEEE Transactions on Circuits and Systems for Video TechnologyEarly Skip Mode Decision for HEVC Encoder with Emphasis on Coding Quality
2015, IEEE Transactions on BroadcastingAn R-D optimized transcoding resilient motion vector selection
2014, Eurasip Journal on Advances in Signal ProcessingA fast inter residual quad-tree construction method in HEVC
2014, Digest of Technical Papers - IEEE International Conference on Consumer Electronics