Short Communication
A pixel-wise directional intra prediction method

https://doi.org/10.1016/j.jvcir.2012.02.004Get rights and content

Abstract

In a decade where the prominence of video applications has become increasingly prevalent, the optimization of video coding processes remains as important as ever. The directional intra prediction has been proved as an effective tool for intra picture coding. However, in a directional intra prediction based coding structure, there is a considerable gap between the transform efficiency of DCT and the optimal transform KLT. In this paper, we analyze these problems and propose a pixel-wise directional intra prediction (PDIP) method to solve this problem. This method exploits the reconstruction value of the adjacent pixels to predict the current pixel, and keeps the block-based lossy coding structure of H.264/AVC. The proposed method can significantly improve the coding efficiency with little decoding complexity increment.

Introduction

The digital video compression techniques can be traced to the principle that some accuracy of pixels composing a static image can be sacrificed without notable deterioration in the image’s viewing quality to the human eyes. Video compression takes advantage of the reasonable level of error allowed for the pixels of an image in order to analytically predict a majority of the image’s pixels based on knowledge of only a few reference pixels [1].

Techniques for video compression prediction can be classified into inter prediction coding and intra prediction coding. Inter prediction coding predicts pixels using the relationship between temporal adjacent frames for temporal estimation. Intra prediction coding removes the spatial redundancy in each separate frame to compress videos. Since the intra coding is applied within one picture, it is essentially the same as the static image compression. There are increasing requirements for intra only video compression, such as film-making, digital cameras and video editing applications. Efficient intra coding is also important for random access applications. In this paper, we will study the intra coding of current standards, and improve the intra coding method to introduce significant coding gain, with the tradeoff being only a relatively small increment in complexity.

The state of the art video coding standard H.264/AVC [2], [3] developed by VCEG (ITU-T SG16 Q.6) and MPEG (ISO/IEC JTC 1/SC 29/WG 11) has been well-known for achieving a much higher coding efficiency than the last generation video coding standards [4]. For example, it saves more than 50% bit-rate compared to MPEG-2 [5] while preserving almost the same visual quality [3].

The most highlighted intra picture coding technique of H.264/AVC is the directional spatial prediction [2]. In this coding structure, the reconstructed pixels from the adjacent blocks are used to predict the current coding block and generate the residue block (The detailed procedure will be illustrated in Section 2.1). The residue block is then transformed, and the coefficients are then scanned, quantized and entropy coded sequentially, as illustrated in Fig. 1.

For a classic intra coding structure like MPEG-2, DCT has been proved to be able to archive very close coding efficiency to KLT [6]. However, [7] proved that the transform efficiency of DCT is far less than KLT while directional intra prediction is employed, and the coding efficiency can be further improved by more than 4% while a separable mode dependent transform derived from KLT is used for each directional intra prediction mode [8]. The defect of the mode dependent KLT is that we should design one transform matrix for each mode. Employing several transformations will significantly improve the implementation complexity of both the encoder and decoder, especially for hardware implementation. Moreover, a KLT is normally not mathematically separable, also does not have fast algorithms. There were some other researches focused on further simplifying the KLT for directional intra prediction, and achieved considerable coding gain [9].

This paper proposes a much simpler method, which improves the prediction process and keeps the transform process unchanged. Test results show a performance improvement of about 2.5% average bit-rate savings or equivalently 0.2 dB PSNR gain for intra coding.

The rest of this paper is organized as follows. The directional prediction in H.264/AVC is briefly introduced in Section 2; the problem brought by directional prediction is also discussed. The proposed prediction method which solves the problem is introduced in Section 3. Section 4 gives the experimental results, and the conclusion is drawn in Section 5.

Section snippets

The directional prediction in H.264/AVC

The directional prediction of H.264/AVC supports the coding block-size of 4 × 4,8 × 8 and 16 × 16. As an example, the directional prediction of 4 × 4 is illustrated below.

As it shows in Fig. 2, the prediction value of the current block (pij) can be obtained from the reconstructed value of the left pixels (ai0) and the upper pixels (a0j) if they are available. These pixels are called reference pixels. There are eight prediction directions that can be chosen. As an example, the horizontal prediction is

The proposed method

In this section, we still take the 4 × 4 horizontal prediction mode as an example. While the proposed method is used, the prediction value equals the reconstruction value of the adjacent pixel to the left of the current pixel.

Experimental results

The proposed method has been implemented on the H.264/AVC test model JM17.2 [15]. The new intra prediction method is enabled for 4 × 4 horizontal mode and vertical mode, replacing the original method. No change is made for the other intra prediction modes.

The experiment is based on H.264/AVC main profile. QP is set to {22, 27, 32, 37}. Six sequences with different sizes are tested. All the frames are coded as I slice. CABAC is used as entropy coding, and RDOQ is enabled. The same RDOQ method is used

Conclusion

Directional prediction has been proved as an efficient tool for intra picture video coding. However, since DCT-based transform is designed to be a good approximation to KLT only when the original signal is to be transformed, and does not work efficiently while the residue signal is to be transformed. Recent research focus on designing mode dependent transforms which will significantly increase the complexity of both the encoder and decoder. This paper analyzes this problem and provides an

Acknowledgment

The authors would like to thank Prof. Jiangtao Wen for his discussions on the topic.

References (16)

  • Soon-kak Kwon et al.

    Overview of H.264/MPEG-4 part 10

    J. Visual Commun. Image Represent.

    (2006)
  • C.-C. Weisi Lin et al.

    Perceptual visual quality metrics: a survey

    J. Visual Commun. Image Represent.

    (2011)
  • ITU-T Recommendation H.264 and ISO/IEC 14496-10, Advanced Video Coding for Generic Audiovisual Services, May...
  • T. Wiegand et al.

    Rate-constrained coder control and comparison of video coding standards

    IEEE Trans. Circuits Syst. Video Technol

    (2003)
  • ITU-T Recommendation H.262 and ISO/IEC 13818-2, Information Technology-Generic Coding of Moving Pictures and Associated...
  • N. Ahmed et al.

    Discrete Cosine Transfom

    IEEE Trans. Computers

    (1974)
  • M. Karczewicz, Improved Intra Coding, ITU-T Q.6/SG16 VCEG document, VCEG-AF15, San Jose, USA, April...
  • Y. Ye, M. Karczewicz, Improved H.264 intra coding based onbi-directional intraprediction, directional transform, and...
There are more references available in the full text version of this article.

Cited by (2)

  • Direction-adaptive fixed length discrete cosine transform framework for efficient H.264/AVC video coding

    2016, Signal Processing: Image Communication
    Citation Excerpt :

    A further alternative scheme such as pixel-wise directional intra prediction (PDIP) method, utilizes the adjacent reconstructed pixels of different orientation to predict the current pixels but keeps the transform module unaltered. It has shown the improvement in bit-rate by 2.5% for intra coding [25]. The MC-residuals have different characteristics than image/intra residuals blocks, and applying the same transform to such blocks leads to inefficient compression performance.

  • Adaptive intra prediction mode signaling approach in H.264/AVC

    2014, IEICE Transactions on Communications

This work was supported in part by 973-2009CB320903 and 2010ZX03004-003.

View full text