Fast Depth Intra Mode Decision Based on DCT in 3D-HEVC

Yang, Renbin; Dai, Guojun; Zhang, Hua; Zhou, Wenhui; Yu, Shifang; Feng, Jie

doi:10.1007/978-3-030-03398-9_20

Renbin Yang²⁰,
Guojun Dai²⁰,
Hua Zhang^20,21,
Wenhui Zhou²⁰,
Shifang Yu²⁰ &
…
Jie Feng²²

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11256))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

2296 Accesses

Abstract

The state-of-the-art 3D High Efficiency Video Coding (3D-HEVC) is an extension of the High Efficiency Video Coding (HEVC) standard dealing with the multi-view texture videos plus depth map format. But current 3D-HEVC with all intra mode prediction leads to extremely high computational complexity. In this paper, we propose two techniques to speed up the encoding of depth video, including DCT decision and fast CU split decision. For DCT decision, early determination of Depth Modeling Modes (DMMs) is performed if the DCT coefficients in the lower right part of the current Coding Unit (CU) are completely zero. For fast CU split decision, current CU is split when the variance of CU is bigger than threshold. Experimental results demonstrate that the proposed decision can reduce 52.45% coding runtime on average while maintaining considerable rate-distortion (RD) performance as the original 3D-HEVC encoder.

You have full access to this open access chapter, Download conference paper PDF

An Efficient Fast CU Depth and PU Mode Decision Algorithm for HEVC

Fast intra mode decision for depth coding in 3D-HEVC

Article 29 February 2016

Fast Coding Unit (CU) Depth Decision Algorithm for High Efficiency Video Coding (HEVC)

Keywords

1 Introduction

With the rapid development of 3D video services, the efficient compression of 3D video data has become a popular research topic over the past few years. 3D-HEVC is an extension of the well-known video coding standard High Efficiency Video Coding (HEVC), and has a more complex and complete structure compared with HEVC and MV-HEVC. The MV-HEVC and 3D-HEVC both use the multi-viewpoint coding structure, while only 3D-HEVC encodes the depth sequences in term of corresponding viewpoints.

Conventional HEVC intra prediction modes were applied in almost smooth depth maps very well, but they will produce ringing effect in the sharp edge, resulting in that the intermediate synthesis view can not meet the expectations of the quality of the video. JCT-3V developed two kinds of intra partition modes for depth maps named DMM1 (Wedgelets) and DMM4 (Contour) [1]. In Wedgelets, the PB (prediction block) is divided into two SBP (sub-block partition) by a straight line. And in Contour, the separation line between the two regions cannot be easily described by a geometrical function.

However, DMMs in the 3D-HEVC mode decision process introduce a huge computational load. There has been many previous works in intra depth of 3D-HEVC [2,3,4,5,6,7,8,9,10]. Gu et al. [2, 3] terminated the unnecessary prediction modes by full RD cost calculation in 3D-HEVC. Park et al. [4] omitted unnecessary DMMs in the mode decision process based on the edge classification results. Peng [5] proposed two techniques including fast intra mode decision and fast Coding Unit (CU) size decision to speed up the encoding of depth video. In [6], Sanchez et al. applied a filter to the borders of the encoded block and determined the best positions to evaluate the DMM 1, reducing the computational effort of DMM 1 process. Zhang et al. [7] simplified the intra mode decision in 3D-HEVC depth map coding based on the way of obtaining the picture texture from the mode with Sum of Absolute Transform Difference (SATD) in rough mode decision. Ruhan [8] put forward a novel early Skip/DIS mode decision for 3D-HEVC depth encoding which aims at reducing the complexity effort of this process. The proposed solution is based on an adaptive threshold model, which takes into consideration the occurrence rate of both Skip and DIS modes. Zhang [9] applied a method for early determination of segment-wise DC coding (SDC) decision based on the hierarchical coding structure. In [10], the proposed algorithm exploits the edge orientation of the depth blocks to reduce the number of modes to be evaluated in the intra mode decision. In addition, the correlation between the Planar mode choice and the most probable modes (MPMs) selected is also exploited, to accelerate the depth intra coding.

This paper proposes propose two techniques to speed up the encoding of depth video, including DCT decision and fast CU split decision. Based on the result of analysis that the CU blocks in the smooth region usually do not perform the DMM mode, we determine DMMs are not added into the candidate modes list if the DCT coefficients in the lower right part of the current CU are completely zero. The experimental results show that the proposed decision reduces 52.45% computational runtime on average while maintaining almost the same coding performance as the original 3D-HEVC encoder.

2 DCT in Depth

Depth maps contain the information of distance. Most depth maps are composed of large nearly constant areas or slowly varying sample values (which represent object areas) and sharp edges (which represent object borders). Thus, the depth map differs from the texture map is that the depth map is composed of large smooth areas and sharp edges. For depth map coding in each CU, there are 37 intra prediction modes, including 35 conventional intra prediction modes and 2 DMMs. And in the DMMs, there are two different types of partition patterns called Wedgelets and Contour. Table 1 represents that the optimal intra prediction modes of CUs. It contains 98.21% conventional modes and 1.79% DMMs on average. It means that most of DMMs are unnecessary for depth coding [1]. As we known, Wedgelets and Contour are always performed in sharp edges. If CUs contain edges can be identified in advance, the DMMs can be decided that whether to add into the candidate modes list. It will significantly reduce the computational time.

Table 1. The optimal intra prediction modes of CUs

Full size table

DCT is a transformation associated with Fast Fourier Transform (FFT). 2D DCT is usually used in signal and image processing, especially lossy compression, which has a strong concentration of energy distribution. And DCT is usually used to distinguish smooth region from maps.

As shown in Fig. 1, Fig. 1(a)–(c) is depth maps (4 $\times $ 4), and Fig. 1(d)–(f) is DCT coefficient matrixes. We use $DCT_{lowerright}$ to represent the numbers in the lower right part of the matrix which marked in red triangle. In Fig. 1(d), $DCT_{lowerright}$ are all zero while the depth map in Fig. 1(a) is smooth. The depth map in Fig. 1(b) changes slowly and $DCT_{lowerright}$ in Fig. 1(e) are nearly zero. And in Fig. 1(f), $DCT_{lowerright}$ are not zero because there is an obvious sharp edge in depth map Fig. 1(c). It can be analyzed that for CUs with a slow gray value variation, most energy after DCT is in the upper left part which called low-frequency region. Conversely, if the CUs contain more detail texture information, more energy is scattered in the lower right part, which called high frequency region.

Based on Table 1 and the analysis that only few CUs with edges in depth maps select the best modes as DMMs for intra mode prediction, we conjecture that the $DCT_{lowerright}$, which are all zero, can be used as the basis for judging smooth region. More than 34 hundred million CUs from eight depth sequences released by JCT-3V Group are statisticed, and the results is shown in Table 2. It presents the hit rate of that depth CU chooses conventional HEVC intra mode as the best prediction mode while $DCT_{lowerright}$ are completely zero. It means that about 99% CUs select conventional modes and only less than 1% select DMMs as best intra mode while $DCT_{lowerright}$ are all zero. Thus, DCT can be used to distinguish between smooth regions and sharp edges, which decides DMMs whether to add into the candidate modes list. The current CU only calculate conventional modes with SATD and don’t add DMMs into the candidate modes list when $DCT_{lowerright}$ are all zero.

Table 2. Statistical analysis for conventional modes hit rate in 3D-HEVC intra coding

Full size table

3 Proposed Decision

Based on the observation in Sect. 2, we propose two fast coding techniques and describe them in detail in the following.

3.1 DCT Decision

We compute the DCT coefficient matrix of current CU and calculate the $DCT_{lowerright}$. If they are not zero, we believe that current CU has sharp edges and DMMs should be added into the candidate modes list for intra mode prediction.

The flowchart of the proposed DCT decision is shown in Fig. 2. If $DCT_{lowerright}$ are all zero, DMMs will not be added into the candidate modes list. Otherwise, all modes in the candidate modes list will be coded. Because of high computational complexity of traditional DCT, we use integer DCT technology of H.265/HEVC, which adopts a fast butterfly-shaped algorithm [11].

Table 3. The proportion of all zero blocks (QP42)

Full size table

However, as shown in Table 3, with the size of CUs increasing, the proportion of the blocks whose $DCT_{lowerright}$ are all zero is decreased. Balloons and Kendo reach 69.76% and 75.44% on average. Big CUs (16 $\times $ 16, 32 $\times $ 32) of GTFly achieves to 30.97% and 17.33%, and PoznanStreet even only achieves up to 14.58% and 5.77%. Small CUs (4 $\times $ 4, 8 $\times $ 8) of GTFly achieves to 86.88% and 60.05%, and PoznanStreet achieves to 64.70% and 34.67%. And the number of small CUs whose $DCT_{lowerright}$ are all zero is greatly larger than big CUs.

Meanwhile, computational complexity of big CUs is higher than the small and it’s wasteful to compute the DCT coefficient matrixes whose $DCT_{lowerright}$ are not all zero. Based on the analysis, we believe that it’s expensive to compute DCT coefficient matrixes of big CUs.

3.2 Fast CU Split Decision

Depth maps have large smooth and uniform areas. Hence, in current CU split decisions, the runtime of RD-Cost computation can be reduced and the sharp areas should be divided more carefully. Since the DCT decision is not suitable for big CUs, an early CU splitting termination algorithm is proposed.

In 2014, the variance of CU and threshold was firstly used to describe whether the CU is smooth [3]. The algorithm of Park [4] and Peng [5] also use variance as a condition, but Park modified the threshold which determines whether DMMs should be added into the candidate modes list and performed better than Gu. Peng applied threshold and variance in CU split, which shows that the variance and threshold decision is a good method to judge whether the depth map is smooth.

Above all, we choose variance and threshold decision as our fast CU split decision, as is shown in Fig. 3, $Th_{CU}=\{(max(QP\gg 3-1,3))^2-8\}\ll 2$. If $Var_{CU}$ is bigger than $Th_{CU}$, current CU should be divided into four partition CUs. Otherwise, it shows that intra Prediction of current CU performs better than partition CUs.

4 Experimental Results

In the experiments, we test eight sequences to verify the coding efficiency of the proposed decision and 300 frames are tested. All the experiments are implemented on the 3D-HEVC Test Model (HTM13.0) under all intra configuration. The encoder configuration is as follows: 3 view case, the coding treeblock has a fixed size of 64 $\times $ 64 pixels and depth range is from 0 to 3. The texture maps use the QPs at 25, 30, 35, 40 and the depth maps use 34, 39, 42, 45. The proposed algorithm is evaluated with Bjontegaard Delta bitrate (BD-rate) and Bjontegarrd Delta bitrate (BD-PSNR) [12] under all-intra configuration. BD-rate represents the total bitrates differences, BD-PSNR represents rendered PSNR change. We define Time Saving (TS) in Eq. (1), which represents reduction of total encoding time, including texture video coding and depth video coding under the all intra configuration.

$$\begin{aligned} Time\ Saving = 1-\frac{runtime\ of\ proposed\ algorithm}{runtime\ of\ orignal\ encoder\; (HTM 13.0)} \end{aligned}$$

(1)

Performance of DCT decision compared with encoder (HTM 13.0) is shown in Table 4, four sequences are tested. DCT decision only reduce 5.9% computational complexity on average while achieving 1.0 BD-rate increasing in depth coding. Not surprising, it’s a waste of time by computing DCT coefficient matrix of big CUs whose $DCT_{lowerright}$ are all zero.

Table 4. Performance of DCT decision

Full size table

Table 5 shows the performance of fast CU split decision under four video sequences. Up to 40.2% time saving is achieved. On average, the time saving is 29.9% at a cost of 0.5% bitrate increasing.

Table 5. Performance of fast CU split decision

Full size table

Table 6 presents the detail of time saving of proposed decision under different QPs for four sequences. The proposed decision combines DCT decision and Fast CU Split decision. It can be observed from Table 6 that time saving on average of proposed decision when QP is 25 are almost the same as fast CU split decision. As the QP increases, proposed decision achieves more complexity reduction of coding on average.

Table 6. The detail of Time Saving (%) of proposed decision under different QPs for four sequences

Full size table

Table 7 shows the experimental results of the coding performance and complexity reduction compared with HTM13.0. Compared with Table 5, although GTFly achieves up to 40.2% time saving in fast CU split decision and 57.0% time saving in proposed decision, it also save 16.8% runtime by DCT decision. It’s satisfied that Kendo in proposed decision achieves 46.0% time reduction rather than 22.7% in fast CU split decision. Based on the above, DCT decision can save time by deciding whether to add DMMs into the candidate modes list. And it’s obvious that DCT decision performs well in distinguish smooth maps between maps with sharp edges. And proposed decision leads to 0.03 BD-rate increasing for video and 2.71 decreasing for depth on average. It’s observed that fast CU split decision only affects time reduction rather than video quality and DCT decision plays an important role in the quality of rebuilt videos. Our proposed decision achieves 52.45% complexity reduction of coding on average. And the proposed decision save time from 37.30% to 68.60% without significant performance loss.

Table 7. Experimental results compared with original encoder

Full size table

Table 8. Comparison result

Full size table

Table 8 compares the proposed algorithm with the state-of-arts for intra coding. The BD-Rate is measured on the synthesized views. Most researches on intra prediction mode decision achieve 27.8%–37.65% time reduction with negligible loss. Our decision can save 52.45% coding runtime while maintaining almost the same RD performance as the original 3D-HEVC encoder.

5 Conclusion

In this paper, we propose a fast intra mode decision algorithm based on DCT to reduce the computational complexity of 3D-HEVC encoder. Although DCT decision encodes better in small CUs, the ratio of big CUs whose $DCT_{lowerright}$ are all zero is extremely small, which leads to high complexity of DCT. We add existing fast CU split decision into the proposed decision to divide big CUs. The recent 3D-HEVC test model (HTM 13.0) is applied to evaluate the proposed decision. The experimental results show that the proposed decision can significantly save the encoding time while maintaining nearly the same RD performance as the original 3D-HEVC encoder. Meanwhile, it performs well in comparison with the state-of-art fast algorithm for 3D-HEVC.

References

Chen, Y., Tech, G., Wegner, K., Yea, S.: Test model 11 of 3D-HEVC and MV-HEVC. JCT-3V Document, JCT3V-J1003, Geneva, CH (2015)
Google Scholar
Gu, Z., Zheng, J., Ling, N., Zhang, P.: Fast depth modeling mode selection for 3D HEVC depth intra coding. In: IEEE International Conference on Multimedia and Expo Workshops, pp. 1–4 (2013)
Google Scholar
Gu, Z., Zheng, J., Ling, N., Zhang, P. Fast bi-partition mode selection for 3D HEVC depth intra coding. In: 2014 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2014)
Google Scholar
Park, C.-S.: Edge-based intramode selection for depth-map coding in 3D-HEVC. IEEE Trans. Image Process. 24(1), 155–162 (2015)
Article MathSciNet Google Scholar
Peng, K.K., Chiang, J.C., Lie, W.N.: Low complexity depth intra coding combining fast intra mode and fast CU size decision in 3D-HEVC. In: IEEE International Conference on Image Processing, pp. 1126–1130 (2016)
Google Scholar
Sanchez, G., Saldanha, M., Balota, G., Zatt, B., Porto, M., Agostini, L.: A complexity reduction algorithm for depth maps intra prediction on the 3D-HEVC. In: Visual Communications and Image Processing Conference, pp. 49–57 (2015)
Google Scholar
Zhang, M., Zhao, C., Xu, J., Bai, H.: A fast depth-map wedgelet partitioning scheme for intra prediction in 3D video coding. In: 2013 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 2852–2855. IEEE (2013)
Google Scholar
Conceição, R., Avila, G., Corrêa, G., Porto, M., Zatt, B., Agostini, L.: Complexity reduction for 3D-HEVC depth map coding based on early skip and early DIS scheme. In: IEEE International Conference on Image Processing, pp. 1116–1120 (2016)
Google Scholar
Zhang, H.B., Tsang, S.H., Chan, Y.L., Fu, C.H.: Early determination of intra mode and segment-wise DC coding for depth map based on hierarchical coding structure in 3D-HEVC. In: Asia-Pacific Signal and Information Processing Association Summit and Conference, pp. 374–378 (2015)
Google Scholar
Da Silva, T.L., Agostini, L.V., Da Silva Cruz, L.A.: Complexity reduction of depth intra coding for 3D video extension of HEVC. In: Visual Communications and Image Processing Conference, pp. 229–232 (2015)
Google Scholar
Rao, K.R., Kim, D.N., Hwang, J.-J.: Fast Fourier Transform-Algorithms and Applications. Springer, 10.1007/978-1-4020-6629-0 (2011). https://doi.org/10.1007/978-1-4020-6629-0
Book MATH Google Scholar
Bjontegarrd, G.: Calculation of average PSNR differences between RD-curves. VCEG-M33 (2001)
Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. 61471150, No. 61501402, No. U1509216), the Key Program of Zhejiang Provincial Natural Science Foundation of China (No. LZ14F020003). Thanks for support and assistance from Key Laboratory of Network Multimedia Technology of Zhejiang Province.

Author information

Authors and Affiliations

School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
Renbin Yang, Guojun Dai, Hua Zhang, Wenhui Zhou & Shifang Yu
Key Laboratory of Network Multimedia Technology of Zhejiang Province, Zhejiang University, Hangzhou, China
Hua Zhang
Zhejiang SCI-Tech University, Hangzhou, China
Jie Feng

Authors

Renbin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Guojun Dai
View author publications
You can also search for this author in PubMed Google Scholar
Hua Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wenhui Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Shifang Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jie Feng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hua Zhang .

Editor information

Editors and Affiliations

Sun Yat-sen University, Guangzhou, China
Jian-Huang Lai
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Cheng-Lin Liu
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Xilin Chen
Tsinghua University, Beijing, China
Jie Zhou
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tieniu Tan
Xi'an Jiaotong University, Xi'an, China
Nanning Zheng
Peking University, Beijing, China
Hongbin Zha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, R., Dai, G., Zhang, H., Zhou, W., Yu, S., Feng, J. (2018). Fast Depth Intra Mode Decision Based on DCT in 3D-HEVC. In: Lai, JH., et al. Pattern Recognition and Computer Vision. PRCV 2018. Lecture Notes in Computer Science(), vol 11256. Springer, Cham. https://doi.org/10.1007/978-3-030-03398-9_20

Download citation

DOI: https://doi.org/10.1007/978-3-030-03398-9_20
Published: 02 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03397-2
Online ISBN: 978-3-030-03398-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Fast Depth Intra Mode Decision Based on DCT in 3D-HEVC

Abstract

Similar content being viewed by others

An Efficient Fast CU Depth and PU Mode Decision Algorithm for HEVC

Fast intra mode decision for depth coding in 3D-HEVC

Fast Coding Unit (CU) Depth Decision Algorithm for High Efficiency Video Coding (HEVC)

Keywords

1 Introduction

2 DCT in Depth

3 Proposed Decision

3.1 DCT Decision

3.2 Fast CU Split Decision

4 Experimental Results

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Fast Depth Intra Mode Decision Based on DCT in 3D-HEVC

Abstract

Similar content being viewed by others

An Efficient Fast CU Depth and PU Mode Decision Algorithm for HEVC

Fast intra mode decision for depth coding in 3D-HEVC

Fast Coding Unit (CU) Depth Decision Algorithm for High Efficiency Video Coding (HEVC)

Keywords

1 Introduction

2 DCT in Depth

3 Proposed Decision

3.1 DCT Decision

3.2 Fast CU Split Decision

4 Experimental Results

5 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation