Abstract
H.265/HEVC (high efficiency video coding) adopts in-loop filters to reduce artifacts such as blocking artifacts, ringing artifacts, but those above-mentioned artifacts are still obvious in the case of large quantization step. This paper presents a novel in-loop filtering mechanism based on three-dimensional (3D) sub-bands in adaptive group of frames and convolutional neural network (CNN) models for further improvement of in-loop filtering capability. Firstly, the video frame sequence is adaptively grouped; then, a group of video frames is decomposed into frequency sub-bands by 3D wavelet transform; afterward, frequency sub-bands with different types are filtered by corresponding CNN model; finally, the group of video frames is synthesized by 3D wavelet inverse transform. We apply the nonlinear mapping ability of CNN into the modification of wavelet coefficients. The sub-bands of high frequency and low frequency are filtered by four offline CNN models with different directional characteristics trained with training data generated by 3D wavelet transform, respectively. The details of the high frequency part are enhanced, while the quality of low-pass image is promoted, so that those artifacts can be effectively alleviated. By means of comparing experiments, the subjective and objective results show that our proposed method has better filtering performance than the in-loop filtering mechanism in HM16.18; especially the video frame quality can be effectively improved in the case of large quantization step.
Similar content being viewed by others
References
Sullivan, G.J., Ohm, J.R., Han, W.J., Wiegand, T.: Overview of the high efficiency video coding (HEVC) standard. IEEE Trans. Circuits Syst. Video Technol. 22(12), 1649–1668 (2013)
Norkin, A., Bjontegaard, G., Fuldseth, A., Narroschke, M., Ikeda, M., Andersson, K., Zhou, M., Auwera, G.V.D.: HEVC deblocking filter. IEEE Trans. Circuits Syst. Video Technol. 22(12), 1746–1754 (2013)
Fu, C.M., Alshina, E., Alshin, A., Huang, Y.W., Chen, C.Y., Tsai, C.Y., Hsu, C.W., Lei, S.M., Park, J.H., Han, W.J.: Sample adaptive offset in the HEVC standard. IEEE Trans. Circuits Syst. Video Technol. 22(12), 1755–1764 (2013)
Donoho, D.L.: Denoising via soft thresholding. IEEE Trans. Inf. Theory 41, 613–627 (1992)
Pang, K.K., Tan, T.K.: Optimum loop filter in hybrid coders. IEEE Trans. Circuits Syst. Video Technol. 4(2), 158–167 (2002)
Kim, S.D., Yi, J., Ra, J.B.: Deblocking filter with two separate modes in block-based video coding. In: Photonics West ‘98 Electronic Imaging, pp. 156–160 (1998)
Watanabe, T., Wada, N., Yasuda, G., Tanizawa, A., Chujoh, T., Yamakage, T.: In-loop filter using block-based filter control for video coding. In: IEEE International Conference on Image Processing, pp. 1013–1016 (2010)
Zhang, X., Lin, W., Wang, S., Ma, S.: Nonlocal adaptive in-loop filter via content-dependent soft-thresholding for HEVC. In: IEEE International Symposium on Multimedia, pp. 465–470 (2016)
Dong, C., Chen, C.L., He, K., Tang, X.: Image Super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2016)
Kim, J., Lee, J.K., Lee, K.M.: Accurate image super-resolution using very deep convolutional networks. In: Computer Vision and Pattern Recognition, pp. 1646–1654 (2016)
Park, W.S., Kim, M.: CNN-based in-loop filtering for coding efficiency improvement. In: Image, Video, and Multidimensional Signal Processing Workshop, pp. 1–5 (2016)
Zhang, Y., Shen, T., Ji, X., Zhang, Y., Xiong, R., Dai, Q.: Residual highway convolutional neural networks for in-loop filtering in HEVC. IEEE Trans. Image Process. Publ. IEEE Signal Process. Soc. 27(8), 3827 (2018)
Dai, Y., Liu, D., Wu, F.: A convolutional neural network approach for post-processing in hevc intra coding. In: International Conference on Multimedia Modeling, pp. 28–39 (2017)
Pearlman, W.A., Kim, B.J., Xiong, Z.: Embedded video subband coding with 3D SPIHT. Wavelet Image Video Compress. 450, 397–432 (2002)
Qing, L., Zeng, W.: Context-adaptive modeling for wavelet-domain distributed video coding. IEEE Multimedia 21(4), 84–93 (2014)
Shapiro, J.M.: Embedded image coding using zerotrees of wavelet coefficients. IEEE Trans. Signal Process. 41(12), 3445–3462 (2002)
Christopoulos, C., Skodras, A., Ebrahimi, T.: The JPEG2000 still image coding system: an overview. IEEE Trans. Consum. Electron. 46(4), 1103–1127 (2000)
Chen, G., Zhu, W.P., Xie, W.: Wavelet-based image denoising using three scales of dependency. IET Image Process. 6(6), 756–760 (2012)
Peng, Y.H.: An improved thresholding method in wavelet transform domain for denosing. J. China Inst. Commun. 24, 114–124 (2004)
Om, H., Biswas, M.: An improved image denoising method based on wavelet thresholding. J. Signal Inf. Process. 3(1), 109–116 (2012)
Donoho, D.L., Johnstone, I.M.: Adapting to unknown smoothness via wavelet shrinkage. Publ. Am. Stat. Assoc. 90(432), 1200–1224 (1995)
Chang, S.G., Yu, B., Vetterli, M.: Adaptive wavelet thresholding for image denoising and compression. IEEE Trans. Image Process. Publ. IEEE Signal Process. Soc. 9(9), 1532 (2000)
Itut, R.P.: Subjective video quality assessment methods for multimedia applications. Recomm. ITU-T P.910 12(2), 3665–3673 (1999)
Mallat, S.G.: A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans. Pattern Anal. Mach. Intell. 11(7), 674–693 (1989). https://doi.org/10.1109/34.192463
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: ACM International Conference on Multimedia, pp. 675–678 (2014)
HM version 16.18. https://hevc.hhi.fraunhofer.de/svn/svn_HEVCSoftware/tags/HM-16.18/
Bossen, F.: Common test conditions and software reference configurations. In: Joint Collaborative Team on Video Coding (2011)
Bjontegaard, G.: Calculation of average PSNR differences between RD curves. In: VCEG Meeting, Austin, Texas, USA (2001)
Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multiscale structural similarity for image quality assessment. In: Proceedings of the IEEE Asilomar Conference on Signals, Systems and Computers, vol. 1392, pp. 1398–1402 (2003)
Acknowledgements
This work was supported by the National Natural Science Foundation of China (Nos. 61671141, 61401081), the Liaoning Provincial Natural Science Foundation of China (No. 20180551007), and the Ministry of Education-China Mobile Scientific Research Funds (No. MCM20150103).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations
Rights and permissions
About this article
Cite this article
Zhang, D., Lei, W., Zhang, W. et al. A novel in-loop filtering mechanism of HEVC based on 3D sub-bands and CNN processing. SIViP 13, 1045–1053 (2019). https://doi.org/10.1007/s11760-019-01444-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-019-01444-1