Abstract
By exploring the current block-based lossy video coding process and compressed videos, this paper finds two unique characteristics namely quality fluctuation and pixel deficiency. And we use 3D convolutional neural network (3D-CNN) to make full use of the limited temporal and spatial information in compressed video and build compressed video quality enhancement network (CVQENet) to improve the compressed video quality. The experimental results show that compared with the videos encoded by High Efficiency Video Coding (HEVC/H.265), the mean value of the Peak Signal-to-Noise Ratio (PSNR) of enhanced videos has been improved by 0.4652 dB under Low Delay (LD) configuration with Quantization Parameter (QP) is set to 37.







Similar content being viewed by others
References
Sullivan, G. J., Ohm, J. R., Han, W. J., & Wiegand, T. (2013). Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on Circuits & Systems for Video Technology, 22(12), 1649–1668.
Liu, M., Liu, C., Chen, Y., Yan, Z., & Zhao, N. (2022). Radio frequency fingerprint collaborative intelligent blind identification for green radios. IEEE Transactions on Green Communications and Networking, 7, 1–10.
Liu, M., Zhang, H., Liu, Z., & Zhao, N. (2022). Attacking spectrum sensing with adversarial deep learning in cognitive radio-enabled internet of things. IEEE Transactions on Reliability. https://doi.org/10.1109/TR.2022.3179491
Lim, B., Son, S., Kim, H., Nah, S., & Mu Lee, K. (2017). Enhanced deep residual networks for single image super-resolution (2017). In IEEE Conference on Computer Vision and Pattern Recognition Workshops, Hawaii, USA (pp. 136–144).
Zheng, B., Chen, Y., Tian, X., Zhou, F., & Liu, X. (2018). Implicit dual-domain convolutional network for robust color image compression artifact reduction. IEEE Transactions on Circuits and Systems for Video Technology, 2018(99), 1–1.
Dong, C., Deng, Y., Loy, C. C., & Tang, X. (2015). Compression artifacts reduction by a deep convolutional network. In 2015 IEEE International Conference on Computer Vision, Santiago, Chile (pp. 576–584).
Tai, Y., Yang, J., Liu, X., & Xu, C. (2017). MemNet: A persistent memory network for image restoration. In 2017 IEEE International Conference on Computer Vision, Venice, Italy (pp. 4539–4547).
Zhang, K., Zuo, W., Chen, Y., Meng, D., & Zhang, L. (2017). Beyond a Gaussian denoiser: Residual learning of deep CNN for image denoising. IEEE Transactions on Image Processing, 26(7), 3142–3155.
Wang, T., Chen, M., & Chao, H. (2017). A novel deep learning-based method of improving coding efficiency from the decoder-end for HEVC. In 2017 Data Compression Conference, Snowbird, United States (pp. 410–419).
Yang, R., Xu, M., Liu, T., Wang, Z., & Guan, Z. (2017). Enhancing quality for HEVC compressed videos. IEEE Transactions on Circuits and Systems for Video Technology. https://doi.org/10.1109/TCSVT.2018.2867568
Yang, R., Sun, X., Xu, M., & Zeng, W. (2019). Quality-gated convolutional LSTM for enhancing compressed video. In 2019 IEEE International Conference on Multimedia and Expo, Shanghai, China (pp. 532–537).
Yang, R., Mentzer, F., Gool, L. V., & Timofte, R. (2020). Learning for video compression with hierarchical quality and recurrent enhancement. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, USA (pp. 6628–6637).
Feng, L., Zhang, X., Wang, S., Wang, Y., & Ma, S. (2019). Coding prior based high efficiency restoration for compressed video. In 2019 IEEE International Conference on Image Processing, Taiwan, China (pp. 769–774).
Yang, R., Xu, M., Wang, Z., & Li, T. (2018). Multi-frame quality enhancement for compressed video. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA (pp. 6664–6673).
Wang, T., Xiao, W., Chen, M., & Chao, H. (2018). The multi-scale deep decoder for the standard HEVC bitstreams. In Data Compression Conference, Snowbird, United States (pp. 197–206).
He, X., Hu, Q., Zhang, X., Zhang, C., Lin, W., & Han, X. (2018). Enhancing HEVC compressed videos with a partition-masked convolutional neural network. In 2018 IEEE International Conference on Image Processing, Athens, Greece (pp. 216–220).
Ma, L., Tian, Y., & Huang, T. (2018). Residual-based video restoration for HEVC intra coding. In 2018 IEEE Fourth International Conference on Multimedia Big Data, Xi’an, China (pp. 1–7).
Kang, J., Kim, S., & Lee K. M. (2017). Multi-modal/multi-scale convolutional neural network based in-loop filter design for next generation video codec. In 2017 IEEE International Conference on Image Processing, Beijing, China (pp. 26–30).
Song, X., Yao, J., Zhou, L., Wang, L., Wu, X., Xie, D., & Pu, S. (2018). A practical convolutional neural network as loop filter for intra frame. In 2018 IEEE International Conference on Image Processing, Madrid, Spain (pp. 1133–1137).
Meng, X., Deng, X., Zhu, S., Liu, S., Wang, C., Chen, C., & Zeng, B. (2020). Mganet: A robust model for quality enhancement of compressed video. In 2020 Data Compression Conference, Snowbird, United States.
Qiu, Z., Yao, T., Mei, T. (2017). Learning spatio-temporal representation with pseudo-3D residual networks. In 2017 IEEE International Conference on Computer Vision, Venice, Italy (pp. 5533–5541).
Guan, Z., Xing, Q., Xu, M., Yang, R., Liu, T., & Wang, Z. (2021). MFQE 2.0: A new approach for multi-frame quality enhancement on compressed video. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43, 949–963.
Acknowledgements
This research is supported by The Beijing Natural Science Foundation (Grant No. 4212001) and by Key R &D and Transformation Program of Qinghai Province (Grant No. 2022-QY-205).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, S., Liu, P., Zhang, Y. et al. Compressed video quality enhancement algorithm based on 3D-CNNs. Wireless Netw 30, 6125–6133 (2024). https://doi.org/10.1007/s11276-023-03392-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11276-023-03392-8