Super-resolution reconstructed video coding scheme based on inter-frame information

Cao, Yinglie; Xu, Haoqi; Zhou, Zhiheng; Yue, Wanlin; Zhuge, Shang; Li, Fei

doi:10.1007/s11042-023-17441-z

Super-resolution reconstructed video coding scheme based on inter-frame information

Published: 31 October 2023

Volume 83, pages 47847–47863, (2024)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Yinglie Cao ORCID: orcid.org/0000-0002-5821-7793¹,
Haoqi Xu²,
Zhiheng Zhou²,
Wanlin Yue²,
Shang Zhuge² &
…
Fei Li³

90 Accesses
Explore all metrics

Abstract

In recent years, the explosive growth of video data has posed a challenge to the performance of traditional video coding frameworks. The industry is thus faced with the pressing issue of how to transmit high-quality videos under limited bandwidth conditions. To address this challenge, a super-resolution reconstruction video coding scheme has been proposed, which combines traditional coding frameworks with deep learning-based super-resolution technology. By reducing the data coding volume through pre-coding downsampling and reconstructing videos through post-decoding upsampling, this scheme shows great potential in solving the aforementioned problems. However, previous super-resolution reconstruction video coding schemes have failed to effectively utilize the inter-frame correlation of video sequences, which limits the coding efficiency of the scheme. To overcome this limitation, this paper proposes an upsampling reconstruction network based on inter-frame information exploration. Experimental results show that, compared with the HEVC standard, the proposed scheme achieves a reduction of 11.05%, 11.3%, and 8.84% in the BD-BR index under the All-Intra, Low Delay P, and Random Access coding configurations, respectively, demonstrating higher coding efficiency than the previous human-designed scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Frame Selection Using Spatiotemporal Dynamics and Key Features as Input Pre-processing for Video Super-Resolution Models

Article Open access 14 March 2024

Down-Sampling Based Video Coding with Degradation-Aware Restoration-Reconstruction Deep Neural Network

Video super-resolution based on deep learning: a comprehensive survey

Article 01 April 2022

Availability of data and materials

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Notes

https://hevc.hhi.fraunhofer.de/

References

Liou M (1991) Overview of the p\(\times \) 64 kbit/s video coding standard. Commun ACM 34(4):59–63
Article Google Scholar
Shlien S (1994) Guide to mpeg-1 audio standard. IEEE Trans Broadcast 40(4):206–218
Article Google Scholar
Rijkse KH (1996) 263: Video coding for low-bit-rate communication. IEEE Commun Mag 34(12):42–45
Article Google Scholar
Wiegand T, Sullivan GJ, Bjontegaard G, Luthra A (2003) Overview of the h. 264/avc video coding standard. IEEE Trans Circ Syst Vid Technol 13(7):560–576
Article Google Scholar
Sullivan GJ, Ohm J-R, Han W-J, Wiegand T (2012) Overview of the high efficiency video coding (hevc) standard. IEEE Trans Circ Syst Vid Technol 22(12):1649–1668
Article Google Scholar
Bross B, Wang Y-K, Ye Y, Liu S, Chen J, Sullivan GJ, Ohm J-R (2021) Overview of the versatile video coding (vvc) standard and its applications. IEEE Trans Circ Syst Vid Technol 31(10):3736–3764
Article Google Scholar
Cisco U (2021) Cisco annual internet report (2018–2023) white paper. 2020. Acessado em 10(01), pp 1–35
Shen M, Xue P, Wang C (2011) Down-sampling based video coding using super-resolution technique. IEEE Trans Circ uits and Syst Vid Technol 21(6):755–765
Article Google Scholar
Ho M-M, He G, Wang Z, Zhou J (2020) Down-sampling based video coding with degradation-aware restoration-reconstruction deep neural network. In: Multimedia modeling: 26th International conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part I 26, pp 99–110. Springer
Ho MM, Zhou J, He G (2021) Rr-dncnn v2. 0: enhanced restoration-reconstruction deep neural network for down-sampling-based video coding. IEEE Trans Image Process 30:1702–1715
Article Google Scholar
Chan KC, Wang X, Yu K, Dong C, Loy CC (2021) Basicvsr: The search for essential components in video super-resolution and beyond. In: Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition, pp 4947–4956
Tian Y, Zhang Y, Fu Y, Xu C (2020) Tdan: Temporally-deformable alignment network for video super-resolution. In: Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition, pp 3360–3369
Huang Y, Wang W, Wang L (2017) Video super-resolution via bidirectional recurrent convolutional networks. IEEE Trans Pattern Anal Mach Intell 40(4):1015–1028
Article Google Scholar
Wang X, Chan KC, Yu K, Dong C, Change Loy, C (2019) Edvr: Video restoration with enhanced deformable convolutional networks. In: Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition workshops, pp 0–0
Chen Y, Murherjee D, Han J, Grange A, Xu Y, Liu Z, Parker S, Chen C, Su H, Joshi U, et al (2018) An overview of core coding tools in the av1 video codec. In: 2018 Picture Coding Symposium (PCS), pp 41–45. IEEE
Lin C, Li Y, Zhang K, Zhang Z, Zhang L (2021) Cnn-based super resolution for video coding using decoded information. In: 2021 International Conference on Visual Communications and Image Processing (VCIP), pp 1–5. IEEE
Montgomery C, Lars H (1994) Xiph. org video test media (derf’s collection). the xiph open source community
Bjontegaard G (2001) Calculation of average psnr differences between rd-curves. ITU SG16 Doc. VCEG-M33

Download references

Acknowledgements

The work is supported by the National Key Research and Development Program of China(2022YFF0607000), National Natural Science Foundation of China (61871188), Guangdong Basic and Applied Basic Research Foundation (2023A1515010993), Guangdong Provincial Key Laboratory of Human Digital Twin (2022B1212010004), Guangzhou City Science and Technology Research Projects (2023B01J0011).

Author information

Authors and Affiliations

Department, Guangzhou City University of Technology, Guangzhou, 510030, China
Yinglie Cao
Department, South China University of Technology, Guangzhou, 510641, China
Haoqi Xu, Zhiheng Zhou, Wanlin Yue & Shang Zhuge
Department, China, Pengcheng lab.,Shenzhen, Shenzhen, 518055, China
Fei Li

Authors

Yinglie Cao
View author publications
You can also search for this author in PubMed Google Scholar
Haoqi Xu
View author publications
You can also search for this author in PubMed Google Scholar
Zhiheng Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Wanlin Yue
View author publications
You can also search for this author in PubMed Google Scholar
Shang Zhuge
View author publications
You can also search for this author in PubMed Google Scholar
Fei Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yinglie Cao.

Ethics declarations

Conflicts of interest

We declared there is no conflict of interest in terms of this submission.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Cao, Y., Xu, H., Zhou, Z. et al. Super-resolution reconstructed video coding scheme based on inter-frame information. Multimed Tools Appl 83, 47847–47863 (2024). https://doi.org/10.1007/s11042-023-17441-z

Download citation

Received: 22 May 2023
Revised: 25 August 2023
Accepted: 03 October 2023
Published: 31 October 2023
Issue Date: May 2024
DOI: https://doi.org/10.1007/s11042-023-17441-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Super-resolution reconstructed video coding scheme based on inter-frame information

Abstract

Access this article

Similar content being viewed by others

Frame Selection Using Spatiotemporal Dynamics and Key Features as Input Pre-processing for Video Super-Resolution Models

Down-Sampling Based Video Coding with Degradation-Aware Restoration-Reconstruction Deep Neural Network

Video super-resolution based on deep learning: a comprehensive survey

Availability of data and materials

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Super-resolution reconstructed video coding scheme based on inter-frame information

Abstract

Access this article

Similar content being viewed by others

Frame Selection Using Spatiotemporal Dynamics and Key Features as Input Pre-processing for Video Super-Resolution Models

Down-Sampling Based Video Coding with Degradation-Aware Restoration-Reconstruction Deep Neural Network

Video super-resolution based on deep learning: a comprehensive survey

Availability of data and materials

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation