research-article

Blind Video Bit-Depth Expansion

Authors:

Ronggang WangAuthors Info & Claims

MM '24: Proceedings of the 32nd ACM International Conference on Multimedia

Pages 9904 - 9912

https://doi.org/10.1145/3664647.3681308

Published: 28 October 2024 Publication History

Abstract

With the rapid development of high-bit-depth display devices, bit-depth expansion (BDE) algorithms that extend low-bit-depth images to high-bit-depth images have received increasing attention. Due to the sensitivity of bit-depth distortions to tiny numerical changes in the least significant bits, the nuanced degradation differences in the training process may lead to varying degradation data distributions, causing the trained models to overfit specific types of degradations. This paper focuses on the problem of blind video BDE, proposing a degradation prediction and embedding framework, and designing a video BDE network based on a recurrent structure and dual-frame alignment fusion. Experimental results demonstrate that the proposed model can outperform some state-of-the-art (SOTA) models in terms of banding artifact removal and color correction, avoiding overfitting to specific degradations and obtaining better generalization ability across multiple datasets. https://github.com/duanpanjun/BVBDE

References

[1]

Gary Baugh, Anil Kokaram, and François Pitié. 2014. Advanced video debanding. In Proceedings of the European Conference on Visual Media Production. 1--10.

Digital Library

[2]

Junyoung Byun, Kyujin Shim, and Changick Kim. 2019. BitNet: Learning-based bit-depth expansion. In Asian Conference on Computer Vision. Springer, 67--82.

[3]

Zheng Chen, Yulun Zhang, Jinjin Gu, Linghe Kong, Xiaokang Yang, and Fisher Yu. 2023. Dual aggregation transformer for image super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 12312--12321.

[4]

Cheuk-Hong Cheng, Oscar C Au, Ngai-Man Cheung, Chun-Hung Liu, and Ka-Yue Yip. 2009. Low color bit-depth image enhancement by contour-region dithering. In IEEE Pacific Rim Conference on Communications, Computers and Signal Processing. IEEE, 666--670.

[5]

Deband 2021. FFmpeg Filter Deband. Retrieved Aug 31, 2021 from https://ffmpeg. org/ffmpeg-filters.html#deband

[6]

Woo Kyoung Han, Byeonghun Lee, Sang Hyun Park, and Kyong Hwan Jin. 2023. Abcd: Arbitrary bitwise boefficient for de-quantization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5876--5885.

[7]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770--778.

[8]

Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861

[9]

Qin Huang, Hui Yong Kim, Wen-Jiin Tsai, Se Yoon Jeong, Jin Soo Choi, and CC Jay Kuo. 2016. Understanding and removal of false contour in hevc compressed images. IEEE Transactions on Circuits and Systems for Video Technology 28, 2 (2016), 378--391.

[10]

Xiang Li, Jiangxin Dong, Jinhui Tang, and Jinshan Pan. 2023. Dlgsanet: Lightweight dynamic local and global self-attention networks for image superresolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 12792--12801.

[11]

Jing Liu, Zhiwei Fan, Ziwen Yang, Yuting Su, and Xiaokang Yang. 2023. Multistage spatio-temporal fusion network for fast and accurate video bit-depth enhancement. IEEE Transactions on Multimedia 26 (2023), 2444--2455.

Digital Library

[12]

Jing Liu, Pingping Liu, Yuting Su, Peiguang Jing, and Xiaokang Yang. 2019. Spatiotemporal symmetric convolutional neural network for video bit-depth enhancement. IEEE Transactions on Multimedia 21, 9 (2019), 2397--2406.

[13]

Jing Liu, Wanning Sun, and Yutao Liu. 2017. Bit-depth enhancement via convolutional neural network. In International Forum on Digital TV and Wireless Multimedia Communications. Springer, 255--264.

[14]

Jing Liu, Wanning Sun, Yuting Su, Peiguang Jing, and Xiaokang Yang. 2019. Becalf: Bit-depth enhancement by concatenating all level features of dnn. IEEE Transactions on Image Processing 28, 10 (2019), 4926--4940.

[15]

Jing Liu, Xin Wen, Weizhi Nie, Yuting Su, Peiguang Jing, and Xiaokang Yang. 2021. Residual-guided multiscale fusion network for bit-depth enhancement. IEEE Transactions on Circuits and Systems for Video Technology 32, 5 (2021), 2773--2786.

Digital Library

[16]

Jing Liu, Ziwen Yang, Yuting Su, and Xiaokang Yang. 2021. Tanet: Target attention network for video bit-depth enhancement. IEEE Transactions on Multimedia 24 (2021), 4212--4223.

Digital Library

[17]

Jing Liu, Guangtao Zhai, Anan Liu, Xiaokang Yang, Xibin Zhao, and Chang Wen Chen. 2018. IPAD: Intensity potential for adaptive de-quantization. IEEE Transactions on Image Processing 27, 10 (2018), 4860--4872.

[18]

Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, and Wen Gao. 2022. Learning weighting map for bit-depth expansion within a rational range. arXiv:2204.12039

[19]

Seungjun Nah, Sungyong Baik, Seokil Hong, Gyeongsik Moon, Sanghyun Son, Radu Timofte, and Kyoung Mu Lee. 2019. Ntire 2019 challenge on video deblurring and super-resolution: Dataset and study. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 0--0.

[20]

Weizhi Nie, Xin Wen, Jing Liu, and Yuting Su. 2022. Iterative residual feature refinement network for bit-depth enhancement. IEEE Signal Processing Letters 29 (2022), 1387--1391.

[21]

Abhijith Punnappurath and Michael S Brown. 2021. A little bit more: Bitplanewise bit-depth recovery. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 12 (2021), 9718--9724.

[22]

Long Sun, Jiangxin Dong, Jinhui Tang, and Jinshan Pan. 2023. Spatially-adaptive feature modulation for efficient image super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 13190--13199.

[23]

Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, and Alan C Bovik. 2020. Adaptive debanding filter. IEEE Signal Processing Letters 27 (2020), 1715--1719.

[24]

Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, and Alan C Bovik. 2020. Bband index: A no-reference banding artifact predictor. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2712--2716.

[25]

Robert A Ulichney and Shiufun Cheung. 1998. Pixel bit-depth increase by bit replication. In Color Imaging: Device-Independent Color, Color Hardcopy, and Graphic Arts III, Vol. 3300. SPIE, 232--241.

[26]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems 30 (2017).

[27]

Pengfei Wan, Oscar C Au, Ketan Tang, Yuanfang Guo, and Lu Fang. 2012. From 2d extrapolation to 1d interpolation: Content adaptive image bit-depth expansion. In IEEE International Conference on Multimedia and Expo. IEEE, 170--175.

Digital Library

[28]

Xintao Wang, Kelvin CK Chan, Ke Yu, Chao Dong, and Chen Change Loy. 2019. Edvr: Video restoration with enhanced deformable convolutional networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 0--0.

[29]

Xin Wen, Weizhi Nie, Jing Liu, and Yuting Su. 2023. Mrft: Multiscale recurrent fusion transformer based prior knowledge for bit-depth enhancement. IEEE Transactions on Circuits and Systems for Video Technology 33, 10 (2023), 5562-- 5575.

Digital Library

[30]

Tianfan Xue, Baian Chen, Jiajun Wu, Donglai Wei, and William T Freeman. 2019. Video enhancement with task-oriented flow. International Journal of Computer Vision 127 (2019), 1106--1125.

Digital Library

[31]

Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang. 2022. Restormer: Efficient transformer for high-resolution image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5728--5739.

[32]

Yanni Zhang, Qiang Li, Miao Qi, Di Liu, Jun Kong, and Jianzhong Wang. 2023. Multi-scale frequency separation network for image deblurring. IEEE Transactions on Circuits and Systems for Video Technology 33, 10 (2023), 5525--5537.

Digital Library

[33]

Yang Zhao, Wei Jia, Yuan Chen, and Ronggang Wang. 2022. Fast blind decontouring network. IEEE Transactions on Circuits and Systems for Video Technology 33, 2 (2022), 478--490.

[34]

Yang Zhao, Ronggang Wang, Yuan Chen, Wei Jia, Xiaoping Liu, and Wen Gao. 2020. Lighter but efficient bit-depth expansion network. IEEE Transactions on Circuits and Systems for Video Technology 31, 5 (2020), 2063--2069.

Digital Library

[35]

Yang Zhao, Ronggang Wang, Wei Jia, Wangmeng Zuo, Xiaoping Liu, and Wen Gao. 2019. Deep reconstruction of least significant bits for bit-depth expansion. IEEE Transactions on Image Processing 28, 6 (2019), 2847--2859.

Index Terms

Blind Video Bit-Depth Expansion
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks

Recommendations

Bit-depth scalable video coding using inter-layer prediction from high bit-depth layer
ICASSP '09: Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Scalable video coding (SVC) is currently developed as an extension of H.264/AVC video coding standard. In this paper, we propose three H.264/AVC compliant bit-depth scalable video coding schemes, named LH mode (Low Bit-depth to High Bit-depth), HL mode (...
H.264/advanced video coding (AVC) backward-compatible bit-depth scalable coding

This paper presents a bit-depth scalable coding solution that is compatible with the scalable extension of H.264/Advanced Video Coding (AVC), also referred to as scalable video coding (SVC). The proposed solution is capable of providing an 8-bit AVC ...
Video Transcoding with H.263 Bit-Streams

Video transcoding is one of the key technologies in implementing dynamic adaptation of the bit-rate of a coded video bit-stream to the available bandwidth over various networks. Many fast transcoder architectures have been proposed to achieve fast ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '24: Proceedings of the 32nd ACM International Conference on Multimedia

October 2024

11719 pages

ISBN:9798400706868

DOI:10.1145/3664647

General Chairs:
Jianfei Cai
Monash University, Australia
,
Mohan Kankanhalli
NUS, Singapore
,
Balakrishnan Prabhakaran
UT Dallas, USA
,
Susanne Boll
University of Oldenburg, Germany
,
Program Chairs:
Ramanathan Subramanian
University of Canberra & IIT Ropar, Australia
,
Liang Zheng
Australian National University, Australia
,
Vivek K. Singh
Rutgers University, USA
,
Pablo Cesar
Centrum Wiskunde & Informatica, Netherlands
,
Lexing Xie
Australian National University, Australia
,
Dong Xu
University of Hong Kong, Hong Kong

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

The Fundamental Research Funds for the Central Universities
The Key R&D and Transformation Program of Qinghai Province
The National Natural Science Foundation of China

Conference

MM '24

Sponsor:

SIGMM

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne VIC, Australia

Acceptance Rates

MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
105
Total Downloads

Downloads (Last 12 months)105
Downloads (Last 6 weeks)62

Reflects downloads up to 27 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten