skip to main content
10.1145/3664647.3681308acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Blind Video Bit-Depth Expansion

Published: 28 October 2024 Publication History

Abstract

With the rapid development of high-bit-depth display devices, bit-depth expansion (BDE) algorithms that extend low-bit-depth images to high-bit-depth images have received increasing attention. Due to the sensitivity of bit-depth distortions to tiny numerical changes in the least significant bits, the nuanced degradation differences in the training process may lead to varying degradation data distributions, causing the trained models to overfit specific types of degradations. This paper focuses on the problem of blind video BDE, proposing a degradation prediction and embedding framework, and designing a video BDE network based on a recurrent structure and dual-frame alignment fusion. Experimental results demonstrate that the proposed model can outperform some state-of-the-art (SOTA) models in terms of banding artifact removal and color correction, avoiding overfitting to specific degradations and obtaining better generalization ability across multiple datasets. https://github.com/duanpanjun/BVBDE

References

[1]
Gary Baugh, Anil Kokaram, and François Pitié. 2014. Advanced video debanding. In Proceedings of the European Conference on Visual Media Production. 1--10.
[2]
Junyoung Byun, Kyujin Shim, and Changick Kim. 2019. BitNet: Learning-based bit-depth expansion. In Asian Conference on Computer Vision. Springer, 67--82.
[3]
Zheng Chen, Yulun Zhang, Jinjin Gu, Linghe Kong, Xiaokang Yang, and Fisher Yu. 2023. Dual aggregation transformer for image super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 12312--12321.
[4]
Cheuk-Hong Cheng, Oscar C Au, Ngai-Man Cheung, Chun-Hung Liu, and Ka-Yue Yip. 2009. Low color bit-depth image enhancement by contour-region dithering. In IEEE Pacific Rim Conference on Communications, Computers and Signal Processing. IEEE, 666--670.
[5]
Deband 2021. FFmpeg Filter Deband. Retrieved Aug 31, 2021 from https://ffmpeg. org/ffmpeg-filters.html#deband
[6]
Woo Kyoung Han, Byeonghun Lee, Sang Hyun Park, and Kyong Hwan Jin. 2023. Abcd: Arbitrary bitwise boefficient for de-quantization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5876--5885.
[7]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770--778.
[8]
Andrew G Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861
[9]
Qin Huang, Hui Yong Kim, Wen-Jiin Tsai, Se Yoon Jeong, Jin Soo Choi, and CC Jay Kuo. 2016. Understanding and removal of false contour in hevc compressed images. IEEE Transactions on Circuits and Systems for Video Technology 28, 2 (2016), 378--391.
[10]
Xiang Li, Jiangxin Dong, Jinhui Tang, and Jinshan Pan. 2023. Dlgsanet: Lightweight dynamic local and global self-attention networks for image superresolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 12792--12801.
[11]
Jing Liu, Zhiwei Fan, Ziwen Yang, Yuting Su, and Xiaokang Yang. 2023. Multistage spatio-temporal fusion network for fast and accurate video bit-depth enhancement. IEEE Transactions on Multimedia 26 (2023), 2444--2455.
[12]
Jing Liu, Pingping Liu, Yuting Su, Peiguang Jing, and Xiaokang Yang. 2019. Spatiotemporal symmetric convolutional neural network for video bit-depth enhancement. IEEE Transactions on Multimedia 21, 9 (2019), 2397--2406.
[13]
Jing Liu, Wanning Sun, and Yutao Liu. 2017. Bit-depth enhancement via convolutional neural network. In International Forum on Digital TV and Wireless Multimedia Communications. Springer, 255--264.
[14]
Jing Liu, Wanning Sun, Yuting Su, Peiguang Jing, and Xiaokang Yang. 2019. Becalf: Bit-depth enhancement by concatenating all level features of dnn. IEEE Transactions on Image Processing 28, 10 (2019), 4926--4940.
[15]
Jing Liu, Xin Wen, Weizhi Nie, Yuting Su, Peiguang Jing, and Xiaokang Yang. 2021. Residual-guided multiscale fusion network for bit-depth enhancement. IEEE Transactions on Circuits and Systems for Video Technology 32, 5 (2021), 2773--2786.
[16]
Jing Liu, Ziwen Yang, Yuting Su, and Xiaokang Yang. 2021. Tanet: Target attention network for video bit-depth enhancement. IEEE Transactions on Multimedia 24 (2021), 4212--4223.
[17]
Jing Liu, Guangtao Zhai, Anan Liu, Xiaokang Yang, Xibin Zhao, and Chang Wen Chen. 2018. IPAD: Intensity potential for adaptive de-quantization. IEEE Transactions on Image Processing 27, 10 (2018), 4860--4872.
[18]
Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, and Wen Gao. 2022. Learning weighting map for bit-depth expansion within a rational range. arXiv:2204.12039
[19]
Seungjun Nah, Sungyong Baik, Seokil Hong, Gyeongsik Moon, Sanghyun Son, Radu Timofte, and Kyoung Mu Lee. 2019. Ntire 2019 challenge on video deblurring and super-resolution: Dataset and study. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 0--0.
[20]
Weizhi Nie, Xin Wen, Jing Liu, and Yuting Su. 2022. Iterative residual feature refinement network for bit-depth enhancement. IEEE Signal Processing Letters 29 (2022), 1387--1391.
[21]
Abhijith Punnappurath and Michael S Brown. 2021. A little bit more: Bitplanewise bit-depth recovery. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 12 (2021), 9718--9724.
[22]
Long Sun, Jiangxin Dong, Jinhui Tang, and Jinshan Pan. 2023. Spatially-adaptive feature modulation for efficient image super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 13190--13199.
[23]
Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, and Alan C Bovik. 2020. Adaptive debanding filter. IEEE Signal Processing Letters 27 (2020), 1715--1719.
[24]
Zhengzhong Tu, Jessie Lin, Yilin Wang, Balu Adsumilli, and Alan C Bovik. 2020. Bband index: A no-reference banding artifact predictor. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE, 2712--2716.
[25]
Robert A Ulichney and Shiufun Cheung. 1998. Pixel bit-depth increase by bit replication. In Color Imaging: Device-Independent Color, Color Hardcopy, and Graphic Arts III, Vol. 3300. SPIE, 232--241.
[26]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems 30 (2017).
[27]
Pengfei Wan, Oscar C Au, Ketan Tang, Yuanfang Guo, and Lu Fang. 2012. From 2d extrapolation to 1d interpolation: Content adaptive image bit-depth expansion. In IEEE International Conference on Multimedia and Expo. IEEE, 170--175.
[28]
Xintao Wang, Kelvin CK Chan, Ke Yu, Chao Dong, and Chen Change Loy. 2019. Edvr: Video restoration with enhanced deformable convolutional networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 0--0.
[29]
Xin Wen, Weizhi Nie, Jing Liu, and Yuting Su. 2023. Mrft: Multiscale recurrent fusion transformer based prior knowledge for bit-depth enhancement. IEEE Transactions on Circuits and Systems for Video Technology 33, 10 (2023), 5562-- 5575.
[30]
Tianfan Xue, Baian Chen, Jiajun Wu, Donglai Wei, and William T Freeman. 2019. Video enhancement with task-oriented flow. International Journal of Computer Vision 127 (2019), 1106--1125.
[31]
Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, and Ming-Hsuan Yang. 2022. Restormer: Efficient transformer for high-resolution image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5728--5739.
[32]
Yanni Zhang, Qiang Li, Miao Qi, Di Liu, Jun Kong, and Jianzhong Wang. 2023. Multi-scale frequency separation network for image deblurring. IEEE Transactions on Circuits and Systems for Video Technology 33, 10 (2023), 5525--5537.
[33]
Yang Zhao, Wei Jia, Yuan Chen, and Ronggang Wang. 2022. Fast blind decontouring network. IEEE Transactions on Circuits and Systems for Video Technology 33, 2 (2022), 478--490.
[34]
Yang Zhao, Ronggang Wang, Yuan Chen, Wei Jia, Xiaoping Liu, and Wen Gao. 2020. Lighter but efficient bit-depth expansion network. IEEE Transactions on Circuits and Systems for Video Technology 31, 5 (2020), 2063--2069.
[35]
Yang Zhao, Ronggang Wang, Wei Jia, Wangmeng Zuo, Xiaoping Liu, and Wen Gao. 2019. Deep reconstruction of least significant bits for bit-depth expansion. IEEE Transactions on Image Processing 28, 6 (2019), 2847--2859.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '24: Proceedings of the 32nd ACM International Conference on Multimedia
October 2024
11719 pages
ISBN:9798400706868
DOI:10.1145/3664647
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 October 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. false contour removal
  2. quantization and dequantization.
  3. video bit-depth expansion

Qualifiers

  • Research-article

Funding Sources

  • The Fundamental Research Funds for the Central Universities
  • The Key R&D and Transformation Program of Qinghai Province
  • The National Natural Science Foundation of China

Conference

MM '24
Sponsor:
MM '24: The 32nd ACM International Conference on Multimedia
October 28 - November 1, 2024
Melbourne VIC, Australia

Acceptance Rates

MM '24 Paper Acceptance Rate 1,150 of 4,385 submissions, 26%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 105
    Total Downloads
  • Downloads (Last 12 months)105
  • Downloads (Last 6 weeks)62
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media