Abstract
With the popularization of smart terminals and multimedia technologies, the video coding standard — H.264/Advanced Video Coding (AVC) and H.265/High Efficiency Video Coding (HEVC) have been unable to meet the needs of various high-definition videos, so the next generation standard —H.266/ Versatile Video Coding (VVC) is under study. In the actual transmission of a video communication channel, rate control plays an important role. However, HEVC rate control based on R-λ model does not adequately take into account the characteristics of the human visual system (HVS). Also, the convergence speed of Least Mean Square (LMS) in HEVC is too slow. In this paper, an improved R-λ(Lambda) rate control model based on joint spatial-temporal domain information and HVS characteristics (IRLRC) is established. In this model, the joint spatial-temporal domain information based on gradient information is used to guide bit allocation for frame and CTU level, where the temporal coefficient is corrected adaptively. What’s more, the Broyden Fletcher Goldfarb Shanno (BFGS) algorithm is introduced, which speeds up the convergence of the proposed model. The experimental results have clearly shown that the proposed IRLRC can achieve better coding performance than HEVC, VVC and other models. In particular, the video sequence based on the proposed IRLRC can meet the needs of HVS and achieve higher optimization for subjective quality.
Similar content being viewed by others
References
Choi H, Nam J, Yoo J, Sim D, and Bajić I (2012) Rate control based on unified RQ model for HEVC. JCTVC of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, JCT-VC H0213 (m23088), San José, CA, USA
Feng Z, Liu P, Jia K (2018) Visual Perception Based Rate Control Algorithm for HEVC. In: Journal of Physics: Conference Series. vol 1. IOP Publishing, p 012041. doi: https://doi.org/10.1088/1742-6596/960/1/012041
Gao J, Xue W (2005) Analysis of the convergence rate on Newton iteration method. J Zhengzhou Univ Light Industry (Nat Sci) 20(4):100–102
Guo Y, Li B, Sun S, Xu J (2015) Rate control for screen content coding in HEVC. In: 2015 IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, pp 1118–1121. https://doi.org/10.1109/ISCAS.2015.7168834
Hongwei G, Ce Z, Shengxi L, Yanbo G (2018) Optimal bit allocation at frame level for rate control in HEVC. IEEE Trans Broadcast 65(2):270–281. https://doi.org/10.1109/TBC.2018.2847445
Lee JS, Ebrahimi T (2012) Perceptual video compression: a survey. IEEE J Select Topics Signal Process 6(6):684–697. https://doi.org/10.1109/JSTSP.2012.2215006
Li Y, Jia H, Zhu C, Yang M, Xie X, Gao W (2014) Window-based rate control for video quality optimization with a novel inter-dependent rate-distortion model. Signal Process Image Commun 29(10):1046–1062. https://doi.org/10.1016/j.image.2014.09.004
Li S, Xu M, Deng X, Wang Z (2015) Weight-based R-λ rate control for perceptual HEVC coding on conversational videos. Signal Process Image Commun 38:127–140. https://doi.org/10.1016/j.image.2015.04.011
Li Y, Jia H, Xie X, Huang T (2016) Rate control for consistent video quality with inter-dependent distortion model for HEVC. In: 2016 Visual communications and image processing (VCIP), IEEE, pp 1–4. https://doi.org/10.1109/VCIP.2016.7805594
Li L, Li B, Li H, Chen CW (2018) lambda-Domain Optimal Bit Allocation Algorithm for High Efficiency Video Coding. In: IEEE Transactions on Circuits and Systems for Video Technology(TCSVT), vol. 28, no. 1, pp 130–142. https://doi.org/10.1109/TCSVT.2016.2598672
Liang X, Wang Q, Zhou Y, Luo B (2013) Men A A novel R-Q model based rate control scheme in HEVC. In: 2013 Visual communications and image processing (VCIP). IEEE, pp 1–6. https://doi.org/10.1109/VCIP.2013.6706343
Lin H, He X, Teng Q, FuW XS (2016) Adaptive bit allocation scheme for extremely low-delay intraframe rate control in high efficiency video coding. J Electron Imaging 25(4):043008. https://doi.org/10.1117/1.JEI.25.4.043008
Liu Z, Wang L, Li X, Ji X (2018) Optimize x265 rate control: an exploration of lookahead in frame bit allocation and slice type decision. IEEE Trans Image Process 28(5):2558–2573. https://doi.org/10.1109/TIP.2018.2887200
Nguyen T, Marpe D, Schwarz H, Wiegand T (2011) Reduced-complexity entropy coding of transform coefficient levels using truncated golomb-rice codes in video compression. In: 2011 18th IEEE International Conference on Image Processing. IEEE, pp 753–756. https://doi.org/10.1109/ICIP.2011.6116664
Osman G, Filiz G, Shevchenko O (2009) Duality in quasi-newton methods and new variational characterizations of the DFP and BFGS updates. Optimization Methods Software 24(1):45–62. https://doi.org/10.1080/10556780802367205
Wang M, Ngan KN, Li H (2015) An efficient frame-content based intra frame rate control for high efficiency video coding. IEEE Signal Process Lett 22(7):896–900. https://doi.org/10.1109/LSP.2014.2377032
Wang M, Ngan KN, Li H (2016) Low-delay rate control for consistent quality using distortion-based Lagrange multiplier. IEEE Trans Image Process 25(7):2943–2955. https://doi.org/10.1109/TIP.2016.2552646
Wang H, Song L, Xie R, Luo Z, Wang X (2018) Masking Effects Based Rate Control Scheme for High Efficiency Video Coding. In: 2018 IEEE International Symposium on Circuits and Systems (ISCAS), Florence. https://doi.org/10.1109/ISCAS.2018.8351023
Xue W, Zhang L, Mou X, Bovik AC (2013) Gradient magnitude similarity deviation: a highly efficient perceptual image quality index. IEEE Trans Image Process 23(2):684–695. https://doi.org/10.1109/TIP.2013.2293423
Yang A, Zeng H, Ma L, Chen J, Cai C, Ma K (2016) A perceptual-based rate control for HEVC.In: 2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA), Oulu. https://doi.org/10.1109/IPTA.2016.7820958
Ye Y, He X, Teng Q, Qing L, Lin H, Xia D (2017) Adaptive gradient information and bfgs based inter frame rate control for high efficiency video coding. Multimed Tools Appl 77(12):14557–14577. https://doi.org/10.1007/s11042-017-5047-5
Zhou M, Zhang Y, Li B, Hu H-M (2017) Complexity-based intra frame rate control by jointing inter-frame correlation for high efficiency video coding. J Vis Commun Image Represent 42:46–64. https://doi.org/10.1016/j.jvcir.2016.11.013
Acknowledgements
This work was supported by the National Natural Science Foundation of China under Grant 61871279, the Industrial Cluster Collaborative Innovation Project of Chengdu (No. 2016-XT00-00015-GX), the Sichuan Science and Technology Program (No. 2018HH0143) and the Sichuan Education Department Program (No. 18ZB0355).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhao, Z., Xiong, S., Sun, W. et al. An improved R-λ rate control model based on joint spatial-temporal domain information and HVS characteristics. Multimed Tools Appl 80, 345–366 (2021). https://doi.org/10.1007/s11042-020-09721-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09721-9