Abstract
Recently, the H.264/AVC standard has been extended to incorporate Scalable Video Coding (SVC). SVC offers the advantage of scalable (layered) coding, but has the disadvantage of a highly increased computational complexity at the encoder side when dealing with spatial scalability. To restrict the increase in required processing power, fast mode decision models for spatial enhancement layers have been proposed in literature. We propose a novel generic fast mode decision model for spatial enhancement layers for both P and B frames based on both the quantization of the enhancement layer and the correlation between the macroblock type in the enhancement layer and the co-located macroblock in the reference layer. In this paper, an evaluation of the proposed model and comparison with a state-of-the-art model is given. Results show that the proposed technique performs exceptionally well for spatial scalability. For both dyadic and non-dyadic spatial scalability, we achieve an average time saving of 75%, while only a slight bit rate increase of 2.23% and a minor PSNR decrease of 0.46 dB are measured. Compared with state of the art techniques, we further halve the complexity while having comparable rate-distortion results.









Similar content being viewed by others
References
Akyol E, Mukherjee D, Lui Y (2007) Complexity control for real-time video coding. In: Proceedings of the international conference on image processing (ICIP). USA
Ates HF, Altunbasak Y (2008) Rate-distortion and complexity optimized motion estimation for H.264 video coding. IEEE Trans Circuits Syst Video Technol 18(2):159–171
Bjøntegaard G (2001) Doc. VCEG-M33: calculation of average PSNR differences between RD-curves. ITU-T SG16 Q.6, Tech. Rep., Austin, Texas
Bjøntegaard G (2008) Doc. VCEG-AI11: improvements of the BD-PSNR model. ITU-T SG16 Q.6, Tech. Rep., Berlin, Germany
De Wolf K, De Schrijver D, De Neve W, De Zutter S, Lambert P, Van de Walle R (2007) Analysis of prediction mode decision in spatial enhancement layers in H.264/AVC SVC. Computer analysis of images and patterns, vol 4673/2007, pp 848–855
De Wolf K, De Schrijver D, De Zutter S, Van de Walle R (2007) Scalable Video coding: analysis and coding performance of inter-layer prediction. In: Proc. of the 9th international symposium on signal processing and its applications (ISSPA)
Goh G, Kang J, Cho M, Chung K (2009) Fast mode decision for scalable video coding based on neighboring macroblock analysis. In: SAC. ACM, pp 1845–1846
Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (2007) Advanced Video Coding for Generic Audiovisual Services, ITU-T Rec. H.264 and ISO/IEC 14496-10 (MPEG-4 AVC), Version 8 (including SVC extension). MPEG / ITU-T, Tech. Rep
Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (2007) Doc. JVT-W203: joint scalable video model 10. MPEG / ITU-T, Tech. Rep
Jung S-W, Baek S-J, Park C-S, Ko S-J (2010) Fast mode decision using all-zero block detection for fidelity and spatial scalable video coding. IEEE Trans Circuits Syst Video Technol 20(2):201–206
Kim S-T, reddy Konda K, Cho C-S (2009) Fast mode decision algorithm for spatial and snr scalable video coding. In: Proceedings of the IEEE international symposium on circuits and systems (ISCAS), pp 872–875
Kim S-T, reddy Konda K, su Park C, Cho C-S, Ko S-J (2009) Fast mode decision algorithm for inter-layer coding in scalable video coding. IEEE Trans Consum Electron 55(3):1572–1580
Li H, Li ZG, Wen C (2006) Fast mode decision algorithm for inter frame coding in fully scalable video coding. IEEE Trans Circuits Syst Video Technol 16(7):889–895
Li H, Li Z, Wen C, Chau L-P (2006) Fast mode decision for spatial scalable video coding. In: International symposium on circuits and systems (ISCAS), pp 3005–3008
Li H, Li Z, Wen C, Xie S (2007) Fast mode decision for coarse granular scalability via switched candidate mode set. In: Proceedings of the IEEE international conference on multimedia and expo
Lin H, Peng W, Hang H (2007) Low-complexity macroblock mode decision algorithm for combined cgs and temporal scalability. MPEG/ITU-T, Tech. Rep
Park C-S, Baek S-J, Yoon M-S, Kim H-K, Ko S-J (2009) Selective Inter-layer residual prediction for SVC-based video streaming. IEEE Trans Consum Electron 55(1):235–239
Park C-S, Dan DB-K, Haechul C, Ko S-J (2009) A statistical approach for fast mode decision in scalable video coding. IEEE Trans Circuits Syst Video Technol 19(12):1915–1920
Ren J, Kehtarnavaz ND (2008) Fast adaptive early termination for mode selection in H.264 scalable video coding. In: Proceedings of the international conference on image processing (ICIP). USA, pp 2464–2467
Ren J, Kehtarnavaz ND (2009) Fast adaptive early termination for mode selection in H.264 scalable video coding. JRTIP 4(1):13–21
Schwarz H, Marpe D, Wiegand T (2007) Overview of the scalable video coding extension of the H.264/AVC standard. IEEE Trans Circuits Syst Video Technol 17(9):1103–1120
Segall C, Sullivan G (2007) Spatial scalability within the H.264/AVC scalable video coding extension. IEEE Trans Circuits Syst Video Technol 17(9):1121–1135
Van Leuven S, De Wolf K, Lambert P, Van de Walle R (2009) Probability analysis for macroblock types in spatial enhancement layers for SVC. In: Proceedings of the 11th IASTED international conference on signal and image processing
Wiegand T, Schwarz H, Joch A, Kossentini F, Sullivan GJ (2003) Rate-constrained coder control and comparison of video coding standards. IEEE Trans Circuits Syst Video Technol 13(7):688–703
Wiegand T, Sullivan GJ, Bjøntegaard G, Luthra A (2003) Overview of the H.264/AVC video coding standard. IEEE Trans Circuits Syst Video Technol 13(7):560–576
Yeh C-H, Fan K-J, Chen M-J, Li G-L (2010) Fast mode decision algorithm for scalable video coding using bayesian theorem detection and Markov process. IEEE Trans Circuits Syst Video Technol 20(4):563–574
Acknowledgements
The research activities as described in this paper were funded by Ghent University, the Interdisciplinary Institute for Broadband Technology (IBBT), Ph.D. and postdoctoral fellow grants of the Agency for Innovation by Science and Technology (IWT), the Fund for Scientific Research-Flanders (FWO-Flanders), and the European Union.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Van Leuven, S., Van Wallendael, G., De Wolf, K. et al. An enhanced fast mode decision model for spatial enhancement layers in scalable video coding. Multimed Tools Appl 58, 215–237 (2012). https://doi.org/10.1007/s11042-010-0716-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-010-0716-7