Abstract
In this paper, an efficient spatially scalable video coding scheme with a two-layered architecture is proposed. In this architecture, two spatial layers are referred to as a base layer and an enhancement layer. The base layer is coded to be compatible to H.264 standard, and when coding the enhancement layer, a new inter layer intra coding method (ILICM) is used to improve the coding efficiency. ILICM intends to use a few specific pixels in the up-sampled and decoded base layer block to predict the corresponding block in enhancement layer, when those original predictors are not available. Besides, in order to interpolate the base layer data, a graceful component-based up-sampling method (CUSM) is also introduced in this paper. Based on the human vision system, CUSM assigns a much simpler up-sampling filter for the chroma component due to its lower sensitivity for human eyes. Generally, proposed schemes including ILICM and CUSM are expected to increase the coding performance of enhancement layer and reduce the computing complexity of the decoder, respectively. Experimental results show that, the PSNR values of luma component of encoded frames are increased with no additional cost on coded bit-rate for ILICM method, while CUSM method can also maintain the coding performance under the theoretically significant reduction of computational complexity.
Similar content being viewed by others
References
Stockhammer T, Hannuksela MM, Wiegand T (2003) H.264/AVC in Wireless Environments. IEEE Trans. On Circuits. System and Video for Video Technology 13:657–673
Katsaggelos AK et al (2005) Advances in efficient resource allocation for packet-based real-time video transmission. Proceedings of the IEEE 93(1):135–147
Van der Schaar M, Radha H (2001) A hybrid temporal SNR fine granular scalability for internet video. IEEE Trans. On Circuits. System and Video for Video Technology 11:318–331
John, F., Michael, R., Wang, Y.Q., 2000. Efficient Drift-Free Signal-to-Noise ratio scalability. IEEE Trans. On Circuits, System and Video for Video Technology 11(1), 267–281
Reichel, J., Schwarz, H., Wien, M., 2005. Scalable Video Coding-Joint Draft 4. ISO/IEC JTC1/SC29/WG11/N7048
Schwarz, H. et al., 2005. Applications and Requirements for Scalable Video Coding. ISO/IEC JTC1/SC29/WG11/ N6880
Schwarz, H., Marpe, D., Wiegand, T., 2004. Scalable Extension of H.264/AVC. ISO/IEC JTC1/SC29/WG11, M10569/S03
Ohm JR (2005) Advances in Scalable Video Coding. Proceedings of the IEEE 93(1):42–56
Golwelkar A, Woods JW (2003) Scalable video compression using longer motion compensated temporal filters. Proc. SPIE Visual Communication Image Process 5150:1406–1417
Yang, L.B., Chen, Y., Zhai, J.F., Zhang, F., 2005. Low Complexity Intra Prediction for Enhancement Layer. ISO/ IEC JTC1/SC29/WG11/Q084
Matsui T, Hirahara S (1991) New human vision system model for spatio-temporal image signals. Proceedings of SPIE 1453:282–289
Reichel, J., Schwarz, H., Wien, M., 2005. Joint Scalable Video Model JSVM-3,” ISO/IEC JTC1/SC29/WG11, Doc. JVT-P202
Wiegand, T., Sullivan, G.J., Luthra, A., 2003. Overview of the H.264/AVC video coding standard. IEEE Trans. On Circuits, System and Video for Video Technology 13(7), 560–576
Zhang, P., Zhao, D.B., Ma, S.W., Lu, Y., Gao, W., 2004. Multiple Modes Intra-Prediction in Intra Coding. IEEE International Conference on Multimedia and Expo (ICME) 419–422
Lee, Y.L., Han, K.H., Sullivan, G.J., 2006. Improved Lossless Intra Coding for H.264/MPEG-4 AVC. IEEE Trans. On Image Processing 15(9) 2610–2615
He Z, Mitra SK (2001) A unified rate-distortion analysis framework for transform coding. IEEE Trans. On Circuits. System and Video for Video Technology 11:1221–1236
Lie, W. N., Yeh, H. C., Lin, T. C. Chen, C. F., 2005. Hardware-Efficient Computing Architecture for Motion Compensate Interpolation in H.264 Video Coding. Proc. of ISCAS, 2136–2139
Reichel, J., Schwarz, H., Wien, M., 2005. Joint Scalable Video Model (JSVM) 4.0 Reference Encoding Algorithm Description. ISO/IEC JTC1/SC29/WG11/N7556
Bjontegarrd, G., 2001. Calculation of average PSNR differences between RD-curves. 13th VCEG-M33
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was supported by China Postdoctoral Science Foundation (No. 20080430454); National High Technology Research and Development Program of China (No.2007AA12Z151) ;Funded by Key Laboratory of Geo-informatics of State Bureau of Surveying and Mapping (No.200834)
Rights and permissions
About this article
Cite this article
Wang, Z., Zhang, J. & Li, H. Spatially scalable video coding with an efficient two-layered architecture. Multimed Tools Appl 48, 247–265 (2010). https://doi.org/10.1007/s11042-009-0327-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-009-0327-3