Skip to main content
Log in

Spatially scalable video coding with an efficient two-layered architecture

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In this paper, an efficient spatially scalable video coding scheme with a two-layered architecture is proposed. In this architecture, two spatial layers are referred to as a base layer and an enhancement layer. The base layer is coded to be compatible to H.264 standard, and when coding the enhancement layer, a new inter layer intra coding method (ILICM) is used to improve the coding efficiency. ILICM intends to use a few specific pixels in the up-sampled and decoded base layer block to predict the corresponding block in enhancement layer, when those original predictors are not available. Besides, in order to interpolate the base layer data, a graceful component-based up-sampling method (CUSM) is also introduced in this paper. Based on the human vision system, CUSM assigns a much simpler up-sampling filter for the chroma component due to its lower sensitivity for human eyes. Generally, proposed schemes including ILICM and CUSM are expected to increase the coding performance of enhancement layer and reduce the computing complexity of the decoder, respectively. Experimental results show that, the PSNR values of luma component of encoded frames are increased with no additional cost on coded bit-rate for ILICM method, while CUSM method can also maintain the coding performance under the theoretically significant reduction of computational complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Stockhammer T, Hannuksela MM, Wiegand T (2003) H.264/AVC in Wireless Environments. IEEE Trans. On Circuits. System and Video for Video Technology 13:657–673

    Article  Google Scholar 

  2. Katsaggelos AK et al (2005) Advances in efficient resource allocation for packet-based real-time video transmission. Proceedings of the IEEE 93(1):135–147

    Article  Google Scholar 

  3. Van der Schaar M, Radha H (2001) A hybrid temporal SNR fine granular scalability for internet video. IEEE Trans. On Circuits. System and Video for Video Technology 11:318–331

    Article  Google Scholar 

  4. John, F., Michael, R., Wang, Y.Q., 2000. Efficient Drift-Free Signal-to-Noise ratio scalability. IEEE Trans. On Circuits, System and Video for Video Technology 11(1), 267–281

    Google Scholar 

  5. Reichel, J., Schwarz, H., Wien, M., 2005. Scalable Video Coding-Joint Draft 4. ISO/IEC JTC1/SC29/WG11/N7048

  6. Schwarz, H. et al., 2005. Applications and Requirements for Scalable Video Coding. ISO/IEC JTC1/SC29/WG11/ N6880

  7. Schwarz, H., Marpe, D., Wiegand, T., 2004. Scalable Extension of H.264/AVC. ISO/IEC JTC1/SC29/WG11, M10569/S03

  8. Ohm JR (2005) Advances in Scalable Video Coding. Proceedings of the IEEE 93(1):42–56

    Article  Google Scholar 

  9. Golwelkar A, Woods JW (2003) Scalable video compression using longer motion compensated temporal filters. Proc. SPIE Visual Communication Image Process 5150:1406–1417

    Google Scholar 

  10. Yang, L.B., Chen, Y., Zhai, J.F., Zhang, F., 2005. Low Complexity Intra Prediction for Enhancement Layer. ISO/ IEC JTC1/SC29/WG11/Q084

  11. Matsui T, Hirahara S (1991) New human vision system model for spatio-temporal image signals. Proceedings of SPIE 1453:282–289

    Article  Google Scholar 

  12. Reichel, J., Schwarz, H., Wien, M., 2005. Joint Scalable Video Model JSVM-3,” ISO/IEC JTC1/SC29/WG11, Doc. JVT-P202

  13. Wiegand, T., Sullivan, G.J., Luthra, A., 2003. Overview of the H.264/AVC video coding standard. IEEE Trans. On Circuits, System and Video for Video Technology 13(7), 560–576

    Google Scholar 

  14. Zhang, P., Zhao, D.B., Ma, S.W., Lu, Y., Gao, W., 2004. Multiple Modes Intra-Prediction in Intra Coding. IEEE International Conference on Multimedia and Expo (ICME) 419–422

  15. Lee, Y.L., Han, K.H., Sullivan, G.J., 2006. Improved Lossless Intra Coding for H.264/MPEG-4 AVC. IEEE Trans. On Image Processing 15(9) 2610–2615

    Google Scholar 

  16. He Z, Mitra SK (2001) A unified rate-distortion analysis framework for transform coding. IEEE Trans. On Circuits. System and Video for Video Technology 11:1221–1236

    Google Scholar 

  17. Lie, W. N., Yeh, H. C., Lin, T. C. Chen, C. F., 2005. Hardware-Efficient Computing Architecture for Motion Compensate Interpolation in H.264 Video Coding. Proc. of ISCAS, 2136–2139

  18. Reichel, J., Schwarz, H., Wien, M., 2005. Joint Scalable Video Model (JSVM) 4.0 Reference Encoding Algorithm Description. ISO/IEC JTC1/SC29/WG11/N7556

  19. Bjontegarrd, G., 2001. Calculation of average PSNR differences between RD-curves. 13th VCEG-M33

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhang Wang.

Additional information

This work was supported by China Postdoctoral Science Foundation (No. 20080430454); National High Technology Research and Development Program of China (No.2007AA12Z151) ;Funded by Key Laboratory of Geo-informatics of State Bureau of Surveying and Mapping (No.200834)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Z., Zhang, J. & Li, H. Spatially scalable video coding with an efficient two-layered architecture. Multimed Tools Appl 48, 247–265 (2010). https://doi.org/10.1007/s11042-009-0327-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-009-0327-3

Keywords

Navigation