Learning-Based Scalable Video Coding with Spatial and Temporal Prediction | IEEE Conference Publication | IEEE Xplore

Learning-Based Scalable Video Coding with Spatial and Temporal Prediction


Abstract:

In this work, we propose a hybrid learning-based method for layered spatial scalability. Our framework consists of a base layer (BL), which encodes a spatially downsample...Show More

Abstract:

In this work, we propose a hybrid learning-based method for layered spatial scalability. Our framework consists of a base layer (BL), which encodes a spatially downsampled representation of the input video using Versatile Video Coding (VVC), and a learning-based enhancement layer (EL), which conditionally encodes the original video signal. The EL is conditioned by two fused prediction signals: a spatial inter-layer prediction signal, that is generated by spatially upsampling the output of the BL using super-resolution, and a temporal inter-frame prediction signal, that is generated by decoder-side motion compensation without signaling any motion vectors. We show that our method outperforms LCEVC and has comparable performance to full-resolution VVC for high-resolution content, while still offering scalability.
Date of Conference: 04-07 December 2023
Date Added to IEEE Xplore: 29 January 2024
ISBN Information:

ISSN Information:

Conference Location: Jeju, Korea, Republic of

Contact IEEE to Subscribe

References

References is not available for this document.