ABSTRACT
Further compression of encoded video bitstream is necessary to save storage space and bandwidth. However, video bitstream is a compressed version of raw data with video encoders according to different standards, and hence the signal redundancy is already reduced compared with original video data. Recompression of video stream requires further exploring the correlation remained. Transform coding as a part of hybrid video coding framework adopted in the latest video coding standards such as discrete cosine transform (DCT) decorrelates predictive residual signal for efficient quantization and entropy coding. Nevertheless, considerable amount of statistical correlation still remains in the transform coefficients that further reducing the redundancy can lead to improved coding efficiency. In this work, we propose a video stream recompression scheme based on further sparse representation of DCT coefficients. Dictionary-based sparse representation method is used after DCT coefficients are obtained as a secondary transform module. Moreover, the proposed scheme leverages the property of DPCM and avoids sending bits of dictionary by forming redundant dictionaries from DCT coefficients of previously decoded frames. Experimental results demonstrate that the proposed recompression framework further reduces the bitrate of original H.264 bitstream by more than while maintains similar subjective quality.
- Michal Aharon, Michael Elad, and Alfred Bruckstein. 2006. K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on signal processing 54, 11 (2006), 4311–4322.Google ScholarDigital Library
- Gisle Bjontegaard. 2001. Calculation of average PSNR differences between RD-curves. VCEG-M33 (2001).Google Scholar
- Jianle Chen, Ying Chen, Marta Karczewicz, Xiang Li, Hongbin Liu, Li Zhang, and Xin Zhao. 2015. Coding tools investigation for next generation video coding based on HEVC. In Applications of Digital Image Processing XXXVIII, Vol. 9599. SPIE, 437–445.Google Scholar
- Je-Won Kang, Moncef Gabbouj, and C-C Jay Kuo. 2013. Sparse/DCT (S/DCT) two-layered representation of prediction residuals for video coding. IEEE transactions on image processing 22, 7 (2013), 2711–2722.Google Scholar
- Julien Mairal, Francis Bach, Jean Ponce, and Guillermo Sapiro. 2010. Online learning for matrix factorization and sparse coding.Journal of Machine Learning Research 11, 1 (2010).Google Scholar
- Tung Nguyen, Benjamin Bross, Paul Keydel, Heiko Schwarz, Detlev Marpe, and Thomas Wiegand. 2019. Extended transform skip mode and fast multiple transform set selection in VVC. In 2019 Picture Coding Symposium (PCS). IEEE, 1–5.Google ScholarCross Ref
- Ron Rubinstein, Michael Zibulevsky, and Michael Elad. 2008. Efficient implementation of the K-SVD algorithm using batch orthogonal matching pursuit. Technical Report. Computer Science Department, Technion.Google Scholar
- ITU Telecom 2003. Advanced video coding for generic audiovisual services. ITU-T Recommendation H. 264 (2003).Google Scholar
- ITU-T VCEG 2010. Joint call for proposals on video compression technology. VCEG-AM91 (2010).Google Scholar
Index Terms
- Video Bitstream Recompression Based on Sparse Representation of DCT Coefficients
Recommendations
Block Matching Video Compression Based on Sparse Representation and Dictionary Learning
This work presents a video compression method based on sparse representation and dictionary learning algorithms. The proposed scheme achieves superb rate-distortion performance and decent subjective quality, compared to modern standards, especially at ...
Multiwavelet video coding based on DCT time domain filtering
Transactions on Edutainment VIITo improve the video encoding efficiency and deal with the real-time demerits of the multiwavelet time-domain filtering in the 3D multiwavelet, a multiwavelet video coding scheme based on DCT(Digital Cosine Transform) time-domain filtering is proposed ...
Low Bit Rate Video Coding Using DCT-Based Fast Decimation/Interpolation and Embedded Zerotree Coding
In this paper, we propose a low bit rate video coding procedure in the discrete cosine transform (DCT) domain that is based in an embedded zerotree algorithm and uses decimation and interpolation. Theory for decimation/interpolation in the DCT domain is ...
Comments