research-article

Video Bitstream Recompression Based on Sparse Representation of DCT Coefficients

Authors:
Han Wang

Faculty of Information Technology, Beijing University of Technology, China

Faculty of Information Technology, Beijing University of Technology, China

0000-0002-0524-9345
View Profile

,
Luheng Jia

Faculty of Information Technology, Beijing University of Technology, China

Faculty of Information Technology, Beijing University of Technology, China

0000-0001-8221-3576
View Profile

,
Zuhai Zhang

Faculty of Information Technology, Beijing University of Technology, China

Faculty of Information Technology, Beijing University of Technology, China

0009-0009-6900-3633
View Profile

,
Kebin Jia

Faculty of Information Technology, Beijing University of Technology, China

Faculty of Information Technology, Beijing University of Technology, China

0000-0001-7620-2221
View Profile

ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial IntelligenceMarch 2023Pages 126–130https://doi.org/10.1145/3594315.3594334

Published:02 August 2023Publication History

ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence

Pages 126–130

ABSTRACT

Further compression of encoded video bitstream is necessary to save storage space and bandwidth. However, video bitstream is a compressed version of raw data with video encoders according to different standards, and hence the signal redundancy is already reduced compared with original video data. Recompression of video stream requires further exploring the correlation remained. Transform coding as a part of hybrid video coding framework adopted in the latest video coding standards such as discrete cosine transform (DCT) decorrelates predictive residual signal for efficient quantization and entropy coding. Nevertheless, considerable amount of statistical correlation still remains in the transform coefficients that further reducing the redundancy can lead to improved coding efficiency. In this work, we propose a video stream recompression scheme based on further sparse representation of DCT coefficients. Dictionary-based sparse representation method is used after DCT coefficients are obtained as a secondary transform module. Moreover, the proposed scheme leverages the property of DPCM and avoids sending bits of dictionary by forming redundant dictionaries from DCT coefficients of previously decoded frames. Experimental results demonstrate that the proposed recompression framework further reduces the bitrate of original H.264 bitstream by more than while maintains similar subjective quality.

References

Michal Aharon, Michael Elad, and Alfred Bruckstein. 2006. K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on signal processing 54, 11 (2006), 4311–4322.Google ScholarDigital Library
Gisle Bjontegaard. 2001. Calculation of average PSNR differences between RD-curves. VCEG-M33 (2001).Google Scholar
Jianle Chen, Ying Chen, Marta Karczewicz, Xiang Li, Hongbin Liu, Li Zhang, and Xin Zhao. 2015. Coding tools investigation for next generation video coding based on HEVC. In Applications of Digital Image Processing XXXVIII, Vol. 9599. SPIE, 437–445.Google Scholar
Je-Won Kang, Moncef Gabbouj, and C-C Jay Kuo. 2013. Sparse/DCT (S/DCT) two-layered representation of prediction residuals for video coding. IEEE transactions on image processing 22, 7 (2013), 2711–2722.Google Scholar
Julien Mairal, Francis Bach, Jean Ponce, and Guillermo Sapiro. 2010. Online learning for matrix factorization and sparse coding.Journal of Machine Learning Research 11, 1 (2010).Google Scholar
Tung Nguyen, Benjamin Bross, Paul Keydel, Heiko Schwarz, Detlev Marpe, and Thomas Wiegand. 2019. Extended transform skip mode and fast multiple transform set selection in VVC. In 2019 Picture Coding Symposium (PCS). IEEE, 1–5.Google ScholarCross Ref
Ron Rubinstein, Michael Zibulevsky, and Michael Elad. 2008. Efficient implementation of the K-SVD algorithm using batch orthogonal matching pursuit. Technical Report. Computer Science Department, Technion.Google Scholar
ITU Telecom 2003. Advanced video coding for generic audiovisual services. ITU-T Recommendation H. 264 (2003).Google Scholar
ITU-T VCEG 2010. Joint call for proposals on video compression technology. VCEG-AM91 (2010).Google Scholar

Index Terms

Video Bitstream Recompression Based on Sparse Representation of DCT Coefficients
1. Computing methodologies
  1. Computer graphics
    1. Image compression

Recommendations

Block Matching Video Compression Based on Sparse Representation and Dictionary Learning

This work presents a video compression method based on sparse representation and dictionary learning algorithms. The proposed scheme achieves superb rate-distortion performance and decent subjective quality, compared to modern standards, especially at ...
Read More
Multiwavelet video coding based on DCT time domain filtering
Transactions on Edutainment VII

To improve the video encoding efficiency and deal with the real-time demerits of the multiwavelet time-domain filtering in the 3D multiwavelet, a multiwavelet video coding scheme based on DCT(Digital Cosine Transform) time-domain filtering is proposed ...
Read More
Low Bit Rate Video Coding Using DCT-Based Fast Decimation/Interpolation and Embedded Zerotree Coding

In this paper, we propose a low bit rate video coding procedure in the discrete cosine transform (DCT) domain that is based in an embedded zerotree algorithm and uses decimation and interpolation. Theory for decimation/interpolation in the DCT domain is ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence
March 2023
824 pages
ISBN:9781450399029
DOI:10.1145/3594315

Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 August 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
DCT
K-SVD
sparse representation
video coding
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 17
  Total Downloads
- Downloads (Last 12 months)17
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Video Bitstream Recompression Based on Sparse Representation of DCT Coefficients

ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence

ABSTRACT

References

Cited By

Index Terms

Recommendations

Block Matching Video Compression Based on Sparse Representation and Dictionary Learning

Multiwavelet video coding based on DCT time domain filtering

Low Bit Rate Video Coding Using DCT-Based Fast Decimation/Interpolation and Embedded Zerotree Coding

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Video Bitstream Recompression Based on Sparse Representation of DCT Coefficients

ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence

ABSTRACT

References

Cited By

Index Terms

Recommendations

Block Matching Video Compression Based on Sparse Representation and Dictionary Learning

Multiwavelet video coding based on DCT time domain filtering

Low Bit Rate Video Coding Using DCT-Based Fast Decimation/Interpolation and Embedded Zerotree Coding

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media