Optimum word length allocation of integer DCT and its error analysis

doi:10.1016/j.image.2004.03.001

Signal Processing: Image Communication

Volume 19, Issue 6, July 2004, Pages 465-478

https://doi.org/10.1016/j.image.2004.03.001 Get rights and content

Abstract

Recently, the integer DCT (Int-DCT), which transforms an integer input to an integer output, is attracting many researchers’ attention as an effective method for DCT-based lossy/lossless unified coding. So far, there are many reports relevant to the Int-DCT, but they have been limited to a few topics such as how to reduce the number of multipliers with the four-point lossless Hadamard transform and the non-separable two-dimensional Int-DCT. However, none of them is focused on how to express the multipliers’ word length as short as possible for the reduction of hardware complexity.

Introduction

The discrete cosine transform (DCT) is a well-known transform used in many international standards of image compression such as JPEG [6] and MPEG [4]. The DCT-based systems have huge advantage to image applications because they provide a high compression ratio. However, their coding systems are limited to operating in only lossy coding because distortion of decoded image is unavoidable with these lossy algorithms.

On the other hand, the integer transform [1], which includes rounding operations in the lifting structure [9], is becoming popular as a key technique to lossless and lossy unified waveform coding [11]. Especially the integer DCT [7], [2], [3] is attractive as the unified coding with compatibility to the conventional DCT-based algorithms. In Fig. 1, encoder applied the conventional lossy DCT, whereas decoder applied the Int-DCT to illustrate its compatibility to the conventional DCT-based algorithms. Notice that the coding performance of the Int-DCT is similar to that of the conventional lossy DCT in a low bit-rate but it is slightly worse than that of the conventional lossy DCT in a high bit-rate because of rounding error discussed in Section 4.2.

So far, relevant to the integer DCT, previous reports focused on reducing the rounding operations with the non-separable 2D structuring [7] and reducing multipliers with the integer Hadamard transform [3]. Optimization of the basis function of the orthogonal transform (integer KLT) is also reported [10]. What seems to be lacking, however, is how to express multipliers’ word length as short as possible for the reduction of hardware complexity.

In this report, we define a new “SNR sensitivity” as an indicator of how the word length truncation of multiplier coefficients affects quality of a reconstructed image. Based on the newly defined sensitivity, we propose a new word length allocation method. We also theoretically analyze errors in a reconstructed signal to confirm an effectiveness of the proposed method. This report is organized as follows. Overview of the integer DCT is summarized in Section 2. An error generated from finite word length allocation is theoretically analyzed in Section 3 and errors in a reconstructed signal are theoretically analyzed in Section 4. The “SNR sensitivity” is newly defined and applied to an optimum word length allocation using the least square method in Section 5. An effectiveness of the proposed method is confirmed in Section 6.

Section snippets

The integer DCT (Int-DCT) [6–8]

Algorithm of the integer DCT (Int-DCT), illustrated in Fig. 2, is composed of the 4-point integer Hadamard transform (4-IHT) and integer rotation transform (IRT) described in 2.2 The 4-point integer Hadamard transform (4-IHT), 2.3 Integer rotation transform (IRT), respectively. The integer DCT transforms integer input vector x(n), (n=0,1,…,7) into integer output vector y(n), (n=0,1,…,7). Therefore, it is possible to achieve effective lossless coding by applying an entropy coding directly to the

Finite word length expression

The multiplier coefficient m_j(i), (i=A, B, C, D, E and j=1,2,3), is expressed as h_k, (k=0,1,…,14), by $h_{k} =(−1)^{B_{0}} · ∑ j=1 ∞ B_{j} 2^{−j}, k=0,1,…,14,$ where B_j (j=0,1,…) is 0 or 1. Under the finite word length expression in this report, h_k is truncated into W_k [bit] binary value h_k′. Namely, $h_{k} ′=(−1)^{B_{0}} · ∑ j=1 W_{k} B_{j} ′2^{−j}, k=0,1,…,14.$

Value Δh_k is defined as a difference between value h_k and binary value h_k′ as $Δ h_{k} =h_{k} −h_{k} ′.$

An error generated from finite word length allocation

Considering errors generated from finite word length allocation, we can find an equivalent circuit of

Analysis on errors in a reconstructed signal

In this section, we analyze errors between an original signal and a reconstructed signal. A variance of the errors (σ_E²) is calculated from $σ_{E}^{2} = 1 N ∑ n=0 N−1 {x′(n)−x(n)}^{2},$ where x(n) and x′(n) denote an original signal and a reconstructed signal, respectively. “n” denotes a sequence of input signal where “n”=0,1,2,…,N−1.

The SNR sensitivity

From , , , , we can rewrite errors generated from finite word length allocation $(N_{TF})$ as $N_{TF} = ∑ k=0 14 (S_{Hk} · Δ h_{k}),$ where the $S_{Hk}$ called “SNR sensitivity” is defined as an effect of the finite word length expression on a quality of the decoded image. $S_{H(j)} = F_{1}^{−1} · N_{HF(j)} · F_{1} · X for j=0,1,2 F_{2}^{−1} · N_{HF(j)} · F_{2} · X for j=3,4,5,6,7,8 F_{3}^{−1} · N_{HF(j)} · F_{3} · X for j=9,10,11,12,13,14,$ where $N_{HF(j)} = HF_{j} Z_{4} Z_{4} Z_{4} for j=0,1,2,…,8, Z_{4} Z_{4} Z_{4} HF_{j} for j=9,10,11,…,14,$ $HF_{0} = Z_{2} Z_{2} Z_{2} G_{F_{1A}}, HF_{1} = Z_{2} Z_{2} Z_{2} G_{F_{2A}},$ $HF_{2} = Z_{2} Z_{2} Z_{2} G_{F_{3A}}, HF_{3} = G_{F_{1B}} Z_{2} Z_{2} Z_{2},$ $HF_{4} = G_{F_{1B}} Z_{2} Z_{2} Z_{2},$ $HF_{5} = G_{F_{1B}} Z_{2} Z_{2} Z_{2}, HF_{6}$

Simulation results

In this section, we practically confirm an effectiveness of the optimum word length allocation by applying AR(1) model and standard images as input signals in 6.1 Simulation results based on AR(1) model, 6.2 Simulation results based on standard images, respectively. In this report, we emphasize on finite-word-length effect (the different coefficients are used between encoder and decoder), so we consider an effectiveness of the proposed method in two conditions: no quantization and a small

Conclusion

In this report, the “SNR sensitivity” was newly defined as an indicator of how the word length truncation of multiplier coefficients affects the quality of a reconstructed image. We proposed a new word length allocation method based on the SNR sensitivity. The optimum word length allocation depends on a frequency spectrum of an input signal. Both theoretical analysis and simulation results confirm an effectiveness of the proposed method.

References (12)

M.D. Adams et al.
Reversible integer-to-integer wavelet transform for image compressionPerformance evaluation and analysis
IEEE Trans. Image Process.
(June 2000)
S. Chokchaitam et al.
A bit-rate adaptive coding system based on lossless DCT
IEICE Trans. Fund.
(February 2002)
S. Fukuma, K. Ohyama, M. Iwahashi, N. Kambayashi, Lossless 8-point fast discrete cosine transform using lossless...
ISO/IEC 11172, Information Technology—Coding of Moving Picture And Associated Audio for Digital Storage Media at up to...
N.S. Jayant et al.
Digital Coding of Wave Forms
(1984)
JPEG CD10918-1, Digital compression coding of continuous-tone still images, JPEG-9-R6, January...

There are more references available in the full text version of this article.

Cited by (8)

Channel scaling for integer implementation of minimum lifting 2Dwavelet transform
2016, IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Digital image watermarking by using discrete wavelet transform and discrete cosine transform and comparison based on PSNR
2011, Proceedings - 2011 International Conference on Communication Systems and Network Technologies, CSNT 2011
Integer DCT based on direct-lifting of DCT-IDCT for lossless-to-lossy image coding
2010, IEEE Transactions on Image Processing
Digital watermarking based on discrete cosine transformation
2010, Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010
The parallel achievement of expanded-dimension discrete polynomial transform on tight coupling multiprocessor
2008, 2008 IEEE International Symposium on Knowledge Acquisition and Modeling Workshop Proceedings, KAM 2008
A new Optimum-Word-Length-Assignment (OWLA) multiplierless integer DCT for lossless/lossy image coding and its performance evaluation
2007, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

View all citing articles on Scopus

View full text

Optimum word length allocation of integer DCT and its error analysis

Abstract

Introduction

Section snippets

The integer DCT (Int-DCT) [6–8]

Finite word length expression

An error generated from finite word length allocation

Analysis on errors in a reconstructed signal

The SNR sensitivity

Simulation results

Conclusion

Reversible integer-to-integer wavelet transform for image compressionPerformance evaluation and analysis

IEEE Trans. Image Process.

A bit-rate adaptive coding system based on lossless DCT

IEICE Trans. Fund.

Digital Coding of Wave Forms