Correlation-based approach to color image compression

doi:10.1016/j.image.2007.04.001

Signal Processing: Image Communication

Volume 22, Issue 9, October 2007, Pages 719-733

https://doi.org/10.1016/j.image.2007.04.001 Get rights and content

Abstract

Most coding techniques for color image compression employ a de-correlation approach—the RGB primaries are transformed into a de-correlated color space, such as YUV or YCbCr, then the de-correlated color components are encoded separately. Examples of this approach are the JPEG and JPEG2000 image compression standards. A different method, of a correlation-based approach (CBA), is presented in this paper. Instead of de-correlating the color primaries, we employ the existing inter-color correlation to approximate two of the components as a parametric function of the third one, called the base component. We then propose to encode the parameters of the approximation function and part of the approximation errors. We use the DCT (discrete cosine transform) block transform to enhance the algorithm's performance. Thus the approximation of two of the color components based on the third color is performed for each DCT subband separately. We use the rate-distortion theory of subband transform coders to optimize the algorithm's bits allocation for each subband and to find the optimal color components transform to be applied prior to coding. This pre-processing stage is similar to the use of the RGB to YUV transform in JPEG and may further enhance the algorithm's performance. We introduce and compare two versions of the new algorithm and show that by using a Laplacian probability model for the DCT coefficients as well as down-sampling the subordinate colors, the compression results are further improved. Simulation results are provided showing that the new CBA algorithms are superior to presently available algorithms based on the common de-correlation approach, such as JPEG.

Introduction

Recently a new algorithm for color image compression was introduced by Goffman and Porat [9]. This algorithm utilizes the high inter-color correlations between RGB color components of natural images [2], [9], [11], [14], [18], [22] without transforming them into another color domain. It does so by dividing the image into square blocks, expanding two of the color primaries (the dependent colors) as a polynomial function of the third (base) color for each block. This way only the polynomial coefficients are encoded for each block of the dependent colors, whereas the base color component is encoded by any monochromatic compression technique. The choice of the base was examined in [9] with general tendency to choose the Green. The conclusion in [9] was that the new algorithm produces less color artifacts than the common de-correlation approach at high compression ratios. The de-correlation approach (such as JPEG [20] and JPEG2000 [10], [16]) consists of transforming the color components into a de-correlated color space, then separately encoding each of the obtained color components. This approach is the most common for color images; however, it is not necessarily the optimal one. Additional examples of this approach can be found in [12], [21].

The algorithm by Goffman and Porat may be considered as an example of a correlation-based approach (CBA) to color image compression. In this work we introduce two new algorithms based on this approach. We use the DCT (discrete cosine transform) block transform [17] to enhance the compression performance. Thus the expansion of the dependent colors is done for each DCT subband instead of for each image block. However, not only the expansion coefficients are coded, but also the approximation errors for part of the subbands. Since the DCT block transform is a special case of a subband transform, we propose to employ the recently developed rate-distortion (R-D) theory for subband transform coders [6]. This theory allows us not only to find the optimal bits allocation for the subbands in MSE (mean square error) sense, but also the optimal color components transform (CCT) as an efficient pre-processing stage.

The structure of this paper is as follows. In Sections 1.1, 1.2, and 1.3 we review the R-D theory for subband transform coders and the algorithms for calculating the optimal subband rates and the corresponding PCM quantization steps. In Sections 2 and 3 we introduce two versions of the new algorithm and discuss how the R-D theory is used in their optimization. Section 4 deals with optimizing the CCT. Then in Section 5 down-sampling (DS) of the subordinate colors and use of the Laplacian probability model for the DCT coefficients are described. This model allows reduction of the algorithms’ complexity and provides lower coding distortion for the same transmission rate. Another potential improvement of the algorithms’ performance is presented in Section 6. Simulation results of the new algorithms and their comparison to JPEG as a representative of the de-correlation approach are discussed in Section 7. Finally, conclusions and summary are given in Section 8.

In [6] a general subband transform coder for color images was considered, based on the following steps.

•
Pre-processing: apply a CCT to the RGB color components of the given image. Denoting the RGB components in vector form as $x = [R G B]^{T}$ and the new color components as $\tilde{x} = [C 1 C 2 C 3]^{T}$ , this stage can be written as $\tilde{x} = Mx$ for some $3 \times 3$ CCT matrix M.
•
Apply a subband transform, such as DCT or DWT (discrete wavelet tree) or any filter bank decomposition to each color component. The subband transform is usually assumed to be non-expansive, i.e., it transforms a signal of length N to a signal with the same length.
•
Quantize the coefficients of each subband of each color component. A uniform scalar quantizer was considered. We refer to this stage as applying the PCM (pulse code modulation) scheme.
•
Post-processing: encode the quantized coefficients in a lossless manner, such as run-length coding, delta modulation or entropy coding.

At high rates R the R-D behavior of the basic PCM scheme applied to a random signal x with variance

σ_{x}^{2}

is [7], [19]

d (R) = ε^{2} σ_{x}^{2} 2^{- 2 R},

where

ε^{2}

is a constant dependent upon the distribution of x. This is only an approximate behavior or model; however, based on (2) the R-D model of a general monochromatic subband coder with B subbands can be expressed as

d^{SC} ({R_{b}}) = \sum_{b = 0}^{B - 1} η_{b} G_{b} d_{b} (R_{b}) = \sum_{b = 0}^{B - 1} η_{b} G_{b} σ_{b}^{2} ε^{2} 2^{- 2 R_{b}} .

Here

d_{b} (R_{b})

is the MSE of subband b

(b \in 0, 1, \dots, B - 1)

σ_{b}^{2}

is its variance,

G_{b}

is its energy gain [19],

R_{b}

is the rate allocated to it, and

η_{b}

is its sample rate. The sample rate of a subband is equal to the relative part of the number of coefficients in it from the total number of samples in the subband transformed signal. For a uniform subband transform, such as the block DCT,

η_{b} = 1 / B

Consider now a color image. The coding algorithm described in the beginning of this section may be regarded as applying a CCT to the image, followed by monochromatic subband coding of each of the new color components. The R-D model of this algorithm is $d ({R_{bi}}, M) = \frac{1}{3} \sum_{i = 1}^{3} \sum_{b = 0}^{B - 1} η_{b} G_{b} σ_{bi}^{2} ε_{i}^{2} e^{- {aR}_{bi}} (({MM}^{T})^{- 1})_{ii},$ where $a = 2 \ln 2$ and $σ_{bi}^{2}$ and $R_{bi}$ remain the same, but for subband b of color component i $(i \in 1, 2, 3)$ . Note that standard entropy coding is assumed here in the post-processing stage or alternatively this stage is not taken into account.

Minimizing the expression of Eq. (4) under the rate constraint $\sum_{i = 1}^{3} \sum_{b = 0}^{B - 1} η_{b} R_{bi} = R$ as well as non-negativity constraints for the rates lead to the optimal subband rates allocation. If we denote the set of non-zero (or active) rates in the color component i by ${Act}_{i}$ , i.e., ${Act}_{i} ≜ {b \in [0, B - 1] | R_{bi} > 0}$ , then the active optimal rates are given by $R_{bi} = \frac{R}{\sum_{j = 1}^{3} ξ_{j}} + \frac{1}{a} \ln (\frac{ε_{i}^{2} G_{b} σ_{bi}^{2} (({MM}^{T})^{- 1})_{ii}}{\prod_{k = 1}^{3} [(ε_{k}^{2} {GMA}_{k} ({MM}^{T})^{- 1})_{kk}]^{ξ_{k} / \sum_{j = 1}^{3} ξ_{i}}}),$ where $ξ_{i} ≜ \sum_{b \in {Act}_{i}} η_{b}, {GMA}_{i} ≜ \prod_{b \in {Act}_{i}} (G_{b} σ_{bi}^{2})^{η_{b} / ξ_{i}} .$ The rate constraint of (5) and the optimal rates of (6) do not account for DS of some of the color components as, for example, is done in JPEG [20]. To do so a DS factor $α_{i}$ is introduced for color component i, so that if the DS is by a factor of 2 horizontally and vertically then $α_{i} = \{\begin{matrix} 1 full component, \\ 0.25 down-sampled component . \end{matrix}$ Thus the rate constraint becomes $\sum_{i = 1}^{3} α_{i} \sum_{b = 0}^{B - 1} η_{b} R_{bi} = R,$ and the solution for the rates is $R_{bi} = \frac{R}{\sum_{j = 1}^{3} α_{j} ξ_{j}} + \frac{1}{a} \ln (\frac{\frac{ε_{i}^{2} G_{b} σ_{bi}^{2} (({MM}^{T})^{- 1})_{ii}}{α_{i}}}{\prod_{k = 1}^{3} {(\frac{(({MM}^{T})^{- 1})_{kk} ε_{k}^{2} {GMA}_{k}}{α_{k}})}^{α_{k} ξ_{k} / \sum_{j = 1}^{3} α_{j} ξ_{j}}}), (b \in {Act}_{i}) .$ There is still a question of how the active rates are determined. The iterative algorithm introduced in [6] for this purpose is described in the next subsection.

The following algorithm can be used to find the active subbands iteratively:

(1)
Assume all the subbands are active and calculate the rates.
(2)
While some $R_{bi} < 0$ :
- $•$
  Set ${Act}_{i} = {b \in [0, B - 1] | R_{bi} > 0}$ .
- $•$
  Calculate new rates.
(3)
Check that the Lagrange multipliers $μ_{bi} ⩾ 0$ , where $μ_{bi} = \{\begin{matrix} \frac{a}{3} η_{b} (({MM}^{T})^{- 1})_{ii} ε_{i}^{2} & b \notin {Act}_{i}, \\ (G_{0} σ_{0 i}^{2} e^{- {aR}_{0 i}} - G_{b} σ_{bi}^{2}), \\ 0, & b \in {Act}_{i} . \end{matrix}$

Here

σ_{0 i}^{2}

and

R_{0 i}

are the variance and rate, respectively, of subband 0 of component i. This subband can be any subband that can be assumed to be active always. Usually we would choose the subband with the maximal energy (variance) for that component, e.g., the DC subband for the DCT.

Once the optimal rates have been determined, there is still the question of the size of the PCM quantization steps to choose to achieve these rates. The quantization steps algorithm proposed in [3] is given next.

The algorithm consists of the following stages:

•
Calculate the optimal rates $R_{bi}^{*}$ (using (6) or (9)).
•
Set some initial quantization steps $Δ_{bi}$ and calculate the resulting rates $R_{bi}$ . The rate $R_{bi}$ is the entropy of subband b of color component i.
•
Update the quantization steps according to $Δ_{bi}^{new} = Δ_{bi} 2^{- (R_{bi}^{*} - R_{bi})}$ until the optimal rates $R_{bi}^{*}$ are sufficiently close, i.e., $E (| R_{bi}^{*} - R_{bi} |) < ε$ for some small constant $ε$ . $E ()$ stands here for statistical mean.

This algorithm provides a means of how to deal with the active subbands. The coefficients of the non-active subbands are zeroed.

Section snippets

The basic CBA algorithm

We begin this section with the introduction of the basic framework for the CBA algorithms. Given a color image in the RGB domain denoted by $x = [R G B]^{T}$ at each pixel, we first apply a CCT to the color components to obtain $\tilde{x} = [C 1 C 2 C 3]^{T} = Mx$ as in Eq. (1). This stage is optional, i.e., the C1, C2, C3 components can be simply the original R, G, B (possibly with some order change). Then we apply the DCT block transform on each of the new color components and group the DCT coefficients into B subbands.

Enhanced CBA algorithm

Examining the approximation errors ${\hat{e}}_{bi}$ coded for the same subband b of C2 and C3, significant correlations can still be noted. This implies that, for example, ${\hat{e}}_{b 2}$ can be coded ‘as is’ and ${\hat{e}}_{b 3}$ can be expanded using ${\hat{e}}_{b 2}$ . For simplicity, linear approximation can once again be used in this second stage expansion. Denoting the new expansion coefficients $δ_{b 1}$ and $δ_{b 0}$ , we can easily derive the optimal values for these coefficients $δ_{b 1} = \frac{cov ({\hat{e}}_{b 2}, {\hat{e}}_{b 3})}{var ({\hat{e}}_{b 2})}, δ_{b 0} = E ({\hat{e}}_{b 3}) - \frac{cov ({\hat{e}}_{b 2}, {\hat{e}}_{b 3})}{var ({\hat{e}}_{b 2})} \cdot E ({\hat{e}}_{b 2}$

The optimal CCT transform

Considering the subband coder, described here in the beginning of Section 1.1, the target function, minimized by the optimal CCT was found to be [5] $f (M) = \prod_{k = 1}^{3} (({MM}^{T})^{- 1})_{kk} \prod_{b = 0}^{B - 1} (σ_{bk}^{2})^{η_{b}} .$ The same target function can be used for the CBA algorithms when the approximation error variances are substituted for $σ_{bk}^{2}$ . These variances can be expressed by the variances of the subbands of C2 and C3 according to Eq. (18) for CBA-2-2 and according to Eqs. (18) and (25) for CBA-2-3. To express these variances

Down-sampling the approximation errors

In JPEG [20] the YUV CCT is employed and the chrominance components U and V are sometimes down-sampled to provide for less MSE distortion for the same image rate. We propose the same procedure in the case of the approximation errors of the CBA algorithms. However, the DS is to be performed in the image domain, thus the following changes in the algorithms of the previous sections are performed:

•
The optimal rates are calculated according to (9) instead of (6).
•
Prior to quantization the errors are

The zero order coefficients

In Stage 4 of the CBA-2-2 algorithm (Section 2.4) we have proposed transmitting both the first order $τ_{b 1}, β_{b 1}$ coefficients and the zero order $τ_{b 0}, β_{b 0}$ coefficients. However, as we saw in the discussion following Eq. (12), it is the first order coefficients that minimize the variances of the approximation errors, thus allowing better compression of the errors. The zero order coefficients on the other hand only bring the mean value of the errors to zero. If we consider a non-active subband, the

Simulation and comparison

In this section we compare the CBA algorithms to the popular JPEG algorithm as a representative of the de-correlation approach. First we consider the case of no DS of the color components. A comparison of the basic CBA-2-2, CBA-2-3 algorithms and JPEG is shown in Fig. 1. Better visual quality can be observed for the images produced by the CBA algorithms when compared to JPEG images. Also quantitative gains of more than 2 dB and up to almost 3 dB can be measured both in the PSNR and the PSPNR for

Summary

A new approach to color image compression has been introduced. The approach is based on exploiting the inter-color correlations between the color primaries instead of transforming them into a de-correlated color space. Based on this approach, two new compression algorithms have been introduced, employing the DCT block transform. Both algorithms use first-order linear approximation of two of the color components (C2 and C3) in each DCT subband based on the third color component (C1). However,

Acknowledgments

This research was supported in part by the HASSIP Research Program HPRN-CT-2002-00285 of the European Commission, and by the Ollendorff Minerva Center. Minerva is funded through the BMBF.

References (23)

E. Gershikov et al.
On color transforms and bit allocation for optimal subband image compression
Signal Process. Image Commun.
(January 2007)
R.K. Kouassi et al.
Approximation of the Karhunen–Loéve transformation and its application to colour images
Signal Process. Image Commun.
(2001)
M. Rabbani et al.
An overview of the JPEG2000 still image compression standard
Signal Process. Image Commun.
(2002)
Y. Roterman et al.
Color image coding using regional correlation of primary colors
Image Vision Comput.
(2007)
E. Gershikov et al.
On a rate-distortion model for color images using a correlation-based approach
E. Gershikov et al.
Does decorrelation really improve color image compression?
E. Gershikov et al.
A rate-distortion approach to optimal color image compression
E. Gershikov, M. Porat, An optimization based approach to color image compression using a rate-distortion model, in:...
E. Gershikov, M. Porat, Data compression of color images using a probabilistic linear transform approach, in: The Sixth...
A. Gersho et al.
Vector Quantization and Signal Compression
(1992)

L. Goffman-Vinopal, The effect of intercolor correlation on color image compression, Graduate School Thesis, Technion,...

Cited by (31)

Novel adaptive color space transform and application to image compression
2011, Signal Processing: Image Communication
Citation Excerpt :
It has been well established by Pratt that RGB channels are highly correlated hence it is extremely unsuitable for optimum compression [14]. In prolongation to development few correlation and de-correlation approaches (CBA and DBA) exploiting inter-channel/inter-color primaries correlation, have also been reported [15–25]. In case of DBA, coding is followed by some de-correlating color component transform (CCT) whereas the CBA utilizes the inter-channel correlation for effective compression.
Various contemporary standards by Joint Picture Expert Group, which is used for compression, exploited the correlation among the color components using a component color space transform before the subband transform stage. The transforms used to de-correlate the colors are primarily the fixed kernel transforms, which are not suitable for large class of images. In this paper an image dependent color space transform (ID-CCT), exploiting the inter-channel redundancy optimally and which is very much suitable for compression has been proposed. Also the comparative performance has been evaluated and a significant improvement has been observed, objectively as well as subjectively over other quantifiable methods.
On texture and image interpolation using Markov models
2009, Signal Processing: Image Communication
Citation Excerpt :
Thus, further research in this field is desirable, and may be developed according to the following directions: modeling of colored textures using Markov random field, auto-regressive processes or other image models [30]; finding mathematical relations of the model parameters in multi-resolutions of the texture image;
Markov-type models characterize the correlation among neighboring pixels in an image in many image processing applications. Specifically, a wide-sense Markov model, which is defined in terms of minimum linear mean-square error estimates, is applicable to image restoration, image compression, and texture classification and segmentation. In this work, we address first-order (auto-regressive) wide-sense Markov images with a separable autocorrelation function. We explore the effect of sampling in such images on their statistical features, such as histogram and the autocorrelation function. We show that the first-order wide-sense Markov property is preserved, and use this result to prove that, under mild conditions, the histogram of images that obey this model is invariant under sampling. Furthermore, we develop relations between the statistics of the image and its sampled version, in terms of moments and generating model noise characteristics. Motivated by these results, we propose a new method for texture interpolation, based on an orthogonal decomposition model for textures. In addition, we develop a novel fidelity criterion for texture reconstruction, which is based on the decomposition of an image texture into its deterministic and stochastic components. Experiments with natural texture images, as well as a subjective forced-choice test, demonstrate the advantages of the proposed interpolation method over presently available interpolation methods, both in terms of visual appearance and in terms of our novel fidelity criterion.
Model-based color natural stochastic textures processing and classification
2016, 2015 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2015
A novel data hiding scheme for block truncation coding -compressed images using dynamic programming strategy
2015, Proceedings of SPIE - The International Society for Optical Engineering
A hybrid image compression scheme using DCT and fractal image compression
2013, International Arab Journal of Information Technology
Color and Image Compression
2013, Digital Color

View all citing articles on Scopus

View full text

Correlation-based approach to color image compression

Abstract

Introduction

Section snippets

The basic CBA algorithm

Enhanced CBA algorithm

The optimal CCT transform

Down-sampling the approximation errors

The zero order coefficients

Simulation and comparison

Summary

Acknowledgments

Signal Process. Image Commun.

Signal Process. Image Commun.

Signal Process. Image Commun.

Image Vision Comput.

On a rate-distortion model for color images using a correlation-based approach

Does decorrelation really improve color image compression?

A rate-distortion approach to optimal color image compression

Vector Quantization and Signal Compression