Correlation-based approach to color image compression

https://doi.org/10.1016/j.image.2007.04.001Get rights and content

Abstract

Most coding techniques for color image compression employ a de-correlation approach—the RGB primaries are transformed into a de-correlated color space, such as YUV or YCbCr, then the de-correlated color components are encoded separately. Examples of this approach are the JPEG and JPEG2000 image compression standards. A different method, of a correlation-based approach (CBA), is presented in this paper. Instead of de-correlating the color primaries, we employ the existing inter-color correlation to approximate two of the components as a parametric function of the third one, called the base component. We then propose to encode the parameters of the approximation function and part of the approximation errors. We use the DCT (discrete cosine transform) block transform to enhance the algorithm's performance. Thus the approximation of two of the color components based on the third color is performed for each DCT subband separately. We use the rate-distortion theory of subband transform coders to optimize the algorithm's bits allocation for each subband and to find the optimal color components transform to be applied prior to coding. This pre-processing stage is similar to the use of the RGB to YUV transform in JPEG and may further enhance the algorithm's performance. We introduce and compare two versions of the new algorithm and show that by using a Laplacian probability model for the DCT coefficients as well as down-sampling the subordinate colors, the compression results are further improved. Simulation results are provided showing that the new CBA algorithms are superior to presently available algorithms based on the common de-correlation approach, such as JPEG.

Introduction

Recently a new algorithm for color image compression was introduced by Goffman and Porat [9]. This algorithm utilizes the high inter-color correlations between RGB color components of natural images [2], [9], [11], [14], [18], [22] without transforming them into another color domain. It does so by dividing the image into square blocks, expanding two of the color primaries (the dependent colors) as a polynomial function of the third (base) color for each block. This way only the polynomial coefficients are encoded for each block of the dependent colors, whereas the base color component is encoded by any monochromatic compression technique. The choice of the base was examined in [9] with general tendency to choose the Green. The conclusion in [9] was that the new algorithm produces less color artifacts than the common de-correlation approach at high compression ratios. The de-correlation approach (such as JPEG [20] and JPEG2000 [10], [16]) consists of transforming the color components into a de-correlated color space, then separately encoding each of the obtained color components. This approach is the most common for color images; however, it is not necessarily the optimal one. Additional examples of this approach can be found in [12], [21].

The algorithm by Goffman and Porat may be considered as an example of a correlation-based approach (CBA) to color image compression. In this work we introduce two new algorithms based on this approach. We use the DCT (discrete cosine transform) block transform [17] to enhance the compression performance. Thus the expansion of the dependent colors is done for each DCT subband instead of for each image block. However, not only the expansion coefficients are coded, but also the approximation errors for part of the subbands. Since the DCT block transform is a special case of a subband transform, we propose to employ the recently developed rate-distortion (R-D) theory for subband transform coders [6]. This theory allows us not only to find the optimal bits allocation for the subbands in MSE (mean square error) sense, but also the optimal color components transform (CCT) as an efficient pre-processing stage.

The structure of this paper is as follows. In Sections 1.1, 1.2, and 1.3 we review the R-D theory for subband transform coders and the algorithms for calculating the optimal subband rates and the corresponding PCM quantization steps. In Sections 2 and 3 we introduce two versions of the new algorithm and discuss how the R-D theory is used in their optimization. Section 4 deals with optimizing the CCT. Then in Section 5 down-sampling (DS) of the subordinate colors and use of the Laplacian probability model for the DCT coefficients are described. This model allows reduction of the algorithms’ complexity and provides lower coding distortion for the same transmission rate. Another potential improvement of the algorithms’ performance is presented in Section 6. Simulation results of the new algorithms and their comparison to JPEG as a representative of the de-correlation approach are discussed in Section 7. Finally, conclusions and summary are given in Section 8.

In [6] a general subband transform coder for color images was considered, based on the following steps.

  • Pre-processing: apply a CCT to the RGB color components of the given image. Denoting the RGB components in vector form as x=[RGB]T and the new color components as x˜=[C1C2C3]T, this stage can be written asx˜=Mxfor some 3×3 CCT matrix M.

  • Apply a subband transform, such as DCT or DWT (discrete wavelet tree) or any filter bank decomposition to each color component. The subband transform is usually assumed to be non-expansive, i.e., it transforms a signal of length N to a signal with the same length.

  • Quantize the coefficients of each subband of each color component. A uniform scalar quantizer was considered. We refer to this stage as applying the PCM (pulse code modulation) scheme.

  • Post-processing: encode the quantized coefficients in a lossless manner, such as run-length coding, delta modulation or entropy coding.

At high rates R the R-D behavior of the basic PCM scheme applied to a random signal x with variance σx2 is [7], [19]d(R)=ε2σx22-2R,where ε2 is a constant dependent upon the distribution of x. This is only an approximate behavior or model; however, based on (2) the R-D model of a general monochromatic subband coder with B subbands can be expressed asdSC({Rb})=b=0B-1ηbGbdb(Rb)=b=0B-1ηbGbσb2ε22-2Rb.Here db(Rb) is the MSE of subband b (b0,1,,B-1), σb2 is its variance, Gb is its energy gain [19], Rb is the rate allocated to it, and ηb is its sample rate. The sample rate of a subband is equal to the relative part of the number of coefficients in it from the total number of samples in the subband transformed signal. For a uniform subband transform, such as the block DCT, ηb=1/B.

Consider now a color image. The coding algorithm described in the beginning of this section may be regarded as applying a CCT to the image, followed by monochromatic subband coding of each of the new color components. The R-D model of this algorithm isd({Rbi},M)=13i=13b=0B-1ηbGbσbi2εi2e-aRbi((MMT)-1)ii,where a=2ln2 and σbi2 and Rbi remain the same, but for subband b of color component i (i1,2,3). Note that standard entropy coding is assumed here in the post-processing stage or alternatively this stage is not taken into account.

Minimizing the expression of Eq. (4) under the rate constrainti=13b=0B-1ηbRbi=Ras well as non-negativity constraints for the rates lead to the optimal subband rates allocation. If we denote the set of non-zero (or active) rates in the color component i by Acti, i.e., Acti{b[0,B-1]|Rbi>0}, then the active optimal rates are given byRbi=Rj=13ξj+1alnεi2Gbσbi2((MMT)-1)iik=13[(εk2GMAk(MMT)-1)kk]ξk/j=13ξi,whereξibActiηb,GMAibActi(Gbσbi2)ηb/ξi.The rate constraint of (5) and the optimal rates of (6) do not account for DS of some of the color components as, for example, is done in JPEG [20]. To do so a DS factor αi is introduced for color component i, so that if the DS is by a factor of 2 horizontally and vertically then αi=1full component,0.25down-sampled component.Thus the rate constraint becomesi=13αib=0B-1ηbRbi=R,and the solution for the rates isRbi=Rj=13αjξj+1alnRbi=εi2Gbσbi2((MMT)-1)iiαik=13((MMT)-1)kkεk2GMAkαkαkξk/j=13αjξj,(bActi).There is still a question of how the active rates are determined. The iterative algorithm introduced in [6] for this purpose is described in the next subsection.

The following algorithm can be used to find the active subbands iteratively:

  • (1)

    Assume all the subbands are active and calculate the rates.

  • (2)

    While some Rbi<0:

    • Set Acti={b[0,B-1]|Rbi>0}.

    • Calculate new rates.

  • (3)

    Check that the Lagrange multipliers μbi0, where μbi=a3ηb((MMT)-1)iiεi2bActi,(G0σ0i2e-aR0i-Gbσbi2),0,bActi.

Here σ0i2 and R0i are the variance and rate, respectively, of subband 0 of component i. This subband can be any subband that can be assumed to be active always. Usually we would choose the subband with the maximal energy (variance) for that component, e.g., the DC subband for the DCT.

Once the optimal rates have been determined, there is still the question of the size of the PCM quantization steps to choose to achieve these rates. The quantization steps algorithm proposed in [3] is given next.

The algorithm consists of the following stages:

  • Calculate the optimal rates Rbi* (using (6) or (9)).

  • Set some initial quantization steps Δbi and calculate the resulting rates Rbi. The rate Rbi is the entropy of subband b of color component i.

  • Update the quantization steps according to Δbinew=Δbi2-(Rbi*-Rbi)until the optimal rates Rbi* are sufficiently close, i.e., E(|Rbi*-Rbi|)<ε for some small constant ε. E() stands here for statistical mean.

This algorithm provides a means of how to deal with the active subbands. The coefficients of the non-active subbands are zeroed.

Section snippets

The basic CBA algorithm

We begin this section with the introduction of the basic framework for the CBA algorithms. Given a color image in the RGB domain denoted by x=[RGB]T at each pixel, we first apply a CCT to the color components to obtain x˜=[C1C2C3]T=Mx as in Eq. (1). This stage is optional, i.e., the C1, C2, C3 components can be simply the original R, G, B (possibly with some order change). Then we apply the DCT block transform on each of the new color components and group the DCT coefficients into B subbands.

Enhanced CBA algorithm

Examining the approximation errors e^bi coded for the same subband b of C2 and C3, significant correlations can still be noted. This implies that, for example, e^b2 can be coded ‘as is’ and e^b3 can be expanded using e^b2. For simplicity, linear approximation can once again be used in this second stage expansion. Denoting the new expansion coefficients δb1 and δb0, we can easily derive the optimal values for these coefficientsδb1=cov(e^b2,e^b3)var(e^b2),δb0=E(e^b3)-cov(e^b2,e^b3)var(e^b2)·E(e^b2

The optimal CCT transform

Considering the subband coder, described here in the beginning of Section 1.1, the target function, minimized by the optimal CCT was found to be [5]f(M)=k=13((MMT)-1)kkb=0B-1(σbk2)ηb.The same target function can be used for the CBA algorithms when the approximation error variances are substituted for σbk2. These variances can be expressed by the variances of the subbands of C2 and C3 according to Eq. (18) for CBA-2-2 and according to Eqs. (18) and (25) for CBA-2-3. To express these variances

Down-sampling the approximation errors

In JPEG [20] the YUV CCT is employed and the chrominance components U and V are sometimes down-sampled to provide for less MSE distortion for the same image rate. We propose the same procedure in the case of the approximation errors of the CBA algorithms. However, the DS is to be performed in the image domain, thus the following changes in the algorithms of the previous sections are performed:

  • The optimal rates are calculated according to (9) instead of (6).

  • Prior to quantization the errors are

The zero order coefficients

In Stage 4 of the CBA-2-2 algorithm (Section 2.4) we have proposed transmitting both the first order τb1,βb1 coefficients and the zero order τb0,βb0 coefficients. However, as we saw in the discussion following Eq. (12), it is the first order coefficients that minimize the variances of the approximation errors, thus allowing better compression of the errors. The zero order coefficients on the other hand only bring the mean value of the errors to zero. If we consider a non-active subband, the

Simulation and comparison

In this section we compare the CBA algorithms to the popular JPEG algorithm as a representative of the de-correlation approach. First we consider the case of no DS of the color components. A comparison of the basic CBA-2-2, CBA-2-3 algorithms and JPEG is shown in Fig. 1. Better visual quality can be observed for the images produced by the CBA algorithms when compared to JPEG images. Also quantitative gains of more than 2 dB and up to almost 3 dB can be measured both in the PSNR and the PSPNR for

Summary

A new approach to color image compression has been introduced. The approach is based on exploiting the inter-color correlations between the color primaries instead of transforming them into a de-correlated color space. Based on this approach, two new compression algorithms have been introduced, employing the DCT block transform. Both algorithms use first-order linear approximation of two of the color components (C2 and C3) in each DCT subband based on the third color component (C1). However,

Acknowledgments

This research was supported in part by the HASSIP Research Program HPRN-CT-2002-00285 of the European Commission, and by the Ollendorff Minerva Center. Minerva is funded through the BMBF.

References (23)

  • L. Goffman-Vinopal, The effect of intercolor correlation on color image compression, Graduate School Thesis, Technion,...
  • Cited by (31)

    • Novel adaptive color space transform and application to image compression

      2011, Signal Processing: Image Communication
      Citation Excerpt :

      It has been well established by Pratt that RGB channels are highly correlated hence it is extremely unsuitable for optimum compression [14]. In prolongation to development few correlation and de-correlation approaches (CBA and DBA) exploiting inter-channel/inter-color primaries correlation, have also been reported [15–25]. In case of DBA, coding is followed by some de-correlating color component transform (CCT) whereas the CBA utilizes the inter-channel correlation for effective compression.

    • On texture and image interpolation using Markov models

      2009, Signal Processing: Image Communication
      Citation Excerpt :

      Thus, further research in this field is desirable, and may be developed according to the following directions: modeling of colored textures using Markov random field, auto-regressive processes or other image models [30]; finding mathematical relations of the model parameters in multi-resolutions of the texture image;

    • Model-based color natural stochastic textures processing and classification

      2016, 2015 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2015
    • A novel data hiding scheme for block truncation coding -compressed images using dynamic programming strategy

      2015, Proceedings of SPIE - The International Society for Optical Engineering
    • A hybrid image compression scheme using DCT and fractal image compression

      2013, International Arab Journal of Information Technology
    • Color and Image Compression

      2013, Digital Color
    View all citing articles on Scopus
    View full text