Lossless region of interest coding

doi:10.1016/S0165-1684(99)00044-4

Signal Processing

Volume 78, Issue 1, October 1999, Pages 1-17

https://doi.org/10.1016/S0165-1684(99)00044-4 Get rights and content

Abstract

This paper describes a method that uses the well known S+P and TT transforms to encode a specially important part of an image, a region of interest (ROI) in a lossless mode. Other parts of the image (the background) are given lower quality levels allowing higher compression. The ROI coding is done in the framework of an embedded wavelet-based image compression algorithm. Entire coding, regional and full image, is done in a naturally progressive manner all the way up to lossless.

Zusammenfassung

Dieser Artikel beschreibt eine Methode, die die wohlbekannten S+P und TT Transformationen benutzt, um einen besonders wichtigen Teil eines Bildes, die Region of Interest (ROI) ohne Informationsverlust zu kodieren. Andere Teile des Bildes (der Hintergrund) erhalten geringere Qualitätsstufen, die eine höhere Kompression erlauben. Die ROI-Kodierung wird im Rahmen eines eingebetteten und auf Wavelets basierenden Bildkompressionsalgorithmus durchgeführt. Die gesamte Kodierung (der ROI und des Gesamtbildes) wird auf eine natürliche, bis hin zu einer informationsverlustfreien, fortschreitenden Art und Weise durchgeführt.

Résumé

Cet article décrit une méthode utilisant les transformations S+P et TT bien connues pour encoder une partie spécialement importante d'une image, une région d'intérêt (ROI), en mode sans perte. D'autres parties de l'image (l'arrière-plan) se voient donner des niveaux de qualité moins élevés ce qui permet une compression accrue. Le codage de la ROI est opéré dans le cadre d'un algorithme de compression d'images basé sur les ondelettes. Tout le codage, aussi bien local que sur l'image complète, est réalisé de façon naturellement progressive jusqu'au codage sans perte.

Introduction

When handling digital images the main concern should of course always be the quality of the image delivered to the user. However, quality should be achieved with as compact representation of the image as possible. The research efforts in the field of digital image compression have been immense for many years. One possible classification of the techniques is into two categories, lossy and lossless algorithms.

The more common lossy algorithms take advantage of the fact that in most cases small losses can be tolerated. This gives more freedom on the choice of the algorithm. Higher compression efficiency can also be achieved if one is willing to sacrifice quality.

Lossless compression is used for high demanding applications where no loss of quality can be tolerated. However, this requirement will make the compression performance fairly modest.

Recently lossy and lossless algorithms have partially merged. This is due to new schemes that combine the two without any substantial loss of performance for either mode [11], [14]. This is done inside the framework of the recently very popular schemes which produce an embedded bitstream [8], [9], [11], [13], [15]. An embedded bitstream has the property that the information is encoded in the order of its importance. This means that with one single bitstream the encoder performs its best independently of where the bitstream is truncated. It is therefore possible to hit a specific bitrate or to easily provide different users with different quality images depending on their demands and access link. The embedded bitstream will also make the transmission progressive so that the receiver can use the transmitted information to display an image of gradually improving quality while receiving. It should be pointed out however that which information is the most important is not always an easy choice and sometimes a highly subjective one. For example in some applications or for some users different regions of an image might have different importance. Therefore it might be desirable to have a representation which is not only compact but also selective [16].

This paper presents an image compression algorithm that achieves progressive encoding and also achieves different quality levels for different parts of the image. Specifically an arbitrary region of interest (ROI) in the image can be encoded progressively up to lossless.

The different quality levels represent a rate distortion trade-off. For example, lossless quality might be required for a small part of the image where the user's real interest is located. The rest of the image, which is only important in a contextual sense, is then assigned a lower quality. This can maintain a fairly high compression while meeting the lossless requirement.

The algorithm is based on the completely reversible integer wavelet transforms S [4], S+P [12] and Two Ten (TT) [11]. The transformation coefficients are then encoded in an embedded fashion. The idea of embedded coding originated with wavelet transform and zero-tree coding [15]. Embedded coding can also be done in a DCT framework [9]. However, when lossless encoding is desired, reversible wavelet transforms is the method of choice, at least if the encoding should be naturally progressive all the way.

Another option is to use residual coding [16], where a lossy algorithm is first used to generate an approximation of the original image and then a simple or more elaborate lossless algorithm is used to encode the remaining differences. This puts restrictions also on the lossy part of the scheme since the lossy reconstruction has to be the same on all platforms. However, with careful implementation this is possible with any lossy scheme. For a transform coding scheme it means that the transform should be implemented to yield the same result on all platforms. Other lossy schemes, like vector quantization for example, naturally give the same results.

It should be noted that although it should be desirable in the same range of applications, there is a technical difference between achieving an ROI with better quality than the background and achieving a lossless ROI. In the former case the coefficients affecting the ROI the most can be improved upon to give it a better quality but when lossless encoding is the goal, all coefficients with a possibility of affecting the ROI should be reversibly encoded and should preferably be as few as possible.

The proposed scheme solves the ROI problem in such a way that:

•
The coding is naturally progressive and without switching algorithm (no residual coding as in [10]).
•
The ROI or ROIs can have arbitrary shape.
•
The quality of the ROI is guaranteed and then the quality of the surroundings is degrading gracefully so that there are no visually annoying edges around the ROI.
•
The functionality of the solution is not dependent on the entropy coding scheme. It is also independent of the method used for coding the shape of the ROIs.

The paper is organized as follows. Section 2 describes the method as it was implemented. In Section 2.1, an algorithm overview is given. Section 2.2 is a review of the S, S+P and TT transforms. Section 2.3 describes how to derive a mask so that these transforms can be used for progressive coding of images with perfect reconstruction of selected ROIs. Section 2.4 describes the normalization of the transform. 2.5 Successive refinement, 2.6 Coefficient ordering and coding give information on the progressive coding of the coefficients with modifications of the method described in [13]. Section 3 gives results and comparisons. Section 4 is a short discussion. Conclusions are drawn in Section 5.

Section snippets

Algorithm overview

The following steps are taken in the algorithm:

1.
Calculate the S, S+P or TT transform of the image and find its normalization.
2.
If an ROI is chosen, a mask is derived, indicating a set of coefficients in the transform which is sufficient for lossless ROI reconstruction.
3.
Transmit the transform coefficients progressively, with the most important information first. Enough information for the ROI can be encoded, while less for the background.
4.
Entropy encode the resulting symbol stream.

The decoder then

Results and comparisons

To check the effectiveness of the proposed way of encoding a lossless region of interest, tests were performed on the JPEG2000 test images, with various ROIs and bitrates. Readers interested in JPEG2000 can find more information in [3], [6], [7]. The test images can be found in [5]. The conditions for the experiments are:

•
For every test image one region of interest is specified.
•
For every test image only one bitstream is produced. The same bitstream is decoded to several bitrates and results are

Discussion

The present method has been described as progressive in pixel fidelity. However, there is no inherent problem in using it in a hierarchically progressive mode also. The choice of progression is only a choice of coded bit depth on the wavelet coefficients at different stages. The key point of the method is to find out the coefficients that eventually should be coded to their full bit depth. This does not restrict the way to reach the full depth. However, the entropy coding scheme would have to

Conclusions

The well known S, S+P and TT transforms have been used to allow progressive transmission with perfect reconstruction of selected ROIs in the image. The proposed scheme calculates a mask that specifies which coefficients are needed for a certain region. This makes it possible to have progressive transmission while at the last stage of the transmission selected ROIs are reconstructed without any loss in quality. The algorithm is useful in high demanding applications, where compression is

Acknowledgements

The authors would like to thank the anonymous reviewers for their constructive comments that helped significantly in improving the paper.

References (17)

C. Brislawn
Classification of nonexpansive symmetric extension transforms for multirate filter banks
Appl. Comput. Harmonic Anal.
(1996)
J. Ström et al.
Medical image compression with lossless regions of interest
Signal Processing
(June 1997)
A.R. Calderbank, I. Daubechies, W. Sweldens, B. Yeo, Lossless image compression using integer to integer wavelet...
C. Christopoulos (Ed.), JPEG 2000 verification model version 3.0 (B), ISO/IEC JTC1/SC29/WG1 N1135, December 1,...
V.K. Heer et al.
A comparison of reversible methods for data compression
Proc. SPIE
(1990)
ISO/IEC JTC1/SC29/WG1, N461, JPEG2000 Algorithm Test and Submission Package, 15 November...
ISO/IEC JTC1/SC29/WG1 N505, Call for contributions for JPEG 2000 (JTC 1.29.14, 15444): Image Coding System, 21 March...
ISO/IEC JTC1/SC29/WG1 N390R, New work item: JPEG 2000 image coding system, 21 March...

There are more references available in the full text version of this article.

Cited by (39)

An HVS-based adaptive coder for perceptually lossy image compression
2003, Pattern Recognition
In this paper a locally adaptive wavelet image coder is presented. It is based on an embedded human visual system model that exploits the space- and frequency-localization properties of wavelet decompositions for tuning the quantization step for each discrete wavelet transforms coefficient, according to the local properties of the image. A coarser quantization is performed in the areas of the image where the visibility of errors is reduced, thus decreasing the total bit rate without affecting the resulting visual quality. The size of the quantization step for each DWT coefficient is computed by taking into account the multiresolution structure of wavelet decompositions, so that there is no need for any side information to be sent to the decoder or for prediction mechanisms.
Perceptually lossless as well as perceptually lossy compression is supported: the desired visual quality of the compressed image is set by means of a quality factor. Moreover, the technique for tuning the target visual quality allows the user to define arbitrarily shaped regions of interest and to set for each one a different quality factor.
Application of a two-dimensional lifting wavelet transform to rotating mechanical vibration data compression
2009, Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science
Efficient data reduction techniques for remote applications of a wireless visual sensor network
2013, International Journal of Advanced Robotic Systems
Detecting and coding region of interests in bi-level images for data reduction in wireless visual sensor network
2012, International Conference on Wireless and Mobile Computing, Networking and Communications
Error-resilient scheme for wavelet video codec using automatic ROI detection and wyner-ziv coding over packet erasure channel
2010, IEEE Transactions on Broadcasting
JPEG 2000 Core Coding System (Part 1)
2009, The JPEG 2000 Suite

View all citing articles on Scopus

View full text