Lossless region of interest coding
Introduction
When handling digital images the main concern should of course always be the quality of the image delivered to the user. However, quality should be achieved with as compact representation of the image as possible. The research efforts in the field of digital image compression have been immense for many years. One possible classification of the techniques is into two categories, lossy and lossless algorithms.
The more common lossy algorithms take advantage of the fact that in most cases small losses can be tolerated. This gives more freedom on the choice of the algorithm. Higher compression efficiency can also be achieved if one is willing to sacrifice quality.
Lossless compression is used for high demanding applications where no loss of quality can be tolerated. However, this requirement will make the compression performance fairly modest.
Recently lossy and lossless algorithms have partially merged. This is due to new schemes that combine the two without any substantial loss of performance for either mode [11], [14]. This is done inside the framework of the recently very popular schemes which produce an embedded bitstream [8], [9], [11], [13], [15]. An embedded bitstream has the property that the information is encoded in the order of its importance. This means that with one single bitstream the encoder performs its best independently of where the bitstream is truncated. It is therefore possible to hit a specific bitrate or to easily provide different users with different quality images depending on their demands and access link. The embedded bitstream will also make the transmission progressive so that the receiver can use the transmitted information to display an image of gradually improving quality while receiving. It should be pointed out however that which information is the most important is not always an easy choice and sometimes a highly subjective one. For example in some applications or for some users different regions of an image might have different importance. Therefore it might be desirable to have a representation which is not only compact but also selective [16].
This paper presents an image compression algorithm that achieves progressive encoding and also achieves different quality levels for different parts of the image. Specifically an arbitrary region of interest (ROI) in the image can be encoded progressively up to lossless.
The different quality levels represent a rate distortion trade-off. For example, lossless quality might be required for a small part of the image where the user's real interest is located. The rest of the image, which is only important in a contextual sense, is then assigned a lower quality. This can maintain a fairly high compression while meeting the lossless requirement.
The algorithm is based on the completely reversible integer wavelet transforms S [4], S+P [12] and Two Ten (TT) [11]. The transformation coefficients are then encoded in an embedded fashion. The idea of embedded coding originated with wavelet transform and zero-tree coding [15]. Embedded coding can also be done in a DCT framework [9]. However, when lossless encoding is desired, reversible wavelet transforms is the method of choice, at least if the encoding should be naturally progressive all the way.
Another option is to use residual coding [16], where a lossy algorithm is first used to generate an approximation of the original image and then a simple or more elaborate lossless algorithm is used to encode the remaining differences. This puts restrictions also on the lossy part of the scheme since the lossy reconstruction has to be the same on all platforms. However, with careful implementation this is possible with any lossy scheme. For a transform coding scheme it means that the transform should be implemented to yield the same result on all platforms. Other lossy schemes, like vector quantization for example, naturally give the same results.
It should be noted that although it should be desirable in the same range of applications, there is a technical difference between achieving an ROI with better quality than the background and achieving a lossless ROI. In the former case the coefficients affecting the ROI the most can be improved upon to give it a better quality but when lossless encoding is the goal, all coefficients with a possibility of affecting the ROI should be reversibly encoded and should preferably be as few as possible.
The proposed scheme solves the ROI problem in such a way that:
- •
The coding is naturally progressive and without switching algorithm (no residual coding as in [10]).
- •
The ROI or ROIs can have arbitrary shape.
- •
The quality of the ROI is guaranteed and then the quality of the surroundings is degrading gracefully so that there are no visually annoying edges around the ROI.
- •
The functionality of the solution is not dependent on the entropy coding scheme. It is also independent of the method used for coding the shape of the ROIs.
Section snippets
Algorithm overview
The following steps are taken in the algorithm:
- 1.
Calculate the S, S+P or TT transform of the image and find its normalization.
- 2.
If an ROI is chosen, a mask is derived, indicating a set of coefficients in the transform which is sufficient for lossless ROI reconstruction.
- 3.
Transmit the transform coefficients progressively, with the most important information first. Enough information for the ROI can be encoded, while less for the background.
- 4.
Entropy encode the resulting symbol stream.
Results and comparisons
To check the effectiveness of the proposed way of encoding a lossless region of interest, tests were performed on the JPEG2000 test images, with various ROIs and bitrates. Readers interested in JPEG2000 can find more information in [3], [6], [7]. The test images can be found in [5]. The conditions for the experiments are:
- •
For every test image one region of interest is specified.
- •
For every test image only one bitstream is produced. The same bitstream is decoded to several bitrates and results are
Discussion
The present method has been described as progressive in pixel fidelity. However, there is no inherent problem in using it in a hierarchically progressive mode also. The choice of progression is only a choice of coded bit depth on the wavelet coefficients at different stages. The key point of the method is to find out the coefficients that eventually should be coded to their full bit depth. This does not restrict the way to reach the full depth. However, the entropy coding scheme would have to
Conclusions
The well known S, S+P and TT transforms have been used to allow progressive transmission with perfect reconstruction of selected ROIs in the image. The proposed scheme calculates a mask that specifies which coefficients are needed for a certain region. This makes it possible to have progressive transmission while at the last stage of the transmission selected ROIs are reconstructed without any loss in quality. The algorithm is useful in high demanding applications, where compression is
Acknowledgements
The authors would like to thank the anonymous reviewers for their constructive comments that helped significantly in improving the paper.
References (17)
Classification of nonexpansive symmetric extension transforms for multirate filter banks
Appl. Comput. Harmonic Anal.
(1996)- et al.
Medical image compression with lossless regions of interest
Signal Processing
(June 1997) - A.R. Calderbank, I. Daubechies, W. Sweldens, B. Yeo, Lossless image compression using integer to integer wavelet...
- C. Christopoulos (Ed.), JPEG 2000 verification model version 3.0 (B), ISO/IEC JTC1/SC29/WG1 N1135, December 1,...
- et al.
A comparison of reversible methods for data compression
Proc. SPIE
(1990) - ISO/IEC JTC1/SC29/WG1, N461, JPEG2000 Algorithm Test and Submission Package, 15 November...
- ISO/IEC JTC1/SC29/WG1 N505, Call for contributions for JPEG 2000 (JTC 1.29.14, 15444): Image Coding System, 21 March...
- ISO/IEC JTC1/SC29/WG1 N390R, New work item: JPEG 2000 image coding system, 21 March...
Cited by (39)
An HVS-based adaptive coder for perceptually lossy image compression
2003, Pattern RecognitionApplication of a two-dimensional lifting wavelet transform to rotating mechanical vibration data compression
2009, Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering ScienceEfficient data reduction techniques for remote applications of a wireless visual sensor network
2013, International Journal of Advanced Robotic SystemsDetecting and coding region of interests in bi-level images for data reduction in wireless visual sensor network
2012, International Conference on Wireless and Mobile Computing, Networking and CommunicationsError-resilient scheme for wavelet video codec using automatic ROI detection and wyner-ziv coding over packet erasure channel
2010, IEEE Transactions on BroadcastingJPEG 2000 Core Coding System (Part 1)
2009, The JPEG 2000 Suite