Abstract
The goal of lossy image compression ought to be reducing entropy while preserving the perceptual quality of the image. Using gaze-tracked change detection experiments, we discover that human vision attends to one scale at a time. This evidence suggests that saliency should be treated on a per-scale basis, rather than aggregated into a single 2D map over all the scales. We develop a compression algorithm which adaptively reduces the entropy of the image according to its saliency map within each scale, using the Laplacian pyramid as both the multiscale decomposition and the saliency measure of the image. We finally return to psychophysics to evaluate our results. Surprisingly, images compressed using our method are sometimes judged to be better than the originals.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Burt, P., Adelson, E.H.: The Laplacian pyramid as a compact image code. IEEE Trans. Communication 31 (1983)
Mallat, S.: A theory for multiresolution signal decomposition: The wavelet representation. PAMI (1989)
Simoncelli, E., Adelson, E.: Noise removal via Bayesian wavelet coring. In: IEEE International Conference on Image Processing, vol. 1 (1996)
Donoho, D.L.: De-noising by soft-thresholding. IEEE Trans. Information Theory 4, 613–627 (1995)
Chambolle, A., DeVore, R.A., Lee, N., Lucier, B.J.: Nonlinear wavelet image processing: variational problems, compression, and noise removal through wavelet shrinkage. IEEE Trans. Image Processing 7, 319–335 (1998)
DeVore, R., Jawerth, B., Lucier, B.: Image compression through wavelet transform coding. IEEE Transactions on Information Theory 32 (1992)
Golner, M.A., Mikhael, W.B., Krishnang, V.: Modified jpeg image compression with region-dependent quantization. Circuits, Systems, and Signal Processing 21, 163–180 (2002)
Lee, S.-H., Shin, J.-K., Lee, M.: Non-uniform image compression using biologically motivated saliency map model. In: Intelligent Sensors, Sensor Networks and Information Processing Conference (2004)
Itti, L.: Automatic foveation for video compression using a neurobiological model of visual attention. IEEE Trans. Image Processing 13, 669–673 (2003)
Itti, L., Koch, C.: Computational modelling of visual attention. Nature Neuroscience, 194–203 (2001)
Torralba, A.: Contextual influences on saliency. In: Itti, L., Rees, G., Tsotsos, J. (eds.) Neurobiology of Attention, pp. 586–593. Academic Press, London (2004)
Rensink, R.A.: Change detection. Annual Review of Psychology 53, 4245–4277 (2002)
O’Regan, J.K., Deubel, H., Clark, J.J., Rensink, R.A.: Picture changes during blinks: looking without seeing and seeing without looking. Visual Cognition 7, 191–211 (2000)
Tomasi, C., Manduchi, R.: Bilateral filtering for gray and color images. In: International Conference on Computer Vision (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yu, S.X., Lisin, D.A. (2009). Image Compression Based on Visual Saliency at Individual Scales. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2009. Lecture Notes in Computer Science, vol 5875. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10331-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-10331-5_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10330-8
Online ISBN: 978-3-642-10331-5
eBook Packages: Computer ScienceComputer Science (R0)