Abstract
Image distortion analysis is a fundamental issue in many image processing problems, including compression, restoration, recognition, classification, and retrieval. Traditional image distortion evaluation approaches tend to be heuristic and are often limited to specific application environment. In this work, we investigate the problem of image distortion measurement based on the theory of Kolmogorov complexity, which has rarely been studied in the context of image processing. This work is motivated by the normalized information distance (NID) measure that has been shown to be a valid and universal distance metric applicable to similarity measurement of any two objects (Li et al. in IEEE Trans Inf Theory 50:3250–3264, 2004). Similar to Kolmogorov complexity, NID is non-computable. A useful practical solution is to approximate it using normalized compression distance (NCD) (Li et al. in IEEE Trans Inf Theory 50:3250–3264, 2004), which has led to impressive results in many applications such as construction of phylogeny trees using DNA sequences (Li et al. in IEEE Trans Inf Theory 50:3250–3264, 2004). In our earlier work, we showed that direct use of NCD on image processing problems is difficult and proposed a normalized conditional compression distance (NCCD) measure (Nikvand and Wang, 2010), which has significantly wider applicability than existing image similarity/distortion measures. To assess the distortions between two images, we first transform them into the wavelet transform domain. Assuming stationarity and good decorrelation of wavelet coefficients beyond local regions and across wavelet subbands, the Kolmogorov complexity may be approximated using Shannon entropy (Cover et al. in Elements of information theory. Wiley-Interscience, New York, 1991). Inspired by Sheikh and Bovik (IEEE Trans Image Process 15(2):430–444, 2006), we adopt a Gaussian scale mixture model for clusters of neighboring wavelet coefficients and a Gaussian channel model for the noise distortions in the human visual system. Combining these assumptions with the NID framework, we derive a novel normalized perceptual information distance measure, where maximal likelihood estimation and least square regression are employed for parameter fitting. We validate the proposed distortion measure using three large-scale, publicly available, and subject-rated image databases, which include a wide range of practical image distortion types and levels. Our results demonstrate the good prediction power of the proposed method for perceptual image distortions.
Similar content being viewed by others
Notes
Hence \(\cdot |s^N\) is dropped in all notations.
References
Li, M., Chen, X., Li, X., Ma, B., Vitányi, P.M.B.: The similarity metric. IEEE Trans. Inf. Theory 50, 3250–3264 (2004)
Cilibrasi, R., Vitányi, P.M.B.: Clustering by compression. In: IEEE Trans. Inf. Theory 51, 1523–1545 (2005)
Nikvand, N., Wang, Z.: Generic image similarity based on kolmogorov complexity. In: Proceedings of International Conference on Image Processing (ICIP), 2010
Tran, N.: The normalized compression distance and image distinguishability. In: The 19th IS &T/SPIE Symposium on Electronic Imaging Science and Technology. Jan, San Jose (2007)
Sow, D.M., Eleftheriadis, A.: Complexity distortion theory. In: IEEE Trans. Inf. Theory 49(3), 604–608 (2003)
Sheikh, H.R., Bovik, A.C.: Image information and visual quality. IEEE Trans. Image Process. 15(2), 430–444 (2006)
Li, M., Vitányi, P.: An Introduction to Kolmogorov Complexity and Its Applications, 2nd edn. Springer, Berlin (1997)
Wainwright, M.J., Simoncelli, E.P.: Scale mixtures of gaussians and the statistics of natural images. In: Solla, S.A., Leen, T.K., Müller, K.-R. (eds.) Advances in Neural Information Processing Systems, vol. 12, pp. 855–861. MIT Press, Cambridge (2000)
Wang, Z., Li, Q.: Information content weighting for perceptual image quality assessment. IEEE Trans Image Process. 20(5), 1185–1198 (2011)
Burt, P.J., Adelson, E.H.: The laplacian pyramid as a compact image code. IEEE Trans. Commun. 31, 532–540 (1983)
Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multi-scale structural similarity for image quality assessment. In: Proceedings of IEEE Asilomar Conference on Signals, Systems, and Computers (Pacific Grove, CA), Nov 2003, pp. 1398–1402
Sheikh, H.R., Seshadrinathan, K., Moorthy, K., Wang, Z., Bovik, A.C., Cormack, L.K.: Image and video quality assessment research at LIVE. (Online) Available http://live.ece.utexas.edu/research/quality
Ponomarenko, N., Egiazarrian, K.: Tampere image database 2008 TID2008. (Online) Available http://www.ponomarenko.info/tid2008.htm
Larson, E.C., Chandler, D.M.: Categorial image quality (CSIQ) database. (Online) Available http://vision.okstate.edu/csiq
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Chandler, D.M., Hemami, S.S.: Vsnr: a wavelet-based visual signal-to-noise-ratio for natural images. IEEE Trans. Image Process. 16, 2284–2298 (2007)
Ponomarenko, N., Silvestri, F., Egiazarian, K., Carli, M., Astola, J., Lukin, V.: On between-coefficient contrast masking of dct basis functions. In: 3rd International Workshop on Video Processing and functions. Scottsdale, Arizona, USA, Jan 2007
Larson, E.C., Chandler, D.M.: Most apparent distortion: full reference image quality assessment and the role of strategy. J. Electron. Imaging, 19, 011006:1–21, Jan–Mar 2010
Cover, Thomas M., Thomas, Joy A.: Elements of Information Theory. Wiley-Interscience, New York (1991)
Acknowledgments
This research was supported in part by the Natural Sciences and Engineering Research Council of Canada, and in part by Ontario Early Researcher Award program, which are gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Nikvand, N., Wang, Z. Image distortion analysis based on normalized perceptual information distance. SIViP 7, 403–410 (2013). https://doi.org/10.1007/s11760-013-0443-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-013-0443-4