Skip to main content
Log in

Image distortion analysis based on normalized perceptual information distance

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Image distortion analysis is a fundamental issue in many image processing problems, including compression, restoration, recognition, classification, and retrieval. Traditional image distortion evaluation approaches tend to be heuristic and are often limited to specific application environment. In this work, we investigate the problem of image distortion measurement based on the theory of Kolmogorov complexity, which has rarely been studied in the context of image processing. This work is motivated by the normalized information distance (NID) measure that has been shown to be a valid and universal distance metric applicable to similarity measurement of any two objects (Li et al. in IEEE Trans Inf Theory 50:3250–3264, 2004). Similar to Kolmogorov complexity, NID is non-computable. A useful practical solution is to approximate it using normalized compression distance (NCD) (Li et al. in IEEE Trans Inf Theory 50:3250–3264, 2004), which has led to impressive results in many applications such as construction of phylogeny trees using DNA sequences (Li et al. in IEEE Trans Inf Theory 50:3250–3264, 2004). In our earlier work, we showed that direct use of NCD on image processing problems is difficult and proposed a normalized conditional compression distance (NCCD) measure (Nikvand and Wang, 2010), which has significantly wider applicability than existing image similarity/distortion measures. To assess the distortions between two images, we first transform them into the wavelet transform domain. Assuming stationarity and good decorrelation of wavelet coefficients beyond local regions and across wavelet subbands, the Kolmogorov complexity may be approximated using Shannon entropy (Cover et al. in Elements of information theory. Wiley-Interscience, New York, 1991). Inspired by Sheikh and Bovik (IEEE Trans Image Process 15(2):430–444, 2006), we adopt a Gaussian scale mixture model for clusters of neighboring wavelet coefficients and a Gaussian channel model for the noise distortions in the human visual system. Combining these assumptions with the NID framework, we derive a novel normalized perceptual information distance measure, where maximal likelihood estimation and least square regression are employed for parameter fitting. We validate the proposed distortion measure using three large-scale, publicly available, and subject-rated image databases, which include a wide range of practical image distortion types and levels. Our results demonstrate the good prediction power of the proposed method for perceptual image distortions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. Hence \(\cdot |s^N\) is dropped in all notations.

References

  1. Li, M., Chen, X., Li, X., Ma, B., Vitányi, P.M.B.: The similarity metric. IEEE Trans. Inf. Theory 50, 3250–3264 (2004)

    Article  Google Scholar 

  2. Cilibrasi, R., Vitányi, P.M.B.: Clustering by compression. In: IEEE Trans. Inf. Theory 51, 1523–1545 (2005)

  3. Nikvand, N., Wang, Z.: Generic image similarity based on kolmogorov complexity. In: Proceedings of International Conference on Image Processing (ICIP), 2010

  4. Tran, N.: The normalized compression distance and image distinguishability. In: The 19th IS &T/SPIE Symposium on Electronic Imaging Science and Technology. Jan, San Jose (2007)

  5. Sow, D.M., Eleftheriadis, A.: Complexity distortion theory. In: IEEE Trans. Inf. Theory 49(3), 604–608 (2003)

  6. Sheikh, H.R., Bovik, A.C.: Image information and visual quality. IEEE Trans. Image Process. 15(2), 430–444 (2006)

    Article  Google Scholar 

  7. Li, M., Vitányi, P.: An Introduction to Kolmogorov Complexity and Its Applications, 2nd edn. Springer, Berlin (1997)

  8. Wainwright, M.J., Simoncelli, E.P.: Scale mixtures of gaussians and the statistics of natural images. In: Solla, S.A., Leen, T.K., Müller, K.-R. (eds.) Advances in Neural Information Processing Systems, vol. 12, pp. 855–861. MIT Press, Cambridge (2000)

  9. Wang, Z., Li, Q.: Information content weighting for perceptual image quality assessment. IEEE Trans Image Process. 20(5), 1185–1198 (2011)

    Google Scholar 

  10. Burt, P.J., Adelson, E.H.: The laplacian pyramid as a compact image code. IEEE Trans. Commun. 31, 532–540 (1983)

    Article  Google Scholar 

  11. Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multi-scale structural similarity for image quality assessment. In: Proceedings of IEEE Asilomar Conference on Signals, Systems, and Computers (Pacific Grove, CA), Nov 2003, pp. 1398–1402

  12. Sheikh, H.R., Seshadrinathan, K., Moorthy, K., Wang, Z., Bovik, A.C., Cormack, L.K.: Image and video quality assessment research at LIVE. (Online) Available http://live.ece.utexas.edu/research/quality

  13. Ponomarenko, N., Egiazarrian, K.: Tampere image database 2008 TID2008. (Online) Available http://www.ponomarenko.info/tid2008.htm

  14. Larson, E.C., Chandler, D.M.: Categorial image quality (CSIQ) database. (Online) Available http://vision.okstate.edu/csiq

  15. Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)

    Article  Google Scholar 

  16. Chandler, D.M., Hemami, S.S.: Vsnr: a wavelet-based visual signal-to-noise-ratio for natural images. IEEE Trans. Image Process. 16, 2284–2298 (2007)

    Article  MathSciNet  Google Scholar 

  17. Ponomarenko, N., Silvestri, F., Egiazarian, K., Carli, M., Astola, J., Lukin, V.: On between-coefficient contrast masking of dct basis functions. In: 3rd International Workshop on Video Processing and functions. Scottsdale, Arizona, USA, Jan 2007

  18. Larson, E.C., Chandler, D.M.: Most apparent distortion: full reference image quality assessment and the role of strategy. J. Electron. Imaging, 19, 011006:1–21, Jan–Mar 2010

  19. Cover, Thomas M., Thomas, Joy A.: Elements of Information Theory. Wiley-Interscience, New York (1991)

    Book  MATH  Google Scholar 

Download references

Acknowledgments

This research was supported in part by the Natural Sciences and Engineering Research Council of Canada, and in part by Ontario Early Researcher Award program, which are gratefully acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nima Nikvand.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nikvand, N., Wang, Z. Image distortion analysis based on normalized perceptual information distance. SIViP 7, 403–410 (2013). https://doi.org/10.1007/s11760-013-0443-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-013-0443-4

Keywords

Navigation