Elsevier

Pattern Recognition

Volume 43, Issue 12, December 2010, Pages 4055-4068
Pattern Recognition

Image quality assessment by discrete orthogonal moments

https://doi.org/10.1016/j.patcog.2010.05.026Get rights and content

Abstract

This paper proposes a novel full-reference quality assessment (QA) metric that automatically assesses the quality of an image in the discrete orthogonal moments domain. This metric is constructed by representing the spatial information of an image using low order moments. The computation, up to fourth order moments, is performed on each individual (8×8) non-overlapping block for both the test and reference images. Then, the computed moments of both the test and reference images are combined in order to determine the moment correlation index of each block in each order. The number of moment correlation indices used in this study is nine. Next, the mean of each moment correlation index is computed and thereafter the single quality interpretation of the test image with respect to its reference is determined by taking the mean value of the computed means of all the moment correlation indices. The proposed objective metrics based on two discrete orthogonal moments, Tchebichef and Krawtchouk moments, are developed and their performances are evaluated by comparing them with subjective ratings on several publicly available databases. The proposed discrete orthogonal moments based metric performs competitively well with the state-of-the-art models in terms of quality prediction while outperforms them in terms of computational speed.

Introduction

Advancement made in digital imaging and communication techniques over the past two decades have led to an abundance of new applications for human end-use. At the same time, demand for better quality has also grown in tandem. Measuring and interpreting the quality of acquired images against a database of known, but diverse reference images has become one of the most challenging tasks in visual form analysis. Given the situation, probably the best way to evaluate the quality of an image is to seek the opinions of human observers as they are representative of the ultimate receivers in most image processing environment. Towards this end, the subjective metric, mean opinion scores (MOS) based on the characteristics of the human visual system (HVS), has been developed to measure perceptual quality [1]. Another subjective metric with similar characteristics is the difference mean opinion scores (DMOS) [2]. Nevertheless, based on the feedback from human observers, subjective evaluation is not possible to be incorporated into automatic systems because of their inability to render observations in real time.

As a result of aforementioned limitation, a different approach which evaluates the image quality objectively that is consistent with the subjective human evaluation has become the major research direction in the image/video quality assessment (QA). It has been proven to be invaluable in many real world applications particularly in the telecommunication industry, multimedia and computer vision operations.

The objective evaluation involves designing a mathematically defined metric that is low in computational complexity, and independent of viewing conditions and human observers. The standard approach for this assessment is by using a full-reference (FR) QA, in which the image similarity or fidelity is measured against a ‘reference’ image/video of ‘perfect quality’ (no loss in fidelity with the original scene). Based on this concept, an abundance of objective metrics have been developed but differ in approaches. In spite of this, they deliver a single consistent score that generalizes the quality of an image or video sequence in various viewing conditions [3], [4], [5], [6], [7], [8], [9], [10].

Of these, the most widely used FR objective metrics are mean squared error (MSE) and its variants such as root mean squared error (RMSE), signal-to-noise ratio (SNR) and peak signal-to-noise ratio (PSNR). The metrics are based on the statistical error sensitivity approach and Minkowski error pooling concept that make them computationally attractive. Nevertheless, they do not correlate well with perceived quality measurement because human perception of image/video distortions and artifacts is not taken into consideration. It has been justified that by integrating with some HVS characteristics, the performance of MSE or its variants can be improved upon [11], [12], [13].

It has also been reported that none of the other objective image quality metrics thus far has shown any clear advantage over simple mathematical measures such as PSNR under strict testing conditions and different image distortion environment [2], [14], [15]. Therefore, researchers are currently focused on improving PSNR or introducing more efficient assessment algorithms.

Orthogonal moments have been widely used in various areas in image processing for the past few decades. Their applications include pattern or object classification [16], [17], [18], [19], [20], [21], [22], text and characters recognition [23], [24], classification of multi-spectral texture [25], texture retrieval [26], image registration [27], image segmentation and edge detection [28], [29], [30], image compression [31], face recognition [32], [33], stereo vision [34], palmprint verification [35], content-based image retrieval [36], pose estimation of three-dimensional objects [37], focus measure [38], image segmentation watermarking [39], [40], etc. The successful implementation of the orthogonal moments in the aforementioned applications is mainly due to their inherent properties and some manipulations on the computed moment values. Their inherent properties include invariance properties, information compactness, oscillating kernels, conveying spatial and phase information of an image.

Orthogonal moments are good signal descriptors with their low order components1 are sufficient to provide discriminant power in pattern or object recognition [41], [42]. This low order moments set provides a global, general but coarse description about the object. With the moment order increases, more and more fine information about the object is included in the computed moments. This is due to the inherent characteristics of orthogonal polynomials which oscillate like sine and cosine functions. The oscillating functions sample the image pixels of different location with different sampling rate when different moment orders are used. For the case of discrete orthogonal moments such as Tchebichef and Krawtchouk moments, image reconstruction can be performed without any loss if up to the maximum order2 of the discrete orthogonal moments are used.

One of the important applications of orthogonal moments in image process is the edge detection of the objects inside an image. This is accomplished through the ability of orthogonal polynomials to function as a filters bank of different spatial frequencies, by changing the order of sine and cosine-like polynomials. Edge information of the objects (such as in the directions of horizontal, vertical and diagonal) is able to be extracted when this filters bank is moving across the whole image. Different combinations of orthogonal polynomials extract the edge information of different directions. The phenomena can obviously observed through the orthogonal moments basis functions image.

Inspired by their successful applications in various areas of image processing, a novel metric for quantifying the quality of images objectively based on the correlation indices defined in the discrete orthogonal moment domain is proposed in this paper. It is believed that the orthogonal moments can be applied as a features set in measuring how an image is departure from its reference due to their favorable properties such as information compactness in representation, spatial globality, spatial-frequency filters bank, image spatial information representation. These properties facilitate recognition through global structures corresponding to spatial frequency and spatial localization. The reader can refer to [43] for the details of the essential characteristics and computation properties of various moment functions and their roles in image analysis. Nevertheless, one of the main drawbacks of orthogonal moments is their computational complexity. This problem is usually handled by simplifying their computation via recurrence relationships and symmetrical property between successive polynomials [44], [45], [46], [47], [48].

The organization of this paper is as follows. A brief background of the state-of-the-art FR QA models based on error sensitivity, structural similarity, natural scene statistics, and human vision system (HVS) is given in Section 2. Brief information on the characteristics of discrete orthogonal moments and their correlation relationships with image contents are given in Section 3. The mathematical formulation of the proposed discrete orthogonal moments-based models is presented in Section 4. The performance of the proposed model is validated through experiments using natural images of various distortions as well as comparative analysis with other models as shown in Section 5. The study is concluded in Section 6.

Section snippets

State-of-the-art QA models

This section briefly reviews several state-of-the-art QA models which are based on different approaches in quantifying the perceptual quality of distorted images. They are the error sensitivity-based (MSE and PSNR), structural similarity-based (UQI and SSIM), natural sense statistics-based (IFC and VIF) and HVS-based (VDP, PDM, Sarnoff's JND model and VSNR) models.

Discrete orthogonal moments

In this section, the basic properties and mathematical models of two discrete orthogonal moments used in this analysis namely the Tchebichef and Krawtchouk moments are presented briefly.

Discrete orthogonal moments-based image quality metric

How accurate and efficient a features set encodes the distortions or differences of an image from its perfect physical realism is one of the crucial factors in providing precise quantitative prediction on the image fidelity. A good image descriptor is able to represent image information efficiently even for small changes in magnitude (pixel intensity) and location (spatial information). Besides that, the effects of adjacent pixels also significantly influence the perceptual quality of an image

Experimental study

In the experimental study, the performance of the proposed discrete orthogonal moments (Tchebichef and Krawtchouk moments) based image quality metric is compared with several existing objective metrics. Three experiments are reported in this paper, i.e. subjective evaluation using LIVE database [63], cross validation across independent databases, and computational speed. The existing objective metrics which are used for comparison consist of (1) PSNR, (2) weighted signal-to-noise-ratio (WSNR)

Conclusion

In this paper, a novel full-reference metric is developed based on discrete orthogonal moments to objectively and quantitatively predict how the test image is deviated or distorted from its perfect reference. Two types of discrete orthogonal moments, i.e. Tchebichef and Krawtchouk moments, are chosen as the features to convey the image quality information. The performance of the proposed metrics over subjective ratings from several publicly accessible datasets is validated with PSNR, WSNR,

Acknowledgements

The authors would like to thank Dr. H.R. Sheikh for supplying the LIVE image dataset, Dr. Z. Wang for supplying the routines used in VQEG Phase I FR-TV test for the regression analysis of subjective/objective data comparison and the SSIM MATLAB routines, and the anonymous reviewers for their valuable and insightful comments for making this manuscript more readable.

Chong-Yaw Wee received his B.Eng., M.Eng.Sc. and Ph.D. degrees in electrical engineering from University of Malaya, Malaysia, in 2001, 2003 and 2007, respectively. He was with Centre for Signal and Image Processing (CISIP), Department of Electrical Engineering at University of Malaya as a post-doctoral research fellow from 2007 to 2008. He is currently with the Centre for Signal Processing (CSP), Nanyang Technological University, Singapore as a research fellow. His research interests lie in the

References (70)

  • T.N. Pappas et al.

    Perceptual criteria for image quality evaluation

  • Final report from the video quality experts group on the validation of objective models of video quality assessment,...
  • A.B. Watson, Dctune: a technique for visual optimization of dct quantization matrices for individuals images, in:...
  • M. Miyahara et al.

    Objectives picture quality scale (pqs) for image cosing

    IEEE Transactions on Communications

    (1998)
  • Z. Wang et al.

    A universal image quality

    IEEE Signal Processing Letters

    (2002)
  • I. Avcibas et al.

    Statistical evaluation of image quality measure

    Journal of Electronic Imaging

    (2002)
  • Z. Wang et al.

    Objective video quality assessment

  • D.V. Weken et al.

    Using similarity measures and homogeneity for the comparison of images

    Image and Vision Computing

    (2004)
  • H.R. Sheikh et al.

    An information fidelity criterion for image quality assessment using natural scene statistics

    IEEE Transactions on Image Processing

    (2005)
  • H.R. Sheikh et al.

    Image information and visual quality

    IEEE Transactions on Image Processing

    (2006)
  • A.M. Eskicioglu et al.

    Image quality measures and their performance

    IEEE Transactions on Communications

    (1995)
  • N. Damera-Venkata et al.

    Image quality assessment based on a degradation model

    IEEE Transactions on Image Processing

    (2000)
  • D.M. Chandler et al.

    Vsnr: a wavelet-based visual signal-to-noise ratio for natural images

    IEEE Transactions on Image Processing

    (2007)
  • Final report from the video quality experts group on the validation of objective models of video quality assessment,...
  • J.-B. Martens et al.

    Image dissimilarity

    Signal Processing

    (1998)
  • S. Dudani et al.

    Aircraft identification by moment invariants

    IEEE Transactions on Computers

    (1977)
  • M.R. Teague

    Image analysis via the general theory of moments

    Journal of the Optical Society of America

    (1980)
  • Y.S. Abu-Mostafa et al.

    Recognitive aspects of moment invariants

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (1984)
  • M. Tuceryan

    Moment based texture segmentation

    Pattern Recognition Letters

    (1994)
  • R.R. Bailey et al.

    Orthogonal moment features for use with parametric and non-parametric classifiers

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (1996)
  • S. Ghosal et al.

    A moment-based unified approach to image feature detection

    IEEE Transactions on Image Processing

    (1997)
  • T. Suk et al.

    Projective moment invariants

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (2004)
  • M. Dehghan, K.Faez, Farsi handwritten character recognition with moment invariants, in: Proceedings of the 13th...
  • A. Khotanzad, C. Chung, Handwritten digit recognition using combination of neural network classifiers, in: 1998 IEEE...
  • L. Wang et al.

    Using zernike moments for the illumination and geometry invariant classification of multispectral texture

    IEEE Transactions on Image Processing

    (1998)
  • D.-G. Sim et al.

    Invariant texture retrieval using modified Zernike moments

    Image Vision Computing

    (2004)
  • J. Flusser et al.

    A moment-based approach to registration of images with affine geometric distortion

    IEEE Transactions on Geoscience and Remote Sensing

    (1994)
  • S. Ghosal, R. Mehrotra, Edge detection using orthogonal moment-based operators, in: Proceedings of 11th Image, Speech...
  • S. Ghosal et al.

    Segmentation of range images: an orthogonal moment-based integrated approach

    IEEE Transactions on Robotics and Automation

    (1993)
  • A.P. Reeves, M.L. Akey, O.R. Mitchell, A moment based two-dimensional edge detector, in: Proceedings of the CVPR,...
  • H.S. Hsu

    Moment preserving edge detection and its application to image data compression

    Optical Engineering

    (1993)
  • N.H. Foon, Y.-H. Pang, A.T.B. Jin, D.N.C. Ling, An efficient method for human face recognition using wavelet transform...
  • J. Haddadnia, M. Ahmadi, K. Raahemifar, An effective feature extraction method for face recognition, in: Proceedings of...
  • R. Mukundan, A. Pang, Stereo image analysis: a new approach using orthogonal moments, in: Proceedings of the 7th Asian...
  • Y.-H. Pang et al.

    Palmprint verification with moments

    Journal of WSCG

    (2003)
  • Cited by (48)

    • Comparative study of orthogonal moments for human postures recognition

      2023, Engineering Applications of Artificial Intelligence
    • ASSP: An adaptive sample statistics-based pooling for full-reference image quality assessment

      2022, Neurocomputing
      Citation Excerpt :

      Mutual information between reference and distorted images is calculated to construct the fidelity in the metrics of information fidelity criterion (IFC) [24] and visual information fidelity (VIF) [25]. Moment correlation between reference and distorted images is considered in [26]. In the second step, various pooling strategies have been adopted.

    • Multiparameter discrete transforms based on discrete orthogonal polynomials and their application to image watermarking

      2021, Signal Processing: Image Communication
      Citation Excerpt :

      Recently, the importance and applications of DOPs in signal and image processing received more attention [11–13]. As Fourier moments have been implemented in many applications [14], DOPs were exploited in various digital signal and image processing applications including image watermarking [11,15–18], recognition [12,19,20], image analysis [21,22], image quality assessment [23,24], image blur assessment [25] as well as coding and reconstruction [26–28]. In this paper, we introduce new families of multiparameter (fractional) discrete transforms defined in terms of three types of DOPs, namely Krawtchouk, Chebyshev, and Charlier DOPs.

    • Blind image quality assessment via learnable attention-based pooling

      2019, Pattern Recognition
      Citation Excerpt :

      FR methods evaluate visual quality by quantifying the differences between the distorted and corresponding reference images. In the past decades, researchers have made many efforts to develop effective FR-IQA metrics according to the known characteristics of the HVS [3–9]. RR approaches are desirable when only partial information of the reference signal is available [10,11].

    View all citing articles on Scopus

    Chong-Yaw Wee received his B.Eng., M.Eng.Sc. and Ph.D. degrees in electrical engineering from University of Malaya, Malaysia, in 2001, 2003 and 2007, respectively. He was with Centre for Signal and Image Processing (CISIP), Department of Electrical Engineering at University of Malaya as a post-doctoral research fellow from 2007 to 2008. He is currently with the Centre for Signal Processing (CSP), Nanyang Technological University, Singapore as a research fellow. His research interests lie in the field of image and signal analysis, statistical pattern recognition, special polynomials, computer vision and evolutionary computation.

    Raveendran Paramesran received his B.Sc. and M.Sc. degrees in electrical engineering from South Dakota State University, Brookings, and the Dr. Eng. degree from University of Tokushima, Japan, in 1984, 1985, and 1994, respectively. He is currently a Professor at the Department of Electrical Engineering at University of Malaya. His research interests include image and signal analysis, video coding and Brain–Computer-Interface (BCI) technology. He is a Senior Member of IEEE. He is also a reviewer for IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), Systems, Man, And Cybernetics (SMC) Part B: Cybernetics, Image Processing and also several other international journals.

    R. Mukundan received the Ph.D degree from the Indian Institute of Science, Bangalore, in 1996, for his research work on “Image Based Attitude and Position Estimation Using Moment Functions.” He was a Senior Scientist with the Control Systems Group, Indian Space Research Organization (ISRO) Satellite Centre, Bangalore, from 1982 to 1997. Later, he joined the Faculty of Information Science and Technology at Multimedia University (MMU), Melaka, Malaysia, where he held the positions of Associate Professor and the Chairman of the Centre for Mathematical Modeling and Computational Science until December 2001. Currently, he is a Senior Lecturer in the Department of Computer Science, University of Canterbury, Christchurch, New Zealand. His primary research interests are in the areas of moment-based feature descriptors and their applications in computer vision, computer graphics algorithms, and image-based rendering.

    Xudong Jiang received the B.Eng. and M.Eng. degrees from the University of Electronic Science and Technology of China (UESTC) in 1983 and 1986, respectively, and the Ph.D. degree from Helmut Schmidt University Hamburg, Germany, in 1997, all in electrical and electronic engineering. From 1986 to 1993, he was a lecturer at UESTC, where he received two Science and Technology Awards from the Ministry for Electronic Industry of China. From 1993 to 1997, he was with Helmut Schmidt University Hamburg, as a scientific assistant. From 1998 to 2002, he was with Nanyang Technological University (NTU), Singapore, as a senior research fellow, where he developed a fingerprint verification algorithm that achieved the most efficient and the second most accurate fingerprint verification at the International Fingerprint Verification Competition (FVC’00). From 2002 to 2004, he was a lead scientist and the head of the Biometrics Laboratory at the Institute for Infocomm Research, Singapore. He joined NTU as a faculty member in 2004. Currently, he serves as the director of the Centre for Information Security, the School of Electrical and Electronic Engineering, NTU, Singapore. His research interest includes pattern recognition, signal and image processing, computer vision, and biometrics.

    View full text