Single-image super-resolution using orthogonal rotation invariant moments

https://doi.org/10.1016/j.compeleceng.2017.02.009Get rights and content

Highlights

  • ORIMs-based single image super-resolution approach has been presented.

  • The kernel functions of ORIMs have been used to estimate HR pixel values.

  • Among ORIMs, ZMs, PZMs, and OFMMs have been considered in the proposed framework.

  • Experiments conducted on both noise-free and noisy LR test images.

  • The proposed approach is very fast as compared to NLM and NLM+ZMs/PZMs-based methods.

Abstract

In this paper, we propose an interpolation-based single-frame image super-resolution approach using orthogonal rotation invariant moments (ORIMs). The ORIMs have several useful characteristics in addition to having the property of image reconstruction. Therefore, they have been used successfully in many image processing applications. Among the various ORIMs, Zernike moments (ZMs), pseudo-Zernike moments (PZMs) and orthogonal Fourier-Mellin moments (OFMMs) have been considered in our proposed framework. The SR performance of the proposed approach has been compared with the classical interpolation-based approaches like bicubic, cubic B-spline, and Lanczos, as well as with nonlocal-means (NLM), and recently developed NLM+ZMs and NLM+PZMs-based SR approaches on twelve standard test images. The experiments have been conducted on both noise-free and noisy LR images corrupted with uniform blur and Gaussian noise. The results demonstrate the superiority of the proposed ORIMs-based approach in super-resolving both noise-free and noisy images over recently developed NLM+ORIMs-based SR approaches. Also, a comparative performance analysis, among various ORIMs (ZMs, PZMs, and OFMMs), is also presented to determine the ORIM which performs better over others under a given condition. A time complexity analysis shows that the proposed method is very fast as compared to NLM, NLM+ZMs and NLM+PZMs-based methods.

Introduction

The primary goal of a super-resolution (SR) technique is to obtain a high resolution (HR) image from one or more observations of a low resolution (LR) image. Since an HR image can offer more details that may be crucial, it has attracted a lot of demand in various applications involving security surveillance, biomedical sciences, remote sensing, object recognition (such as face, fingerprint, iris, vehicle number plate), etc. [1], [2], [3], [4], and has become one of the most active research areas in the field of image processing.

The techniques for SR can be divided into two different categories: single-frame SR and multi-frame SR. In single-image (or single-frame) SR, the HR pixel values are either interpolated or estimated using the information contained in single (degraded) LR image. Whereas, in multi-frame SR, information from all available (multiple) LR images (of the same scene) is accumulated to form a single high quality HR image. The single-image SR methods differ from the multi-frame SR methods as the latter requires more than one frame (or image) for image super-resolution which may not be available in many cases such as LR images of fingerprints reported from a crime scene, LR images of the number plate of over-speeding vehicles, LR images of old relics and documents, etc. Multi-frame methods require multiple images for their proper functioning. They are designed to extract inter-frame redundancies required to generate a high quality HR image, whereas, single-frame SR methods are designed specifically to work on one (and the only) LR image available for HR image reconstruction. Thus, single-frame SR cannot be treated as a special case of multi-frame SR. Single-image super-resolution has been widely used in several applications [1], [2], [3], [4], such as medical image processing, infrared imaging, recognition of facial images, fingerprint image enhancement, signature and number plate reading, and for improvement of text documents. In this paper, we focus our attention towards single-image super-resolution and propose a novel SR approach based on orthogonal rotation invariant moments (ORIMs).

Before going into the details of the proposed approach, we present a brief overview of various existing single SR techniques. Single SR techniques can broadly be classified into two groups. First group of the techniques are learning based-approaches that require an additional set of training images for generating an optimal HR image. These approaches estimate the missing information by determining the similarities between the given LR images and the training images present in the external dictionaries. Among the learning-based approaches, the sparse representation-based approaches [5], [6] are most predominant and provide very superior SR results. In sparse representation-based approaches, the sparse coefficients of the LR patches (extracted from the given LR image) are used to obtain the most suitable HR patch from the external HR dictionary. This external HR dictionary stores the sparse coefficients of the LR-HR patches extracted from the given set of training images. Finally, all super-resolved patches are combined to form an HR image. Different approaches vary in their methodologies to construct these HR dictionaries. Another learning-based approach which has become very popular because of its high SR performance is the example-based (EB) SR method proposed by Kim and Kwon [7]. In EB method, pairs of LR-HR image patches are collected and stored in the training stage. In the learning stage, each LR patch of the input image is then compared with the stored LR patches, and a nearest LR patch and its corresponding HR pair is selected by using sparse kernel ridge regression method. Recently, few single SR approaches [8], [9], [10] have emerged that present several ways to improve the example-based single-image super-resolution. Among them, anchored neighbor regression (ANR) [8] and adjusted anchored neighbor regression (A+) [9] are the two approaches which are faster versions of sparse representation-based approaches. A fast execution is achieved by using the neighbor-embedding and ridge-regression to learn sparse dictionaries offline and then using these neighborhoods to pre-compute projections to map LR patches onto the HR domain. The A+ is an improved variant of ANR approach. SRCNN [10] (super-resolution convolutional neural network) is another recently proposed single SR approach that uses convolutional neural networks for learning directly the end-to-end mapping between the LR and HR images. The SR performance reported by SCNN is observed to be the best among various existing state-of-the-art learning-based methods. Although the learning-based single-frame SR techniques are very effective, they are complex, computation intensive and their performance is dependent highly on the choice of training images and the training methods. Therefore, they need proper training to achieve high SR performance.

The second category of single-frame SR methods consists mainly of the interpolation-based approaches [11], [12]. These methods are very popular as they do not need any training process. Moreover, they are very simple and fast to compute. Since they do not need training, a new class of images can be super-resolved without a new set of training images. However, their performance is not as good as the learning-based methods. In interpolation-based methods, intensities in HR image are derived using information contained in LR image itself. However, artifacts like Gibb's phenomena (ringing effect), smearing near edges and stair-case effects are easily visible in the HR images super-resolved by these methods.

Attempts have been made to overcome the limitations of the interpolation-based methods and remove the effects of noise in the LR image. One such solution is to divide the entire SR process into three stages. In the first stage, interpolation methods are used to derive an initial estimate of the HR image, which then undergoes the denoising and deblurring operations in the second and the third stage to enhance further its SR quality. The bilateral-based [13] and the NLM-based [14] single-image SR methods belong to this category. These approaches are simple to implement and provide high quality HR images. The level of improvement depends largely in the manner these filters are used to denoise the HR image in the second stage because the deblurring process is handled by the standard deblurring algorithms in the third stage. Thus, stage two of the whole process is very crucial for the success of these approaches.

The NLM-based image super-resolution is very effective for images with smooth areas and repetitive textures for which the redundancy is high. However, its performance suffers due to three major issues: rare patch effect, patch jittering blur effect, and false patch detections due to rotation [15], [16]. The rare patch effect refers to the inability of the NLM method to find enough similar patches for singular structures, such as edges and corners, thus, performing insufficient denoising of the noisy image. The patch jittering blur effect causes over-smoothing of the image by averaging several pixels that do not truly belong to the same underlying texture. The presence of noise in the compared patches can lead to false detections. Moreover, when two patches are similar but rotated, the similarity value turns out to be low which prevents it from detecting many redundant regions of the image. These limitations are well addressed by the NLM-ORIMs-based image SR methods [16], [17], [18], [19]. These methods use the magnitude of ORIMs coefficients, instead of the intensity values, which are rotation invariant and robust to image noise. As a result, the NLM-ORIMs-based SR methods are very capable of determining similar patches even if the compared patches are noisy and rotated. One of the important properties of ORIMs is that they represent an image signal in various frequency components. The low frequency components represent the holistic view of an image and the high frequency components represent the fine details such as edges and corners in the image. Being orthogonal and complete, they are very useful in reconstructing the signal and hence they can be used for interpolating the signal. Being rotation-invariant, they preserve edge information in any orientation. In fact, ORIMs have been used for sub-pixel edge detection in images [20]. The moments are computed as an integration process, and hence they have been observed to be robust to image noise. Because of their useful properties, ORIMs have been very successful in several image processing applications including noise removal [21], [22] and image super-resolution [16], [17], [18], [19] which are relevant to the present work. Recently, Gao et al. [16], Kanan and Salkhordeh [17], [18], and Nayak and Patra [19] have used ZMs and PZMs, respectively, for multi-frame image super-resolution using the general framework of NLM-based image super-resolution [14]. However, no attempt has been made to use ORIMs for the single-frame image super-resolution. Therefore, in this paper, a novel idea of using ORIMs for single-image super-resolution has been presented. The functioning of the proposed ORIMs-based single SR approach is based on the three-stage strategy of SR, similar to the one adopted by [14], [16], [17], [18], [19]. However, the proposed ORIMs-based SR approach employs ORIMs kernel functions as opposed to conventional Bicubic or Lanczos interpolation methods used in NLM-based and NLM-ORIMs-based SR approaches to obtain an initial estimate of the HR image. The use of ORIMs kernel functions for interpolation in the first stage offers a good advantage over Bicubic or Lanczos interpolation in obtaining high quality HR images by preserving important image features with minimum amount of artifacts. The three ORIMs considered in this paper are: Zernike moments (ZMs), pseudo-Zernike moments (PZMs), and orthogonal Fourier-Mellin moments (OFMMs) because they have been used widely in several image processing applications.

The rest of the paper is organized as follows. Section 2 presents an introduction to the ORIMs and describes how the proposed ORIMs-based approach can estimate the desired HR image from the observed degraded LR image. Section 3 presents an overall algorithm for single-image super-resolution using various ORIMs. Section 4 presents a detailed experimental analysis for the performance evaluation of the proposed ORIMs-based approach and its comparison with other interpolation-based methods in terms of their PSNR and MSSIM values. Also, a comparative performance analysis of the proposed SR methods with recent state-of-the-art learning-based methods has been presented in Section 5. Time complexity analysis and speed enhancement techniques are discussed in Section 6. Section 7 concludes our work.

Section snippets

Image model for single-frame super-resolution

Let fHR(x, y) be the original (ground truth) HR image which undergoes the process of blurring, decimation and additive Gaussian noise corruption to obtain the observed (degraded) LR image gLR(x, y) of size M × N. The input-output relationship is represented as: gLR(x,y)=D(H(fHR(x,y)))+η(x,y)=fLR(x,y)+η(x,y),where H is space-independent linear blur operator, D is the decimation operator which depends on the decimation factor, η(x, y) is the additive Gaussian noise with zero mean and σ2 noise

Steps for the proposed approach for single-image super-resolution using ORIMs

The proposed ORIMs-based approach for single-image super-resolution is explained using following steps:

  • 1.

    Let gLR(x, y) be the noisy LR image of size M × N which is obtained after blurring, decimation and corruption by additive Gaussian noise with zero mean and σ2 noise variance. Let s be the scaling factor. We wish to obtain a noise-free estimate of HR image, i.e., f^HR(x,y) of size sM × sN.

  • 2.

    a) Around each pixel (k, l) of gLR(x, y), form a block of size t × t. In our experiments, we have taken t = 8. The

Experimental results

The performance of the proposed ORIMs-based single-frame SR method has been compared with various interpolation-based SR methods such as bicubic, cubic B-spline, and Lanczos interpolation. Also, comparisons are made with NLM [14], and recently developed ORIMs-based methods, NLM+ZMs [16], and NLM+PZMs [18]-based SR methods, which have been developed for multi-frame super-resolution. We recast these methods for single-frame super-resolution to compare our results because these methods have shown

Comparison of the proposed ORIMs-based SR approach with state-of-the-art learning-based methods

In this section, the SR performance of the proposed PZMs-based SR approach (referred to as Method A) has been compared with the recent and the state-of-the-art learning-based methods namely, sparse representation method of Zeyde et al. [5] (Method B), ANR [8] (Method C), A+ [9] (Method D), and SCNN [10] (Method E), on both noise-free as well as noisy (σ = 5 and 15) LR test images. It is observed in Section 4.3 that amongst the three proposed ORIMs-based approaches (ZMs, PZMs, and OFMMs), the

Time complexity and speed enhancement

The proposed method computes ORIMs on MN blocks each of size t × t. The order of time complexity of ZMs and PZMs is O(MNt2pmax3) and for OFMMs it is O(MNt2pmaxqmax) when all moments upto a maximum order pmax and repetition qmax are computed [31]. The order of time complexity is very high. We reduce the time complexity keeping in mind that the kernel functions Vpq*(x,y) and Vpq(x, y) are data independent. Therefore, we perform the fast computation of ORIMs by first computing the kernel functions Vp

Conclusion

In this paper, we propose a new interpolation-based method for single-image super-resolution using ORIMs. Among the various ORIMs, the three commonly used ORIMs in image processing applications, ZMs, PZMs, and OFMMs, have been considered. Detailed experiments have been conducted on both noise-free and noisy images corrupted with additive white Gaussian noise. The following conclusions are drawn:

  • The proposed ORIMs-based approach outperforms all other interpolation-based SR approaches for both

Acknowledgments

The comments and suggestions provided by the anonymous reviewers to raise the standard of the paper are highly appreciated. The authors are grateful to the University Grants Commission (UGC), New Delhi, India, for providing financial grants for the Major Research Project entitled, “Development of Efficient Techniques for Feature Extraction and Classification for Invariant Pattern Matching and Computer Vision Applications”, vide its File No.: 43-275/2014(SR). The financial grant provided by the

Chandan Singh received Ph.D. degree in mathematics from Indian Institute of Technology, Kanpur, India, in 1982. Currently, He is serving as professor in the Department of Computer Science, Punjabi University, Patiala, India. His areas of research are pattern recognition, noise removal, and image super-resolution. He has published 80 papers in international journals and 44 papers in national and international conferences.

References (31)

  • R. Zeyde et al.

    On single-image scale-up using sparse-representations

  • S. Mandal et al.

    Noise adaptive super-resolution from single-image via non-local mean and sparse representation

    Signal Process

    (2017)
  • K.I. Kim et al.

    Example-based learning for single-image super-resolution and JPEG artifact removal

    (2008)
  • R. Timofte et al.

    Anchored neighborhood regression for fast example-based super-resolution

  • R. Timofte et al.

    A+: adjusted anchored neighborhood regression for fast super-resolution

  • Cited by (9)

    • A survey on rotation invariance of orthogonal moments and transforms

      2021, Signal Processing
      Citation Excerpt :

      There exist a number of moment and transform-based rotation invariant descriptors both for the gray-scale and color images. These invariants have been used in many applications in the area of computer vision and pattern recognition [90-104] as global descriptors. Lately, their use as local descriptors has been observed to be very effective in many applications [21,105-110].

    • Novel fractional-order generic Jacobi-Fourier moments for image analysis

      2020, Signal Processing
      Citation Excerpt :

      Also, image intensity functions could be reconstructed by using a finite number of orthogonal moments [3]. Based on these important characteristics, orthogonal moments successfully used in image representation [4–8], blind image quality assessment [9], SAR image classification [10], image registration in remote sensing [11], image denoising [12], image super-resolution [13–14], analysis of textures [15], detection of pedestrians [16], object recognition [17], digital image watermarking [18–21], medical image analysis [22–24], multimedia security [25–26] and digital image forensics [27–30]. Ping et al. [31] defined generic Jacobi-Fourier moments (JFMs) where these orthogonal moments are defined by multiplying two kernel functions.

    • A local Zernike moment-based unbiased nonlocal means fuzzy C-Means algorithm for segmentation of brain magnetic resonance images

      2019, Expert Systems with Applications
      Citation Excerpt :

      The higher is the redundancy, the better is its performance. Although NLM provides excellent denoising performance, it suffers from three major drawbacks (Singh & Aggarwal, 2017; Sutour, Deledalle, & Aujol, 2014). First, it fails to perform well on singular structures, such as the edges and corners, due to its inability to find similar blocks for these regions, called rare patch effect.

    • A transform-based fast fuzzy C-means approach for high brain MRI segmentation accuracy

      2019, Applied Soft Computing Journal
      Citation Excerpt :

      NLM filter provides good performance on the smooth areas where the redundancy is high. However, on singular structures, such as edges and corners, the algorithm fails to find enough similar patches and thus provides poor denoising [30,31] results, also called rare patch effect. Additionally, the presence of noise in the compared patches may lead to false detections.

    • Quaternion generalized Chebyshev-Fourier and pseudo-Jacobi-Fourier moments for color object recognition

      2018, Optics and Laser Technology
      Citation Excerpt :

      Moments and moment invariants are among the most popular global shape descriptors which have been most widely used in the applications related to digital image processing such as object classification [1], image watermarking [2], face recognition [3], image segmentation [4], image denoising [5], image retrieval [6], image reconstruction [7], image super-resolution [8], etc.

    View all citing articles on Scopus

    Chandan Singh received Ph.D. degree in mathematics from Indian Institute of Technology, Kanpur, India, in 1982. Currently, He is serving as professor in the Department of Computer Science, Punjabi University, Patiala, India. His areas of research are pattern recognition, noise removal, and image super-resolution. He has published 80 papers in international journals and 44 papers in national and international conferences.

    Ashutosh Aggarwal received B.Tech. degree from Guru Nanak Dev University, Amritsar, India, and M.Tech. degree from B.R. Ambedkar National Institute of Technology, Jalandhar, India in 2010 and 2012, respectively. He is currently pursuing Ph.D. degree in computer science from Punjabi University, Patiala, India. His research interests include pattern recognition, image and video interpolation, and super-resolution.

    Reviews processed and recommended for publication to the Editor-in-Chief by Area Editor Dr. E. Cabal-Yepez.

    View full text