Learning local Gaussian process regression for image super-resolution

doi:10.1016/j.neucom.2014.11.064

Neurocomputing

Volume 154, 22 April 2015, Pages 284-295

https://doi.org/10.1016/j.neucom.2014.11.064 Get rights and content

Abstract

Learning based super-resolution (SR) methods, which predict the high-resolution pixel values but not directly provide an estimation of uncertainty, are typically non-probabilistic and have limited generalization ability. Gaussian processes can provide a framework for deriving regression techniques with explicit uncertainty models, but Gaussian Process Regression (GPR) has a significant drawback in being time consuming. The computational complexity of GPR is cubic in the number of training examples, which is prohibitively expensive for a large-scale training set. In this article, we have proposed learning local GPR for image SR. Two algorithms are developed to support local GPR for super resolution. A data-driven GPR based super-resolution algorithm is first developed to learn a local GPR model for every LR patch on an input oriented training dataset with moderate size. In order to further improve the running speed, a prototype based GPR algorithm is developed for super resolution. The proposed algorithm is about one-order faster than the data-driven GPR solution because it makes models for the prototypes of image patches rather than for each image patch. Thus, the local regression efforts are greatly reduced to just finding the nearest prototype for each LR image patch and applying its corresponding pre-computed projective matrix for super-resolution prediction. Our algorithms have greater robustness and usability as they provide a formularized way to automatically learn the hyper-parameters introduced for optimizing the covariance function, while most of the state-of-the-art super-resolution methods could only utilize these parameters in a cross-verification way. Moreover, our algorithms offer confidence values at the test points which benefit the pixels’ post-processing. Our algorithms are evaluated on popular datasets that are widely used in the super-resolution literature, and the experimental results have demonstrated that the efficiency and effectiveness of our proposed algorithms are comparative with several state-of-the-arts super-resolution methods.

Introduction

Image super-resolution (SR) is widely used in many practical applications such as satellite and medical imaging, where the analysis or diagnosis from low-resolution (LR) images can be difficult. Therefore, SR has become a hot topic in the field of computer vision. The goal of image SR is to generate a high-resolution (HR) image from one or multiple low-resolution (LR) input images. The SR problem is ill-posed because a low-resolution image can be generated by many different high-resolution images under different transformations. Up to now, a large number of learning based SR approaches [11], [12], [13], [14], [15], [16], [17], [18], [19] have been developed and they can yield promising results. However, the state-of-the-art learning based super-resolution methods are typically non-probabilistic and can predict the high-resolution pixel values but not directly provide an estimate of uncertainty. Thus, they are inadequate when the uncertainty is required. Moreover, the parameters used in these methods are specified a priori. In this article, we present an explicit probabilistic model for SR. We introduce Gaussian processes regression (GPR) for SR and our probabilistic formulation provides a principled way to learn hyper-parameters.

The computer vision community has paid little attention to Gaussian processes (GP) due to the fact that the Gaussian processes conventionally limit the amount of training data, because the computational complexity of GP is $O (N^{3})$ , which is cubic in the number of training examples and it is prohibitively expensive when the training dataset is large scale. In this article, we focus on how to make GPR work when the training dataset is large-scale. We propose an approach which learns local GPR models for the SR problem. We make a local GPR model for an LR image patch rather than make a global GPR model. It is worth noting that each query LR patch has its special local model. Two SR algorithms are developed to support the local GPR solution. In the first algorithm, a training collection is searched for an LR image patch from the training dataset, and then a GPR model is made depending on the training collection. Because each image patch has its special GPR model, the local GPR model is data-driven, and we name it data-driven Gaussian Process Regression (DDGPR). In the second algorithm, we first find prototypes for image patches and make GPR models for the prototypes of the image patches. We name it prototype-based GPR (PGPR). DDGPR focuses on the practicability of GPR and the super-resolution accuracy. And PGPR focuses on low running time.

The three main contributions of this article are: 1) an explicit probabilistic model is made for super-resolution based on Gaussian Process regression which could not only predict the high-resolution values but also give their confidence values; 2) the data-driven GPR based SR is developed, which is a local model and achieves the promising SR performance; 3) the prototype-based GPR scheme is developed, which reduces the computational complexity and is one-order faster than DDGPR. We have shown that the proposed approach can achieve state-of-the-art super-resolution results on the benchmark image set with superior scaling ability.

This article extends our previous work [23] by proposing a novel algorithm named PGPR. PGPR is different from DDGPR which is proposed in [23] in the two major ways: 1) in DDGPR, the local GPR models are made for image patches, while in PGPR, the local GPR models are made for prototypes of image patches; 2) when an HR image is generated from its LR image, we need to build an image training dataset and learn a GPR model for each image patch in DDGPR, while in PGPR we just anchor an image patch to its nearest prototype and look up the corresponding prototype model which is pre-computed and stored as a reference model. The two differences lead to much fewer GPR models and significantly less computational time of the HR image reconstruction for PGPR than for DDGPR. Thus, PGPR is about one-order faster than DDGPR without scarifying much accuracy. However, DDGPR can achieve more accurate results than PGPR. Moreover, we also add additional empirical results and making an in-depth analysis of our approach’s performance.

The remainder of this article is organized as follows. We introduce the related work in Section 2, and we give a brief overview of Gaussian Process regression in Section 3, and then we describe the local Gaussian Process regression algorithms for image super resolution in Section 4. The experimental results are shown in Section 5. In the last section, we give our conclusions.

Section snippets

Related work

Generally, the state-of-the-art SR techniques can be categorized into three classes: the interpolation based methods [1], [2], [3], [4], [5], [6], the reconstruction based methods [7], [8], [9], [10] and the learning based methods [11], [12], [13], [14], [15], [16], [17], [18], [19].

The interpolation-based methods are simple and fast, but the quality of the interpolated super-resolution image is very limited, because such methods cannot recover high frequency details. The reconstruction based

Gaussian process regression

Gaussian processes provide an appealing probabilistic framework which models the uncertainty conditioned on the observations. It turns out that the computations of Gaussian Processes required for inference and learning become relatively easy. The supervised learning problem in computer vision which can be thought of as learning a function from examples can be cast directly into Gaussian process framework. Usually, a regression problem requires a specified function $f (x)$ , which may be, for

The local GPR based super-resolution approach

Our approach adopts the framework of Freeman’s work [11]. Given an LR image patch, we estimate the corresponding missing high-frequency details based on its interpolation into the desired scale. We initially apply an interpolation method, such as the bi-cubic interpolation method in this article, to the LR image and the result is called the blurred HR image in this article. We subtract the blurred HR image from the ground truth HR image to create the high-frequency HR image. And then we

Training set generation

We have downloaded 100 natural HR images from the Internet for algorithm evaluation. The downloaded image collections consist of four categories of images: animals, plants, buildings and people. Fig. 4 shows some representative examples. An HR image is downsampled by using the bi-cubic interpolation with the desired scale to generate the LR image. And then a low-resolution image is interpolated into the blurred HR image with the desired scale. A sampled blurred HR image patch and its

Conclusions and future work

In this article, an explicit uncertainty framework is developed by using GPR (Gaussian Process Regression) to solve the SR problem. Considering that GPR has a significant shortage of being time consuming, we propose learning local GPR models for SR rather than a global GPR model. Two algorithms are developed to support local GPR learning: DDGPR and PGPR, which are more flexible than the deterministic SR method. Furthermore, the proposed algorithms are data-driven without specifying fitting

Acknowlegements

This research work is supported by the National Natural Science Foundation of China Under Grant No. 61373077, the Natural Science Foundation of Fujian Province of China Under Grant No. 2013J01257, and the Scientific Research Foundation for the Introduction of Talent at Xiamen University of Technology No. YKJ12023R.

Jianmin Li received the B.S. degree and the M.S. degree in Computer Science Department from Xiamen University, Xiamen, China, in 2006 and 2009 respectively. He is currently working toward the Ph.D. degree in Department of Automation of Xiamen University. His research interests include computer vision and machine learning.

References (29)

M. Irani et al.
Motion analysis for image enhancement: Resolution, occlusion, and transparency
J. Vis. Commun. Image R
(1993)
Tak-Ming Chan et al.
Neighbor embedding based super-resolution algorithm through edge detection and feature selection
Pattern Recogn. Lett
(2009)
M. Irani et al.
Improving resolution by image registration
CVGIP: Graph. Model Image Process
(1991)
X. Li et al.
New edge-directed interpolation
IEEE T. Image Process
(2001)
R. Schultz et al.
A Bayesian approach to image expansion for improved definition
IEEE T. Image Process
(1994)
J. Sun et al.
Image super-resolution using gradient profile prior
Comput. Vis. Pattern Recogn.
(2008)
H. Hou et al.
Cubic splines for image interpolation and digital filtering
IEEE T. Acoust. Speech.
(1978)
W.-S. Tam et al.
Modified edge-directed interpolation for images
J. Electron. Imaging
(2010)
S. Baker et al.
Limits on super-resolution and how to break them
IEEE T. Pattern Anal
(2002)
K.-W. Hung et al.
New motion compensation model via frequency classification for fast video super-resolution
ICIP
(2009)

B.S. Morse et al.

Image magnification using level-set reconstruction

CVPR

(2001)

Q. Shan et al.

Fast image/video upsampling

ACM T. Graphics

(2008)

W.T. Freeman et al.

Example-based super-resolution

IEEE Comput. Graphics

(2002)

H. Chang et al.

Super-resolution through neighbor embedding

CVPR

(2004)

Cited by (0)

Yanyun Qu received the B.S. and the M.S. degrees in Computational Mathematics from Xiamen University and Fudan University, China, in 1995 and 1998, respectively, and received the Ph.D. degrees in Automatic Control from Xi’an Jiaotong University, China, in 2006. She joined the faculty of Department of Computer Science in Xiamen University since 1998. She was appointed as a lecturer from 2000 to 2007 and was appointed as an associate professor since 2007. She is a member of IEEE and a member of ACM. Her current research interests include pattern recognition, computer vision, etc.

Cuihua Li received the Ph.D. degree in Automatic Control from Xi’an Jiaotong University, China, in 1999. He was the director of Computer Science Department in Xiamen University from 2009 to 2013. He is currently a Professor of Computer Science Department, Xiamen University, China. His research interests include signal processing, pattern recognition and wavelet analysis.

Yuan Xie received the M.S. degree in Computer Science Department from Xiamen University in 2010 and the Ph.D. degree in the State Key Laboratory of Intelligent Control and Management of Complex Systems at Institute of Automation, Chinese Academy of Sciences in 2013. He is currently a faculty of Institute of Automation. His research interests include image processing, computer vision, machine learning and pattern recognition.

Yang Wu received the B.S. degree in information engineering and the Ph.D degree in pattern recognition and intelligent systems from Xi’an Jiaotong Unviversity in 2004 and 2010, respectively. He is currently a post-doctoral researcher of Academic Center for Computing and Media Studies, Kyoto University. From Sep. 2007 to Dec. 2008, he was a visiting student in the General Robotics, Automation, Sensing and Perception (GRASP) lab at University of Pennsylvania. From Jul. 2014 to Aug. 2014, he was an invited academic visitor at the Big Data Institute of University College London. His research is in the fields of computer vision, pattern recognition, and image/video search and retrieval, with particular interests in the detecting, tracking and recognizing humans and generic objects. He is also interested in pursuing general data analysis models applicable to large data sets.

Jianping Fan received the M.S. degree in theory physics from Northwest University, Xi’an, China, in 1994, and the Ph.D degree in optical storage and computer science from Shanghai Institute of Optics and Fine Mechanics, Chinese Academy of Sciences, Shanghai, China, in 1997. He was a postdoctoral researcher with Fudan University, Shanghai, China, during 1998. From 1998 to 1999, he was a researcher with the Japan Society of Promotion of Science (JSPS), Department of Information System Engineering, Osaka University, Japan. From September 1999 to 2001, he was a postdoctoral researcher with the department of computer Science, Purdue University, West Lafayette, IN. He is currently a professor in the Department of Computer Science, University of North Carolina at Charlotte. From 2012, he is also a professor at Northwest University in China. His research interests include image/video analysis, semantic image/video classification, personalized image/video recommendation, surveillance videos, and statistical machine learning.

View full text

Learning local Gaussian process regression for image super-resolution

Abstract

Introduction

Section snippets

Related work

Gaussian process regression

The local GPR based super-resolution approach

Training set generation

Conclusions and future work

Acknowlegements

J. Vis. Commun. Image R

Pattern Recogn. Lett

CVGIP: Graph. Model Image Process

New edge-directed interpolation

IEEE T. Image Process

A Bayesian approach to image expansion for improved definition

IEEE T. Image Process

Image super-resolution using gradient profile prior

Comput. Vis. Pattern Recogn.

Cubic splines for image interpolation and digital filtering

IEEE T. Acoust. Speech.

Modified edge-directed interpolation for images

J. Electron. Imaging

Limits on super-resolution and how to break them

IEEE T. Pattern Anal

New motion compensation model via frequency classification for fast video super-resolution

ICIP

Image magnification using level-set reconstruction

CVPR

Fast image/video upsampling

ACM T. Graphics

Example-based super-resolution

IEEE Comput. Graphics

Super-resolution through neighbor embedding

CVPR