Low-Sampling Imagery Data Recovery by Deep Learning Inference and Iterative Approach

Lin, Jiping; Zhou, Yu; Kang, Junhao

doi:10.1007/978-3-030-29551-6_44

Jiping Lin¹¹,
Yu Zhou¹¹ &
Junhao Kang¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11775))

Included in the following conference series:

International Conference on Knowledge Science, Engineering and Management

2616 Accesses

Abstract

Block-based compressed sensing (CS) recovery aims to reconstruct the high quality image from only a small number of observations in a block-wise manner. However, when the sampling rate is very low and the existence of additive noise, there are usually some block artifacts and detail blurs which degrades the reconstructed quality. In this paper, we propose an efficient method which takes both the advantages of deep learning (DL) framework and iterative approaches. First, a deep multi-layer perceptron (DMLP) is constructed to obtain the initial reconstructed image. Then, an efficient iterative approach is applied to keep the consistence and smoothness between the adjacent blocks. The proposed method demonstrates its efficacy on benchmark datasets.

Download conference paper PDF

Cascade neural network-based joint sampling and reconstruction for image compressed sensing

Article 17 June 2021

Chunyan Zeng, Jiaxiang Ye, … Minghu Wu

Cascaded reconstruction network for compressive image sensing

Article Open access 28 August 2018

Yahan Wang, Huihui Bai, … Yao Zhao

FISTA-CSNet: a deep compressed sensing network by unrolling iterative optimization algorithm

Article 28 July 2022

Liqi Xin, Dingwen Wang & Wenxuan Shi

Keywords

1 Introduction

In block compressive sensing (BCS) [5], an image $\mathbf {X}$ is divided into a certain number of image blocks with size $B \times B$. For the ith block,

$$\begin{aligned} \mathbf {y}_{i}=\varvec{\varPhi }_{B} \mathbf {x}_{i} + \mathbf {n}_{i} \end{aligned}$$

(1)

where $\varvec{\varPhi }_{B} \in R^{M_{B}\times B^{2}}$ represents the sensing matrix, $\mathbf {y}_{i} \in R^{M_{B}}$ is the observed vector for the ith block, $\mathbf {x}_{i} \in R^{B^{2}}$ denotes the ith vectorized block and $\mathbf {n}_{i}$ denotes the additive noise.

Given $\varvec{\varPhi }_{B}$, designing efficient approaches to recover $\mathbf {x}_{i}$ from $\mathbf {y}_{i}$ is very critical. It is well-known that most of existing BCS are based on the iterative approaches [2,3,4, 6]. To further improve the quality of the reconstructed image, more comprehensive models, such as the bilevel optimization model [10], multi-objective optimization models [11] and sparsity-enhanced models [2, 6] are proposed. Although proper modeling could contribute a bit in achieving very competitive results, these models are usually not easy to solve, which requires different iterative approaches, e.g. iterative thresholding shrinkage methods, gradient-based methods or some evolutionary algorithms. To obtain the satisfactory results, it is often the case that more number of iterations are preferred, increasing the computational complexity. Moreover, the initial solution is also very important, which is currently obtained either by random initialization or by the least squares (LS) method. When the sampling rate (SR) is small (less than 20%), which makes the problem seriously ill-posed, these methods may not guide the iterative algorithm to converge to a high quality solution fast enough.

In this paper, we propose an efficient method which takes both the advantages of deep learning framework and the iterative approaches. First, a deep multi-layer perceptron (DMLP) is trained to find the mapping between BCS measurements and the original imagery blocks. Then, an efficient iterative approach is applied on the whole image to further reduce the block artifacts between the adjacent blocks. The proposed method is compared with several state-of-the-art competitors on the benchmark database and demonstrates its efficacy. The rest of our paper is organized as follows. Section 2 introduces our proposed BCS reconstruction approach, the experimental studies are presented in Sect. 3, and the conclusion is finally made in Sect. 4.

2 The Proposed Method

2.1 Deep MLP

Following the suggestions in [1], the structure of deep MLP (DMLP) is shown in Fig. 1, which consists of give layers.

For the input layer, the number of neurons is set equal to the length of vectorized image block, $B^{2}$. The connection weights between the input layer and the sensing layer construct the sensing matrix, where the signal $\mathbf {x}_{i} \in R^{B^{2}}$ is mapped to $\mathbf {y}_{i} \in R^{M_{B}}$ ($M_{B} \ll B^{2}$). There are two reconstruction layers that perform recovery of $\mathbf {x}_{i}$ from $\mathbf {y}_{i}$ and each followed by an activation function in terms of rectified linear units (ReLU) [7]. The length of the reconstruction layer is usually determined by $B^{2} \times T$, where T denotes the redundancy parameter for reconstruction. The output layer has the same size with input layer, which denotes the reconstructed signal, $\hat{\mathbf {x}}_{i}$. In overall, the mathematical expression of the DMLP can be written as:

$$\begin{aligned} f(\mathbf {x}) = \mathbf {W}_{4} \cdot ReLU\left( \mathbf {W}_{3} \cdot ReLU \left( \mathbf {W}_{2} \left( \mathbf {W}_{1}\mathbf {x}+\mathbf {b}_{1} \right) + \mathbf {b}_{2} \right) + \mathbf {b}_{3} \right) + \mathbf {b}_{4} \end{aligned}$$

(2)

where $\cdot $ denotes the product between matrix and vector and $ReLU(\mathbf {x})=max(\mathbf {0},\mathbf {x})$ is the activation function based on an element-wise comparison.

Table 1. Statistical results of PSNR and SSIM for BCS considering the overlap

Full size table

2.2 Smoothed Projected Landweber Algorithm

Landweber iteration algorithm is a classical approach to solve the ill-posed linear inverse problems. The projected version converts the original signal into a sparse or compressive domain where the sparsity could be enhanced, so it could be intuitively applied in CS image reconstruction. Considering removing the block artifacts, the procedure of BCS combined with smoothed projected Landweber (SPL) algorithm is given in Algorithm 1, where $\varvec{\varPsi }$ denotes the sparse basis Dual-tree Discrete Wavelet Transform (DDWT) [6] applied in our method.

2.3 The Flowchart of Our Approach

As presented in Fig. 2, the proposed method is composed of two stages: at first, we establish an end-to-end deep neural network based on MLP and obtain the reconstructed image block by block based on inference. In the second stage, the reconstructed image in the first stage is used as the initial solution, ${\mathbf {x}}^{(0)}$, the SPL iteration method is applied to get the ultimately recovered image.

3 Experimental Studies

3.1 Parameter Settings

In our experiment, we randomly choose 5,000,000 image blocks from 50,000 images in LabelME database [8]. The learning rate is set to 0.005, the batch size is 16 with a block size $16\times 16$ and the redundancy parameter R is equal to 8. All the training images for DMLP are converted into grayscale images.

3.2 Numerical and Visual Results

In Table 1, the peak signal-to-noise ratio (PSNR) and structural similarity index (SSIM) [9] for 10 test images when SR is equal to 0.1, 0.125, 0.15 and 0.175 are given and each image is tested for 20 times. For simplicity, we fixed the overlap equal to 8 in all the simulations. It is demonstrated that our proposed method could achieve significantly higher PSNR and SSIM. In comparison to the DL-based method, the proposed approach could have a maximum 0.9dB gain in PSNR and the improvement is more significant when the SR is smaller. Our method utilizes the powerful regression ability of DNN in the first stage and have a very good estimated initial solution for the iterative process in the second stage especially under a very low SR.

Table 2. Average computational time for different BCS methods

Full size table

3.3 Computational Time

To evaluate the efficiency of our proposed method, the computational time for different methods with and without overlap between the adjacent blocks is recorded in Table 2. BCS-DNN only needs the fast inference, which is able to be completed in around 3 s. In our method, the initial solution obtained by DNN is usually near-optimal, which largely reduces the iterative times. So, its computational time is comparative to BCS-DNN and could achieve a higher reconstruction quality.

4 Conclusion

In this paper, we proposed an efficient CS recover method. At first, a DMLP is built where the recovered image block is obtained based on the inference from the learned regression parameters and its input measurements. Then, SPL is applied on the whole image to further reduce the block artifacts and improve the pixel consistence and smoothness between the adjacent blocks. Experimental results demonstrate that our method could not only achieve competitive numerical results but also outperform the compared methods in visual quality.

References

Adler, A., Boublil, D., Elad, M., Zibulevsky, M.: A deep learning approach to block-based compressed sensing of images. arXiv preprint arXiv:1606.01519 (2016)
Chen, C., Tramel, E.W., Fowler, J.E.: Compressed-sensing recovery of images and video using multihypothesis predictions. In: 2011 Conference Record of the Forty Fifth Asilomar Conference on Signals, Systems and Computers (ASILOMAR), pp. 1193–1198. IEEE (2011)
Google Scholar
Fowler, J.E., Mun, S., Tramel, E.W.: Multiscale block compressed sensing with smoothed projected landweber reconstruction. In: 2011 European Signal Processing Conference, pp. 564–568 (2011)
Google Scholar
Fowler, J.E., Mun, S., Tramel, E.W., et al.: Block-based compressed sensing of images and video. Found. Trends Sig. Process. 4(4), 297–416 (2012)
Article Google Scholar
Gan, L.: Block compressed sensing of natural images. In: 2007 15th International Conference on Digital Signal Processing, pp. 403–406, July 2007. https://doi.org/10.1109/ICDSP.2007.4288604
Mun, S., Fowler, J.E.: Block compressed sensing of images using directional transforms. In: 2010 Data Compression Conference. pp. 547–547, March 2010. https://doi.org/10.1109/DCC.2010.90
Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: International Conference on Machine Learning, pp. 807–814 (2010)
Google Scholar
Russell, B.C., Torralba, A., Murphy, K.P., Freeman, W.T.: Labelme: a database and web-based tool for image annotation. Int. J. Comput. Vision 77(1–3), 157–173 (2008)
Article Google Scholar
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article Google Scholar
Zhou, Y., Kwong, S., Guo, H., Gao, W., Wang, X.: Bilevel optimization of block compressive sensing with perceptually nonlocal similarity. Inf. Sci. 360, 1–20 (2016). https://doi.org/10.1016/j.ins.2016.03.027
Article Google Scholar
Zhou, Y., Kwong, S., Guo, H., Zhang, X., Zhang, Q.: A two-phase evolutionary approach for compressive sensing reconstruction. IEEE Trans. Cybern. 47(9), 2651–2663 (2017)
Article Google Scholar

Download references

Acknowledgment

This work is supported in part by the Natural Science Foundation of China under Grant 61702336, in part by Shenzhen Emerging Industries of the Strategic Basic Research Project JCYJ20170302154254147, and in part by Natural Science Foundation of SZU (grant No. 2018068).

Author information

Authors and Affiliations

Shenzhen University, Shenzhen, 518060, China
Jiping Lin, Yu Zhou & Junhao Kang

Authors

Jiping Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Junhao Kang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yu Zhou .

Editor information

Editors and Affiliations

University of Piraeus, Piraeus, Greece
Christos Douligeris
University of Vienna, Vienna, Austria
Dimitris Karagiannis
University of Piraeus, Piraeus, Greece
Dimitris Apostolou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lin, J., Zhou, Y., Kang, J. (2019). Low-Sampling Imagery Data Recovery by Deep Learning Inference and Iterative Approach. In: Douligeris, C., Karagiannis, D., Apostolou, D. (eds) Knowledge Science, Engineering and Management. KSEM 2019. Lecture Notes in Computer Science(), vol 11775. Springer, Cham. https://doi.org/10.1007/978-3-030-29551-6_44

Download citation

DOI: https://doi.org/10.1007/978-3-030-29551-6_44
Published: 21 August 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29550-9
Online ISBN: 978-3-030-29551-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics