On the Direction Guidance in Structure Tensor Total Variation Based Denoising

Demircan-Tureyen, Ezgi; Kamasak, Mustafa E.

doi:10.1007/978-3-030-31332-6_8

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11867))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

1523 Accesses
1 Citations

Abstract

This paper introduces a new analysis-based regularizer, which incorporates the neighborhood-awareness of the structure tensor total variation (STV) and the tunability of the directional total variation (DTV), in favor of a pre-selected direction with a pre-selected dose of penalization. In order to show the utility of the proposed regularizer, we consider the problem of denoising uni-directional images. Since the regularizer is convex, we develop a simple optimization algorithm by realizing its proximal map. The quantitative and the visual experiments demonstrate the superiority of our regularizer over DTV (only for scalar-valued images) and STV.

This work is supported by Istanbul Kultur University under ULEP-2018-2019/49.

You have full access to this open access chapter, Download conference paper PDF

Nonlocal adaptive direction-guided structure tensor total variation for image recovery

Article 01 April 2021

Directional total generalized variation regularization

Article 04 May 2019

Image Denoising Via Spatially Adaptive Directional Total Generalized Variation

Article 22 August 2022

Keywords

1 Introduction

From capturing to encoding and transmission, different stages of an image acquisition pipeline produce different types of noise (additive vs. multiplicative, structured vs. random, signal dependent vs. independent, etc.). The noise existing on an image not only creates visual disruption, but also obscures significant details. Either to meet the quality expectations or to extract valuable information from the image, denoising has become an indispensable part of the processing. And yet, various denoising techniques have been proposed so far. These techniques most particularly aimed at recovering the latent signal $f \in \mathbb {R}^{NC}$ from the observation $g \in \mathbb {R}^{NC}$, by considering the forward corruption model of the form: $g=f+\eta $, where $\eta \in \mathbb {R}^{NC}$ is the additive noise that is assumed to be i.i.d. and normally distributed $(\eta \sim \mathcal {N}(\underline{0},\sigma _\eta ^2 {I}))$. Note that, the images are assumed to be stacked into vectors, where N and C are respectively denoting the number of pixels in a single channel, and the number of channels. For the scalar-valued images $C=1$.

The variational methods inverse the given forward model by minimizing an energy functional of the form:

$$\begin{aligned} E(f) = \frac{1}{2} \Vert g - f\Vert _2^2 + \tau R(f) \end{aligned}$$

(1)

where R(f) stands for the regularizer, whose role is encoding a prior on the unknown image. The prior may reflect some real characteristics, or may just presume them to reduce the size of search space. $\tau \ge 0$ is a variable used to tune the contribution of R(f) to E(f), and for the small values of it, the fidelity to the observed data dominates the prior. The choice of R(f) is crucial. Over the years, intensive research has been pursued on designing good regularizers, which can adequately characterize the latent image in an efficiently solvable way. The concept of compressive sensing (CS) proves that the recovery of a deficient signal is possible, as long as it exhibits sparsity in a certain domain. This sparsity can be encoded by a regularizer. The total variation (TV) is such a regularizer, where the image’s gradient is assumed to be sparse (yielding a piecewise constant – PWC image) as a special case of CS. It is a convex functional, thus ensures a feasible solution. TV has given rise to a diverse range of TV inspired regularizers aiming to cope with TV’s limitations (especially with the staircase artifacts caused due to the PWC assumption), while preserving its convexity. One of the most popular methods is the total generalized variation (TGV) [4], which favors piecewise smoothness rather than PWC, by encoding not only first-, but also higher-order information. Another such TV variant is structure tensor total variation (STV) [8], which forms the basis for this paper together with another variant: directional total variation (DTV) [1] designed for uni-directional images. STV penalizes the non-linear combinations of the first-order derivatives within a neighborhood (i.e. eigenvalues of the structure tensor), while DTV penalizes weighted and rotated gradients of each pixel. STV applies semi-local regularization, while DTV does local. Moreover, STV can intrinsically decide and adaptively apply the dose of penalization, while DTV requires priorly determined parameters for this purpose.

STV and DTV are both analysis-based convex regularizers. Here the term analysis-based refers to the regularizers of the form (in discrete setting):

$$\begin{aligned} R(f) = \sum \limits _{i=1}^{N} \phi ((\varGamma f)[i]) \end{aligned}$$

(2)

which directly run on the image, rather than applying the regularization in a transform domain (i.e. sythesis-based regularizers such as wavelet, curvelet and shearlet.) In Eq. (2), $\varGamma : \mathbb {R}^{N}\rightarrow \mathbb {R}^{N \times D}$ is regularization operator with $(\varGamma f)[i]$ denoting the D-dimensional i-th element of $\varGamma f$ mapping, and $\phi : \mathbb {R}^M \rightarrow \mathbb {R}$ is a potential function. For TV regularization, $\varGamma = \nabla $ that denotes the gradient operator, and $\phi = \Vert \cdot \Vert _p$, where $p=2$ for the native isotropic TV [10]. There are also anisotropic alternatives, designed to catch convex shapes with angled boundaries. For example in [7], a convex TV variant that uses $p=1$ was suggested, which was later referred to as anisotropic TV. In [6], the authors preferred using $p \in (0,1)$, and in the recent work [9], a weighted mixed-norm design like $\phi = w_1\Vert \cdot \Vert _1+w_2\Vert \cdot \Vert _2$ was proposed. The common drawback of the last two is their non-convexity. Also, there are applications of having $p=0$, as in [12], which is intrinsically NP-hard. This work aims at designing such a regularizer by combining the ideas behind STV and DTV regularizers, which will be introduced in the following section.

2 Background

By recalling Eq. (2), STV plugs $\varGamma = \mathcal {S}_k$ and $\phi = \Vert \sqrt{\lambda (\cdot )}\Vert _p$. $(\mathcal {S}_kf)[i]$ stands for the structure tensor, which is a symmetric positive semi-definite (PSD) matrix normally defined as $(\mathcal {S}_{k}f)[i] = k_{\sigma } * ((J f)[i] (J f)[i]^T)$, where $k_\sigma $ is a Gaussian kernel of standard deviation $\sigma $; and $\lambda $ returns the eigenvalues of the given matrix. Here J denotes the Jacobian operator defined by extending the gradient for vector-valued images: $(Jf)[i] = \big [(\nabla f^{1})[i] \ (\nabla f^{2})[i] \ \cdots \ (\nabla f^{C})[i]\big ]^T$, where superscripted f’s are denoting the channels. Thus for scalar-valued images, $J(\cdot ) = \nabla (\cdot )$. When compared to the local differential operators, semi-local structure tensor provides a better way to characterize the directional variations, since it also takes the neighboring pixels into account. This way lies in its rooted eigenvalues and their associated unit eigenvectors, which summarize the gradients within a patch supported by $k_\sigma $ and centered at i-th pixel.

In [8], the authors also defined the structure tensor in terms of another operator that they proposed and called as patch-based Jacobian $J_k: \mathbb {R}^{N \times C} \mapsto \mathbb {R}^{N \times LC \times 2}$, which embodies the convolution kernel of size L. It is defined as:

$$\begin{aligned} (J_k f)[i] = \big [(\tilde{\nabla } f^{1})[i], \ (\tilde{\nabla } f^{2})[i] \ \cdots \ (\tilde{\nabla } f^{C})[i]\big ]^T \end{aligned}$$

(3)

where $(\tilde{\nabla } f^{c})[i] = \big [(\mathcal {T}_{1}\nabla f^c)[i], (\mathcal {T}_{2}\nabla f^c)[i], \cdots , (\mathcal {T}_{L}\nabla f^c)[i]\big ]$. Each l-th entity corresponds to shifting and weighting on the gradient as $(\mathcal {T}_{l} \nabla f^c)[i] = \omega [p_l](\nabla f)[x_i-p_l]$, where $x_i$ denotes the actual 2D coordinates of the i-th pixel, $p_l \in \mathcal {P}$ is the shift amount so that each $x_i-p_l$ is within the $\mathcal {P}$-neighborhood of $x_i$, and the weight is determined by the smoothing kernel ${k}_{\sigma }$ as $\omega [p_l] = \sqrt{k_\sigma [p_l]}$. This new operator allows us to decompose the nonlinear $(\mathcal {S}_k f)[i]$ into linear $(J_k f)[i]$’s as follows:

$$\begin{aligned} (\mathcal {S}_{k}f)[i] = (J_k f)[i] (J_k f)[i]^T \end{aligned}$$

(4)

so that the new formulation reveals an easier way of employing optimization tools. The rooted eigenvalues of $(\mathcal {S}_{k}f)[i]$ coincide with the singular values of $(J_kf)[i]$, and this paves the way of using Schatten p-norms to redefine STV by this time plugging $\varGamma = J_k$ and $\phi = \Vert \cdot \Vert _{S_p}$ in Eq. (2). $S_p$ matrix norm is nothing but the $\ell _p$-norm of the vector that contains the singular values of the subjected matrix, which is $(J_k f)[i]$ in our case. Therefore, STV regularizer in its first and second form is given below:

$$\begin{aligned} STV(f) = \sum _{i=1}^{N} \Vert \sqrt{\lambda ((\mathcal {S}_{k}f)[i])}\Vert _p = \sum _{i=1}^{N} \Vert (J_kf)[i] \Vert _{S_p} \end{aligned}$$

(5)

where ${\lambda ((\mathcal {S}_{k}f)[i])} = [ {\lambda ^+((\mathcal {S}_{k}f)[i])}, {\lambda ^-((\mathcal {S}_{k}f)[i])} ]^T$.

The DTV, on the other hand, simply substitutes $\varGamma = \nabla $ and $\phi = \Vert \varLambda _{\alpha } R_{-\theta } (\cdot ) \Vert _2$, where ${R}_{\theta } = \left[ \begin{array}{cc} \text {cos}\theta &{} -\text {sin}\theta \\ \text {sin}\theta &{} \text {cos}\theta \end{array} \right] $ and $\varLambda _\alpha = \left[ \begin{array}{cc} \alpha &{} 0 \\ 0 &{} 1 \end{array} \right] $. Here $\theta $ corresponds to the dominant direction (that should be pre-determined), and $\alpha $ is used to tune the dose of penalization throughout that direction. DTV stipulates that the image to be recovered exhibits a global directional dominance. It is only designed for the scalar-valued images.

The structure tensor has the ability of capturing the first-order information within a local neighborhood. This neigborhood-awareness makes it more robust to the noise (or the other types of deterioration), when compared to the gradient. Since STV is designed by using the rooted eigenvalues of the structure tensor, it better codifies the image variation at a point than the regularizers that aims to penalize the gradient magnitude (such as TV and DTV). However, STV may not always distinguish the edges under excessive noise, thus may smooth out them. Even though DTV’s preference of penalizing the gradient magnitude makes it vulnerable to the noise, it is good at struggling with it along the edges, since its gradients are rotated to a favorable direction and scaled according to a favorable dose of smoothness.

3 Method

Let us define a new operator $\varPi $ (as being a composition of $R_{\theta }$ and $\varLambda _{\alpha }$), which can act on the gradients within the patch, when applied to the patch-based Jacobian, as follows:

$$\begin{aligned} (\varPi J_kf)[i] = \left[ \begin{array}{cccc} (\mathcal {T}_{1} \varLambda _{\alpha } R_{-\theta } \nabla f^1)[i] &{} (\mathcal {T}_{2}\varLambda _{\alpha } R_{-\theta }\nabla f^1)[i] &{} \cdots &{} (\mathcal {T}_{L}\varLambda _{\alpha } R_{-\theta }\nabla f^1)[i] \\ (\mathcal {T}_{1} \varLambda _{\alpha } R_{-\theta } \nabla f^2)[i] &{} (\mathcal {T}_{2}\varLambda _{\alpha } R_{-\theta }\nabla f^2)[i] &{} \cdots &{} (\mathcal {T}_{L}\varLambda _{\alpha } R_{-\theta }\nabla f^2)[i]\\ \vdots &{} \vdots &{} \vdots &{} \vdots \\ (\mathcal {T}_{1} \varLambda _{\alpha } R_{-\theta } \nabla f^c)[i] &{} (\mathcal {T}_{2}\varLambda _{\alpha } R_{-\theta }\nabla f^c)[i] &{} \cdots &{} (\mathcal {T}_{L}\varLambda _{\alpha } R_{-\theta }\nabla f^c)[i] \end{array} \right] \end{aligned}$$

(6)

Let it be called as directional patch-based Jacobian. Therefore, we propose a new regularizer of the form: DSTV functional of the form:

$$\begin{aligned} DSTV(f) = \sum _{i = 1}^N \Vert (\varPi J_kf)[i] \Vert _{S_p} \end{aligned}$$

(7)

where DSTV abbreviates the direction guided STV, here and hereafter. By means of Eq. (7), one has the chance of penalizing the eigenvalues of the structure tensor, in favor of the predetermined direction. By means of Eq. (7), the eigenvalues of the structure tensor this time summarize the rotated and scaled gradients with the incorporation of prior knowledge. Also, the presence of the $\varLambda _\alpha R_\theta $’s doesn’t destroy the convexity of the regularizer.

Equation (7) can equivalently be written as a support function of the set $B_{F} = \lbrace \varUpsilon \in \mathbb {R}^{LC \times 2}: \Vert \varUpsilon \Vert _{S_2} \le 1 \rbrace $, which is the unit ball of $S_2$ (Frobenius) matrix norm and takes the shape of:

$$\begin{aligned} DSTV(f) = \sum _{i=1}^{N} \underset{\varUpsilon \in B_{F}}{\mathrm{sup}} \langle (\varPi J_k f)[i], \varUpsilon \rangle \end{aligned}$$

(8)

where $\langle \cdot \rangle $ denotes the inner product. The right-hand side of Eq. (8) can be rewritten in the compact form of $\underset{\varPsi [i] \in B_{F}}{\mathrm{sup}} \langle \varPi J_k f, \varPsi \rangle $, by introducing another variable $\varPsi \in \mathbb {R}^{N \times LC \times 2}$, where $\varPsi [i]$ is the i-th submatrix. This expression allows us to use the available algorithms for orthogonal projection onto the convex $B_{F}$.

Therefore, a DSTV-regularized denoising problem will require to solve the following minimization problem:

$$\begin{aligned} \underset{f \in \mathcal {C}}{\mathrm{min}} \ \frac{1}{2}\Vert g - f \Vert _2^2 + \tau \underset{\varPsi [i] \in B_{F}}{\mathrm{sup}} \langle {\varPi J_k}f, \varPsi \rangle \end{aligned}$$

(9)

where $\mathcal {C}$ is nothing but a set that corresponds to an additional constraint on f (e.g. box constraint), which is equal to $\mathbb {R}^{N}$ for unconstrained case.

Due to DSTV’s nonsmoothness, solving Eq. (9) is nontrivial. But since the convexity of the STV (proven in [8]) is preserved in Eq. (8), the proximal map of DSTV corresponds to the minimizer, thus can be employed as:

$$\begin{aligned} \hat{f} = \text {prox}_{\tau DSTV(f)}(g) := \underset{f \in \mathcal {C}}{\mathrm{arg min}} \ \frac{1}{2}\Vert g - f \Vert _2^2 + \tau \underset{\varPsi [i] \in B_{F}}{\mathrm{sup}} \langle f, {J^*_k}\varPi ^*\varPsi \rangle \end{aligned}$$

(10)

where $J^*_k$ and $\varPi ^*$ arising after we leave f alone, denote the adjoints. $J^*_k$ was defined in [8] as:

$$\begin{aligned} (J^*_K X) [k] = \sum _{l=1}^{L} - \text {div}(\mathcal {T}^*_{l}X[i,s]) \end{aligned}$$

(11)

where $s = (c-1)L+l$ and $k=(c-1)N+n$ with $1 \le n \le N$ and $1 \le c \le C$. $\mathcal {T}^*_{l}$ corresponds to the adjoint of $\mathcal {T}_{l}$, which scans the X[i, s] in column-wise manner, where $X[i,s] \in \mathbb {R}^2$ is the s-th row of the i-th submatrix of an arbitrary $X \in \mathbb {R}^{N \times LC \times 2}$. In Eq. (10), $X = \varPi ^*\varPsi $ for the operator $\varPi ^*$ acting the same way with $\varPi $, except that the operator pair $\varLambda _\alpha R_{-\theta }$ applied to the two-dimensional vector elements ($\varPsi [i,s]$ in this case) is replaced by $R_\theta \varLambda _\alpha $. Also in Eq. (11), $\text {div}$ is discrete divergence, whose definition depends on the discretization scheme used for the gradient. In [8], it is defined using backward differences, since the gradient is discretized using forward differences.

From now on, we will be following the fast gradient projection (FGP) method [2], which combines the dual approach introduced in [5] to solve TV-based denoising, and the convergence accelerator FISTA [3]. Equation (10) can first be expressed as a minimax problem for the objective $\mathcal {L}(f,\varPsi ) = \frac{1}{2}\Vert g - f \Vert _2^2 + \tau \langle f, {J^*_k}\varPi ^* \varPsi \rangle $, which is convex w.r.t. f and concave w.r.t. $\varPsi $; thus, has a common saddle point that doesn’t change when the minimum and the maximum are swapped as shown below:

$$\begin{aligned} \underset{f \in \mathcal {C}}{\mathrm{min}} \ \underset{\varPsi [i] \in B_{F}}{\mathrm{max}} \ \mathcal {L}(f,\varPsi ) = \mathcal {L}(\hat{f},\hat{\varPsi }) = \underset{\varPsi [i] \in B_{F}}{\mathrm{max}} \ \underset{f \in \mathcal {C}}{\mathrm{min}} \ \mathcal {L}(f,\varPsi ) \end{aligned}$$

(12)

Maximization of the dual problem $d(\varPsi ) = \underset{f \in \mathcal {C}}{\mathrm{min}} \ \mathcal {L}(f,\varPsi )$ at the right-hand side is same with the minimization of the primal problem at the left-hand side. When the maximizer $\hat{\varPsi }$ of $d(\varPsi )$ is found, it can be used to find the minimizer $\hat{f}$ of the primal problem. In order to find $\hat{\varPsi }$, we first derive the minimizer $\hat{f}$, in terms of $\varPsi $, as:

$$\begin{aligned} \hat{f} = \underset{f \in \mathcal {C}}{\mathrm{argmin}} \ \mathcal {L}(f,{\varPsi }) \end{aligned}$$

(13)

where $\mathcal {L}(f,{\varPsi }) = \Vert f - (g - \tau J^*_k(\varPi ^* {\varPsi })) \Vert ^2_2 - M$ is found by expanding the equation, collecting the constants at the end, and subsumed under the term M. The solution of Eq. (13) is $\hat{f} = P_\mathcal {C}(g - \tau J^*_k \varPi ^*{\varPsi })$, where $P_\mathcal {C}(\cdot )$ is the orthogonal projection onto the convex set C. Then we proceed by plugging $\hat{f}$ in $\mathcal {L}(f,\varPsi )$ to get the dual problem $d(\varPsi ) = \mathcal {L}(\hat{f},\varPsi ) = \frac{1}{2} \Vert w - P_\mathcal {C}(w)\Vert ^2_2 + \frac{1}{2} (\Vert z\Vert ^2_2 - \Vert w \Vert ^2_2)$, where $w = g - \tau J^*_k \varPi ^*{\varPsi }$. As opposed to the primal problem, $d(\varPsi )$ is smooth with a well-defined gradient:

$$\begin{aligned} \nabla d(\varPsi ) = \tau \varPi J_k P_\mathcal {C}(g - \gamma \tau J^*_k \varPi ^*{\varPsi }) \end{aligned}$$

(14)

obtained based on the derivation in Lemma 4.1 of [2]. From now on, finding the maximizer $\hat{\varPsi }$ of $d(\varPsi )$ is trivial by using the idea of the projected gradients [5]. In case of minimization, it iteratively performs decoupled sequences of gradient descent and gradient projection on a set. In our case, each $\varPsi [i]$ will be projected onto the set $B_F$, where the projection is defined as:

$$\begin{aligned} P_{B_F}(\varPsi [i]) = \frac{\varPsi [i]}{\text {max}(1,\Vert \varPsi [i] \Vert _F)} \end{aligned}$$

(15)

in [8], to which the readers are referred for the derivation. When it comes to gradient ascent, an appropriate step size that ensures the convergence is needed. Since Eq. (14) is a Lipschitz continuous gradient, a constant step size 1/L(d) can be used, where L(d) is the Lipschitz constant having the upper bound $L(d) \le 8\sqrt{2}(\alpha ^2+1)\gamma ^2\tau ^2$. For the derivation, see Appendix A.

As a consequence, the overall algorithm is shown in Algorithm 1. FISTA procedure is referring to the fast iterative shrinkage-thresholding algorithm proposed in [3], and attached to Appendix B.

4 Experimental Results

In this section, we perform experiments to compare the performance of our regularizer, with its predecessor STV and its influencer DTV. Since STV’s superiority over the baseline TV and the TGV has already been shown in [8], they are not included to the competing algorithms. In Fig. 1, the uni-directional grayscale and color images used in the experiments are all shown. Among those images; reed, sea shell, cotton bud, and feather were taken from Amsterdam Library of Textures (ALOT) image dataset while the others were public domain images. We use peak signal to noise ratio (PSNR) and structural similarity index (SSIM) [11] to assess the performances of the algorithms. DSTV has been implemented on top of STV, whose source code is publicly available on the author’s (Lefkimmiatis, S.) website. All methods were written in MATLAB by only making use of the CPU. The runtimes were computed on a computer equipped with Intel Core Processor i7-7500U (2.70-GHz) with 16 GB of memory.

For the convolution kernel $k_\sigma $ of the structure tensor, we choose it to be a $5 \times 5$ pixel Gaussian window with standard deviation $\sigma = 0.5$ pixels, for both STV and DSTV. This decision is taken by prioritizing the reconstruction quality; however, by selecting a $3 \times 3$ window, the computational complexity, thus the reported runtimes can be reduced (nearly up to the DTV’s runtimes) at the cost of a little loss from the quality. On the other hand, in all experiments, $\alpha $ for DTV and DSTV, and the regularization parameter $\tau $ for all competing regularizers are optimized, such that they yield the best possible PSNR. Furthermore, to be used by DTV and DSTV, the $\theta $ value is estimated from the observed image by computing the DTV measures for a range of angles, while keeping $\alpha $ > 1 constant. The angle that yields the smallest DTV is picked as the dominant direction.

The grayscale results of the DSTV against to the DTV and the STV are reported in Table 1. The PSNR and SSIM performances are listed, along with their respective runtimes, by considering four different noise levels applied on the first three images shown in Fig. 1. Although it is obvious that the DSTV yields better recovery than the others, for the cotton bud and the feather images, DTV almost catches up DSTV, especially in terms of the PSNR measure. This is due to the fact that those images are coherently uni-directional (except the bottom left and top right corners of the feather image), with smooth, non-textural, and continuous stripes. Denoising this kind of images is a trivial task for DTV, since its enforced direction of smoothness coincides with the correct pattern. Even so, in higher frequency feather image, the SSIM gap between DTV and DSTV is concretely higher than the one obtained in cotton bud image. When it comes to the straw image, deviations from the dominant direction work against both DTV and DSTV. The results on straw image can better reflect the DSTV’s superiority arising from the semi-localness of the structure tensor, over DTV. This semi-localness make DSTV more robust to the errors in pre-determined $\theta $. For the visual assessment, Fig. 2 shows the restored versions of the feather and straw images by three subjected regularizers. As one can observe, STV causes to the oil-painting like artifacts on the edges, whereas it is the only regularizer that can manage to restore the texture at the bottom left corner of the feather image, where the direction does not match with the dominant one. On the other side, the DTV over-smooths by damaging rough textures on the details, which are well preserved by the DSTV (see the detail images in Fig. 2).

Table 1. Assessing the PSNR/SSIM performances of DTV [1], STV [8], and DSTV regularizers (along with their elapsed runtimes) on scalar-valued (grayscale) images shown in Fig. 2.

Full size table

Table 2. Assessing the PSNR/SSIM performances of STV [8] and DSTV denoisers (along with their elapsed runtimes) on vector-valued (color) images shown in Fig. 3.

Full size table

In Table 2 and in Fig. 3, the results obtained from the color images are reported. Since DTV is only designed for the scalar-valued images, it is excluded from these experiments. According to the results presented in the table, DSTV systematically outperforms STV for our uni-directional image denoising problem. Its superiority becomes more apparent in the high-frequency images with less deviations from the pre-determined dominant direction (e.g. palm). In sea shell image on the other hand, the PSNR and the SSIM values obtained by both regularizers are almost same. This result is due to the smooth transitions of the stripes on the shell from one direction to another. However, for the same image, the value of DSTV can visually be seen by comparing the detail images given in Fig. 3(k) and (l). In contrast to the quantitative results, these images reflect the DSTV’s ability of texture preservation. The same applies for the cat fur image, which substantially may not be categorized as a uni-directional image. Also to evaluate the DSTV’s robustness to the small deviations from the dominant direction, one can observe the first row of Fig. 3. Overall, DSTV’s idea of contributing STV minimization process in favor of the dominant direction succeeds denoising in more appealing way for nearly uni-directional images; while STV fails to restore high-frequency details under excessive amount of noise, since the fair contributions of the rapidly changing gradients within a neighborhood may mislead the process.

5 Conclusion and Future Work

In this paper, we discoursed the denoising problem from variational perspective, and proposed a new analysis-based regularizer to be used on uni-directional images. We utilized the DTV’s idea of incorporating the prior knowledge on the directional dominance of the latent image to the inversion process, while designing our regularizer DSTV as a variant of STV. Throughout the paper, only the uni-directional images were our concern. Even though this scenario is biased to the DTV and DSTV, it provides valuable insight on how the guidance of a directional prior can improve one of the state-of-the-art regularizers. As shown by the experimental results, DSTV systematically outperformed its two predecessors. This encourages us about a possible extension to DSTV that can work with spatially varying $\theta $’s and $\alpha $’s, extracted from the observed image. A relevant research question is if it is possible to develop such a preprocessor algorithm that can extract robust derivative-based directional descriptors, and can feed them into DSTV-based denoising machinery.

References

Bayram, I., Kamasak, M.E.: Directional total variation. IEEE Signal Process. Lett. 19(12), 781–784 (2012)
Article Google Scholar
Beck, A., Teboulle, M.: Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems. IEEE Trans. Image Process. 18(11), 2419–2434 (2009)
Article MathSciNet Google Scholar
Beck, A., Teboulle, M.: A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imaging Sci. 2(1), 183–202 (2009)
Article MathSciNet Google Scholar
Bredies, K., Kunisch, K., Pock, T.: Total generalized variation. SIAM J. Imaging Sci. 3(3), 492–526 (2010)
Article MathSciNet Google Scholar
Chambolle, A.: An algorithm for total variation minimization and applications. J. Math. Imaging Vis. 20(1–2), 89–97 (2004)
MathSciNet MATH Google Scholar
Chartrand, R.: Exact reconstruction of sparse signals via nonconvex minimization. IEEE Signal Process. Lett. 14(10), 707–710 (2007)
Article Google Scholar
Esedoḡlu, S., Osher, S.J.: Decomposition of images by the anisotropic Rudin-Osher-Fatemi model. Commun. Pure Appl. Math. 57(12), 1609–1626 (2004)
Google Scholar
Lefkimmiatis, S., Roussos, A., Maragos, P., Unser, M.: Structure tensor total variation. SIAM J. Imaging Sci. 8(2), 1090–1122 (2015)
Article MathSciNet Google Scholar
Lou, Y., Zeng, T., Osher, S., Xin, J.: A weighted difference of anisotropic and isotropic total variation model for image processing. SIAM J. Imaging Sci. 8(3), 1798–1823 (2015)
Article MathSciNet Google Scholar
Rudin, L.I., Osher, S., Fatemi, E.: Nonlinear total variation based noise removal algorithms. Phys. D 60(1–4), 259–268 (1992)
Article MathSciNet Google Scholar
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article Google Scholar
Xu, L., Lu, C., Xu, Y., Jia, J.: Image smoothing via L$_0$ gradient minimization. ACM Trans. Graph. (TOG) 30, 174 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Engineering, Department of Computer Engineering, Istanbul Kultur University, 34156, Istanbul, Turkey
Ezgi Demircan-Tureyen
Faculty of Computer and Informatics, Department of Computer Engineering, Istanbul Technical University, 34390, Istanbul, Turkey
Ezgi Demircan-Tureyen & Mustafa E. Kamasak

Authors

Ezgi Demircan-Tureyen
View author publications
You can also search for this author in PubMed Google Scholar
Mustafa E. Kamasak
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ezgi Demircan-Tureyen .

Editor information

Editors and Affiliations

Universidad Autónoma de Madrid, Madrid, Spain
Aythami Morales
Universidad Autónoma de Madrid, Madrid, Spain
Julian Fierrez
Universitat Jaume I, Castellón de la Plana, Spain
José Salvador Sánchez
University of Coimbra, Coimbra, Portugal
Bernardete Ribeiro

Appendices

Appendix A

In order to find an upper bound for Lipschitz constant, we follow the derivation in [2] and adapt it to our formulation.

$$\begin{aligned} \begin{aligned} \Vert \nabla d(v) - \nabla d(u) \Vert&= \tau \Vert (\varPi J_k P_\mathcal {C}(g - \tau J_k^*\varPi ^* u) - \varPi J_k P_\mathcal {C}(g - \tau J_k^*\varPi ^* v)\Vert \\&\le \tau \Vert \varPi J_k \Vert \Vert P_\mathcal {C}(g - \tau J_k^*\varPi ^* u) - P_\mathcal {C}(g - \tau J_k^*\varPi ^* v)\Vert \\&\le \tau \Vert \varPi J_k \Vert \Vert \tau J_k^* \varPi ^* (u-v) \Vert \\&\le \tau ^2 \Vert \varPi \Vert ^2 \Vert J_k \Vert ^2 \Vert u-v \Vert \\&\le \tau ^2 \Vert \varPi \Vert ^2 \Vert \nabla \Vert ^2 \Vert T\Vert ^2 \Vert u-v \Vert \\ \end{aligned} \end{aligned}$$

(16)

where $T = \sum _{l=1}^L (\mathcal {T}_{l}^* \mathcal {T}_{l})$. Knowing from [2] that $\Vert \nabla \Vert ^2 \le 8$, from [8] that $\Vert T \Vert ^2 \le \sqrt{2}$, and further showing that $\Vert \varPi ^* \Vert ^2 \le (\alpha ^2 + 1)$, we come up with the Lipschitz constant $L(d) \le 8\sqrt{2}(\alpha ^2 + 1) \tau ^2 $.

Appendix B

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Demircan-Tureyen, E., Kamasak, M.E. (2019). On the Direction Guidance in Structure Tensor Total Variation Based Denoising. In: Morales, A., Fierrez, J., Sánchez, J., Ribeiro, B. (eds) Pattern Recognition and Image Analysis. IbPRIA 2019. Lecture Notes in Computer Science(), vol 11867. Springer, Cham. https://doi.org/10.1007/978-3-030-31332-6_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-31332-6_8
Published: 22 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-31331-9
Online ISBN: 978-3-030-31332-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

On the Direction Guidance in Structure Tensor Total Variation Based Denoising

Abstract

Similar content being viewed by others

Nonlocal adaptive direction-guided structure tensor total variation for image recovery

Directional total generalized variation regularization

Image Denoising Via Spatially Adaptive Directional Total Generalized Variation

Keywords

1 Introduction

2 Background

3 Method

4 Experimental Results

5 Conclusion and Future Work

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendices

Appendix A

Appendix B

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

On the Direction Guidance in Structure Tensor Total Variation Based Denoising

Abstract

Similar content being viewed by others

Nonlocal adaptive direction-guided structure tensor total variation for image recovery

Directional total generalized variation regularization

Image Denoising Via Spatially Adaptive Directional Total Generalized Variation

Keywords

1 Introduction

2 Background

3 Method

4 Experimental Results

5 Conclusion and Future Work

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendices

Appendix A

Appendix B

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation