Adaptive total variation based image segmentation with semi-proximal alternating minimization

doi:10.1016/j.sigpro.2021.108017

Signal Processing

Volume 183, June 2021, 108017

https://doi.org/10.1016/j.sigpro.2021.108017 Get rights and content

Highlights

•
We propose a novel adaptive total variation-based segmentation model combining the weighted matrix with the gradient operator.
•
Our model has a unique minimizer and can be solved effectively by the semi-proximal alternating direction method of multipliers with guaranteed convergence.
•
We utilize the K-means method to select the thresholds automatically and get good restoration and segmentation results for complex situations.

Abstract

To improve the image segmentation quality, it is important to adequately describe the local features of targets in images. In this paper, we develop a novel adaptive total variation based two-stage segmentation approach to restore and segment images under complex degradations. To find a smooth approximation solution in the first stage, we introduce an effective regularization term that combines an adaptive weighted matrix with the gradient operator. The adaptive weighted matrix gives different penalties in the axis directions to enhance the diffusion along the tangent direction of the edge. It can filter out the details far away from the edge and preserve the main structure of targets. For the convex objective function in the first stage, a semi-proximal alternating direction method of multipliers (sPADMM) with guaranteed convergence is successfully employed. We utilize the K-means method to select thresholds automatically and complete the segmentation by thresholding the image into different regions in the second stage. Extensive experimental comparisons between our method and some state-of-the-art methods including a deep learning approach are provided. All numerical results illustrate clearly that our method has better performance for different kinds of segmentation and restoration tasks.

Introduction

Image segmentation has always been a hot topic, which aims to divide an image domain into mutually disjoint regions reasonably. This plays a significant role in many application fields, such as medical imaging [1], [2], vehicle license plate recognition [3], [4], etc. During the past decades, many excellent approaches have been proposed [5], [6], [7], [8], [9], [10]. In 1989, one of the most important segmentation models called MS model was proposed by Mumford and Shah [6], which was a landmark achievement in segmentation fields. Let $Ω \subset R^{2}$ be a bounded open connected set, and $Γ$ be a compact curve in $Ω,$ the MS model can be formulated as $E_{MS} (u, Γ; Ω) = \frac{λ}{2} \int_{Ω} {(f - u)}^{2} d x + \frac{μ}{2} \int_{Ω ∖ Γ} {| \nabla u |}^{2} d x + Length (Γ),$ where $λ, μ$ are positive parameters, $f : Ω \to R$ is the degraded image and $u : Ω \to R$ is the ideal image. Here, the length of $Γ$ can be written as $H^{1} (Γ),$ i.e., the 1-dimensional Hausdorff measure in $R^{2}$ . Model (1) adopted the energy minimization equation to find an approximate solution, but it is difficult to solve due to its nonconvexity.

Some early attempts [11], [12] to solve (1) were done by approximating it using a sequence of simpler elliptic problems. Meanwhile, more works have focused on exploring the MS model such as active contour methods [13], [14], graph cut methods [15] and convex relaxation approaches [16], [17], etc. In 2013, Cai et al. proposed a two-stage image segmentation strategy using a convex variant of the MS model and thresholding [18], which got good segmentation results. The first stage is to find a smooth approximation solution $u$ by minimizing the energy functional: $inf_{u} {\frac{λ}{2} \int_{Ω} {(f - A u)}^{2} d x + \frac{μ}{2} \int_{Ω} {| \nabla u |}^{2} d x + \int_{Ω} | \nabla u | d x},$ where $A$ is the problem related linear operator. Once $u$ is obtained, the segmentation is done by thresholding $u$ properly in the second stage. Since then, many novel image segmentation methods [19], [20], [21], [22], [23], [24] have been proposed based on the two-stage idea [18], and they show the subtle connection between image restoration and segmentation to some extent. The two-stage strategy and corresponding improved models have noticeable virtues in image segmentation, refer to [18], [19], [23] for details.

As we know, image restoration (including denoising, deblurring, inpainting, etc.) is to estimate a clean image from the degraded image. And the segmentation can be used as the preprocessing or postprocessing for restoration. In [19], Chan et al. used the two-stage method for segmenting blurry images with Poisson or multiplicative Gamma noise. Duan et al. [20] introduced Euler’s Elastica as the regularization in the MS model based on a two-stage segmentation strategy to capture the geometry of object shapes in noisy images with missing pixels. In [24], the discrete-MS model was proposed, and the proximal alternating linearized minimization algorithm was employed to deal with the objective function to restore a degraded image and extract its contours. In [23], authors explored a linkage between the PCMS model [6] and the ROF model [25], then derived a novel thresholded-ROF (T-ROF) segmentation method to illustrate the advantages of image segmentation through image restoration techniques. However, as the regularizers of the aforementioned methods are isotropic, non-ideal results can be generated, especially when images with complex geometry structures.

Many previous works [18], [25], [26], [27] have considered the total variation (TV) regularization and anisotropic filtering as standard methods for image restoration because of their capability to detect and maintain image edges. However, the anisotropic filtering produces artifacts and the TV regularization causes the staircase effect [25], [28]. In order to overcome these drawbacks, many approaches [28], [29], [30], [31], [32] were proposed by combining the TV regularization and anisotropic filtering, where the isotropic TV norm is replaced by an anisotropic term. For example, Pang et al. proposed an anisotropic TV-based restored model combining a weighted matrix with the gradient operator [30]. A model based on the adaptive weighted TV $^{p}$ regularization in [31] was subsequently proposed for image denoising, where the rotation matrix and weight matrix are embedded in TV $^{p}$ regularization.

Different from the reweighted $l_{1}$ algorithm [33], [34], the weighted $l_{1} - l_{2}$ algorithm [35], [36], TV $^{p}$ [37], [38], [39] and the nuclear/ $l_{2, 1}$ -norm regularization [40], we often need that the target model diffuses along the tangent direction of the local features. Generally, the setting for $\nabla_{x}$ and $\nabla_{y}$ with the same weight cannot efficiently couple with the local features. To manifest the geometry structures of targets in images adequately, we give different penalties in the $x$ -axis and $y$ -axis directions according to the gradient information $\nabla_{x}$ and $\nabla_{y}$ . Based on the above facts, we propose an adaptive TV-based two-stage segmentation model with the following novelties:

a.
We present a robust and efficient anisotropic segmentation model, where the regularization combines the adaptive weighted matrix $T$ (see details in Section 2.2) and gradient operator ( $\nabla_{x},$ $\nabla_{y}$ ) to describe geometry structure details and more edge information of segmentation objects in images. The adaptive weighted matrix $T$ has good properties in capturing elongated fine structures, which are useful for fine structure segmentation.
b.
To deal with the adaptive TV-based convex objective function, an elegant and effective numerical method is designed through the sPADMM algorithm with guaranteed convergence [41], [42]. Furthermore, extensive experiments demonstrate that the proposed algorithm and model outperform advanced methods on kinds of image segmentation problems under various degraded situations such as noise, blur and missing pixels, etc.

The outline of this paper is as follows. In Section 2, we introduce our anisotropic segmentation model and present the properties of the model. In Section 3, the sPADMM algorithm is carefully applied to solve the convex optimization model and the convergence analysis of the algorithm is shown. We list some preparation works of numerical experiments in Section 4. In Section 5, numerical examples are presented to compare different methods including a celebrated U-Net approach. Finally, the conclusions are drawn in Section 6.

Section snippets

The proposed model

This section mainly introduces our proposed method. And we show that our model has a unique solution under mild conditions. Recall that $Ω \subset R^{2}$ is a bounded open connected set. Let $X$ be the function from $Ω$ to $R$ . Then $f, u \in X$ are images living on $Ω$ . Throughout the paper, we do not distinguish the linear operator and the corresponding discrete matrix.

Algorithm

This section focuses on our algorithm, where we exhibit the solving process of model (3) and the convergence analysis of the algorithm. In the discrete settings, we assume that the gray image is an $M \times N$ matrix, denote the Euclidean space $R^{M \times N}$ by $X$ and set $Y = X \times X$ .

Preparation for numerical experiments

To verify the effectiveness and robustness of our method, we compare our method with state-of-the-art approaches (CV [13], SaT [18], BCEEMS [20], ITCV [58], ICTM-LIF [10], T-ROF [23], WBHMS [32], U-Net [1], SLAT [59]) on some natural images. Except for the operation of the U-Net [1] experiment, all other experiments are performed via using MATLAB R2020a and Windows 10(x64) on a PC with Intel Core (TM) i7 10750H CPU 2.60GHz and 16.0GB memory. The U-Net is conducted under PyTorch and Windows

Image segmentation with different levels of noise and blur

In this example, we test our method on images with different noise levels and blur to show the stability and robustness of our model. In Fig. 8(b) and (c), we add Gaussian noise with mean 0 and $σ^{2}$ =0.3, $σ^{2}$ =0.5, respectively. In Fig. 8(d)--(f), we add $B_{d}$ =fspecial(‘disk’, 5), $B_{m}$ =fspecial(‘motion’, 15, 45), $B_{g}$ =fspecial(‘gaussian’, 12, 12) respectively, and then Gaussian noise with mean value 0 and variance $σ^{2}$ =0.01 on all of them, where $B_{d},$ $B_{m}$ and $B_{g}$ means disk blur, motion blur and Gaussian blur.

Conclusions

In this paper, we developed an adaptive TV-based two-stage segmentation model. We introduced an effective adaptive TV regularization, which can smooth out unimportant details in intrinsic regions of targets and preserve characteristic structures such as edges, sharp corners, etc. A suitable weighted matrix can make our method more robust by coupling with the local structure information of the object. The convex minimization problem can be solved by the efficient sPADMM algorithm with guaranteed

CRediT authorship contribution statement

Tingting Wu: Conceptualization, Methodology, Writing - original draft. Xiaoyu Gu: Data curation, Software. Youguo Wang: Formal analysis, Validation. Tieyong Zeng: Project administration, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

The research was supported by the Natural Science Foundation of China (No. 61971234, 11501301, 11671002, 62071248), CUHK start-up and CUHK DAG 4053296, 4053342, 4053405, RGC 14300219, RGC 14302920, NSFC/RGC N_CUHK 415/19, Hunan Provincial Key Laboratory of Mathematical Modeling and Analysis in Engineering (2018MMAEYB03), Postgraduate Research & Practice Innovation Program of Jiangsu Province (Grant No. SJCX20_0229), the “1311 Talent Plan” of NUPT, and the QingLan Project for Colleges and

References (63)

V. Khare et al.
A novel character segmentation-reconstruction approach for license plate recognition
Expert Syst. Appl.
(2019)
C. Liu et al.
Weighted variational model for selective image segmentation with application to medical images
Pattern Recognit.
(2018)
C. Liu et al.
Weighted variational model for selective image segmentation with application to medical images
Pattern Recognit.
(2018)
L.I. Rudin et al.
Nonlinear total variation based noise removal algorithms
Physica D
(1992)
Z.-F. Pang et al.
Image denoising via a new anisotropic total-variation-based model
Signal Process.
(2019)
Z.-F. Pang et al.
Image denoising based on the adaptive weighted TV $^{p}$ regularization
Signal Process.
(2020)
Y. Yang et al.
A weighted bounded Hessian variational model for image labeling and segmentation
Signal Process.
(2020)
F. Li et al.
Variable exponent functionals in image restoration
Appl. Math. Comput.
(2010)
Z.F. Pang et al.
Image restoration via the adaptive TV $^{p}$ regularization
Comput. Math. Appl.
(2020)
J.C. Bezdek et al.
FCM: the fuzzy c-means clustering algorithm
Comput. Geosci.
(1984)

D. Wang et al.

An efficient iterative thresholding method for image segmentation

J. Comput. Phys.

(2017)

O. Ronneberger et al.

U-Net: Convolutional networks for biomedical image segmentation

International Conference on Medical Image Computing and Computer-Assisted Intervention

(2015)

M. Falcone et al.

A high-order scheme for image segmentation via a modified level-set method

SIAM J. Imaging Sci.

(2020)

M. Pan et al.

Vehicle license plate character segmentation

Int. J. Autom. Comput.

(2008)

M. Kass et al.

Snakes: active contour models

Int. J. Comput. Vis.

(1988)

D. Mumford et al.

Optimal approximations by piecewise smooth functions and associated variational problems

Commun. Pure Appl. Math.

(1989)

J. Yuan et al.

A continuous max-flow approach to Potts model

European Conference on Computer Vision

(2010)

A. Chien et al.

Frame based segmentation for medical images

Commun. Math. Sci.

(2011)

D. Wang et al.

The iterative convolution-thresholding method (ICTM) for image segmentation

Comput. Vis. Pattern Recognit.

(2019)

L. Ambrosio et al.

Approximation of functionals depending on jumps by elliptic functionals via $Γ$ -convergence

Commun. Pure Appl. Math.

(1990)

L. Ambrosio et al.

On the approximation of free discontinuity problems

Bollettino Della Unione Matematica Italiana B

(1992)

T.F. Chan et al.

Active contours without edges

IEEE Trans. Image Process.

(2001)

L.A. Vese et al.

A multiphase level set framework for image segmentation using the Mumford and Shah model

Int. J. Comput. Vis.

(2002)

L. Grady et al.

Reformulating and optimizing the Mumford-Shah functional on a graph–a faster, lower energy solution

European Conference on Computer Vision

(2008)

T. Pock et al.

A convex relaxation approach for computing minimal partitions

2009 IEEE Conference on Computer Vision and Pattern Recognition

(2009)

A. Chambolle et al.

A convex approach to minimal partitions

SIAM J. Imaging Sci.

(2012)

X. Cai et al.

A two-stage image segmentation method using a convex variant of the Mumford–Shah model and thresholding

SIAM J. Imaging Sci.

(2013)

R. Chan et al.

A two-stage image segmentation method for blurry images with poisson or multiplicative gamma noise

SIAM J. Imaging Sci.

(2014)

Y. Duan et al.

A two-stage image segmentation method using Euler’s elastica regularized Mumford-Shah model

2014 22nd International Conference on Pattern Recognition

(2014)

Q. Ma et al.

Image segmentation via mean curvature regularized Mumford-Shah model and thresholding

Neural Process. Lett.

(2018)

X. Cai et al.

Linkage between piecewise constant Mumford–Shah model and Rudin–Osher–Fatemi model and its virtue in image segmentation

SIAM J. Sci. Comput.

(2019)

Cited by (35)

A fractional-order image segmentation model with application to low-contrast and piecewise smooth images[Formula presented]
2024, Computers and Mathematics with Applications
In this paper, we propose a two-stage image segmentation model based on structure tensor and fractional-order regularization. In the first stage, we use the fractional-order regularization to approximate the Hausdorff measure of the Mumford-Shah (MS) model. The existence and uniqueness of the solution is proved and the alternating direction implicit (ADI) scheme is used to find the solution of the modified MS model. In the second stage, a thresholding is used to induce the segmentation of the target. The superior performances of the proposed model are demonstrated by some comparative experimental results with several state-of-art methods.
Image Segmentation Based on the Hybrid Bias Field Correction
2023, Applied Mathematics and Computation
Image segmentation is the foundation for analyzing and understanding high-level images. How to effectively segment an intensity inhomogeneous image into several meaningful regions in terms of human visual perception and ensure that the segmented regions are consistent at different resolutions is still a very challenging task. In order to describe the structure information of the intensity inhomogeneous efficiently, this paper proposes a novel hybrid bias field correction model by decoupling the multiplicative bias field and the additive bias field. These kinds of bias fields are assumed to be smooth, so can employ the Sobolev space $W^{1, 2}$ to feature them and use a constraint to the multiplicative bias field. Since the proposed model is a constrained optimization problem, we use the Lagrangian multiplier method to transform it into an unconstrained optimization problem, and then the alternating direction method can be used to solve it. In addition, we also discuss some mathematical properties of our proposed model and algorithm. Numerical experiments on the natural images and the medical images demonstrate performance improvement over several state-of-the-art models.
Multi-phase image segmentation by the Allen–Cahn Chan–Vese model
2023, Computers and Mathematics with Applications
This paper proposes an Allen–Cahn Chan–Vese model to settle the multi-phase image segmentation. We first integrate the Allen–Cahn term and the Chan–Vese fitting energy term to establish an energy functional, whose minimum locates the segmentation contour. The subsequent minimization process can be attributed to variational calculation on fitting intensities and the solution approximation of several Allen–Cahn equations, wherein n Allen–Cahn equations are enough to partition $m = 2^{n}$ segments. The derived Allen–Cahn equations are solved by efficient numerical solvers with exponential time integrations and finite difference space discretization. The discrete maximum bound principle and energy stability of the proposed numerical schemes are proved. Finally, the capability of our segmentation method is verified in various experiments for different types of images.
General nonconvex total variation and low-rank regularizations: Model, algorithm and applications
2022, Pattern Recognition
Total Variation and Low-Rank regularizations have shown significant successes in machine learning, data mining, and image processing in past decades. This paper develops the general nonconvex composite regularized model, which contains previous regularizers and motivates novel ones. Although the classical Alternating Direction Methods of Multiplier (ADMM) algorithm is applicable for this model, the nonconvexity of the problem and the complicacy of choosing the parameters increase the difficulty in the use of ADMM. Thus, by the penalty method, we propose the Alternating Minimization (AM) algorithm, whose convergence results are proved under mild assumptions. The proposed model and algorithm are applied to the image restoration problem. Numerical results demonstrate the efficiency of our model and algorithm.
Learning multi-level structural information for small organ segmentation
2022, Signal Processing
Deep neural networks have achieved great success in medical image segmentation problems such as liver, kidney, the accuracy of which already exceeds the human level. However, small organ segmentation (e.g., pancreas) is still a challenging task. To tackle such problems, extracting and aggregating multi-scale robust features become essentially important. In this paper, we develop a multi-level structural loss by integrating the region, boundary, and pixel-wise information to supervise feature fusion and precise segmentation. The novel pixel-wise term can provide information complementary to the region and boundary loss, which helps to discover more local information from the image. We further develop a multi-branch network with a saliency guidance module to better aggregate the three levels of features. The coarse-to-fine segmentation architecture is adopted to use the prediction on the coarse stage to obtain the bounding box for the fine stage. Comprehensive evaluations are performed on three benchmark datasets, i.e., the NIH pancreas, ISICDM pancreas, and MSD spleen dataset, showing that our models can achieve significant increases in segmentation accuracy compared to several state-of-the-art pancreas and spleen segmentation methods. Furthermore, the ablation study demonstrates the multi-level structural features help both the training stability and the convergence of the coarse-to-fine approach.
An ℓ<inf>0</inf>-overlapping group sparse total variation for impulse noise image restoration
2022, Signal Processing: Image Communication
Citation Excerpt :
One of the regularization functions proposed was the total variation (TV) norm. It was first introduced by Rudin et al. in [7] which has successfully used to solve denoising [8], deblurring [9], segmentation [10], and superresolution [11], due to its ability to allows sharp recovery that preserves edges in the restored image [12]. However, in the presence of noise in piecewise affine signal regions, TV restoration suffers the staircase artifacts [13,14].
Total variation (TV) based methods are effective models in image restoration. For eliminating impulse noise, an effective way is to use the $ℓ_{1}$ -norm total variation model. However, the TV image restoration always yields staircase artifacts, especially in high-density noise levels. Additionally, the $ℓ_{1}$ -norm tends to over penalize solutions and is not robust to outlier characteristics of impulse noise. In this paper, we propose a new total variation model to effectively remove the staircase effects and eliminate impulse noise. The proposed model uses the $ℓ_{0}$ -norm data fidelity to effectively remove the impulse noise while the overlapping group sparse total variation (OGSTV) acts as a regularizer to eliminate the staircase artifacts. Since the proposed method requires solving an $ℓ_{0}$ -norm and an OGSTV optimization problem, a formulation using the mathematical program with equilibrium constraints (MPEC) and the majorization–minimization (MM) method are respectively used together with the alternating direction method of multipliers (ADMM). Experiments demonstrate that our proposed model performs better than several state-of-the-art algorithms such as the $ℓ_{1}$ total generalized variation, $ℓ_{0}$ total variation, and the $ℓ_{1}$ overlapping group sparse total variation in terms of the peak signal-to-noise ratio (PSNR) and the structural similarity index measure (SSIM).

View all citing articles on Scopus

View full text

Adaptive total variation based image segmentation with semi-proximal alternating minimization

Highlights

Abstract

Introduction

Section snippets

The proposed model

Algorithm

Preparation for numerical experiments

Image segmentation with different levels of noise and blur

Conclusions

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgment

Expert Syst. Appl.

Pattern Recognit.

Pattern Recognit.

Physica D

Signal Process.

Signal Process.

Signal Process.

Appl. Math. Comput.

Comput. Math. Appl.

Comput. Geosci.

J. Comput. Phys.

U-Net: Convolutional networks for biomedical image segmentation

International Conference on Medical Image Computing and Computer-Assisted Intervention

A high-order scheme for image segmentation via a modified level-set method

SIAM J. Imaging Sci.

Vehicle license plate character segmentation

Int. J. Autom. Comput.

Snakes: active contour models

Int. J. Comput. Vis.

Optimal approximations by piecewise smooth functions and associated variational problems

Commun. Pure Appl. Math.

A continuous max-flow approach to Potts model

European Conference on Computer Vision

Frame based segmentation for medical images

Commun. Math. Sci.

The iterative convolution-thresholding method (ICTM) for image segmentation

Comput. Vis. Pattern Recognit.

Approximation of functionals depending on jumps by elliptic functionals via Γ-convergence

Commun. Pure Appl. Math.

On the approximation of free discontinuity problems

Bollettino Della Unione Matematica Italiana B

Active contours without edges

IEEE Trans. Image Process.

A multiphase level set framework for image segmentation using the Mumford and Shah model

Int. J. Comput. Vis.

Reformulating and optimizing the Mumford-Shah functional on a graph–a faster, lower energy solution

European Conference on Computer Vision

A convex relaxation approach for computing minimal partitions

2009 IEEE Conference on Computer Vision and Pattern Recognition

A convex approach to minimal partitions

SIAM J. Imaging Sci.

A two-stage image segmentation method using a convex variant of the Mumford–Shah model and thresholding

SIAM J. Imaging Sci.

A two-stage image segmentation method for blurry images with poisson or multiplicative gamma noise

SIAM J. Imaging Sci.

A two-stage image segmentation method using Euler’s elastica regularized Mumford-Shah model

2014 22nd International Conference on Pattern Recognition

Image segmentation via mean curvature regularized Mumford-Shah model and thresholding

Neural Process. Lett.

Linkage between piecewise constant Mumford–Shah model and Rudin–Osher–Fatemi model and its virtue in image segmentation

SIAM J. Sci. Comput.

Approximation of functionals depending on jumps by elliptic functionals via $Γ$ -convergence