Statistical active grid for segmentation refinement

doi:10.1016/S0167-8655(01)00037-X

Pattern Recognition Letters

Volume 22, Issue 10, August 2001, Pages 1125-1132

https://doi.org/10.1016/S0167-8655(01)00037-X Get rights and content

Abstract

We present a new statistical method, based on a deformable partition called “active grid” for the semi-supervised segmentation of an image composed of several homogeneous regions. This approach allows one to efficiently refine a rough pre-segmentation with a computing time below half a second for 256×256 images (on a standard 700 MHz PC).

Introduction

Since the work of Geman and Geman (1984), there has been a growing interest in statistical techniques for image segmentation. Recently, deformable models have been coupled with the statistical approach to provide efficient estimation and regularization of the contours. Some of these methods aim at segmenting a unique object in the scene (Staib and Duncan, 1992, Kervrann and Heitz, 1994, Storvik, 1994, Nguyen et al., 1992, Figueiredo et al., 2000), while others are able to segment the scene into several regions. For example, in (Zhu and Yuille, 1996), a global technique which allows region growing and competition has been proposed in order to segment complex images. We propose here, an analogous but simpler technique, which leads to a fast algorithm based on rigorous statistical criterion but which needs a more supervised approach. In a recent paper (Germain and Réfrégier, 1996), we have presented a method based on active contours (snakes) to estimate the contour of an object in a statistical framework. The contour deformation was driven by the optimization of a closed-form criterion that is optimal for given models of the scene. It has been shown that this technique is efficient when the edges are difficult to detect (Chenaud et al., 1999) and that it could be used to correct the bias observed in SAR edge location (Germain and Réfrégier, 2000). However, the main limitation of this approach was the assumption that the image is only made of two regions. If the background of the scene is not homogeneous, this assumption is violated and thus, the method can fail in this case.

In this letter, we propose an original statistical method to segment images made of several (more than two) homogeneous regions, when the number of regions as well as their approximate positions are known a priori. The aim is then to refine and regularize an initial rough segmentation with a deformable partition that we call “active grid”. Note that contour regularization is classical (Chenaud et al., 1999) and will not be detailed here. In the following, we will focus on two original points: the active grid and its fast implementation. In Section 2, we give the mathematical formulation of the problem and study the solution in the case where the pdf of the intensities belong to the exponential family. In Section 3, we address the implementation of the method and we show that a fast algorithm, analogous to the one proposed in (Chenaud et al., 1999) can be applied. Finally, some segmentation results with computing times are presented in Section 4.

Section snippets

Maximum likelihood estimation

In the following mathematical development, bold font symbols will denote vectors. Let us consider a scene $s ={s(x,y)}$ . This scene is modeled as a tessellation of R statistically independent, and simply connected regions. Note that we will not address the estimation of R and below, we will assume that this parameter is known a priori. In each region $Ω_{r}$ (r∈{1,2,…,R}), we assume that the pixel intensities are realizations of independent and identically distributed random variables with a pdf of

Topology of the grid

The active grid includes P nodes and R polygonal regions. This grid is described by two structures:

•
one structure contains the spatial coordinates of the P nodes. This structure changes during the convergence;
•
the other one is relative to the grid topology, i.e., the relationship between nodes and regions. This structure remains invariant during the convergence.

The topology of the grid is represented by an oriented, valued graph (Fig. 1). To each node in the grid corresponds a vertex in the

Results

In this section, we present segmentation results with approximative computing times to illustrate the performance of the active grid (Table 3). These results were obtained on a PC under Linux (Mandrake 7.0) with a 700 MHZ Pentium III processor and a 256 Mo RAM. The same optimization scheme (see Section 2.2) is applied to all images. The number of nodes is progressively increased in a three-step convergence: d=20 at the end of the first step and d=15 at the end of the second one.

In Fig. 4, an

Conclusion

In this letter, we have presented a new statistical method based on a deformable model for the semi-supervised segmentation of images into several homogeneous regions. This approach allows one to improve the accuracy of a rough initial segmentation. Thanks to a fast algorithm, a typical computing time of 400 ms for a 256×256 image has been obtained on a 700 MHz Pentium III PC. The adjunction of smoothness constraints to regularize the estimation of the contour is easy to introduce and can thus

Acknowledgments

This work was supported by the French Space Agency (CNES) which supplied the SAR data. The authors are grateful to Christophe Chesnaud for fruitful discussions.

References (14)

C. Chenaud et al.
Statistical region snake-based segmentation adapted to different physical noise models
IEEE Trans. Pattern Anal. Machine Intell.
(1999)
M. Figueiredo et al.
Unsupervised contour representation and estimation using b-splines and a minimum description length criterion
IEEE Trans. Image Process.
(2000)
R. Fjørtoft et al.
An optimum multiedge detector for SAR image segmentation
IEEE Trans. Geosci. Remote Sensing
(1998)
S. Geman et al.
Stochatic relaxation, Gibbs distribution and the Bayesian restoration of images
IEEE Trans. Pattern Anal. Machine Intell.
(1984)
O. Germain et al.
Optimal snake-based segmentation of a random luminance target on a spatially disjoint background
Opt. Lett.
(1996)
O. Germain et al.
On the bias of the likelihood ratio edge detector for SAR images
IEEE Trans. Geosci. Remote Sensing
(2000)
Kervrann, C., Heitz, F., 1994. A hierarchical statistical framework for the segmentation of deformable objects in image...

There are more references available in the full text version of this article.

Cited by (13)

Smooth contour coding with minimal description length active grid segmentation techniques
2011, Pattern Recognition Letters
We analyze the influence of the contour coding term in segmentation techniques based on active grids and on the minimum description length (MDL) principle. These segmentation techniques have been developed up to now with a contour coding term adapted to polygonal objects. However, this approach can lead to degraded segmentation results for smooth contours of objects which can be observed for example in geoscience, medicine or microscopy. We demonstrate that an appropriate choice of the contour coding term can improve segmentation results with MDL active grid approaches in the presence of regions with smooth boundaries. This improvement opens a large class of application domains and still allows one to obtain low computational time.
Multi-component image segmentation in homogeneous regions based on description length minimization: Application to speckle, Poisson and Bernoulli noise
2005, Pattern Recognition
Citation Excerpt :
Using a classical approach of region-based segmentation [1–5], we will consider that the image is made up of piecewise homogeneous regions whose pixel grey levels are spatially independent. Although this assumption may not be representative of many real-world images, it has been widely used in the past [2,6,5,3] since it can fairly represent some types of satellite images for example, and it can lead to simple techniques. Furthermore, the obtained partition of the image into homogeneous regions can be viewed as a first step preceding more complex processing tasks such as image understanding, object recognition, etc.
In this article, a minimum description length (MDL) criterion adapted to independent multi-component image segmentation into homogeneous regions is proposed. This approach, based on a deformable polygonal grid, allows us to segment noisy multi-component images perturbed with spatially independent speckle, Poisson or Bernoulli noise. The advantages of using such a multi-component approach rather than a mono-component one is demonstrated on synthetic and real images. This segmentation method is also applicable to multi-component images whose components do not follow the same noise statistics or have not been previously registered.
Multiphase SAR image segmentation with G<sup>0</sup>-statistical-model- based active contours
2013, IEEE Transactions on Geoscience and Remote Sensing
A symmetric scheme for building reconstruction from a couple of HR optical and SAR data
2011, 2011 Joint Urban Remote Sensing Event, JURSE 2011 - Proceedings
Image models
2011, Springer Topics in Signal Processing
Unsupervised segmentation based on Von Mises circular distributions for orientation estimation in textured images
2011, Proceedings of SPIE - The International Society for Optical Engineering

View all citing articles on Scopus

View full text

Statistical active grid for segmentation refinement

Abstract

Introduction

Section snippets

Maximum likelihood estimation

Topology of the grid

Results

Conclusion

Acknowledgments

Statistical region snake-based segmentation adapted to different physical noise models

IEEE Trans. Pattern Anal. Machine Intell.

Unsupervised contour representation and estimation using b-splines and a minimum description length criterion

IEEE Trans. Image Process.

An optimum multiedge detector for SAR image segmentation

IEEE Trans. Geosci. Remote Sensing

Stochatic relaxation, Gibbs distribution and the Bayesian restoration of images

IEEE Trans. Pattern Anal. Machine Intell.

Optimal snake-based segmentation of a random luminance target on a spatially disjoint background

Opt. Lett.

On the bias of the likelihood ratio edge detector for SAR images

IEEE Trans. Geosci. Remote Sensing