Statistical active grid for segmentation refinement

https://doi.org/10.1016/S0167-8655(01)00037-XGet rights and content

Abstract

We present a new statistical method, based on a deformable partition called “active grid” for the semi-supervised segmentation of an image composed of several homogeneous regions. This approach allows one to efficiently refine a rough pre-segmentation with a computing time below half a second for 256×256 images (on a standard 700 MHz PC).

Introduction

Since the work of Geman and Geman (1984), there has been a growing interest in statistical techniques for image segmentation. Recently, deformable models have been coupled with the statistical approach to provide efficient estimation and regularization of the contours. Some of these methods aim at segmenting a unique object in the scene (Staib and Duncan, 1992, Kervrann and Heitz, 1994, Storvik, 1994, Nguyen et al., 1992, Figueiredo et al., 2000), while others are able to segment the scene into several regions. For example, in (Zhu and Yuille, 1996), a global technique which allows region growing and competition has been proposed in order to segment complex images. We propose here, an analogous but simpler technique, which leads to a fast algorithm based on rigorous statistical criterion but which needs a more supervised approach. In a recent paper (Germain and Réfrégier, 1996), we have presented a method based on active contours (snakes) to estimate the contour of an object in a statistical framework. The contour deformation was driven by the optimization of a closed-form criterion that is optimal for given models of the scene. It has been shown that this technique is efficient when the edges are difficult to detect (Chenaud et al., 1999) and that it could be used to correct the bias observed in SAR edge location (Germain and Réfrégier, 2000). However, the main limitation of this approach was the assumption that the image is only made of two regions. If the background of the scene is not homogeneous, this assumption is violated and thus, the method can fail in this case.

In this letter, we propose an original statistical method to segment images made of several (more than two) homogeneous regions, when the number of regions as well as their approximate positions are known a priori. The aim is then to refine and regularize an initial rough segmentation with a deformable partition that we call “active grid”. Note that contour regularization is classical (Chenaud et al., 1999) and will not be detailed here. In the following, we will focus on two original points: the active grid and its fast implementation. In Section 2, we give the mathematical formulation of the problem and study the solution in the case where the pdf of the intensities belong to the exponential family. In Section 3, we address the implementation of the method and we show that a fast algorithm, analogous to the one proposed in (Chenaud et al., 1999) can be applied. Finally, some segmentation results with computing times are presented in Section 4.

Section snippets

Maximum likelihood estimation

In the following mathematical development, bold font symbols will denote vectors. Let us consider a scene s={s(x,y)}. This scene is modeled as a tessellation of R statistically independent, and simply connected regions. Note that we will not address the estimation of R and below, we will assume that this parameter is known a priori. In each region Ωr (r∈{1,2,…,R}), we assume that the pixel intensities are realizations of independent and identically distributed random variables with a pdf of

Topology of the grid

The active grid includes P nodes and R polygonal regions. This grid is described by two structures:

  • •

    one structure contains the spatial coordinates of the P nodes. This structure changes during the convergence;

  • •

    the other one is relative to the grid topology, i.e., the relationship between nodes and regions. This structure remains invariant during the convergence.

The topology of the grid is represented by an oriented, valued graph (Fig. 1). To each node in the grid corresponds a vertex in the

Results

In this section, we present segmentation results with approximative computing times to illustrate the performance of the active grid (Table 3). These results were obtained on a PC under Linux (Mandrake 7.0) with a 700 MHZ Pentium III processor and a 256 Mo RAM. The same optimization scheme (see Section 2.2) is applied to all images. The number of nodes is progressively increased in a three-step convergence: d=20 at the end of the first step and d=15 at the end of the second one.

In Fig. 4, an

Conclusion

In this letter, we have presented a new statistical method based on a deformable model for the semi-supervised segmentation of images into several homogeneous regions. This approach allows one to improve the accuracy of a rough initial segmentation. Thanks to a fast algorithm, a typical computing time of 400 ms for a 256×256 image has been obtained on a 700 MHz Pentium III PC. The adjunction of smoothness constraints to regularize the estimation of the contour is easy to introduce and can thus

Acknowledgments

This work was supported by the French Space Agency (CNES) which supplied the SAR data. The authors are grateful to Christophe Chesnaud for fruitful discussions.

References (14)

  • C. Chenaud et al.

    Statistical region snake-based segmentation adapted to different physical noise models

    IEEE Trans. Pattern Anal. Machine Intell.

    (1999)
  • M. Figueiredo et al.

    Unsupervised contour representation and estimation using b-splines and a minimum description length criterion

    IEEE Trans. Image Process.

    (2000)
  • R. Fjørtoft et al.

    An optimum multiedge detector for SAR image segmentation

    IEEE Trans. Geosci. Remote Sensing

    (1998)
  • S. Geman et al.

    Stochatic relaxation, Gibbs distribution and the Bayesian restoration of images

    IEEE Trans. Pattern Anal. Machine Intell.

    (1984)
  • O. Germain et al.

    Optimal snake-based segmentation of a random luminance target on a spatially disjoint background

    Opt. Lett.

    (1996)
  • O. Germain et al.

    On the bias of the likelihood ratio edge detector for SAR images

    IEEE Trans. Geosci. Remote Sensing

    (2000)
  • Kervrann, C., Heitz, F., 1994. A hierarchical statistical framework for the segmentation of deformable objects in image...
There are more references available in the full text version of this article.

Cited by (13)

  • Multi-component image segmentation in homogeneous regions based on description length minimization: Application to speckle, Poisson and Bernoulli noise

    2005, Pattern Recognition
    Citation Excerpt :

    Using a classical approach of region-based segmentation [1–5], we will consider that the image is made up of piecewise homogeneous regions whose pixel grey levels are spatially independent. Although this assumption may not be representative of many real-world images, it has been widely used in the past [2,6,5,3] since it can fairly represent some types of satellite images for example, and it can lead to simple techniques. Furthermore, the obtained partition of the image into homogeneous regions can be viewed as a first step preceding more complex processing tasks such as image understanding, object recognition, etc.

  • A symmetric scheme for building reconstruction from a couple of HR optical and SAR data

    2011, 2011 Joint Urban Remote Sensing Event, JURSE 2011 - Proceedings
  • Image models

    2011, Springer Topics in Signal Processing
  • Unsupervised segmentation based on Von Mises circular distributions for orientation estimation in textured images

    2011, Proceedings of SPIE - The International Society for Optical Engineering
View all citing articles on Scopus
View full text