Reduced set density estimator for object segmentation based on shape probabilistic representation

doi:10.1016/j.jvcir.2012.07.006

Journal of Visual Communication and Image Representation

Volume 23, Issue 7, October 2012, Pages 1085-1094

https://doi.org/10.1016/j.jvcir.2012.07.006 Get rights and content

Abstract

In this paper, a nonparametric statistical shape model based on shape probabilistic representation is proposed for object segmentation. Given a set of training shapes, Cremers et al.’s probabilistic method is adopted to represent the shape, and then principal components analysis (PCA) on shape probabilistic representation is computed to capture the variation of the training shapes. To encode complex shape variation in training set, reduced set density estimator is used to model nonlinear shape distributions in a finite-dimensional subspace. This statistical shape prior is integrated to convex segmentation functional to guide the evolving contour to the object of interest. In addition, in contrast to the commonly used signed distance functions, PCA on shape probabilistic representation needs less number of eigenmodes to capture certain details of the training shapes. Numerical experiments show promising results and the potential of the model for object segmentation.

Highlights

► A nonparametric statistical shape model for object segmentation is proposed. ► PCA on shape probabilistic representation needs less number of eigenmodes than SDF. ► RSDE can provide a high-accuracy and low-cost density estimator.

Introduction

Object segmentation is a fundamental task in image processing and computer vision. Its essential goal is to extract desired objects from the given images. Since the object and background may exhibit very similar intensity characteristics in numerous real-world applications, it is normally not enough to only use the low-level information of the images, such as intensity, color or texture for segmentation, especially when misleading information due to occlusion, clutter and noises exist in the input images. This naturally leads to a need for integrating prior knowledge such as shape information into the segmentation process in order to improve segmentation results. In this paper, by assuming that prior knowledge given by a set of training shapes of expected objects, we focus on the problem of how to exploit such shape priors for object segmentation.

Level set methods were introduced by Osher and Sethian [1]. Since such methods allow implicit representation of the evolving object boundary and automatic changes of its topology, level set methods have become increasingly popular for image segmentation [2], [3]. Recently, to segment images of low quality or with missing data, level set based variational approaches have gained significant attention toward the integration of shape prior into the image segmentation processes [4], [5], [6], [7], [8], [9], [10], [11], [12], [13]. Almost all these works can be considered as a linear combination of two terms: a data-driven term and a shape constraint term. Geometric active contours model [14] and Chan-Vese’s model [15] have become two popular data-driven terms to guide the motion of the active contour. There are two ways to define the shape constraint term. One is commonly defined by an explicit dissimilarity measure between the evolving contour and a given prior contour, and the other is to estimate a statistical distribution from training shapes to guide the evolving contour to the most likely shape of the estimated distribution. Given a set of training shapes, one may impose simple or more complicated distribution functions such as uniform distribution [7], Gaussian distribution [16], or non-parametric estimator [17] to improve segmentation results in the presence of noise or occlusion. In applications, the distribution of training shapes is generally not uniform distribution or Gaussian distribution due to a large variability of shape. Kernel density estimation (KDE) is an efficient approach to model nonlinear distributions of training shapes [12], [13]. In this technique, the density function is estimated by a sum of kernel functions. The kernel number is equal to the size of the training data. When the training data set is very large, the KDE suffers from high computational cost and becomes intractable for subsequent use (e.g., in a real-time applications). Reduced set density estimator (RSDE) was proposed by Girolami and He [18] to solve the above problem by providing a kernel density estimator which employs a small subset of the available data sample to provide similar levels of performance.

Shape is represented implicitly by signed distance function (SDF), and can be easily integrated into level set variational methods as a shape constraint term. These representations have gained much popularity in recent years [4], [5], [6], [7], [8], [9], [10], [11], [12], [13]. The idea is to represent the shape contour C by embedding it in a higher dimension level set functional ϕ, as follows: $ϕ (x) = \{\begin{matrix} Dist (x, C), & x \in in (C) \\ 0, & x \in C \\ - Dist (x, C), & x \in out (C) \end{matrix},$ where Dist(x, C) denotes the Euclidean distance from x to the closet point on C, and out(C) and in(C) represent the regions outside and inside of the contour C, respectively. The contour C can be reconstructed from such representation by taking its zero level set C = {x|ϕ(x) = 0}. Hence, any shape in the plane corresponds to a unique SDF. This shape representation is consistent with the level set framework, and has its advantages since parameterization free and easy handling of topological changes. However, the use of principal component analysis (PCA) on a set of SDF embedding a set of sample shapes has two drawbacks:

1.
The space of SDF is not a linear space, e.g., the mean shape and linear combinations of sample shapes are typically no longer SDF. Most existing works only consider very similar shape priors.
2.
While the first few principal components are used to capture the most variation on the space of SDF, they will not necessarily capture the variation on the space of the embedded shape contours. Therefore, in contrast to PCA on explicit shape contours, PCA on SDF need to include a larger number of eigenmodes in order to capture certain details of the sample shapes.

Recently, there has been significant research exploring methods to solve these non-convex problems by using convex relaxation methods [19], [20], [21], [22]. In [20], Cremers et al. proposed a shape probabilistic representation (SPR) by relaxing the binary constraint and allowing the binary function to take on values in the interval [0, 1], defined as a mapping $q = Ω \to [0, 1],$ that assigns to every pixel x of the shape domain $Ω \subset R^{2}$ the probability that this pixel is inside the given shape. In traditional definition of shape, pixels are part of the shape, and only take values 1 (members) or 0 (non-members). It can be described as q:Ω → {0, 1}. Based on the probabilistic definitions, it is easy to get the shape region of the object $(q)_{τ} = {x | q (x) ⩾ τ}$ and the background of image $(q)_{τ}^{C} = 1 - (q)_{τ}$ by selecting a τ ϵ [0, 1]. In the experiment, τ is chosen as 0.5. It was shown that the space Q of all probabilistic shapes forms a convex set, and the space spanned by a few training shapes χ = {q₁, q₂, ⋯q_N} forms a convex subset. Arbitrary convex combinations of the set again correspond to a valid shape. For example, the mean $μ (x) = \frac{1}{N} \sum_{i = 1}^{N} q_{i} (x), μ (x) \in [0, 1]$ is a function which assigns to each point x ϵ Ω the average of all probabilities (Fig. 1). This shape probabilistic representation leads to convex segmentation functional on convex shape spaces.

In this paper, we are building up on the above developments and propose two contributions in order to overcome the discussed limitations:

1.
We use probabilistic representation to model the shape prior, and then compute PCA on shape probabilistic representation to capture the variation of the training shapes. In contrast to the commonly used signed distance functions, PCA on shape probabilistic representation needs less number of eigenmodes to capture certain details of the training shapes.
2.
RSDE is used to learn the shape prior information in the low-dimensional subspace, and then is integrated to convex segmentation functional to restrict the segmenting contour to a manifold of training shapes during the segmentation process. In contrast to existing statistic approaches of shape priors (uniform distribution, Gaussian distribution, KDE), RSDE can provide a high accuracy estimator of a probability density function which employs a small percentage of the available data sample.

The remainder of this paper is organized as follows. In Section 2, the shape prior based level set segmentation and RSDE are described. RSDE based on shape probabilistic representation for variational segmentation is presented in Section 3. The implementation procedure is proposed in Section 4 to minimize the energy. Experimental results are provided in Section 5, and the conclusions are given in Section 6.

Section snippets

Shape prior-based level set segmentation

Shape prior based image segmentation, which incorporates the shape prior information into the segmenting process, makes the final result more robust, accurate and efficient. In general, most level set segmentation models can be cast as variational minimization problem as follows $\min_{ϕ} {E_{i} (ϕ) + γ E_{s} (ϕ)},$ where ϕ is the level set function, and γ > 0 determines the relative importance of the two energy terms. E_i is a data-driven term which aims at driving the segmenting curve to the object boundaries, and E

A compact low-dimensional representation

It is well-known that statistical shape models can be performed more reliably and more efficiently in low-dimensional representations. For this reason, principal components analysis (PCA) of shape probabilistic representation (SPR) is used to capture the variations of shapes while removing redundant information. Using PCA, we compute the eigenmodes of the shape set χ = {q₁, q₂, ⋯q_N}. We use only a subspace of χ spanned by the first $n ⩽ N$ eigenmodes {ψ₁, ψ₂, ⋯ψ_n}. The value of n must be chosen large

Energy minimization

In practice, the edge indicator term ∫ _Ω|∇q_α(T_ρ(x))|dx in (16) can be neglected, since the proposed model is directly optimized in the linear subspace spanned by the principal components. Then, we get the following energy functional: $E (α, ρ) = \int_{Ω} (R_{o} (u) - R_{b} (u)) q_{α} (T_{ρ} (x)) dx - γ \log (\sum_{i = 1}^{N} ω_{i} \exp (\frac{(α - α_{i})^{2}}{2 σ^{2}})) .$

As for the region descriptors, several descriptors [13], [15], [24], [25] may be considered. In this work, we choose the following as the region descriptors. $R_{o} (u) = - \log p_{o} (u), R_{b} (u) = - \log p_{b} (u),$ where p_o(u) and p_b(u

Tracking a walking person

In this section, we apply the proposed model to track a walking person. Here four different data sets we used. One is used as training data, and the rest three data sets are used for testing purpose. The training set consists of 151 shapes from a sequence (showing a different person walking at a different pace), which is publicly available [20]. The RSDE is applied to this training set, and reduces the number of the kernels to 26 (near 17 percent of the original sample). Given an approximate

Conclusion

In this paper, a reduced set density estimator model based on shape probabilistic representation is constructed for image segmentation. If the sample size is very large, RSDE is an efficient approach to model nonlinear distributions of training shapes by providing a kernel density estimator which employs a small subset of the available data sample to provide similar levels of performance. In contrast to the commonly used signed distance functions, shape probabilistic representation can capture

Acknowledgments

This work was supported by a National Key Basic Research Project of China (973 Program No. 2012CB316400) and NSFC (No. 60872069).

References (36)

S. Osher et al.
Fronts propagation with curvature dependent speed: algorithms based on Hamilton–Jacobi formulations
Journal of Computational Physics
(1988)
D. Cremers et al.
Shape statistics in kernel space for variational image segmentation
Pattern Recognition
(2003)
S. Osher et al.
Level Set Methods and Dynamic Implicit Surfaces
(2002)
A. Tsai et al.
Curve evolution implementation of the Mumford–Shah functional for image segmentation, denoising, interpolation, and magnification
IEEE Trans. Image Processing
(2001)
M. Leventon et al.
Statistical shape influence in geodesic active contours
IEEE International Conference on Computer Vision and Pattern Recognition
(2000)
S. Dambreville et al.
A framework for image segmentation using shape models and kernel space shape priors
IEEE Transactions on Pattern Analysis and Machine Intelligence
(2008)
M. Rousson, N. Paragios, Shape priors for level set representations, in: Proceedings of the European Conference on...
A. Tsai et al.
A shape-based approach to the segmentation of medical imagery using level sets
IEEE Transactions on Medical Imaging
(2003)
D. Cremers et al.
Towards recognition-based variational segmentation using shape priors and dynamic labeling
International Conference on Scale Space Methods in Computer Vision
(2003)
D. Cremers et al.
A multiphase dynamic labeling model for variational recognition-driven image segmentation”
International Journal of Computer Vision
(2006)

T. Chan et al.

Level set based shape prior segmentation

IEEE International Conference Computer Vision and Pattern Recognition

(2005)

D. Cremers

Nonlinear dynamical shape priors for level set segmentation

Journal of Scientific Computing

(2008)

D. Cremers et al.

Kernel density estimation and intrinsic alignment for shape priors in level set segmentation

International Journal of Computer Vision

(2006)

M. Rousson et al.

Efficient kernel density estimation of shape and intensity priors for level set segmentation

International Conference on Medical Image Computing and Computer-Assisted Intervention

(2005)

V. Caselles et al.

Geodesic active contours

International Journal of Computer Vision

(1997)

T. Chan et al.

Active contours without edges

IEEE Transactions on Image Processing

(2001)

M. Rousson et al.

Implicit active shape models for 3d segmentation in MRI imaging

MICCAI

(2004)

M. Girolami et al.

Probability density estimation from optimally condensed data samples

IEEE Transactions on Pattern Analysis and Machine Intelligence

(2003)

Cited by (7)

Mammogram image visual enhancement, mass segmentation and classification
2015, Applied Soft Computing Journal
Citation Excerpt :
Variant feature transformation methods are mostly effective in segmenting micro-calcification along with mammogram masses and have been applied in [8,59,26,54,6,62,36] [57,44,55,1]. Other image segmentation methods that deal with color images include the work of [17,63,13]. Various mammogram image enhancement and segmentation techniques were proposed in literature such as the state-of-the-art approaches that include the work in [2,32,24,31].
Mammography is the most effective technique for breast cancer screening and detection of abnormalities. However, early detection of breast cancer is dependent on both the radiologist's ability to read mammograms and the quality of mammogram images. In this paper, the researchers have investigated combining several image enhancement algorithms to enhance the performance of breast-region segmentation. The masses that appear in mammogram images are further analyzed and classified into four categories that include: benign, probable benign and possible malignant, probable malignant and possible benign, and malignant. The main contribution of this work is to reveal the optimal combination of various enhancement methods and to segment breast region in order to obtain better visual interpretation, analysis, and classification of mammogram masses to assist radiologists in making more accurate decisions. The experimental dataset consists of a total of more than 1300 mammogram images from both the King Hussein Cancer Center and Jordan Hospital. Results achieved tumor classification accuracy values of 90.7%. Moreover, the results showed a sensitivity of 96.2% and a specificity of 94.4% for the mass classifying algorithm. Radiologists from both institutes have acknowledged the results and confirmed that this work has lead to better visual quality images and that the segmentation and classification of tumors has aided the radiologists in making their diagnoses.
Ultrasound kidney segmentation with a global prior shape
2013, Journal of Visual Communication and Image Representation
Citation Excerpt :
Afterwards, a prior shape is incorporated in a GAC model [16,17]. More related works can be found in [18–21]. In this paper, we replace the Rayleigh noise statistics used for prostate ultrasound images (the application in [35]) with the Fisher–Tippett statistics which are accepted to be appropriate for kidney ultrasound.
In this paper, we focus on segmentation of ultrasound kidney images. Unlike previous work by using trained prior shapes, we employ a parametric super-ellipse as a global prior shape for a human kidney. The Fisher–Tippett distribution is employed to describe the grey level statistics. Combining the grey level statistics with a global character of a kidney shape, we propose a new active contour model to segment ultrasound kidney images. The proposed model involves two subproblems. One subproblem is to optimize the parameters of a super-ellipse. Another subproblem is to segment an ultrasound kidney image. An alternating minimization scheme is used to optimize the parameters of a super-ellipse and segment an image simultaneously. To segment an image fast, a convex relaxation method is introduced and the split Bregman method is incorporated to propose a fast segmentation algorithm. The efficiency of the proposed method is illustrated by numerical experiments on both simulated images and real ultrasound kidney images.
TRUS image segmentation with non-parametric kernel density estimation shape prior
2013, Biomedical Signal Processing and Control
Citation Excerpt :
However, most of the methods are based on the assumption that the training shapes form a Gaussian distribution, while non-Gaussian shape prior used in several medical image segmentation tasks demonstrates superior results [11,12]. On the other hand, many studies have been carried out to embed the shape prior into both the explicit active contour model (ACM) [13] and the implicit level sets models [14], using non-Gaussian shape prior [11,15–25]. However, very few of them are specially designed for ultrasound images, and the prostate segmentation procedure cannot favor from the advantages of these methods for the following reasons.
Due to noises, speckles, etc., automatic prostate segmentation is rather challenging, and using only low-level information such as intensity gradient is insufficient and unable to tackle the problem. In this paper, we propose an automatic prostate segmentation method combining intrinsic properties of TRUS images with the high-level shape prior information. First, intrinsic properties of TRUS images, such as the intensity transition near the prostate boundary as well as the speckle induced texture features obtained by Gabor filter banks, are integrated to deform the model to the target contour. These properties make our method insensitive to high gradient regions introduced by noises and speckles. Then, the preliminary segmentation is fine-tuned by the non-parametric shape prior, which is optimally distilled by non-parametric kernel density estimation as it can approximate arbitrary distributions. The refinement is along the direction of mean shift vector, and considerably strengthens the robustness of the method. The performance of our method is validated by experimental results. Compared with the state of the art, the accuracy and robustness of the method is quite promising, and the mean absolute distance is only 1.21 ± 0.85 mm.
Bayesian Robust Principal Component Analysis with Adaptive Singular Value Penalty
2020, Circuits, Systems, and Signal Processing
Implicit kernel sparse shape representation: A sparse-neighbors-based objection segmentation framework
2017, Journal of the Optical Society of America A: Optics and Image Science, and Vision
PCB CT image segmentation based on level set with shape prior
2016, Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics

View all citing articles on Scopus

View full text

Reduced set density estimator for object segmentation based on shape probabilistic representation

Abstract

Highlights

Introduction

Section snippets

Shape prior-based level set segmentation

A compact low-dimensional representation

Energy minimization

Tracking a walking person

Conclusion

Acknowledgments

Journal of Computational Physics

Pattern Recognition

Level Set Methods and Dynamic Implicit Surfaces

Curve evolution implementation of the Mumford–Shah functional for image segmentation, denoising, interpolation, and magnification

IEEE Trans. Image Processing

Statistical shape influence in geodesic active contours

IEEE International Conference on Computer Vision and Pattern Recognition

A framework for image segmentation using shape models and kernel space shape priors

IEEE Transactions on Pattern Analysis and Machine Intelligence

A shape-based approach to the segmentation of medical imagery using level sets

IEEE Transactions on Medical Imaging

Towards recognition-based variational segmentation using shape priors and dynamic labeling

International Conference on Scale Space Methods in Computer Vision

A multiphase dynamic labeling model for variational recognition-driven image segmentation”

International Journal of Computer Vision

Level set based shape prior segmentation

IEEE International Conference Computer Vision and Pattern Recognition

Nonlinear dynamical shape priors for level set segmentation

Journal of Scientific Computing

Kernel density estimation and intrinsic alignment for shape priors in level set segmentation

International Journal of Computer Vision

Efficient kernel density estimation of shape and intensity priors for level set segmentation

International Conference on Medical Image Computing and Computer-Assisted Intervention

Geodesic active contours

International Journal of Computer Vision

Active contours without edges

IEEE Transactions on Image Processing

Implicit active shape models for 3d segmentation in MRI imaging

MICCAI

Probability density estimation from optimally condensed data samples

IEEE Transactions on Pattern Analysis and Machine Intelligence