Adaptive multi-scale segmentation of surface data using unsupervised learning of seed positions

https://doi.org/10.1016/j.engappai.2011.03.004Get rights and content

Abstract

This paper presents a method for multi-scale segmentation of surface data using scale-adaptive region growing. The proposed segmentation algorithm is initiated by an unsupervised learning of optimal seed positions through the surface attribute clustering with a two-criterion score function. The seeds are selected as consecutive local maxima of the clustering map, which is computed by an aggregation of the local isotropic contrast and local variance maps. The proposed method avoids typical segmentation errors caused by an inappropriate choice of seed points and thresholds used in the region-growing algorithm. The scale-adaptive threshold estimate is based on the image local statistics in the neighborhoods of seed points. The performance of this method was evaluated on LiDAR surface images.

Introduction

Surface segmentation for automatic or interactive data interpretation is encountered in many engineering applications, which deal with three-dimensional (3D) data obtained by scanning objects of interest. A typical application example, which is the subject of this paper, belongs to the LiDAR (Light Detection and Ranging) technology in remote sensing. This technology can provide a high-resolution representation of object and terrain surfaces in the form of a digital surface model (DSM), which is the height map of 3D LiDAR data (Campbell, 2002). Consequently, digital image processing techniques such as the image segmentation can be applied to segment 3D objects based on their detailed DSM images. The possibility of automatic (remote) height measurements makes this technique attractive for engineering projects connected with size measurements of particular objects of interest. Another example of surface segmentation is the automatic segmentation of range images, which are reconstructed from distance data of range scanners (Bab-Hadiashar and Gheissari, 2006). A different potential application is the industrial and medical diagnostic imaging based on ultrasound images of objects surfaces. The method presented in this paper focuses on LiDAR image segmentation of mostly natural scenes such as forests, parks and residential areas with buildings and trees.

The method of seeded region growing stands out from the multitude of image (surface) segmentation techniques due to its simplicity, time-efficient implementation and relatively good segmentation results (Adams and Bischof, 1994). Its theoretical foundation lies in the data clustering approach to image segmentation (Pal and Pal, 1993). The direct application of data clustering to image segmentation, however, does not take into account the spatial adjacency of regions and pixel connectivity, and it often results in noisy and sparse segmented regions. Advantageously, the region growing procedure as an iterative label-assignment algorithm directly analyzes region adjacency and connectivity of pixels belonging to the same region. Several algorithmic versions of the seeded region growing exist and they differ by the label-assignment procedure. For instance, the popular watershed segmentation algorithm can also be classified as a region-growing method (Beucher and Meyer, 1993). The weak side of the region growing approach is the seed-point selection, which is interactive or heuristic if based on a fully automatic procedure. Another problem, which may appear when applying a classical clustering approach to regional segmentation, is the presence of multiple regions (objects of interest) with close values of image attributes, e.g., the height attribute in the case of LiDAR data. They will be labeled as a single region, although in reality they are different isolated regions corresponding to distinct objects of interest.

Some modifications of the region-growing method were made in order to automatically select seed points. One of the existing methods consists in selecting seed locations at the regional borders using an edge detection operator (Fan et al., 2001). This procedure requires a heuristic and time-consuming post-processing to remove irrelevant seeds or merge over-segmented regions. In another similar method, a probability map is constructed to select seeds as the local minima points of an edge probability score (Mičušík and Hanbury, 2006). However, the seed selection by this method allocates multiple seeds at quasi-regular positions and requires sophisticated and time-consuming post-processing algorithms to select the optimal segmentation case. As opposed to the edge-area selection, a seed selection scheme using centers of flat areas has been proposed in Brunner and Soille (2007). A pre-segmentation by thresholding is used in the segmentation algorithm (Shan et al., 2008) to score the relevance of the obtained regions. This method is based on several existing image analysis methods, which are also present in the probability map method (Mičušík and Hanbury, 2006). A novel image segmentation scheme based on case-based reasoning was proposed to improve the performance of the watershed segmentation method (Frucci et al., 2008). It selects particular cases from a case base, which have image characteristics similar to that of the current input image. The segmentation parameters associated with the most similar case are then applied to the current image. The approach of case-base reasoning for seed selection requires a case base of segmentation outcomes, which is not always available in practice.

With regard to segmentation methods specially developed for LiDAR data, there are two major approaches. One is to process directly 3D LiDAR data and perform segmentation of surface point clouds in the 3D space (Söderman et al., 2004, Zhang et al., 2006). A significant part of the existing algorithms is based on 3D geometrical models of LiDAR surfaces, which limits their application scope to man-made or highly structured object surfaces only. Another approach, which is also adopted in this paper, is to implement a data segmentation method on the DSM image obtained from the initial LiDAR data by surface point triangulation and subsequent smoothing (Tiede et al., 2005). This method relies on the extraction of local maxima on the canopy surface to locate seed points. A different method of individual tree segmentation in forest areas is the crown delineation method based on the valley following algorithm (Leckie et al., 2003). However, this algorithm is not fully automatic since it requires a preliminary determination of forest areas and prior knowledge of key algorithm parameters. Another example is the method of gray-scale morphology used to segment treetops in a dense forest area (Andersen et al., 2001). A texture segmentation method using a seeded region growing was applied to segment coastal landforms in LiDAR imagery (Lucieer and Stein, 2005). In order to determine seed points, this method identifies image locations with minimum local variances similarly to the seed selection method (Brunner and Soille, 2007).

The main goal in the development of our segmentation method is a fully automatic selection of seed points for reliable image segmentation. Another goal is to obtain estimates for the threshold value for accurate region growing. In particular, the thresholds have to be estimated as functions of image local statistics in the neighborhoods of the seed points. The adaptive approach to surface (image) segmentation outlined in this paper can also be considered as an unsupervised learning of seed positions through local clustering of surface data with the constraint of region adjacency and pixel connectivity. The seed points are selected as the centers of local sub-clusters with high homogeneity and compactness with respect to their local inter-cluster distances.

Multi-scale image analysis is an important characteristic of the proposed approach, which provides invariance to variability of region sizes and their relative locations. The multi-scale and multi-resolution approaches to image segmentation are not new and many single-scale algorithms have been modified to implement the multi-scale image segmentation (Frucci et al., 2006, Jung, 2006, Wang, 1997, Saha et al., 2000, Duarte-Carvajalino et al., 2008). The main difference between the multi-resolution image analysis and the multi-scale approach is that the multi-scale analysis is performed once but at different local scales simultaneously, which depend on the image locations, i.e., the local scale is a function of pixel coordinates. The existing methods mostly perform the classical multi-scale image analysis based on multiple Gaussian filters at various scales. Here, a completely different scale selection principle is adopted for image segmentation, which is based on region homogeneity and its local contrast with adjacent regions. This principle substantially contributes to optimized selection of seed points and adaptive thresholds for the region growing.

The balance of this paper focuses on our multi-scale segmentation method with scale-adaptive selection of seed points and thresholds in the region growing. Section 2 describes a local clustering aspect of the conventional image segmentation, which is based on a two-criterion principle for optimized attribute clustering. The flowchart of the proposed region-growing segmentation is outlined in Section 3. Computation of local clustering map and selection of seed points are described in 4 Time-efficient computation of local clustering map, 5 Selection of seed points, respectively. Section 6 describes a modified region-growing algorithm using scale-adaptive thresholds. The experimental results of LiDAR surface segmentation are presented in Section 7 and concluding remarks are given in Section 8.

Section snippets

Local clustering aspect of the conventional image segmentation

Data clustering as a method of unsupervised learning of image segmentation into homogeneous regions can also be used to solve the seed selection problem in LiDAR image segmentation (Tou and Gonzalez, 1974). In the case of DSM images, which represent surfaces of real objects or their parts, the segmentation consists in labeling regions – one label per isolated region – in order to delineate objects of interest such as building, trees, forests, lakes, etc. on the background in LiDAR images. The

Multi-scale segmentation using adaptive region growing

In this section, we outline the proposed region-growing segmentation of surface data, details of which are described in the following sections. Our multi-scale algorithm can be distinguished by two main phases: (1) selection of multi-scale seed points using a pre-computed local clustering map and local scale values and (2) scale-adaptive region growing from the selected seeds. The first phase is aimed at the solution of the seed selection problem: incorrect seed selection results in erroneous

Time-efficient computation of local clustering map

Computation of the local clustering map is an extension of the score function estimation in Eq. (2.4) to the case of image segmentation. The local score function has to be modified in order to take into account the pixel connectivity on the image discrete grid and the region adjacency conditions in Eqs. (2.2), (2.3). Similarly, the inter-cluster distance has to be substituted by a multi-scale function serving as an estimator of the inter-cluster distance with respect to the current region and

Selection of seed points

In order to ensure an effective and fully automatic selection of seed points, their extraction has to be based on some formal conditions for an image pixel to be a good seed point. The seed selection process usually means the determination of an image location, intensity of which can serve as the seed intensity value in the region-growing segmentation. It corresponds to the assignment of initial values to the cluster centers (Section 2, Fig. 2). We can define three goodness-of-seed conditions

Scale-adaptive region growing

We propose a modified version of the conventional region-growing algorithm (Adams and Bischof, 1994) to improve image segmentation in the case of LiDAR surface data. The first modification is to use an adaptive threshold in the basic condition for region growing, which depends on statistical characteristics of the region under growth. The second one is the simultaneous consideration of two basic conditions for stopping the region growing instead of using only one based on a single attribute.

Experimental results

The main goal of the experiments was to evaluate the accuracy of the proposed multi-scale segmentation for the case of LiDAR images of natural scenes including forests, parks and individual trees in residential sub-urban areas. The main application is forestry, namely, segmentation of individual trees for automatic measurement of their characteristics. Majority of the testing areas are selected from forest stands located in a boreal forest. This is a natural terrain area without buildings and

Conclusions

A novel multi-scale method for surface (image) segmentation using seeded region growing is proposed. The seed points for region growing are selected as consecutive local maxima of the multi-scale clustering map, which indicate the centers of homogeneous regions with high contrast. This method has several advantageous characteristics as compared with other existing techniques in the context of LiDAR image segmentation. First, the seed points can be localized in a computationally efficient manner

References (30)

  • J.B. Campbell

    Introduction to Remote Sensing

    (2002)
  • D. Comaniciu et al.

    Mean shift: a robust approach toward feature space analysis

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (2002)
  • J.M. Duarte-Carvajalino et al.

    Multi-scale representation and segmentation of hyperspectral imagery using geometric partial differential equations and algebraic multi-grid methods

    IEEE Transactions on Geoscience and Remote Sensing

    (2008)
  • J. Fan et al.

    Automatic image segmentation by integrating color-based extraction and seeded region growing

    IEEE Transactions on Image Processing

    (2001)
  • M. Frucci et al.

    Using resolution pyramids for watershed image segmentation

    Image and Vision Computing

    (2006)
  • Cited by (0)

    View full text