Comparison of sampling strategies for object-based classification of urban vegetation from Very High Resolution satellite images

https://doi.org/10.1016/j.jag.2016.04.005Get rights and content

Highlights

  • Mapping of urban vegetation.

  • Very high resolution satellite imagery and object based image analysis.

  • Sampling strategies, active learning, Random Forest classification.

Abstract

Vegetation monitoring is becoming a major issue in the urban environment due to the services they procure and necessitates an accurate and up to date mapping. Very High Resolution satellite images enable a detailed mapping of the urban tree and herbaceous vegetation. Several supervised classifications with statistical learning techniques have provided good results for the detection of urban vegetation but necessitate a large amount of training data. In this context, this study proposes to investigate the performances of different sampling strategies in order to reduce the number of examples needed. Two windows based active learning algorithms from state-of-art are compared to a classical stratified random sampling and a third combining active learning and stratified strategies is proposed. The efficiency of these strategies is evaluated on two medium size French cities, Strasbourg and Rennes, associated to different datasets. Results demonstrate that classical stratified random sampling can in some cases be just as effective as active learning methods and that it should be used more frequently to evaluate new active learning methods. Moreover, the active learning strategies proposed in this work enables to reduce the computational runtime by selecting multiple windows at each iteration without increasing the number of windows needed.

Introduction

With the increasing urbanization, urban green spaces become more important for urban environments and urban ecosystems, notably for the services they procure (Tzoulas et al., 2007). These services are numerous and allow for instance, reducing pollutants from transportation and the effect of urban heat islands, to maintain biodiversity in cities or to develop leisure facilities. In Europe, urbanisation results in urban sprawl that is fragmenting natural and agricultural landscapes (European Environment Agency, 2006) and the developing green infrastructures at the European level is a permanent task for several decades (Jongman et al., 2004). In France, the implementation of the Grenelle 1 (2009) and Grenelle 2 (2010) environmental policy aimed at creating through regional schemes of ecological coherence, a network of green and blue networks. The Green and Blue Infrastructure consists of a green component, i.e. natural and semi-natural environments on land, and a blue component, i.e. the water and wetland network (rivers, streams, canals, ponds, wetlands, etc.) (COMOP, 2009). In France, the topographic database produced by the French National Geographic Institute (BD Topo®—IGN) is often used to map the green part of these networks (Hubert-Moy et al., 2012). This database is not necessarily regularly updated and is not complete for the whole national territory. Only tree areas of over 0.5 ha are represented and herbaceous vegetation is not considered. Thus this database is not accurate enough or sufficiently up to date to map ecological continuities at the infra-communal scale (Hubert-Moy et al., 2012). The development of satellites at Very High Resolution (VHR) providing images with a resolution of less than a metre has continued since the early 2000s, allowing up to date maps to be obtained that are exhaustive in terms of the type of vegetation.

With the increasing availability of satellite images, numerous methods have been developed to retrieve the urban vegetation. These methods may be supervised or not and are often uniquely based on spectral values of pixels (Xie et al., 2008). However, with VHR images, the spectral values of objects of the same type become more heterogeneous and it may become necessary to use new features such as texture or neighbourhood to improve the results of classifications (Tuominen and Pekkarinen, 2005, Sheeren et al., 2009). Rather than using a per-pixel classification, object-based image analysis has been proposed as an alternative (Blaschke, 2010). These approaches allow pixels to be re-grouped within homogenous segments and a large set of features to be computed, which can be spectral, textural, contextual or spatial (Mathieu et al., 2007, Blaschke et al., 2011). The use of these approaches for the detection of urban vegetation allows increased performance compared to per-pixel approaches (Cleve et al., 2008, Zhang et al., 2010). Rule-based classifications are often used to retrieve the vegetation from VHR images (Pham et al., 2011; Van Delm and Gulink, 2011). The relevant features and thresholds are often determined by trial and error, which can be very time-consuming and is strongly influenced by the knowledge of the expert (Belgiu et al., 2014). Additionally, the created rule sets are difficult to transpose to other sites or images, whether they come from the same sensor, or not and need to be adapted (Hussain and Shan, 2016).

Supervised classifications with statistical learning techniques have provided good results for the detection of urban trees (Cleve et al., 2008, Zhang et al., 2010; Tigges et al., 2013; Rougier and Puissant, 2014). These methods have the advantage of being easily reproducible, regardless of the study site or the image considered. However, to obtain good results with a supervised algorithm, it is often necessary to collect large amounts of training data, particularly for the most heterogeneous classes (Chen and Stow, 2002) or require representative training data which allow to consider the whole diversity of the space or the classes studied (Foody and Mathur, 2004). The collection of this data is typically obtained by field survey or visual image interpretation which can be an important work and cost. The image interpretation is very time consuming and may vary depending on the interpreter or external and technical factors (Van Coillie et al., 2014). To obtain representative examples, several schemes are traditionally used in remote sensing, such as stratified random sampling (Congalton and Green, 2009). This strategy generally allow to reduce the size of the training data needed but require prior knowledge on the study site to construct the stratification.

More recently, active learning methods have been proposed, to reduce the cost needed for the production of training data by reducing the training set size and thus the labelling time. Using an iterative process, the aim of this approach is to select the most informative and representative examples for the active classifier at each iteration, so that they can be annotated by the user (Settles, 2010). Using these methods generally allow to obtain more accurate results than with simple random sampling (Settles, 2010, Tuia et al., 2009). These methods also permit to increase interaction with the user (Hichri et al., 2013) and are recently used with good results, both for multispectral and hyperspectral image classification (Tuia et al., 2009, Demir et al., 2011). Tuia et al. (2011) identified three major families of strategies that are commonly used for the selection of new examples: strategies based on the disagreement of a model committee, on the distance to the separators and on posteriori probability measurements.

A problem of the active learning strategies is the computational runtime needed to carry out the classification, as the number of iterations can sometimes be large. Several solutions have been proposed to reduce the number of iterations by selecting a batch (Demir et al., 2011) or a window (Stumpf et al., 2014) at each iteration. Other algorithms have been proposed, to integrate diversity criteria and to reduce redundant information of new examples and obtain more rapidly a series of examples that represent the full complexity of the image being studied (Demir et al., 2011, Tuia et al., 2009, Volpi et al., 2012, Stumpf et al., 2014). Different methods have been also proposed to take spatial constraints into account (Liu et al., 2009, Xu et al., 2014, Pasolli et al., 2014) or to take terrain specificities into account (Demir et al., 2014) to obtain less correlated and thus more representative examples.

In this context, the main objective of this work is the mapping of urban vegetation (tree and herbaceous vegetation) from VHR images extending to multiclass classifications an active learning method initially proposed by Stumpf et al. (2014) for the detection of landslides. Most applications of active learning in the field of remote sensing compare these methods to simple random sampling. Moreover, as it has been shown that other selection strategies allow obtaining better results, this work addresses this issue and provides a benchmark of different sampling strategies for the mapping of urban vegetation. To challenge random stratified sampling, three strategies of active learning are evaluated. They are based on a committee model to select the most informative windows in the framework of an object based analysis. The first will choose the most uncertain window and the second integrates a diversity criterion. The third method proposes to combine active leaning and stratified sampling strategies in order to further reduce the computational and the labelling costs. In order to check the reproducibility of the proposed methods for the mapping of tree and herbaceous vegetation in different urban contexts, two medium-sized French cities with different urban morphologies, Strasbourg and Rennes, are analysed. Thus, complementary aims of this research are to check what active learning brings to different datasets and also to verify the contribution of auxiliary data for the improvement of the classification results.

This paper is structured in five sections. In Section 2 both study sites and available datasets are presented. In Section 3, the methodology of active learning and the three algorithms used for the selection of the new examples, as well as the reference method used to evaluate the results of active learning, are detailed. The influence of the tested parameters for each algorithm is also analysed. Section 4 allows evaluating and discussing their performances. Finally, Section 5 concludes this work and presents several research perspectives.

Section snippets

Study sites and data

Two study sites were selected to investigate the performances of the active learning and stratified sampling methods for the mapping of tree and herbaceous vegetation. The two cities are situated in France and are representative for numerous medium-sized towns in Western Europe (Nabucet et al., 2015). The first is Strasbourg, situated in the East of France, while the second is Rennes in the West (Fig. 1c).

Strasbourg is a city characterised by a dense centre inherited from the Middle-Ages, with

Methods

The first step of the object-based classification scheme (Section 3.1) is a segmentation and computation of object attributes using the Pléiades multispectral images. In the second step, active learning methods and different sampling strategies are described (Section 3.2), before explaining the experimental setup (Section 3.3). Fig. 2 provides an overview of the tested processing strategies.

Results and discussion

To evaluate the performances of the different methods put forward in this work, two accuracy measures are considered. The first is overall accuracy which is commonly used in active learning studies. The overall accuracy has the advantage of being global accuracy measure, but can be strongly biased when classes are unbalanced, as is the case here for our two study sites where the “other” class is highly dominant (Table 1). Thus, the F-measure is used to evaluate the classification performances

Conclusion

The questions and issues concerning vegetation in urban areas are multiple and of significant importance both for determining public policies and in academic research. Obtaining an accurate prior cartography of all tree and herbaceous elements for a given area is therefore important and the analysis of VHR satellite images provide a feasible solution in this context.

In this research, three active learning strategies are proposed to extract tree or herbaceous vegetation from VHR images of two

Acknowledgements

The authors are grateful to the RTU Pleiades, Program ORFEO Accompaniment Program (CNES) for making the Pléiades images available. We also thank the IGN services for the orthorectification of the images, and to the CIGAL regional agency for the regional land cover database of Strasbourg.

References (57)

  • J. Qi et al.

    A modified soil adjusted vegetation index

    Remote Sens. Environ.

    (1994)
  • A. Stumpf et al.

    Object-oriented mapping of landslides using Random Forests

    Remote Sens. Environ.

    (2011)
  • J. Tigges et al.

    Urban vegetation classification: benefits of multitemporal RapidEye satellite data

    Remote Sens. Environ.

    (2013)
  • S. Tuominen et al.

    Performance of different spectral and textural aerial photograph features in multi-source forest inventory

    Remote Sens. Environ.

    (2005)
  • K. Tzoulas et al.

    Promoting ecosystem and human health in urban areas using Green Infrastructure: a literature review

    Landscape Urban Plann.

    (2007)
  • A. Van Delm et al.

    Classification and quantification of green in the expanding urban and semi-urban complex: application of detailed field data and IKONOS-imagery

    Ecol. Indic.

    (2011)
  • M. Baatz et al.

    Multiresolution Segmentation: an optimization approach for high quality multi-scale image segmentation

  • A. Beygelzimer et al.

    FNN: Fast Nearest Neighbor Search Algorithms and Applications

    (2013)
  • T. Blaschke et al.

    Object-Based image analysis for vegetation mapping and monitoring

    Advances in Environmental Remote Sensing: Sensors, Algorithms, and Applications., London, Remote Sensing Applications Series

    (2011)
  • S. Boltz et al.

    kNN-based high-dimensional Kullback-Leibler distance for tracking

    Presented at the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007. WIAMIS’07

    (2007)
  • L. Breiman

    Random forests

    Mach. Learn.

    (2001)
  • COMOP, 2009. Orientations nationales pour la préservation et la restauration des continuités écologiques. Guide 2–Appui...
  • D.M. Chen et al.

    The effect of training strategies on supervised classification at different spatial resolutions

    Photogramm. Eng. Remote Sens.

    (2002)
  • C. Chen et al.

    Using Random Forest to Learn Imbalanced Data (No. 666)

    (2004)
  • R.G. Congalton et al.

    Assessing the Accuracy of Remotely Sensed Data: Principles and Practices

    (2009)
  • L. Copa et al.

    Unbiased query-by-bagging active learning for VHR image classification

  • B. Demir et al.

    Batch-mode active-learning methods for the interactive classification of remote sensing images

    IEEE Trans. Geosci. Remote Sens.

    (2011)
  • B. Demir et al.

    Definition of effective training sets for supervised classification of remote sensing images by a novel cost-sensitive active learning method

    IEEE Trans. Geosci. Remote Sens.

    (2014)
  • Cited by (25)

    • Remote sensing of urban green spaces: A review

      2021, Urban Forestry and Urban Greening
    • Urban landscape extraction and analysis in the mega-city of China's coastal regions using high-resolution satellite imagery: A case of Shanghai, China

      2018, International Journal of Applied Earth Observation and Geoinformation
      Citation Excerpt :

      It was difficult to determine an appropriate threshold related to the selection of land cover classes with large differences. In a follow-up study, to reduce the error rate and inefficiency caused by threshold selection, the method of active learning from the ancillary data will be used to solve the problem of threshold determination (Rougier et al., 2016), thereby improving the overall accuracy of the delineation methods. The present study focuses on an analysis of the land cover and landscape pattern of an ecologically fragile city using China’s largest city in a coastal (Shanghai) area.

    • Land cover mapping in urban environments using hyperspectral APEX data: A study case in Baden, Switzerland

      2018, International Journal of Applied Earth Observation and Geoinformation
      Citation Excerpt :

      In our implementation, we used eCognition® (Trimble 2018) to obtain the image objects, which were then used in an ensuing classification step applied outside of eCognition®. This software has been widely used in object-based image analysis methods for remote sensing data (Zhang and Xie, 2012; Rougier et al., 2016; Zhou and Troy, 2008). After careful visual inspection of the segmentation result and evaluation of the final classification accuracy, we set the shape parameter to 0.1, the compactness parameter to 0.5 and the scale parameter to 50, which were observed to obtain the optimal classification result.

    • A review of accuracy assessment for object-based image analysis: From per-pixel to per-polygon approaches

      2018, ISPRS Journal of Photogrammetry and Remote Sensing
      Citation Excerpt :

      Authors frequently report measures that derive from the confusion matrix. These measures include the Kappa coefficient (e.g., Su et al., 2012; Han et al., 2015; Novelli et al., 2016), the F measure (e.g., Güttler et al., 2016; Rougier et al., 2016), McNemar’s test statistic (e.g., Belgiu et al., 2014; Kavzoglu et al., 2015), and quantity & allocation (e.g., Doxani et al., 2012; Cai and Liu, 2013; Müllerová et al., 2013; Katagis et al., 2014; Pham et al., 2016). Kappa was most used: 84 of the publications reported Kappa as a summary metric.

    • Multi-sensor feature fusion for very high spatial resolution built-up area extraction in temporary settlements

      2018, Remote Sensing of Environment
      Citation Excerpt :

      Several strategies for an automated and objective identification of optimized segmentations are proposed in literature (Espindola et al., 2006; Esch et al., 2008; Drăgut et al., 2010; Martha et al., 2011; Geiß et al., 2016a). Nonetheless, numerous studies have shown that accuracies of object-based supervised classifications are less dependent on segmentation (Belgiu and Drăgut, 2014), and indicate that over-segmentation is generally preferable (e.g. Stumpf and Kerle, 2011; Johnson and Xie, 2013; Rougier et al., 2016). Thus, in order to keep computational costs low, we applied segmentation without an extensive optimization.

    View all citing articles on Scopus
    View full text