Mapping of aggregated floodplain plant communities using image fusion of CASI and LiDAR data

https://doi.org/10.1016/j.jag.2008.09.001Get rights and content

Abstract

Combined optical and laser altimeter data offer the potential to map and monitor plant communities based on their spectral and structural characteristics. A problem unresolved is, however, that narrowly defined plant communities, i.e. plant communities at a low hierarchical level of classification in the Braun-Blanquet system, often cannot be linked directly to remote sensing data for vegetation mapping. We studied whether and how a floristic dataset can be aggregated into a few major discrete, mappable classes without substantial loss of ecological meaning. Multi-source airborne data (CASI and LiDAR) and floristic field data were collected for a floodplain along the river Waal in the Netherlands. Mapping results based on floristic similarity alone did not achieve highest levels of accuracy. Ordination of floristic data showed that terrain elevation and soil moisture were the main underlying environmental drivers shaping the floodplain vegetation, but grouping of plant communities based on their position in the ordination space is not always obvious. Combined ordination-based grouping with floristic similarity clustering led to syntaxonomically relevant aggregated plant assemblages and yielded highest mapping accuracies.

Introduction

Mapping is one of the most efficient methods to visualize trends of plant community (PC) patterns in space and time. The problem is that, although in most cases plant communities can clearly be recognized in the field, it is not always easy to detect the borders between different vegetation types. Vegetation units are abstractions of reality and boundaries between them are not always defined (Whittaker, 1973). Between two adjacent plant communities there is a transition that is either long or short (Sykora, 1984a, Sykora, 1984b). According to Barkman (1990) real boundaries may be absent (local continua) or sharp (usually man made), but both cases are rare and, in general, real boundaries are usually fuzzy. Glavac et al. (1992) state that in natural vegetation transitions between plant communities may be continuous or discontinuous, but usually the “non-linear but continuous” model (Scott, 1974) holds. In this spatial model, areas in which species composition changes very slowly are alternated by areas with quick species turn-over.

Core emphasis in natural vegetation mapping is on how to simplify vegetation gradients into discernible, mappable and ecologically meaningful units. Key challenges herewith are the delineation of classes with an adequate level of detail and solving the problem of transitional zones (Cingolani et al., 2004, Fortin et al., 2000). Remotely sensed reflectance data offer capabilities to map vegetation but require spectral signatures of pre-identified vegetation classes. Plant communities need to be defined prior to mapping. Various classification approaches were developed in the past years to cope with transitional zones, e.g. by means of fuzzy classifiers (Foody, 1996, Foody and Atkinson, 2002; Zhang and Foody, 2001). However, fuzzy approaches also need pre-identified classes as input and thus the problem of class delineation remains. Therefore, the emphasis of this work was on the development of a methodology that delineates plant communities that are discernible by RS data on one hand, and preserve maximal ecological significance on the other hand.

Plant ecologists have been developing methods to identify species assemblages for some time now (Hill, 1979), yet matching these methodologies with remote sensing (RS) techniques has only recently been tackled (Nilsen et al., 1999, Schmidtlein and Sassin, 2004, Thomas et al., 2003). In this respect, the use of multivariate statistical methods, such as cluster analysis and ordination, and ancillary abiotic field data is suggested to organize plant community data along environmental gradients across space. Such organization asks for the use of multispectral data of high spatial resolution to spectrally delineate such classes. Yet spectral discrimination of vegetation classes with similar species composition or environmental conditions remains subject to inaccuracies (Lawrence et al., 2006, Thomas et al., 2003).

In a previous study, multi-source remote sensing data were acquired over a semi-natural floodplain along the river Waal in the Netherlands (Geerling et al., 2007). Use was made of a compact airborne spectrographic imager (CASI) and light detection and ranging sensor technology (LiDAR). The authors concluded that a fusion of spectral and altimetry data can improve the classification of floodplain vegetation, particularly vegetation classes differentiated by their vertical structure. These classes are important to predict the hydraulic roughness of a floodplain, but were almost exclusively based on structure and not on species composition. As such, they lack syntaxonomic relevance.

Plant communities are usually described by plant ecologists at the lowest possible syntaxonomical level in the Braun-Blanquet system (Westhoff and van der Maarel, 1978). The Braun-Blanquet system describes plant assemblages at different hierarchical syntaxonomic levels with increasing number of floristic specifications: class, order, alliance and association. Differentiation of plant communities at lowest levels is based on the presence of indicator species, which are often to be found within the dominant plant cover. Since this level of differentiation is beyond the detection capacity of remote sensing instruments, the purpose of this work was to develop a methodology that enables to group these plant communities into broader vegetation classes in such a way that they can be discriminated by the CASI + LiDAR dataset and, at the same time, maintain phytosociological significance. Adopting ecological field data into the RS analysis can speed up the mapping process not only due to its potential use as ground truth data, but also by analyzing its inherent informative value for improving classification quality. Accordingly, the objective of this study was to evaluate the adequacy of implementing an ecological methodology using clustering and ordination techniques for optimizing floodplain vegetation mapping based on a fusion of spectral (CASI) and structural (LiDAR) data.

To do so, we first describe the phytosociological way to identify and describe plant communities, followed by a brief description of remote sensing as a means to identify and map plant communities. Subsequently we propose a synergism of both approaches by relying on plant community similarity analysis and ordination techniques to achieve an ecologically sound mapping.

Phytosociology identifies discrete plant assemblages by grouping species abundance data through cluster analysis and ordination (Austin and Smith, 1989, Fortin et al., 2000). Cluster analysis in field ecological studies essentially seeks to group floristic abundance data into classes or groups in such a way that within-group similarity is maximized and between-group similarity is minimized according to some objective criteria (Jongman et al., 1995). Basically, cluster analysis techniques divide plot samples of species (relevés) into a hierarchy of statistically similar clusters and then examine their dissimilarity. Amongst the most widely employed techniques are: two-way indicator species analysis (TWINSPAN) (Gauch and Whittaker, 1981, Hill, 1979) and cluster analysis by dendrogram method (McCune and Mefford, 1999). TWINSPAN is a polythetic, hierarchical clustering technique, which involves the joint clustering of samples and species by successive partitions of ordination axes generated at each step by reciprocal averaging. A dendrogram is a treelike plot that depicts the agglomeration sequence in cluster analysis, in which entities are enumerated along one axis and the dissimilarity level at which each fusion of clusters occurs is placed on the other axis. These methods show the hierarchy of clustering as a vegetation table showing all plots (TWINSPAN) and as a graphical representation (dendrogram), respectively. Due to differences in algorithm, the clusters made by TWINSPAN and dendrogram do not fully correspond. TWINSPAN classifies the plots in a divisive way, dividing the clusters using ordination space partitioning, whereas dendrogram groups the plots together based on a cluster distance measure in an agglomerative way.

Nevertheless, as vegetation units are abstractions of reality and boundaries between them are not always defined, in practice, clustering is often used in combination with ordination. Ordination analysis, as a kind of gradient analysis, seeks to detect a set of factors that account for the major patterns across all the original variables without a substantial loss of information. Detrended correspondence analysis (DCA; Hill and Gauch, 1980) is an ‘unconstrained’ ordination technique, which means that it refers to plot-based species data without considering any explanatory variables. A correspondence analysis (CA) is a form of weighted averaging that constructs theoretical environmental variables that best explain the species data (Jongman et al., 1995). This is done by maximizing the dispersion of the species scores along an ordination axis and the correlation between species and sites. The first axis symbolizes the ‘longest’ environmental gradient in species composition; the second one describes the ‘longest’ gradient in the remaining floristic variation and so on. Multiple axes can be constructed, with the constraint that they are uncorrelated with the previous axes. In a DCA two additional steps, detrending and rescaling, are added to remove two major faults: the arch effect that is caused by the unimodal distribution of species along gradients, and axis compression that occurs at the end of the gradients (McGarigal et al., 2000). Special problems arise with ordination in that it is subject to a number of assumptions about joint relationships of variables (Gauch, 1982).

In fact, despite case-specific shortcomings, ordination and cluster analysis may be seen as complementary, and when applied together they may provide useful information about the relationships among species (e.g. Thomas et al., 2003). In practice more than one solution is possible for defining plant communities. For instance, multiple plant assemblages can be defined, depending on the required level of detail, applied method, skills of the user and expert knowledge. Once plant assemblages are formed, they can be classified in syntaxa according to the Braun-Blanquet system. The philosophy of the Braun-Blanquet approach is that plant assemblages are biological units that can be typified and described into a hierarchical system in analogy to plant taxonomy (Westhoff and van der Maarel, 1978). The benefit of this classification system is that most of the described plant assemblages are quite well studied, particularly their synecology: the environmental conditions of plant communities.

Detection of plant assemblages through combined optical reflectance and laser altimeter data concerns the spectral and structural differentiation of a group of species, rather than the specific response of one species. Species assemblages have distinct spectral and structural characteristics when compared to single plants. The spectral influence of the non-vegetation elements, the multiple scattering between different plants and the structural layering of the vegetation can be seen as characteristic of a plant community (Schmidt and Skidmore, 2003). The success of PC mapping by optical remote sensing data depends therefore mainly on the type of remotely sensed data and the degree of syntaxonomical detail used for classification. Concerning the type of data, there are numerous examples of studies where high spatial resolution imaging spectroscopy data were applied to derive thematic maps of floristic composition (e.g. Schmidtlein and Sassin, 2004, Treitz and Howarth, 2000). Previous studies of river dynamics using multispectral and hyperspectral sensors include Winterbottom and Gilvear (1997), Malthus and George (1997) and Schmidt and Skidmore (2003). In addition, LiDAR data have significant potential for generating maps of complex river channel environments and vegetation characteristics. Airborne laser scanning is an active remote sensing technique that involves high-resolution elevation measurements by means of laser pulses. Over the past years various studies showed that laser data are able to distinguish structural differences both between and within plant communities (e.g. see review Straatsma and Middelkoop, 2006). For instance, Cobby et al. (2001), Mason et al. (2003) and Straatsma and Middelkoop (2007) employed laser altimetry for the assessment of floodplain vegetation height. The LiDAR observations of vegetation structure can thus present an independent information source complementing the spectral information content for a comprehensive floodplain characterization (Geerling et al., 2007, Gillespie et al., 2004, Held et al., 2003, Hill and Thomson, 2005).

Regardless of the type of remotely sensed data and the applied method, the syntaxonomical assignment of the ground truth data that precedes the classification step is usually taken for granted by remote sensing analysts. Given that various syntaxonomical assignments in the Braun-Blanquet system are possible from the same floristic data, it is this ‘grey zone’ of identification and, thus, the possibilities for improvement of mapping that will be further explored. The aforementioned methods of vegetation sampling, clustering and ordination will be used to delineate and optimize characterization of PC classes that can be discriminated through spectral and altimetry data. In Section 2 we will present a floristic dataset of a nature area that will be used for testing the map-ability of vegetation clusters.

Section snippets

Study area

A semi-natural floodplain called “Millingerwaard” (51.5°N and 5°E) was chosen as the study area. The Millingerwaard is a floodplain along the river Waal, one of the main branches of the river Rhine in the Netherlands (Fig. 1). The size of the study area is 58 ha with sand dunes, low vegetation, bushes and forest included within its boundaries. In 1989, the nature area was formed and left to develop. In subsequent years, cattle, horses and beavers were introduced to stimulate spontaneous

Image classification using dendrogram-based aggregation

In total, six steps of the dendrogram-based grouping were carried out. The first two steps took place at a level of >80% similarity (Fig. 3). Some plots of the TWINSPAN-based PCs were divided over one group of branches while other branches incorporated plots of more than one TWINSPAN-based PC. This intermixture facilitated PC aggregation because it can reasonably be assumed that PCs mixed up in one branch, or a dense group of branches, are closely interrelated. Hence such mixed up PCs were

Discussion

We evaluated sampling, classification and delineation of floristic assemblages according to the Braun-Blanquet method and through clustering and ordination techniques for their potential to map ecologically relevant floodplain vegetation types. Use of dendrogram-based grouping as input into remote sensing schemes rests on the assumption that similarities in floristic composition encompass similarities in spectral and height properties. DCA-based grouping (ordination space partitioning) rests on

Acknowledgements

The authors want to thank Isabel van Geloof and Iris de Ronde, who within their MSc-thesis work collected and analyzed the ecological data under supervision of the third author. We also want to thank the reviewers who helped to improve the manuscript.

References (58)

  • J.J. Barkman

    Controversies and perspectives in plant ecology and vegetation science

    Phytocoenologia

    (1990)
  • Brügelmann, R., 2003. Quality test of the LiDAR dataset. Internal document. Personal Communication (Delft: Ministry of...
  • C. Fernández-Aláez et al.

    Spatial distribution pattern of the riparian vegetation in a basin in the NW Spain

    Plant Ecology

    (2005)
  • K. Fitzpatrick-Lins

    Comparison of sampling procedures and data analysis for a land-use and land-cover map

    Photogrammetric Engineering & Remote Sensing

    (1981)
  • G.M. Foody

    Approaches for the production and evaluation of fuzzy land cover classifications from remotely-sensed data

    International Journal of Remote Sensing

    (1996)
  • G.M. Foody et al.

    Uncertainty in Remote Sensing and GIS

    (2002)
  • M.J. Fortin et al.

    Issues related to the detection of boundaries

    Landscape Ecology

    (2000)
  • H.G. Gauch

    Multivariate Analysis in Community Ecology

    (1982)
  • H.G. Gauch et al.

    Hierarchical classification of community data

    Journal of Ecology

    (1981)
  • G.W. Geerling et al.

    Classification of floodplain vegetation by data fusion of spectral (CASI) and LiDAR data

    International Journal of Remote Sensing

    (2007)
  • T.W. Gillespie et al.

    Prospects for quantifying structure, floristic composition and species richness of tropical forests

    International Journal of Remote Sensing

    (2004)
  • V. Glavac et al.

    On the nature of vegetation boundaries, undisturbed flood plain forest communities as an example—a contribution to the continuum/discontinuum controversy

    Vegetatio

    (1992)
  • F. Grevilliot et al.

    Phytogeographical and phenological comparison of the Meuse and the Saone valley meadows (France)

    Journal of Biogeography

    (1998)
  • F. Grevilliot et al.

    Grassland ecotopes of the upper Meuse as references for habitats and biodiversity restoration: A synthesis

    Landscape Ecology

    (2002)
  • A. Held et al.

    High resolution mapping of tropical mangrove ecosystems using hyperspectral and radar remote sensing

    International Journal of Remote Sensing

    (2003)
  • M.O. Hill

    TWINSPAN—A FORTRAN Program for Arranging Multivariate Data in an Ordered Two-way Table by Classification of the Individuals and Attributes

    (1979)
  • M.O. Hill et al.

    Detrended correspondence analysis: an improved ordination technique

    Vegetatio

    (1980)
  • R.A. Hill et al.

    Mapping woodland species composition and structure using airborne spectral and LiDAR data

    International Journal of Remote Sensing

    (2005)
  • R.H.G. Jongman et al.

    Data Analysis in Community and Landscape Ecology

    (1995)
  • Cited by (43)

    • Optical trait indicators for remote sensing of plant species composition: Predictive power and seasonal variability

      2017, Ecological Indicators
      Citation Excerpt :

      For dates on which reflectance had a higher degree of explanatory power than the OTI set, many vegetation plots featured high cover fractions of litter (Feilhauer and Schmidtlein, 2011). Litter has a considerable effect on the canopy reflectance spectrum (Asner, 1998) and causes spectral differences among plant assemblages (Verrelst et al., 2009) that were not considered in the tested OTIs. The superior performance of OTIs may be attributed to the fact that PROSAIL-based optical trait estimates include information inherent to spectral data in a straight-forward manner while excluding irrelevant spectral information, although not all drivers of canopy reflectance are considered in detail.

    • Remote sensing for mapping natural habitats and their conservation status - New opportunities and challenges

      2015, International Journal of Applied Earth Observation and Geoinformation
    View all citing articles on Scopus
    View full text