Elsevier

Environmental Modelling & Software

Volume 84, October 2016, Pages 467-481
Environmental Modelling & Software

Effect of raster resolution and polygon-conversion algorithm on landslide susceptibility mapping

https://doi.org/10.1016/j.envsoft.2016.07.016Get rights and content

Highlights

  • Landslide susceptibility maps using ANN.

  • Effects of raster resolution and vector-to-raster conversion algorithms.

  • The finest resolutions do not necessarily lead to the highest model performances.

  • The algorithm of conversion data may significantly affect the ANN training.

Abstract

The choice of the proper resolution in landslide susceptibility mapping is a worth considering issue. If, on the one hand, a coarse spatial resolution may describe the terrain morphologic properties with low accuracy, on the other hand, at very fine resolutions, some of the DEM-derived morphometric factors may hold an excess of details. Moreover, the landslide inventory maps are represented throughout geospatial vector data structure, therefore a conversion procedure vector-to-raster is required.

This work investigates the effects of raster resolution on the susceptibility mapping in conjunction with the use of different algorithms of vector-raster conversion. The Artificial Neural Network technique is used to carry out such analyses on two Sicilian basins. Seven resolutions and three conversion algorithms are investigated. Results indicate that the finest resolutions do not lead to the highest model performances, whereas the algorithm of conversion data may significantly affect the ANN training procedure at coarse resolutions.

Introduction

Landslides occur systematically all over the world, thus requiring landslide hazard researches to support an appropriate landuse planning. From this perspective, the recognition of landslide-prone terrains is traditionally considered one of the most useful approach (Hansen, 1984), which leads to the definition of landslide susceptibility maps, i.e., the spatial distribution of probability of landslide occurrence over a region. The assessment of landslide susceptibility is based on empirical or modeled relationships between landslide events occurred in the past and landslide inducing (or triggering) factors (Varnes and IAEG, 1984) of the region of interest, thus resulting into a typical spatial correlation analysis.

The scientific literature offers various methods for the landslide susceptibility mapping (Francipane et al., 2014, Arnone et al., 2014). These methods can be generally classified as: heuristic, mainly based on the opinion of geomorphologic experts (e.g., Francipane et al., 2014, Puglisi et al., 2013); bivariate statistical, mainly based on the frequency ratio approach (e.g., Lepore et al., 2012, Lee and Pradhan, 2007); multivariate statistical, mainly based on the logistic regression (e.g., Lee et al., 2004, Lee and Pradhan, 2007; Lepore et al., 2012, Francipane et al., 2014); and multivariate data-driven, mainly based on the use of Artificial Neural Network, ANN (e.g., Francipane et al., 2014, Arnone et al., 2014, Melchiorre et al., 2008). A comparison of these methods is provided by Francipane et al. (2014), who demonstrated the strength of the statistical and data-driven methods as compared to the heuristic methods.

The application of such methods requires the use of spatial distributed data generally represented by means of a raster structure. Specifically, all the input data (i.e., landslide inducing factors) and historical landslide distributions must be consistent in terms of spatial representation and resolution. The inducing factors refer commonly to continuous (e.g., elevation, slope, distance from river network or roads, etc.) or categorical (e.g., landuse, geology, soil type, etc.) variables and are described as raster data or polygons in vector data structure, respectively; events of landslide inventory, instead, are generally represented by points (landslide locations) or polygons (landslide sources and propagation area) in a vector data structure, e.g. Yilmaz (2010).

In order to make all data consistent, a sound preprocessing of data is, then, generally required, which is commonly carried out throughout spatial analysis techniques. The main issues of the preprocessing phase are i) the choice of the proper algorithm required to convert vector data (polygons or point features) into a raster structure and ii) the choice of the proper spatial resolution for the data.

In the case of continuous variables (e.g., elevation), a coarser spatial resolution describes the terrain less accurately, since the information of a portion of terrain identified as a pixel are synthesized with a value. This lack of accuracy is propagated on the morphological secondary attributes extracted from the DEM (Digital Elevation Model), such as the slope, the curvature, the aspect, etc. (Wu et al., 2007, Vaze et al., 2010, Yang et al., 2014). As an example, it has been demonstrated that the slope frequency distribution is very sensitive to the DEM resolution (e.g., Chang and Tsai, 1991, Kienzle, 2004, Claessens et al., 2005, Grohmann, 2015). In other words, uncertainties due to the raster structure (Pogson and Smith, 2015) of the inducing factors increase as the resolution used for the description of the variables decreases, affecting somehow the reliability of the derived susceptibility maps (Guzzetti et al., 1999, Lee et al., 2004, Lee et al., 2010, Yuan et al., 2008, Cama et al., 2016).

Across the studies available in literature, a large variety of resolutions, also significantly different, have been used for the development of various maps. However, not always the choice of the resolution is justified on the basis of objective criteria. Guzzetti et al. (1999) suggested the use of multiple resolutions and to opt for that which provides the best performances in terms of reliability. As demonstrated by Li and Zhou (2003), who compare the susceptibility maps obtained using data having various resolutions (from 2 m to 300 m), the best performances cannot be always attributed to the higher resolutions; in this case, the authors suggest an intermediate resolution as best option. Lee et al. (2004) considered data at 5, 10, 30, 100, and 200 m resolution, demonstrating that the higher resolutions (i.e., 5, 10, and 30 m) lead to very similar susceptibility maps. Interesting insights are given by Yuan et al. (2008), who concluded that the optimal resolution depends on the size of the study area and on the average landslide scale. In particular, in their experiment conducted on eleven different resolutions (from 5 m to 190 m), the best performance was obtained for the 90 m resolution and the worst one for the 150 m resolution; moreover, they found out that the resolution of 90 m was very close to the average area of the landslides occurred in the study area, suggesting the presence of a correlation between the best resolution and the average landslide area. Similar outcomes were found out by Lee et al. (2010), who derived landslide susceptibility maps for 12 sub-watersheds of the Chi-Shan River, southern Taiwan, at different resolutions. In this case, the pixel size derives from the spatial resolution of images from different existing sensors used to identify the landslide areas at fine resolutions (i.e., 2.5, 8, and 20 m); they, then, obtained spatial layers at coarser spatial resolutions (i.e., 10, 20, 40, 60, 80, and 100 m) by means of resampling methods. If, on the one hand, they found out that greater pixel sizes leaded to a rougher landslide map and larger landslide areas, on the other hand, they found that also the use of finer resolutions can lead to wrong results, being the optimal resolution depending on the real landslide dimensions. Recently, Cama et al. (2016) evaluated the relationship between the cell size and the model performances for susceptibility mapping of debris-flow by means of the binary logistic regression (Hosmer and Lemeshow, 2000). Specifically, they focused on the effect of landslide inducing factors on the global model performances by investigating four different resolutions (from 2 to 32 m). They found out small differences in the global accuracy and precision of models, whereas effects were detected in the role of each inducing factor as the resolution changed.

Other works explored the effect of resolution on landslide susceptibility modeling through the use of physically-based hydrological-stability models (Montgomery and Dietrich, 1994, Rosso et al., 2006, Arnone et al., 2011, Lepore et al., 2013, Capparelli and Versace, 2014, Arnone et al., 2016). By using such approaches, the cell size plays an important role mainly in the description of the terrain-based processes and the DEM derived information layers. Among these works, Claessens et al. (2005) carried out an investigation on the resolution effects using the landscape evolution model LAPSUS (Claessens et al., 2007); they concluded that a perfect resolution for a DEM cannot exist, because no resolution can generally represent perfectly the size of all landslides located throughout the analyzed area. A terrain stability model was also used by Tarolli and Tarboton (2006), who estimated the most likely initiation points for landslides from potential instability by using the Stability INdex MAPping (SINMAP, Pack et al., 1998) model and by comparing five different raster resolutions (from 2 to 50 m). They concluded that DEMs at resolutions larger than 10 m lead to a loss of resolution that degrades the results, while for DEM smaller than 10 m, the physical processes responsible for triggering landslides are obscured by the smaller scale terrain variability.

In all the above discussed works, landslide locations and areas are represented as point and polygon features, respectively, requiring a procedure to convert data from vector to raster. This aspect may considerably affect the outcome of the susceptibility analysis, even more significantly than the proper resolution that, in turn, may be related to the proper representation of the landslide polygons. Nonetheless, while the choice of the proper spatial resolution has received considerable attention, the issue of the proper representation of landslide locations and the choice of the best method to convert data from a vector to a raster structure has not been widely discussed yet.

In this work we investigated the effect of the raster resolution on susceptibility mapping in conjunction with the effect of the algorithm used to transform landslide polygons into a raster structure. We selected a well-known case study in northeastern Sicily, Italy, where, in 2009, an extreme rainfall event triggered a number of landslides on the Briga and Giampilieri basins that caused 37 casualties. Specifically, the landslide inventory consists of more than one thousands polygons, which delineate the landslide area. The inventory map was realized by the UTPRA-PREV (Technical Unit for Environmental Characterization, Prevention and Remediation – Natural risks prevention and effect mitigation) of ENEA (Italian National Agency for New Technologies, Energy, and Sustainable Economic Development) through a detailed geomorphological and morphometric field survey and an aerial photos analysis (Puglisi et al., 2013). Most of the phenomena were classified as debris and mud flows; for each event, both the landslide locations, as point information, and the landslide sources and propagation area, as polygonal information, are available. The susceptibility mapping has been conducted by applying the Artificial Neural Network technique (Arnone et al., 2014, Pumo et al., 2015) on different resolutions. Specifically, starting from the 2 m available DEM, the 10, 20, 30, 40, and 50 m DEMs have been derived, together with the associated topographic-derived variables; concurrently, three algorithms to convert vector into raster (available in ESRI GIS software) have been used and then compared.

Section snippets

Artificial Neural Network – ANN

The use of Artificial Neural Networks (ANNs) for the landslide susceptibility mapping has become very popular in literature, due especially to their efficiency in pattern recognition and multivariate analyses (e.g., Pradhan and Lee, 2010, Friedel, 2011). An ANN consists of a set of layers containing some neurons and connected by weighted channels.

The ANN here used is a feed-forward MultiLayer Perceptron (MLP) network, which is one of the most suitable and adopted ANNs for landslide

Case study: The Briga and Giampilieri basins

Climate, hydrology, DEM, and landuse data were collected in the Briga and Giampilieri catchments, which are located within the Messina district in northeastern Sicily, Italy (38° 11′ N, 15° 34′ E). Both the catchments are approximately 10 km2 in size and present a rugged morphology with mountains up to about 1000 m above the sea level, narrow valleys, and very steep hillslopes. Fig. 2 shows the DEM of the study area at the resolution of 2 m. The vegetation is heterogeneous and mainly dominated

Model setup

In order to assess the sensitivity of the susceptibility map to the size resolution, raster files at seven different cell size were adopted. The finer raster resolution here adopted is 2 m, which corresponds to that of the available DEM for the study area. Additionally, six coarser resolutions, i.e., 5, 10, 20, 30, 40, and 50 m were considered. In order to establish how the conversion methods influence the susceptibility mapping, three algorithms from the “vector to raster conversion tool” of

Results

This section describes the results of the ANNs simulations for all the configurations obtained by combining the seven resolutions with the three resampling methods defined in Section 2.2. At a first stage we made a simple image-driven analysis of the susceptibility maps (Fig. 9, Fig. 10, Fig. 11) and, then, a quantitative assessment of the results evaluating the ROC curves (Fig. 12) and the AUCs (Fig. 13a) for each case. Results are discussed in Section 6.

Discussion

The results described above, both in terms of susceptibility maps, ROC and relative AUC, provide interesting outcomes relatively to both the effect of pixel size and algorithms of data conversion applied to the landslide inventory map.

With regard to the best resolution, the results clearly demonstrated that, independently from the resampling method, the best performances were provided by the map at 10 m. These outcomes confirm the results of other studies which revealed that the finest

Conclusions

The use of the ANNs in landslide susceptibility analyses has become a valid alternative to the use of statistical methods, due to the particular capability in analyzing spatial correlation data.

The effect of the resolution of the initial data in the analysis of susceptibility constitutes one of the aspects of most interest in the field of derivation maps of landslide susceptibility. The choice of an not proper resolution can affect the whole analysis, returning unreliable results. Moreover,

Acknowledgements and data availability

Landslide inventory dataset used in this work was realized by the UTPRA-PREV of ENEA through a detailed geomorphological and morphometric field survey and aerial photos. Landslide dataset, geological and pedological information layers can be asked to Dr. C. Puglisi, [email protected]. Other data on landslide inducing factors can be asked to Dr. E. Arnone, [email protected]. Authors would like to thank two anonymous reviewers who contributed to improve the manuscript.

References (54)

  • F. Guzzetti et al.

    Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study, Central Italy

    Geomorphology

    (1999)
  • C. Melchiorre et al.

    Artificial neural networks and cluster analysis in landslide susceptibility zonation

    Geomorphology

    (2008)
  • M. Pogson et al.

    Effect of spatial data resolution on uncertainty

    Environ. Model. Softw.

    (2015)
  • B. Pradhan et al.

    Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling

    Environ. Model. Softw.

    (2010)
  • D. Pumo et al.

    Annual flow duration curves assessment in ephemeral small basins

    J. Hydrol.

    (2014)
  • J. Vaze et al.

    Impact of DEM accuracy and resolution on topographic indices

    Environ. Model. Softw.

    (2010)
  • S. Wu et al.

    Modeling the effects of elevation data resolution on the performance of topography-based watershed runoff simulation

    Environ. Model. Softw.

    (2007)
  • P. Yang et al.

    What is the effect of LiDAR-derived DEM resolution on large-scale watershed model results?

    Environ. Model. Softw.

    (2014)
  • E. Yesilnacar et al.

    Landslide susceptibility mapping: a comparison of logistic regression and neural networks methods in a medium scale study, Hendek region (Turkey)

    Eng. Geol.

    (2005)
  • APAT

    La realizzazione in Italia del progetto Corine Land Cover2000

    (2005)
  • E. Arnone et al.

    Accounting for soils parameter uncertainty in a physically-based and distributed approach for rainfall-triggered landslides

    Hydrol. Process.

    (2016)
  • E. Arnone et al.

    Strategies investigation in using artificial neural network for landslide susceptibility mapping: application to a Sicilian catchment

    J. Hydroinformatics

    (2014)
  • E. Arnone et al.

    Effect of DEM Resolution on Rainfall-triggered Landslide Modeling within a Triangulated Network-based Model a Case Study in the Luquillo Forest, Puerto Rico

    (2013)
  • M. Cama et al.

    Exploring relationships between grid cell size and accuracy for debris-flow susceptibility models: a test in the Giampilieri catchment (Sicily, Italy)

    Environ. Earth Sci.

    (2016)
  • D. Caniani et al.

    Neural networks and landslide susceptibility: a case study of the urban area of Potenza

    Nat. Hazards

    (2008)
  • G. Capparelli et al.

    Analysis of landslide triggering conditions in the Sarno area using a physically based model

    Hydrol. Earth Syst. Sci.

    (2014)
  • C.F. Chung et al.

    Multivariate regression analysis for landslide hazard zonation

  • Cited by (99)

    • An open-source and QGIS-integrated physically based model for Spatial Prediction of Rainfall-Induced Shallow Landslides (SPRIn-SL)

      2023, Environmental Modelling and Software
      Citation Excerpt :

      From the forecasting point of view, this type of model output is preferable than that showing unstable pixels randomly scattered over terrain units free from landslides. In this regard, as observed in literature, the simple comparison of terrain units occupied by known landslides with model outcomes may be affected by inherent inaccuracies and errors of the landslide inventory (Guzzetti et al., 2012) or by issues coming from algorithms for polygon rasterization (Arnone et al., 2016b). This evidence reveals that the SPRIn-SL package adequately fulfils one of the main requirements of landslide zoning models, namely minimizing the prediction of unstable zones where no landslides occurred (i.e., false alarms) (Godt et al., 2008; Corominas et al., 2014).

    View all citing articles on Scopus
    View full text