Automatic procedure for selecting flood events and identifying flood characteristics from daily streamflow data

https://doi.org/10.1016/j.envsoft.2021.105180Get rights and content

Highlights

  • We propose a generic standardized method for identifying the flood characteristics.

  • An automatic procedure to select flood events and identify flood characteristics is constructed.

  • We developed a graphical user interface (GUI) for this automatic procedure in MATLAB.

  • The validation results show that our method has good applicability to watersheds with diverse characteristics.

Abstract

The selection of flood events and determination of flood characteristics (e.g. start and end dates, peak discharge, volume, and duration) are the first critical steps for flood analyses. To obtain the key flood information accurately, we used the automatic peak over threshold (POT) model for flood sampling and proposed an automatic approach to determine flood characteristics using the master recession curve analysis (MRC) method. We further developed a graphical user interface (GUI) and toolbox for this procedure in MATLAB. Model parameter estimation experiment (MOPEX) data from 423 stations were used to evaluate the proposed method. Our results suggest that the proposed procedure performs well for watersheds with diverse characteristics. The developed toolbox can be conveniently applied to other watersheds for flood sampling and the characterisation of flood events, thus helping reduce the uncertainty in subsequent flood analyses, such as multivariable flood frequency and trend analyses.

Introduction

Floods are serious natural hazards, causing significant damage and affecting millions of people worldwide (Kauffeldt et al., 2016; Bruijn et al., 2019; Qiu et al., 2017). Moreover, with the increase in atmospheric water-holding capacity due to global warming, extreme weather events, especially flood events, occur more frequently (Blöschl et al., 2019; Adhikari et al., 2010; Tanoue et al., 2016). To reduce flood damage and economic losses, it is essential to accurately investigate the variability of flood events and assess flood risks (Zeng et al., 2020). It is well known that the precise selection of flood events and identification of flood characteristics from daily streamflow data is the most critical steps for subsequent flood related research (e.g. multivariable flood frequency analysis, and trend analysis) (Karahacane et al., 2020). However, such work yet lacks comprehensive and feasible approaches (Karahacane et al., 2020), thus highlighting the importance of developing a framework for selecting flood events and identifying flood characteristics.

Flooding is a multivariate stochastic phenomenon generally described by variables such as flood peak, flood volume, and flood duration (Mediero et al., 2010). The determination of flood characteristics has not yet been comprehensively studied, although various simplified methods have been used to obtain them in previous studies (Vittal et al., 2015; Jeong et al., 2014; Nadarajah and Shiau 2005). Three main challenges should be considered for the accurate and objective identification of flood characteristics: the selection of suitable flood samples, accurate identification of the start and end dates of flood events, and extension of the recession process to separate flood events from the observed discharge.

The annual maximum series (AMS) approach and the POT method are widely used for flood sampling (S. Solari and Losada, 2012). The POT method, which can capture more information about flood processes extracted from the daily streamflow data and reduce the uncertainty of flood frequency analysis (Lang et al., 1999), has been widely used for flood risk estimation in the past few decades (Durocher et al., 2018; Durocher et al., 2019; Aissia et al., 2012). However, the flood samples obtained using the POT model largely depended on the selection of the threshold. Reliable threshold selection methods, such as the fixed quantile (Jeong et al., 2014; S. Solari and Losada, 2012), mean number of over-threshold events (Brunner et al., 2018, 2019), mean exceedance above the threshold (Davison and Smith, 1990; Lang et al., 1999), and the automatic threshold selection method (Liang et al., 2019; S. Solari and Losada, 2012) can result in a suitable flood sample. Automatic procedures, which determine the threshold automatically according to the degree of fit between the hypothetical distribution and the flood sample, have proven to be effective (Solari et al., 2017; Durocher et al., 2018; S. Solari and Losada, 2012). Durocher et al. (2018) compared the efficiency of different threshold selection methods and indicated that automatic threshold selection based on the goodness-of-fit (GOF) test can obtain a reasonable optimal threshold more objectively.

The determination of the start and end dates of flood events is the key to characterise the flood process. One of the widely used ways to address this issue is the conceptual graphical approach, in which the start date is usually marked by an abrupt increase in the hydrograph, and the end date can be determined by the flattening of the hydrograph's recession limb (Tosunoglu et al., 2020; Y. R. Liu et al., 2020; Sheng Yue, 2000). However, there is a certain degree of subjectivity involved in identifying the start and end dates. To reduce the influence of human decisions, a large number of relevant studies have used a simplified approach (Vittal et al., 2015; Brunner et al., 2019; Mediero et al., 2010). For example, Vittal et al. (2015) suggested that the intersections of the threshold line and flow hydrograph correspond to the start and end points of a flood event. To date, there is no objective method for addressing this problem without human intervention due to the complexity of flooding events.

Furthermore, to determine the flood volume and hydrograph, flood events need to be extracted from the flow hydrograph using baseflow separation methods (Smakhtin 2001; Sujono et al., 2004; Tallaksen 1995). Generally, simplified methods can be used to separate flood hydrographs. For example, the flood volume was simply estimated using a straight line to separate direct runoff from baseflow (Tosunoglu et al., 2020; Aissia et al., 2012; Yue, 2000). In addition, the master recession curve, a graphical method, has been widely used to describe the discharge-storage relationship of watersheds, and some researchers have proposed different functional models for obtaining the master recession curve, such as linear (Sujono et al., 2004), and power function relation (Carlotto and Chaffe 2019). Furthermore, this method allows the extraction of multiple flood events over a long period by extending the recession process (Beven et al., 2011; Lamb and Keith, 1997; Sujono et al., 2004); however, it only finds scarce use in characterising flood events (such as calculating flood volume) due to its complexity.

Both flood sampling and identification of flood characteristics are often performed manually, which is tedious and requires expertise. It also gives rise to large uncertainties in subsequent flood analyses. Moreover, the manual method would be very inefficient when applied to a large amount of data (Solari et al., 2017; Carlotto and Chaffe 2019; Arciniega-Esparza et al., 2017; Durocher et al., 2018). Therefore, it is necessary to develop an automatic generic procedure for selecting flood events and identifying flood characteristics with minimal human intervention that can be applied efficiently to extensive data.

Hence, we developed an automatic generic procedure to objectively select the threshold for flood sampling based on the POT model, and identify flood characteristics according to the master recession curve method, which is a generalised standardised approach that can be applied to all watersheds with diverse characteristics. We also construct a graphical user interface (GUI) for this procedure in MATLAB. We tested and validated the proposed procedure using MOPEX data (Duan et al., 2006). We expect to improve the selection of flood events and the determination of flood characteristics by using our automatic procedure, thereby reducing uncertainties in subsequent flood analyses, such as multivariable flood frequency and flood trend analyses.

Section snippets

Methods

A flowchart of the automatic generic procedure for selecting flood events and identifying flood characteristics is shown in Fig. 1. The details of each method are described in subsequent sections.

Development of the GUI

In this study, a MATLAB toolbox (i.e. SFE_IFC) was developed to perform the automatic procedure, and the App Designer embedded in MATLAB was used to build the GUI. The GUI for selecting flood events and identifying the flood characteristics is shown in Fig. 3. The functions and procedures of the SFE_IFC toolbox are presented in detail in this section.

The SFE_IFC toolbox contains four main components: selecting flood event panels, determining flood characteristics panels, graph windows, and

Materials and validation methods

The model parameter estimation experiment (MOPEX) dataset (https://www.nws.noaa.gov/oh/mopex/mo_datasets.htm) (Duan et al., 2006) was employed to evaluate the proposed method described in Section 2. Daily streamflow data are available for 438 catchments, ranging from 67 to 10329 km2 across the United States. We selected 423 stations with 15 years of data to validate the method.

In order to verify the adequacy of this procedure for basins with diverse characteristics, the 423 stations were

Discussion

As the POT model can capture more information about flood processes than the AMS method, it is widely used in flood sampling (Lang et al., 1999). In this study, the PPY values fluctuated around 2, and exhibited smaller variance for the large basins, which is consistent with the results from previous literature (Claps and Laio 2003). As for the dispersion index, the value of moist basins was closer to 1 and lay within the 95% confidence intervals for almost all moist basins, indicating that the

Conclusions

In this study, we constructed a generic framework of automatic procedures for selecting flood events and identifying flood characteristics using an automatic-threshold-based POT model for flood sampling. More importantly, we proposed a generic automatic approach to determine flood characteristics using the MRC analysis methods. Furthermore, we developed a GUI (i.e. the SFE_IFC toolbox) based on the MATLAB App Designer. We validated the proposed method using 423 MOPEX dataset stations. We

Software and data availability

Toolbox name: SFE_IFC (select flood events and identify flood characteristics).

Software required: MATLAB R2019a and above.

Program language: MATLAB.

Contact email: [email protected].

Trial version link: https://github.com/Zhang-Qin-0925/SFE_IFC-Toolbox/find/main.

Validation data: the Model Parameter Estimation Experiment (MOPEX) dataset (https://www.nws.noaa.gov/oh/mopex/mo_datasets.htm).

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This study was supported by the National Key Research and Development Program of China (No.2017YFA0603704), Major projects of National Natural Science Foundation of China (No.41890824), the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDA23040103), the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDA23040500), the National Natural Science Foundation of China (No.51809008).

References (63)

  • Linyao Qiu et al.

    An integrated flood management system based on linking environmental models and disaster-related data

    Environ. Model. Software

    (2017)
  • V.U. Smakhtin

    Low flow hydrology: a review

    J. Hydrol.

    (2001)
  • L. Tallaksen

    A review of baseflow recession analysis

    J. Hydrol.

    (1995)
  • H. Vittal et al.

    A framework for multivariate data-based at-site flood frequency analysis: essentiality of the conjugal application of parametric and nonparametric approaches

    J. Hydrol.

    (2015)
  • S. Yue et al.

    The gumbel mixed model for flood frequency analysis

    J. Hydrol.

    (1999)
  • Sidong Zeng et al.

    Development of an interface-oriented add-in modeling framework for integrated water system simulation and its application

    Environ. Model. Software

    (2020)
  • Pradeep Adhikari et al.

    A digitized global flood inventory (1998-2008): compilation and preliminary results

    Nat. Hazards

    (2010)
  • M.-A. Ben Aissia et al.

    Multivariate Analysis of flood characteristics in a climate change context of the watershed of the Baskatong Reservoir, Province of Québec, Canada

    Hydrol. Process.

    (2012)
  • P. Bernardara et al.

    A two-step framework for over-threshold modelling of environmental extremes

    Nat. Hazards Earth Syst. Sci.

    (2014)
  • K. Beven et al.

    On the colour and spin of epistemic error (and what we might do about it)

    Hydrol. Earth Syst. Sci.

    (2011)
  • Günter Blöschl et al.

    Changing climate both increases and decreases European river floods

    Nature

    (2019)
  • Jens A. de Bruijn et al.

    A Global Database of Historic and Real-Time Flood Events Based on Social Media

    (2019)
  • Manuela I. Brunner et al.

    Synthetic design hydrographs for ungauged catchments: a comparison of regionalization methods

    Stoch. Environ. Res. Risk Assess.

    (2018)
  • Manuela I. Brunner et al.

    “Future trends in the interdependence between flood peaks and volumes: hydro‐climatological drivers and uncertainty

    Water Resour. Res.

    (2019)
  • V. Choulakian et al.

    Goodness-of-Fit tests for the generalized Pareto distribution

    Technometrics

    (2001)
  • P. Claps et al.

    Can continuous streamflow data support flood frequency analysis? An alternative to the partial duration series approach

    Water Resour. Res.

    (2003)
  • Aimé. Coutagne

    Météorologie et Hydrologie - Etude Générale Des Débits et Des Facteurs Qui Les Conditionnent

    La Houille Blanche

    (1948)
  • C. Cunnane

    A note on the Poisson assumption in partial duration series models

    Water Resour. Res.

    (1979)
  • A.C. Davison et al.

    Models for exceedances over high thresholds

    J. Roy. Stat. Soc. B

    (1990)
  • Q. Duan et al.

    Model parameter estimation experiment (MOPEX): an overview of science strategy and major results from the second and third workshops

    In Journal of Hydrology

    (2006)
  • Martin Durocher et al.

    Comparison of automatic procedures for selecting flood peaks over threshold based on goodness-of-fit tests

    Hydrol. Process.

    (2018)
  • Cited by (17)

    View all citing articles on Scopus
    View full text