Automatic procedure for selecting flood events and identifying flood characteristics from daily streamflow data

doi:10.1016/j.envsoft.2021.105180

Environmental Modelling & Software

Volume 145, November 2021, 105180

https://doi.org/10.1016/j.envsoft.2021.105180 Get rights and content

Highlights

•
We propose a generic standardized method for identifying the flood characteristics.
•
An automatic procedure to select flood events and identify flood characteristics is constructed.
•
We developed a graphical user interface (GUI) for this automatic procedure in MATLAB.
•
The validation results show that our method has good applicability to watersheds with diverse characteristics.

Abstract

The selection of flood events and determination of flood characteristics (e.g. start and end dates, peak discharge, volume, and duration) are the first critical steps for flood analyses. To obtain the key flood information accurately, we used the automatic peak over threshold (POT) model for flood sampling and proposed an automatic approach to determine flood characteristics using the master recession curve analysis (MRC) method. We further developed a graphical user interface (GUI) and toolbox for this procedure in MATLAB. Model parameter estimation experiment (MOPEX) data from 423 stations were used to evaluate the proposed method. Our results suggest that the proposed procedure performs well for watersheds with diverse characteristics. The developed toolbox can be conveniently applied to other watersheds for flood sampling and the characterisation of flood events, thus helping reduce the uncertainty in subsequent flood analyses, such as multivariable flood frequency and trend analyses.

Graphical abstract

Introduction

Floods are serious natural hazards, causing significant damage and affecting millions of people worldwide (Kauffeldt et al., 2016; Bruijn et al., 2019; Qiu et al., 2017). Moreover, with the increase in atmospheric water-holding capacity due to global warming, extreme weather events, especially flood events, occur more frequently (Blöschl et al., 2019; Adhikari et al., 2010; Tanoue et al., 2016). To reduce flood damage and economic losses, it is essential to accurately investigate the variability of flood events and assess flood risks (Zeng et al., 2020). It is well known that the precise selection of flood events and identification of flood characteristics from daily streamflow data is the most critical steps for subsequent flood related research (e.g. multivariable flood frequency analysis, and trend analysis) (Karahacane et al., 2020). However, such work yet lacks comprehensive and feasible approaches (Karahacane et al., 2020), thus highlighting the importance of developing a framework for selecting flood events and identifying flood characteristics.

Flooding is a multivariate stochastic phenomenon generally described by variables such as flood peak, flood volume, and flood duration (Mediero et al., 2010). The determination of flood characteristics has not yet been comprehensively studied, although various simplified methods have been used to obtain them in previous studies (Vittal et al., 2015; Jeong et al., 2014; Nadarajah and Shiau 2005). Three main challenges should be considered for the accurate and objective identification of flood characteristics: the selection of suitable flood samples, accurate identification of the start and end dates of flood events, and extension of the recession process to separate flood events from the observed discharge.

The annual maximum series (AMS) approach and the POT method are widely used for flood sampling (S. Solari and Losada, 2012). The POT method, which can capture more information about flood processes extracted from the daily streamflow data and reduce the uncertainty of flood frequency analysis (Lang et al., 1999), has been widely used for flood risk estimation in the past few decades (Durocher et al., 2018; Durocher et al., 2019; Aissia et al., 2012). However, the flood samples obtained using the POT model largely depended on the selection of the threshold. Reliable threshold selection methods, such as the fixed quantile (Jeong et al., 2014; S. Solari and Losada, 2012), mean number of over-threshold events (Brunner et al., 2018, 2019), mean exceedance above the threshold (Davison and Smith, 1990; Lang et al., 1999), and the automatic threshold selection method (Liang et al., 2019; S. Solari and Losada, 2012) can result in a suitable flood sample. Automatic procedures, which determine the threshold automatically according to the degree of fit between the hypothetical distribution and the flood sample, have proven to be effective (Solari et al., 2017; Durocher et al., 2018; S. Solari and Losada, 2012). Durocher et al. (2018) compared the efficiency of different threshold selection methods and indicated that automatic threshold selection based on the goodness-of-fit (GOF) test can obtain a reasonable optimal threshold more objectively.

The determination of the start and end dates of flood events is the key to characterise the flood process. One of the widely used ways to address this issue is the conceptual graphical approach, in which the start date is usually marked by an abrupt increase in the hydrograph, and the end date can be determined by the flattening of the hydrograph's recession limb (Tosunoglu et al., 2020; Y. R. Liu et al., 2020; Sheng Yue, 2000). However, there is a certain degree of subjectivity involved in identifying the start and end dates. To reduce the influence of human decisions, a large number of relevant studies have used a simplified approach (Vittal et al., 2015; Brunner et al., 2019; Mediero et al., 2010). For example, Vittal et al. (2015) suggested that the intersections of the threshold line and flow hydrograph correspond to the start and end points of a flood event. To date, there is no objective method for addressing this problem without human intervention due to the complexity of flooding events.

Furthermore, to determine the flood volume and hydrograph, flood events need to be extracted from the flow hydrograph using baseflow separation methods (Smakhtin 2001; Sujono et al., 2004; Tallaksen 1995). Generally, simplified methods can be used to separate flood hydrographs. For example, the flood volume was simply estimated using a straight line to separate direct runoff from baseflow (Tosunoglu et al., 2020; Aissia et al., 2012; Yue, 2000). In addition, the master recession curve, a graphical method, has been widely used to describe the discharge-storage relationship of watersheds, and some researchers have proposed different functional models for obtaining the master recession curve, such as linear (Sujono et al., 2004), and power function relation (Carlotto and Chaffe 2019). Furthermore, this method allows the extraction of multiple flood events over a long period by extending the recession process (Beven et al., 2011; Lamb and Keith, 1997; Sujono et al., 2004); however, it only finds scarce use in characterising flood events (such as calculating flood volume) due to its complexity.

Both flood sampling and identification of flood characteristics are often performed manually, which is tedious and requires expertise. It also gives rise to large uncertainties in subsequent flood analyses. Moreover, the manual method would be very inefficient when applied to a large amount of data (Solari et al., 2017; Carlotto and Chaffe 2019; Arciniega-Esparza et al., 2017; Durocher et al., 2018). Therefore, it is necessary to develop an automatic generic procedure for selecting flood events and identifying flood characteristics with minimal human intervention that can be applied efficiently to extensive data.

Hence, we developed an automatic generic procedure to objectively select the threshold for flood sampling based on the POT model, and identify flood characteristics according to the master recession curve method, which is a generalised standardised approach that can be applied to all watersheds with diverse characteristics. We also construct a graphical user interface (GUI) for this procedure in MATLAB. We tested and validated the proposed procedure using MOPEX data (Duan et al., 2006). We expect to improve the selection of flood events and the determination of flood characteristics by using our automatic procedure, thereby reducing uncertainties in subsequent flood analyses, such as multivariable flood frequency and flood trend analyses.

Section snippets

Methods

A flowchart of the automatic generic procedure for selecting flood events and identifying flood characteristics is shown in Fig. 1. The details of each method are described in subsequent sections.

Development of the GUI

In this study, a MATLAB toolbox (i.e. SFE_IFC) was developed to perform the automatic procedure, and the App Designer embedded in MATLAB was used to build the GUI. The GUI for selecting flood events and identifying the flood characteristics is shown in Fig. 3. The functions and procedures of the SFE_IFC toolbox are presented in detail in this section.

The SFE_IFC toolbox contains four main components: selecting flood event panels, determining flood characteristics panels, graph windows, and

Materials and validation methods

The model parameter estimation experiment (MOPEX) dataset (https://www.nws.noaa.gov/oh/mopex/mo_datasets.htm) (Duan et al., 2006) was employed to evaluate the proposed method described in Section 2. Daily streamflow data are available for 438 catchments, ranging from 67 to 10329 km² across the United States. We selected 423 stations with 15 years of data to validate the method.

In order to verify the adequacy of this procedure for basins with diverse characteristics, the 423 stations were

Discussion

As the POT model can capture more information about flood processes than the AMS method, it is widely used in flood sampling (Lang et al., 1999). In this study, the PPY values fluctuated around 2, and exhibited smaller variance for the large basins, which is consistent with the results from previous literature (Claps and Laio 2003). As for the dispersion index, the value of moist basins was closer to 1 and lay within the 95% confidence intervals for almost all moist basins, indicating that the

Conclusions

In this study, we constructed a generic framework of automatic procedures for selecting flood events and identifying flood characteristics using an automatic-threshold-based POT model for flood sampling. More importantly, we proposed a generic automatic approach to determine flood characteristics using the MRC analysis methods. Furthermore, we developed a GUI (i.e. the SFE_IFC toolbox) based on the MATLAB App Designer. We validated the proposed method using 423 MOPEX dataset stations. We

Software and data availability

Toolbox name: SFE_IFC (select flood events and identify flood characteristics).

Software required: MATLAB R2019a and above.

Program language: MATLAB.

Contact email: [email protected].

Trial version link: https://github.com/Zhang-Qin-0925/SFE_IFC-Toolbox/find/main.

Validation data: the Model Parameter Estimation Experiment (MOPEX) dataset (https://www.nws.noaa.gov/oh/mopex/mo_datasets.htm).

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This study was supported by the National Key Research and Development Program of China (No.2017YFA0603704), Major projects of National Natural Science Foundation of China (No.41890824), the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDA23040103), the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDA23040500), the National Natural Science Foundation of China (No.51809008).

References (63)

Saúl Arciniega-Esparza et al.
HYDRORECESSION: A Matlab Toolbox for Streamflow Recession Analysis
Comput. Geosci.
(2017)
T. Carlotto et al.
Master recession curve parameterization tool (MRCPtool): different approaches to recession curve analysis
Comput. Geosci.
(2019)
Alireza Daneshkhah et al.
Probabilistic modeling of flood characterizations with parametric and minimum information pair-copula model
J. Hydrol.
(2016)
Hugh P. Duncan
“Baseflow separation – a practical approach
J. Hydrol.
(2019)
A. Kauffeldt et al.
Technical review of large-scale hydrological models for implementation in operational flood forecasting schemes on continental level
Environ. Model. Software
(2016)
Bingchen Liang et al.
An automated threshold selection method based on the characteristic of extrapolated significant wave heights
Coastal Engineering
(2019)
Jianyu Liu et al.
Multi-temporal clustering of continental floods and associated atmospheric circulations
J. Hydrol.
(2017)
Franck Mazas et al.
A multi-distribution approach to POT methods for determining extreme wave heights
Coastal Engineering
(2011)
Nasser Najibi et al.
Hydroclimate drivers and atmospheric teleconnections of long duration floods: an application to large reservoirs in the Missouri river basin
Adv. Water Resour.
(2017)
B. Önöz et al.
Effect of the occurrence process of the peaks over threshold on the flood estimates
J. Hydrol.
(2001)

Linyao Qiu et al.

An integrated flood management system based on linking environmental models and disaster-related data

Environ. Model. Software

(2017)

V.U. Smakhtin

Low flow hydrology: a review

J. Hydrol.

(2001)

L. Tallaksen

A review of baseflow recession analysis

J. Hydrol.

(1995)

H. Vittal et al.

A framework for multivariate data-based at-site flood frequency analysis: essentiality of the conjugal application of parametric and nonparametric approaches

J. Hydrol.

(2015)

S. Yue et al.

The gumbel mixed model for flood frequency analysis

J. Hydrol.

(1999)

Sidong Zeng et al.

Development of an interface-oriented add-in modeling framework for integrated water system simulation and its application

Environ. Model. Software

(2020)

Pradeep Adhikari et al.

A digitized global flood inventory (1998-2008): compilation and preliminary results

Nat. Hazards

(2010)

M.-A. Ben Aissia et al.

Multivariate Analysis of flood characteristics in a climate change context of the watershed of the Baskatong Reservoir, Province of Québec, Canada

Hydrol. Process.

(2012)

P. Bernardara et al.

A two-step framework for over-threshold modelling of environmental extremes

Nat. Hazards Earth Syst. Sci.

(2014)

K. Beven et al.

On the colour and spin of epistemic error (and what we might do about it)

Hydrol. Earth Syst. Sci.

(2011)

Günter Blöschl et al.

Changing climate both increases and decreases European river floods

Nature

(2019)

Jens A. de Bruijn et al.

A Global Database of Historic and Real-Time Flood Events Based on Social Media

(2019)

Manuela I. Brunner et al.

Synthetic design hydrographs for ungauged catchments: a comparison of regionalization methods

Stoch. Environ. Res. Risk Assess.

(2018)

Manuela I. Brunner et al.

“Future trends in the interdependence between flood peaks and volumes: hydro‐climatological drivers and uncertainty

Water Resour. Res.

(2019)

V. Choulakian et al.

Goodness-of-Fit tests for the generalized Pareto distribution

Technometrics

(2001)

P. Claps et al.

Can continuous streamflow data support flood frequency analysis? An alternative to the partial duration series approach

Water Resour. Res.

(2003)

Aimé. Coutagne

Météorologie et Hydrologie - Etude Générale Des Débits et Des Facteurs Qui Les Conditionnent

La Houille Blanche

(1948)

C. Cunnane

A note on the Poisson assumption in partial duration series models

Water Resour. Res.

(1979)

A.C. Davison et al.

Models for exceedances over high thresholds

J. Roy. Stat. Soc. B

(1990)

Q. Duan et al.

Model parameter estimation experiment (MOPEX): an overview of science strategy and major results from the second and third workshops

In Journal of Hydrology

(2006)

Martin Durocher et al.

Comparison of automatic procedures for selecting flood peaks over threshold based on goodness-of-fit tests

Hydrol. Process.

(2018)

Cited by (17)

Identifying changes in flood characteristics and their causes from an event-based perspective in the Central Taihu Basin
2023, Science of the Total Environment
Increasing rainstorms induced by climate change and modification in the land surface due to urbanization have greatly altered floods at different spatio-temporal scales. However, investigating flood events in urbanized plains is challenging as anthropogenic behaviors can change river flow without rainfall. In addition, while the frequency and magnitude of floods have been well examined, knowledge about variations in the rate of flood change is still limited. To fill these gaps, we proposed a scheme that focused on flood responses to rainfall to detect changes in flood characteristics in the Central Taihu Basin, a highly urbanized region in the Yangtze River Delta of China. Four characteristic metrics were adopted to summarize the flood hydrograph, including the peak, increment, rising rate, and falling rate. We then examined trends of these metrics based on the selected rainfall-flood events from ten hydrological stations during 1970–2020. Subsequently, the reduction method was used to separate the impacts of regional climate change and human activities on flood characteristics alterations. Furthermore, the importance of fifteen factors was quantified by the random forest model. We found that there is a significant upward trend in the evolution of flood characteristics, except for the increment of floods. Flood characteristics exhibit higher values when rainfall accumulates, indicating stronger responses of floods to a large amount of rainfall. The results also show that human activities dominate and impact the peak, rising rate, and falling rate of floods more than climate change. Meanwhile, although cumulative precipitation is the most important factor, flood characteristics are also susceptible to anthropogenic factors, such as land use change and hydraulic engineering construction. Our findings, which provide insights into flood event identification and enhance the understanding of regional flood changes, will serve as a reference for water resource management and flood mitigation in urbanized areas.
Investigating the spatial–temporal changes of flood events across the Yangtze River Basin, China: Identification, spatial heterogeneity, and dominant impact factors
2023, Journal of Hydrology
Flood is one of the most devastating natural hazards and is responsible for sizeable social-economic losses and substantial fatalities. Flood events with diverse behavior characteristics vary greatly in spatial patterns and bring great challenges for flood management. In this study, the Peak Over Threshold (POT) approach coupled with an event-start-and-end detecting approach was proposed to separate flood events during 2008 ∼ 2018 across the Yangtze River Basin. Ten flood behavior metrics, comprehensively characterizing magnitude, duration, timing, rate of change, and flood forms of flood events, were used to fully describe flood events and then to identify flood event classes. Subsequently, the spatial heterogeneity of flood events was revealed based on behavior similarity classification of numerous events. Furthermore, a Contribution-Based Impact Factor Analysis (CBIFA) method was constructed to investigate the contributions of natural and anthropogenic factors to flood event variations. Results show that: (1) six flood event classes were identified including long-duration and small-rate-of-change flood events (Class 1, 31.00%), sharp-thin and short-tail flood events (Class 2, 20.58%), sharp-thin and early-peak flood events (Class 3, 5.12%), dwarf-fat flood events (Class 4, 18.37%), small and sharp-thin flood events (Class 5, 4.24%), and conventional events (Class 6, 20.67%). (2) Class 1 and Class 4, characterized by small rate of change, long duration and large magnitude, occurred across the whole Yangtze River Basin and mainly in the Middle-Lower Yangtze plains. Class 2 with small magnitude and short duration was mainly distributed in the Qinghai Tibet Plateau. Class 3, Class 5, and Class 6 mainly occurred in mountainous areas and hills in the middle and upper reaches, wherein Class 3 and Class 5 were characterized by large rate of change and short duration. (3) The meteorological factors were the most important impact factors and explained 30.23%∼68.10% of the total flood event variations, followed by geographical (9.19%∼32.10%), human activities (6.56%∼23.17%), soil (9.19%∼19.34%) and vegetation (2.56%∼5.31%) factors. Identifying the spatial pattern of flood events and their driver factors is a crucial step toward accurate flood prediction and will finally provide essential information for reliable flood management.
Dynamics of dissolved organic carbon during drought and flood events: A phase-by-stages perspective
2023, Science of the Total Environment
Dissolved organic carbon (DOC) is a key water quality parameter that plays a crucial role in controlling aquatic ecosystems and carbon cycling. Understanding DOC dynamics during hydrological extremes (i.e., droughts and floods) helps in managing water quality, but such variability is rarely studied. Furthermore, how differences in DOC concentrations among phase-by-stages of drought/flood affect simulation performances based on hydrological features remains unclear. Here, phase-by-stages of hydrological drought (flood) were divided into intensification (rising) and recovery (falling) periods based on drought peak intensity (flood peak intensity). The long-term (1976–2019) daily discharge and weekly (biweekly) DOC concentrations from four headwater streams with different watershed sizes (from 9.97 to 119.09 ha) in south-central Ontario, Canada, were used to achieve the above aims. The results showed that (i) the average DOC concentration during intensification (rising) stage of drought (flood) was smaller (larger) than during recovery (falling). (ii) Simulations performed better when accounting for phase-by-stages of drought/flood, with reductions in mean absolute percentage error of 32.85 % and 53.59 % for drought and flood events, respectively. These results will help understand the dynamics of DOC during hydrological extremes and improve simulation performance of numerical models for water quality parameters under changing environmental conditions.
A user-friendly software for modelling extreme values: EXTRASTAR (EXTRemes Abacus for STAtistical Regionalization)
2023, Environmental Modelling and Software
The software EXTRASTAR (EXTRemes Abacus for STAtistical Regionalization) is here proposed. It can represent a useful and quick tool for both regional and at-site statistical analyses of Annual Maxima (AM) time series. The innovative aspects (compared to other algorithms/softwares) are: 1) an easy comparison of sample skewness of several observed series with pre-determined Monte Carlo Prediction Intervals (depending on sample size) from hypothesized probability distributions, for obtaining a first important indication about possible clusters and regional values for some parameters, which can be afterward refined (if deemed as necessary) with specific and more complex algorithms/softwares proposed in literature; 2) for at-site analyses, the possibility to benefit from calibration of one function for some parameters of the others (thus significantly reducing the computational costs). EXTRASTAR was tested with the daily AM series for the Italian rain-gauge network, by implementing EV1, GEV and TCEV distributions.
Predicting the peak flow and assessing the hydrologic hazard of the Kessem Dam, Ethiopia using machine learning and risk management centre-reservoir frequency analysis software
2024, Journal of Water and Climate Change
Comprehensive investigation of flood-resilient neighborhoods: the case of Adama City, Ethiopia
2024, Applied Water Science

View all citing articles on Scopus

View full text

Automatic procedure for selecting flood events and identifying flood characteristics from daily streamflow data

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Methods

Development of the GUI

Materials and validation methods

Discussion

Conclusions

Software and data availability

Declaration of competing interest

Acknowledgements

Comput. Geosci.

Comput. Geosci.

J. Hydrol.

J. Hydrol.

Environ. Model. Software

Coastal Engineering

J. Hydrol.

Coastal Engineering

Adv. Water Resour.

J. Hydrol.

Environ. Model. Software

J. Hydrol.

J. Hydrol.

J. Hydrol.

J. Hydrol.

Environ. Model. Software

A digitized global flood inventory (1998-2008): compilation and preliminary results

Nat. Hazards

Multivariate Analysis of flood characteristics in a climate change context of the watershed of the Baskatong Reservoir, Province of Québec, Canada

Hydrol. Process.

A two-step framework for over-threshold modelling of environmental extremes

Nat. Hazards Earth Syst. Sci.

On the colour and spin of epistemic error (and what we might do about it)

Hydrol. Earth Syst. Sci.

Changing climate both increases and decreases European river floods

Nature

A Global Database of Historic and Real-Time Flood Events Based on Social Media

Synthetic design hydrographs for ungauged catchments: a comparison of regionalization methods

Stoch. Environ. Res. Risk Assess.

“Future trends in the interdependence between flood peaks and volumes: hydro‐climatological drivers and uncertainty

Water Resour. Res.

Goodness-of-Fit tests for the generalized Pareto distribution

Technometrics

Can continuous streamflow data support flood frequency analysis? An alternative to the partial duration series approach

Water Resour. Res.

Météorologie et Hydrologie - Etude Générale Des Débits et Des Facteurs Qui Les Conditionnent

La Houille Blanche

A note on the Poisson assumption in partial duration series models

Water Resour. Res.

Models for exceedances over high thresholds

J. Roy. Stat. Soc. B

Model parameter estimation experiment (MOPEX): an overview of science strategy and major results from the second and third workshops

In Journal of Hydrology

Comparison of automatic procedures for selecting flood peaks over threshold based on goodness-of-fit tests

Hydrol. Process.