A comparative study of population-based optimization algorithms for downstream river flow forecasting by a hybrid neural network model

https://doi.org/10.1016/j.engappai.2015.09.010Get rights and content

Highlights

  • Comparison of performances of population-based optimization algorithms in forecasting downstream river flow.

  • Differential evolution (DE), artificial bee colony and ant colony optimization (ACO).

  • Particle swarm optimization is included as a benchmark comparison for forecasting performances.

  • Hybrid neural network (HNN) model incorporating fuzzy pattern-recognition and continuity equation.

  • DE and ACO algorithms are considerably more adaptive in optimizing the forecasting problem for the HNN model.

Abstract

Population-based optimization algorithms have been successfully applied to hydrological forecasting recently owing to their powerful ability of global optimization. This paper investigates three algorithms, i.e. differential evolution (DE), artificial bee colony (ABC) and ant colony optimization (ACO), to determine the optimal one for forecasting downstream river flow. A hybrid neural network (HNN) model, which incorporates fuzzy pattern-recognition and a continuity equation into the artificial neural network, is proposed to forecast downstream river flow based on upstream river flows and areal precipitation. The optimization algorithm is employed to determine the premise parameters of the HNN model. Daily data from the Altamaha River basin of Georgia is applied in the forecasting analysis. Discussions on the forecasting performances, convergence speed and stability of various algorithms are presented. For completeness׳ sake, particle swarm optimization (PSO) is included as a benchmark case for the comparison of forecasting performances. Results show that the DE algorithm attains the best performance in generalization and forecasting. The forecasting accuracy of the DE algorithm is comparable to that of the PSO, and yet presents weak superiority over the ABC and ACO. The Diebold–Mariano (DM) test indicates that each pair of algorithms has no difference under the null hypothesis of equal forecasting accuracy. The DE and ACO algorithms are both favorable for searching parameters of the HNN model, including the recession coefficient and initial storage. Further analysis reveals the drawback of slow convergence and time-consumption of the ABC algorithm. The three algorithms present stability and reliability with respect to their control parameters on the whole. It can be concluded that the DE and ACO algorithms are considerably more adaptive in optimizing the forecasting problem for the HNN model.

Introduction

River flow forecasting is a prerequisite for many water resource applications such as flood warning and reservoir design. The hydrological process of river flows is so complex that simple data-driven models are not able to describe its behavior. It is therefore worth investigating suitable models for the highly-nonlinear and seasonal river flows. The reasoning ability of fuzzy-based neural networks has led to an increasing interest within the hydrology community. So far a number of neuro-fuzzy computing techniques and fuzzy set theories have already been applied in hydrological modeling (Li and Chen, 2010, Tabari et al., 2012, Rath et al., 2013). Incorporating a fuzzy concept activation function into an artificial neural network (ANN) model is a good alternative to the latter since the former can be flexibly implemented. Qiu et al. (1998) introduced a fuzzy pattern-recognition activation function (from the input layer to the hidden layer) into an ANN model to forecast annual runoff. This function classified runoff into a number of categories during wet and dry seasons, and therefore reflected nonlinear and seasonal characteristics of the river system. Zhao and Chen (2008) developed a mixed forecasting model combining neural networks with fuzzy pattern-recognition, considering the fuzziness in the concept of similar basins. This practice, in which activation functions in the ANN structure are modified with characterization, is more easily executed and also addresses the fuzzy behavior of river flows. In this paper, this type of fuzzy pattern-recognition activation function is introduced into a hybrid neural network (HNN) model, representing uncertainties of the river flow problem.

As an important issue in hydrological modeling, conceptual models have been developed for river flow forecasting over the past few decades. Rainfall-runoff models have extensively emerged in river flow forecasting by considering simplified forms of physical laws (Moore, 2007, Bhadra et al., 2010, Rezaeianzadeh et al., 2013). Another practical approach is to directly integrate physical equations into the ANN models. Yang et al. (1998) developed a hydrological modeling network (HYMN) model using the continuity equation and considering nodes in the hidden or output layers as storage reservoirs. This method has been successful applied to compute the monthly river flow at the Salford University station in the Irwell River basin using daily evaporation data and precipitation records from six upstream stations. Li and Gu (2003) further expanded the HYMN model to sediment yield forecasting. These investigations are rather limited, whereas they attain the possibility of directly integrating fundamental physical principles into an ANN structure. This paper contributes to assembling the fuzzy pattern-recognition concept and continuity equation into the ANN framework, with an explicit emphasis on the seasonal and non-stationary features of the river flows.

The proposed forecasting model is highly nonlinear and varying with time. Traditional optimization algorithms, e.g. the Gauss–Newton algorithm and gradient descent method are not applicable for this non-differentiable and multi-dimensional problem. Recently, population-based optimization algorithms have attracted the interest of many researchers (Blum and Roli, 2003, Chiong et al., 2012). They are characterized by a population consisting of possible solutions to the problem, which are modified by applying different types of operators and thus moving towards a near-optimal solution area. These algorithms are very powerful in finding a global optimum since they simultaneously search in many directions by using a population of possible solutions. Generally, there are two categories of population-based optimization algorithms: evolutionary algorithms and swarm intelligence algorithms (Blum et al., 2012). Typical methods of evolutionary algorithms are the genetic algorithm (GA) and differential evolution (DE), which attempt to simulate natural evolution. The DE algorithm was proposed by Storn and Price (1995) and has been applied extensively in hydrological modeling (e.g., Babu and Angira, 2003; Vasan and Simonovic, 2010; Liu and Pender, 2013). It shows better performances than the GA in terms of convergence characteristics and computational efficiency (Wang et al., 2008, Li et al., 2013, Song et al., 2014). In view of its ability to handle optimization problems that are non-differentiable, nonlinear, non-continuous and varying with time (Rocca et al., 2011), it is adopted in this paper for the comparative study.

The second category, swarm intelligence-based algorithms are inspired by the collective behavior of animal societies, including particle swarm optimization (PSO), artificial bee colony (ABC) and ant colony optimization (ACO). This paper focuses on ABC and ACO as stochastic global optimization algorithms. The ABC algorithm was introduced and popularized by Karaboga (2005) to solve numerical optimization problems. It has predictive capability comparable to the GA, PSO and DE algorithms on numerical test functions (Karaboga and Akay, 2009). Hybrid models that combine ABC algorithms with ANNs have been developed recently (Karaboga et al., 2007, Kisi et al., 2012). The ABC algorithm has been found to apply in many fields, e.g. HVAC systems (Zhang et al., 2013), reservoir optimum problems (Hossain and El-shafie, 2013) and protein structure optimization (Li et al., 2015). Another swarm-based optimization method ACO was derived from the food searching behavior of ants (Dorigo et al., 1996). Similar to the ABC algorithm, it is a meta-heuristic technique available to solve non-linear optimization problems with high dimensionality and inequality constraints. Coupling ACO algorithms with feed-forward neural network training has proven to be successful (Li and Chung, 2005, Shelokar et al., 2007, Socha and Blum, 2007). The potential to apply ACO to the field of river flow forecasting is clear, e.g., see its application in water resource problems (Maier et al., 2003, Jalali et al., 2006, Kumar and Reddy, 2006).

Optimization of neural networks has always been an open research. It is imperative to solve the disadvantages of traditional learning algorithms, such as poor generalization, slow convergence speed and easily plunging into local optima. The main objective of this paper is therefore to incorporate population-based optimization algorithms (i.e. DE, ABC and ACO) into a HNN model and compare their optimization ability, stability and reliability, and thereby determine the most adaptive optimization algorithm for the river flow forecasting problem.

The rest of this paper is structured in the following manner. A description of the HNN model for downstream river flow forecasting is presented in Section 2. In Section 3, a brief review of the three population-based optimization algorithms is provided. Section 4 introduces the case study site, while Section 5 summaries the computational results and comparisons of different optimization algorithms. Finally, the conclusion is reported in Section 6.

Section snippets

Hybrid neural network (HNN) model

The ANN is an efficient data-driven model for real-time forecasting. For a typical three-layer feed-forward ANN, the nodes in the input layer (input data introduced to the network) are linked with a predetermined activation function to those in the hidden layer, and then to the nodes in the output layer with a similar operation. An objective function with premise parameters (to be adjusted) is defined by comparing the difference between computed and target outputs. The fitness value of the

Differential evolution algorithm

The DE algorithm conducts mutation, crossover and selection operations based on the differences of randomly sampled pairs of solutions in the population. Mutation operation acts as a search mechanism while the crossover operator recombines the parent vector with the mutated one. Afterwards, all solutions have an equal chance of being selected as parent. The strategies are described as follows.

Firstly, define a population of D dimensional parameter vectors, with NP population size. Each

Study case and data preparation

The study area for the present study is the Altamaha River basin, situated on the Atlantic coast in the U.S. state of Georgia. The Yellow River flows southward in the basin, as a tributary of the Ocmulgee River. The locations of four river stations along the Yellow River in the watershed are shown in Fig. 2. Daily river discharge and precipitation data is taken from these four hydrological stations. The simplified letters Q and P refer to discharge and precipitation, respectively. In addition,

Results and discussion

A complete assessment of model performances is performed employing three evaluation measures. These include: the root mean square error (RMSE), the Nash–Sutcliffe efficiency coefficient (NSEC) and the accuracy (ACC). They are formulated as follows:RMSE=1Ni=1N(QiQ^i)2NSEC=1i=1N(QiQ^i)2/i=1N(QiQ¯)2ACC=11Ni=1N|QiQ^i|Qiin which Qi and Q^i are respectively observed and computed discharges, Q¯ is the averaged observed data, and N is the number of observations. The RMSE is an absolute error

Conclusions

In this work, the performances of three population-based optimization algorithms (i.e. DE, ABC and ACO algorithms) for evolving a HNN model are compared. The purpose is to determine a relatively reliable and stable optimization algorithm for downstream river forecasting. For completeness׳ sake, the PSO algorithm is included as a benchmark case for the comparison of forecasting performances. The HNN model is triggered for the forecasting, by integrating fuzzy pattern-recognition and a continuity

Acknowledgments

This research was supported by Central Research Grant of Hong Kong Polytechnic University (4-ZZAD).

References (49)

  • H. Tabari et al.

    SVM, ANFIS, regression and climate based models for reference evapotranspiration modeling using limited climatic data in a semi-arid highland environment

    Journal of Hydrology

    (2012)
  • T. Xiong et al.

    Beyond one-step-ahead forecasting: evaluation of alternative multi-step-ahead forecasting models for crude oil prices

    Energy Economics

    (2013)
  • T. Xiong et al.

    Forecasting interval time series using a fully complex-valued RBF neural network with DPSO and PSO algorithms

    Information Sciences

    (2015)
  • Babu, B.V., Angira, R., 2003. Optimization of water pumping system using differential evolution strategies. In:...
  • Y. Bao et al.

    PSO-MISMO modeling strategy for multistep-ahead time series prediction

    Cybernetics, IEEE Transactions on

    (2014)
  • A. Bhadra et al.

    Rainfall-runoff modeling: comparison of two approaches with different data requirements

    Water Resources Management

    (2010)
  • C. Blum et al.

    Evolutionary optimization

  • C. Blum et al.

    Metaheuristics in combinatorial optimization: overview and conceptual comparison

    ACM Computing Surveys

    (2003)
  • R. Chiong et al.

    Variants of evolutionary algorithms for real-world applications

    2012

    (2012)
  • F.X. Diebold et al.

    Comparing predictive accuracy

    Journal of Business & Economic Statistics

    (1995)
  • M. Dorigo et al.

    Ant system: optimization by a colony of cooperating agents

    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on

    (1996)
  • M.S. Hossain et al.

    Application of artificial bee colony (ABC) algorithm in search of optimal release of Aswan High Dam

    Journal of Physics: Conference Series

    (2013)
  • Z. Hu et al.

    Mid-term interval load forecasting using multi-output support vector regression with a memetic algorithm for feature selection

    Energy

    (2015)
  • M.R. Jalali et al.

    Reservoir operation by ant colony optimization algorithms

    Iranian Journal of Science and Technology, Transaction B: Engineering

    (2006)
  • Cited by (159)

    View all citing articles on Scopus
    View full text