A locally recurrent fuzzy neural network with application to the wind speed prediction using spatial correlation

doi:10.1016/j.neucom.2006.01.032

Neurocomputing

Volume 70, Issues 7–9, March 2007, Pages 1525-1542

https://doi.org/10.1016/j.neucom.2006.01.032 Get rights and content

Abstract

In this paper, a locally feedback dynamic fuzzy neural network (LF-DFNN) for modeling of temporal processes is suggested. The model is composed of dynamic TSK-type fuzzy rules where the consequent sub-models are implemented by recurrent neural networks with internal feedback paths and dynamic neuron synapses. The LF-DFNN exhibits some interesting features, such as enhanced representation power, local modeling characteristics, model parsimony, and stable learning. Training of the LF-DFNN models is achieved using an optimal on-line learning scheme, the decoupled recursive prediction error algorithm (DRPE). The method has reduced computational demands and is derived through decomposition of the weight vector to several mutually exclusive weight groups. The partial derivatives required for the implementation of the training algorithm are calculated using the adjoint model approach, adapted to the fuzzy network's architecture exercised here. The paper deals with the wind speed prediction in wind farms, using spatial information from remote measurement stations. The LF-DFNN networks are used as advanced forecast models, providing multi-step ahead wind speed estimates from 15 min to 3 h ahead. Extensive simulation results demonstrate that our models exhibit superior performance compared to other network types suggested in the literature. Furthermore, it is shown that DRPE outperforms three gradient descent algorithms, in training of the recurrent forecast models.

Introduction

Recently, considerable research has been directed to developing recurrent fuzzy neural networks for modeling of temporal processes, control, communications and pattern recognition. Depending on the way dynamics is introduced, these models can be classified into two major categories. The former class includes models with external feedback [12], [11], [26], [32], [22], [10], while the later one includes fuzzy models with internal recurrence [17], [13], [14], [15], [18], [20]. In [12], an ANFIS structure with external feedback is used as a neuro-fuzzy controller while in [11] a recurrent fuzzy system (RFS) is suggested composed of several TSK models with crisp outputs which are interconnected through internal variables and feedback loops. A recurrent neuro-fuzzy model with multiple external feedback is suggested in [26] where the rule sub-models are linear polynomials of the inputs. The model inputs include the current external variables as well as delayed outputs of the model. A similar architecture is suggested in [32] with single-tap delay feedback and used to model a neutralization process. In [22] a dynamic fuzzy logic system (FLS) is proposed with multiple external feedback. Finally, both feedforward and recurrent NARMAX-type prediction modes are developed in [10] using a fuzzy neural network approach. An efficient algorithm is suggested for structure determination and parameter identification with the scope to improving the predictive performance of the obtained models.

With regard to the models belonging to the second class, the internal dynamics is usually introduced in the premise part of the model. In the recurrent fuzzy neural network (RFNN) suggested in [17], single-tap delay feedbacks are devised around the membership functions. A recurrent self-organizing neural fuzzy inference system (RSONFIN) is suggested in [13], [14]. An internal feedback loop is introduced in RSONFIN by circulating the firing strengths of the rules. The feedback path comprises the context nodes and the associated feedback nodes. The outputs of the context nodes provide the internal variables, used to memorize the temporal history paths of the network. The model is composed of Mamdani-type rules, implementing a dynamic fuzzy inference reasoning mechanism. Following a similar approach, TSK recurrent fuzzy network is proposed in [15]. The model is composed of dynamic TSK fuzzy rules and is used for dynamic system modeling and control. In [18] a recurrent compensatory neuro-fuzzy system is proposed where feedback connections are introduced in the term nodes, acting as memory units.

In this paper, we suggest a novel recurrent fuzzy neural network, called the locally feedback dynamic fuzzy neural network (LF-DFNN), with the model dynamics introduced in the consequent part. The model is composed of recurrent fuzzy rules of TSK type. To improve the representation capabilities of the model, the rule sub-models are implemented by means of locally feedback multi-layered perceptron networks (LF-MLP) [2], [27], including dynamic neuron units. The neuron models embedded to the consequent level are realized by linear infinite impulse response (IIR) synaptic filters. The LF-DFNN model exhibits a number of interesting qualities, such as local modeling decomposition, rich temporal representation capabilities, model parsimony with reduced complexity and stability monitoring throughout the learning process. Our approach is an extension of the NARA [25] and the CANFIS [21] models that employ static rule models, in the realm of recurrent networks. In [20] DFNN structure is presented with similar characteristics, where single-layered recurrent neural networks are considered in the consequent part, including Frasconi–Gori–Soda neurons [9], with output feedback architecture. The present work is also an extension of the above model, with regard to the following aspects. The recurrent sub-models are extended to multi-layered architectures, thus enriching the model's structure and improving its representation power. Additionally, as opposed to the batch-type learning performed in [20], we are focused on developing effective on-line learning schemes to address the wind-forecasting problem.

Training of the recurrent networks is usually accomplished through the use of gradient descent algorithms, such as the real-time recurrent learning (RTRL) [31], and the backpropagation through time (BPTT) [30]. In the RTRL method, the gradients are calculated forwards as the network runs in time, through the use of the so-called sensitivity models, while in BPTT the gradient computations are performed in a backward direction, opposite to the signal flows. To account for the complexity of the LF-DFNN suggested here, we resort to the adjoint model approach [24] that provides a systematic framework for error backpropagation through arbitrary network structures. This is achieved by unfolding the network in time, which allows apportioning the errors among the weights at different time stages in the past. In contrast to RTRL learning, the adjoint approach is advantageous since it employs a unique model for calculating the error gradients with respect to all the adjustable parameters. The weights update is completed in a single backward run, thus reducing considerably the computational and storage demands. Nevertheless, the gradient descent methods exhibit long convergence times due to the small learning rates required, and are often become trapped to local minima of the error function. In order to overcome the above shortcomings and to cope with the temporal complexities of our application, we employ an on-line decoupled recursive prediction error (DRPE) algorithm for training of the LF-DFNN models. The DRPE scheme is derived by dividing the weight set into several weight groups. A group is used for the premise parameters while the remaining ones are used for grouping of the adjustable weights of the rule sub-models, decomposed at the neuron level. Owing to the second-order information of the error covariance matrices, improved learning qualities are attained with regard to the speed of convergence and solution accuracy.

A real-world application is tackled in this paper, the wind speed forecasting on a site located at the Gulf of Thessaloniki, Northern Greece. The objective is to determine multi-step wind predictions at the park site for up to 3 h ahead, using wind data measured at neighboring sites up to 30 km away (spatial correlation). The availability of efficient wind forecasts allows designing the connection or disconnection of wind turbines or conventional generators, thus attaining low spinning reserve and optimal operational costs. In the past, considerable efforts have been mainly focused on analyzing the wind speed time series of the site under investigation [8]. Nevertheless, the typical statistical properties of wind speed, such as non-stationarity, the gradually decreasing autocorrelation curve and the weak diurnal variation, are not helpful enough. Therefore, research activity was directed to spatial correlation studies, not always leading to satisfactory models. In [5], the spatial correlation of wind turbulence is considered for distances of 700 m–15 km and for time scales of 4, 10 and 30 min. It is concluded that the correlation coefficients strongly depend on the direction of the wind, terrain roughness and height above the ground. For distances of 20–100 km a significant correlation of the hourly or daily average wind speeds has been recognized. This correlation decreases with distance [6] and with the topographical elevation difference [3]. Furthermore, a decrease of the correlation factor is noticed when the wind direction axis differs considerably from the distance vector connecting the measurement sites [23].

The wind forecast models suggested previously are static. To cope with the severe dynamics of the problem, we use the LF-DFNN networks as advanced forecast models producing multi-step wind speed estimates. The spatial and temporal locality of the forecast model along with the richness of the network's dynamics, make it suitable for the identification of the temporal dependencies underlying the process. Additionally, to improve the prediction performance, we employ the DRPE algorithm for model training. The experimental setup includes comparisons of the LF-DFNN forecast models with other recurrent and static neuro-fuzzy network types, and other prediction models used on the same application, previously reported in the literature. To demonstrate the effectiveness of DRPE, comparisons are performed with three conventional gradient descent methods. Initially we proceed to developing the so-called truncated BPTT (T-BPTT), adapted to the LF-DFNN models suggested here. Derivation of the algorithm requires the generation of the corresponding adjoint model, used in the error backpropagation and the weight updates. The obtained adjoint model is also employed with slight modifications for the computation of the ordered partial derivatives of the model's output with respect to its weights, required by DRPE.

The rest of the paper is organized as follows. In Section 2, the architecture of the LF-DFNN model is presented along with the respective notation. In Section 3, derivation of the T-BPTT algorithm is given, including the adjoint model building and parameter update formulas. Section 4 details on the development of the DRPE learning algorithm. In Section 5, simulation results are given and comparative analysis is carried out for the wind-speed forecasting problem. The paper concludes by summarizing the important features of our approach in Section 6.

Section snippets

Structure of the LF-DFNN Recurrent Network

The structure of the LF-DFNN is shown in Fig. 1. For the sake of simplicity in the presentation, a model with multiple inputs and a single output is considered. The suggested network is a five-layered architecture, implemented by means of recurrent fuzzy rules performing a TSK-type of reasoning: $R^{(r)} : IF u_{1} [t] is A_{1}^{r} AND u_{2} [t] is A_{2}^{r} AND \dots AND u_{n_{p}} [t] is A_{n_{p}}^{r} THEN y^{(r)} [t] = g^{(r)} (u_{c} [t]) = {IIR ‐ MLP}^{(r)} {u_{c} [t]},$ where $R^{(r)}, r = 1, \dots, R$ denotes the rth fuzzy rule, R is the number of rules. $u_{p} [t] = [u_{p, 1} [t], u_{p, 2} [t], \dots, u_{p, n_{p}} [t]]^{T}$

The truncated BPTT algorithm

In this section, the derivation of the T-BPTT algorithm is presented, a major part of which includes description of the error backwards flow. The gradient calculations are also used for the implementation of an efficient training algorithm presented in Section 4. Since we are dealing with on-line training schemes, consider the instantaneous squared error E[t] determined by $E [t] = (d [t] - y [t])^{2} = e^{2} [t],$ where y[t] and d[t] are the model outputs and the desired values, respectively, at each time t. The

Weight grouping

In an attempt to improve the learning performance of the gradient-descent training methods and enhance the quality of the attained solutions, we propose a decoupled version of the recursive prediction error (RPE) algorithm [19] for training of the LF-DFNN model. Let $ϑ (t)$ denote a composite ( $W \times 1$ ) vector including the premise and consequent weights of LF-DFNN. The real process, $y (t)$ , to be modeled is obtained by $y (t) = \hat{y} (t ϑ) + ε (t, ϑ) = h (ϑ, u (t), ϕ (t, ϑ)) + ε (t, ϑ),$ where $\hat{y} (t | ϑ)$ is a nonlinear recurrent

Statement of the problem

In our application, we consider a wind park located at a reference site S₀ in the area around the Gulf of Thessaloniki, at Northern Greece, shown in Fig. 6. The area has an adequately smooth terrain at almost sea level. Low winds due to sea breeze are of no particular interest to us. On the contrary, we are primarily focused on the modeling of the strong local winds of N–NW direction, considered as the prevailing winds in the area. Therefore, two meteorological stations are installed at the

Conclusions

In this paper, a recurrent fuzzy neural network, the LF-DFNN is suggested with internal dynamics. The network includes TSK fuzzy rules where the consequent sub-models are implemented by recurrent neural networks of IIR-MLP type, having local feedback paths. Training of the LF-DFNN models is achieved using the DRPE method, an efficient algorithm with enhanced learning qualities. The required ordered derivatives of the model's output with regard to the adjustable weights are calculated by means

Thanasis G. Barbounis was born in Lamia, Greece, in January 1977. He graduated as an Electrical Engineer from the Aristotle University of Thessaloniki in 1999. He received his Ph.D. degree in Electrical and Computer Engineering from the same university, in 2005. His research interests lie on artificial neural networks, fuzzy logic systems and modelling of non-linear systems.

References (32)

H.G. Beyer et al.
Power fluctuations in spatially dispersed wind turbine systems
Sol. Energy
(1993)
A.R. Daniel et al.
Stochastic simulation and forecasting of hourly average wind speed sequences in Jamaica
Sol. Energy
(1991)
Y. Gao et al.
NARMAX time series model prediction: feedforward and recurrent fuzzy neural network approaches
Fuzzy Sets and Systems
(2005)
C.-J. Lin et al.
Identification and prediction using recurrent compensatory neuro-fuzzy systems
Fuzzy Sets and Systems
(2005)
M.C. Alexiadis et al.
Wind speed and power forecasting based on spatial correlation models
IEEE Trans. Energy Conversion
(1999)
A.D. Back et al.
FIR and IIR synapses, a new neural network architecture for time series modeling
Neural Comput.
(1991)
P. Campolucci et al.
On-line learning algorithms for locally recurrent neural networks
IEEE Trans. Neural Networks
(1999)
S.M. Chan et al.
Operations requirements of utilities with wind power generation
IEEE Trans. Power Appl. Systems
(1983)
R. Corotis et al.
Variance analysis of wind characteristics for energy conversion
J. Appl. Meteorol.
(1977)
I.G. Damousis et al.
A fuzzy model for wind speed prediction and power generation in wind parks using spatial correlation
IEEE Trans. Energy Conversion
(2004)

P. Frasconi et al.

Local feedback multilayered networks

Neural Comput.

(1992)

V. Gorrini et al.

Recurrent fuzzy systems

J.-S.R. Jang

Self-learning fuzzy controllers based on temporal back propagation

IEEE Trans. Neural Networks

(1992)

C.-F. Juang et al.

A recurrent self-organizing neural fuzzy inference network

IEEE Trans. Neural Networks

(1999)

C.F. Juang et al.

Noisy signal processing by recurrently adaptive fuzzy filters

IEEE Trans. Fuzzy Systems

(2001)

C.F. Juang

A TSK-type recurrent fuzzy network for dynamic systems processing by neural network and genetic algorithms

IEEE Trans. Fuzzy Systems

(2002)

Cited by (189)

Wind speed prediction and reconstruction based on improved grey wolf optimization algorithm and deep learning networks
2024, Computers and Electrical Engineering
The random and intermittent of wind speeds can affect the secure and stable functioning of wind turbines (WTs). To enhance the security and stability of WTs, accurate and effective wind speed prediction, and abnormal wind speed reconstruction are essential. This study proposes a wind speed prediction and reconstruction approach that combines an improved grey wolf optimization algorithm with an adaptive search strategy (SAGWO) and long- and short-term memory network (LSTM). Firstly, the wind speed data collected by the anemometer is analyzed to determine the corresponding correlation threshold and to select WTs with high similarity. Then, the wind speed data of highly similar WTs are taken as input variables and the LSTM is optimized using the grey wolf optimization algorithm with an adaptive search strategy to enhance the convergence speed and prediction ability. Finally, the SAGWO-LSTM model is built to reconstruct the wind speed data with added noise. The experimental outcomes indicate that the developed approach has a favorable performance concerning wind speed prediction and reconstruction compared to the baseline model.
A novel ultra-short-term wind power prediction model jointly driven by multiple algorithm optimization and adaptive selection
2024, Energy
Ultrashort-term wind power forecasting with great precision and robustness is essential for improving power quality and reliability management and reducing the cost of rotating backup supply, thus guaranteeing the security and stability of power systems in large-scale grid-connected wind power. This study proposed a novel ultra-short-term wind power prediction model jointly driven by multiple algorithm optimization and adaptive selection. The original wind power sequence is decomposed into smooth subsequences by the optimized variational mode decomposition algorithm. Each sequence is predicted in advance by two outstanding prediction methods. The method with high accuracy is automatically selected for the prediction output of that sequence. The two excellent models are least square support vector machine optimized by improved whale optimization algorithm and hybrid kernel extreme learning machine optimized by sine cosine search-sparrow search algorithm, improving the prediction accuracy and efficiency. Based on three publicly available datasets, the proposed model has more than 41 % percent improvement in root mean square error compared to the current studies and about 20 % percent improvement in root mean square error compared to the proposed models without selection strategy. Combined with the adaptive selection concept, the proposed prediction model can obtain more accurate wind power prediction results with higher prediction accuracy, more substantial prediction generalization, and robustness.
Chaotic time series wind power prediction method based on OVMD-PE and improved multi-objective state transition algorithm
2023, Energy
As the global wind power generation capacity is constantly increasing, the problems of safe operation and utilization after grid connection are becoming more and more prominent. Aiming at the problem of low accuracy and stability of wind power time series with chaotic characteristics, a time series prediction method combining chaotic characteristic processing and neural networks is proposed. First, optimal variational mode decomposition with permutation entropy (OVMD-PE) is used to decompose the original wind power time series and overcome the disadvantage of insufficient mode aliasing encountered by empirical and integrated empirical modes. Second, an improved multi-objective state transition algorithm is proposed to determine the weight coefficients among the neural networks and improve the accuracy of the reconstructed predictive neural networks. Finally, the combined prediction method is used to study and analyze the wind power data from a wind farm in Xinjiang, China, from the perspectives of multiple scenarios and multiple time scales. The experimental results show that OVMD-PE can successfully deal with chaotic characteristics and the improved algorithm has improved the prediction accuracy. Compared with other traditional prediction models, the combined prediction model has higher robustness and stability.
Novel wind speed forecasting model based on a deep learning combined strategy in urban energy systems
2023, Expert Systems with Applications
Effective wind speed forecasting has great significance for urban energy system operations and the construction of low-carbon cities. However, most previous research has focused only on data in the same frequency, limiting forecast performance to some extent. In this study, considering the value of mix-frequency data, a deep learning combined model based on mixed frequency modeling is developed to improve forecast effectiveness. Firstly, a data preprocessing module is designed to decompose and reconstruct the original low- and high-frequency wind speeds. Then, a mixed frequency modeling module, comprising four mixed data sampling models and four machine learning models, is proposed to achieve mixed frequency wind speed forecasting. Further, the optimal sub-models are determined based on a newly developed evaluation index. Finally, a deep-learning-based non-linear combination forecasting module is developed to realize wind speed forecasting by taking full advantage of optimal sub-models to increase forecasting performance and guarantee the developed model’s accuracy and stability. Furthermore, a scientific and comprehensive evaluation module is established. Four experiments and eight discussions based on real wind farms demonstrate that the developed model can significantly enhance wind speed forecasting performance, accelerate the construction of low-carbon cities, and improve the sustainable and resilient development of urban energy systems.
A novel combined forecasting system based on advanced optimization algorithm - A study on optimal interval prediction of wind speed
2023, Energy
Wind speed forecasting is becoming increasingly crucial for an environmentally friendly and sustainable economy because of the renewability and benefits of wind energy. Currently, many scholars have proposed various approaches to forecasting wind speed; however, due to the limitations of point prediction and drawbacks of traditional individual methods, it is challenging to acquire satisfactory results. Compared with the previous articles, this paper combines a data preprocessing strategy by fuzzification, interval estimation, and advanced optimization methods to enhance the accuracy of wind speed forecasting. Moreover, the weight coefficient allocated by a multi-objective algorithm is proved to reach Pareto optimal solution in theory. Based on the comparative experiments and discussion of the performance of the developed combined forecasting system and other control groups, it is revealed that the combined system not only outperforms for predicting wind speed with higher accuracy and stability but also enables a valid assessment of uncertainty than the former studies. It can be convinced that the developed predicting system is an appropriate and efficient tool for further practical applications in energy systems.
An innovative combined model based on multi-objective optimization approach for forecasting short-term wind speed: A case study in China
2022, Renewable Energy
Citation Excerpt :
Although statistical methods work better in short-term prediction than the physical methods, they have lower accuracy in non-linear patterns than the models utilizing linear time series assumption [23]. Therefore, to enhance the predicting efficiency in case of non-linear time series, spatial correlation models were developed; they analyze wind speed spatial relationships from various locations to increase the accuracy and consider related characteristics to improve forecasting stability, such as the correlation of data from various location coordinates [24]. Zhou et al. proposed a predicting model based on spatial and temporal correlation models considering meteorological data while training the experiments between the planned and training wind farm [25].
Wind speed forecasting plays a crucial role in enhancing the operating efficiency of wind power systems for generating electric power. Currently, a substantial number of approaches have been developed to improve the precision of wind speed forecasting. However, owing to the instability and fluctuation of wind speed, many models ignore the deficiencies of the individual models and data preprocessing strategies, which leads to results with poor accuracy. In this study, a novel forecasting system that combines data denoising methods, traditional forecasting algorithms, and a combination optimization approach to predict wind speed is proposed. To analyze the training and testing dataset, this study uses the 10-min original wind speed dataset from a wind farm in Penglai, China. Based on the results of three comparative numerical simulations and the discussion of the proposed forecasting system, it is revealed that the developed model performs more effectively than other models. Therefore, in this study we conclude that the proposed combined forecasting system is an efficient and promising technique that provides precise results for predicting wind speed in the short term, and it could be employed for further applications in energy systems.

View all citing articles on Scopus

John B. Theocharis graduated as an Electrical Engineer from the Aristotle University of Thessaloniki in 1980. From 1980 to 1985 he has been with the scientific staff of the Department of Electrical and Computer Engineering where he received his Ph.D. degree in 1985. He is now working as an Associate Professor at the Department of Electronic and Computer Engineering in the same university. His research activities include fuzzy systems, neural networks, adaptive control and modelling of complex non-linear systems.

View full text

A locally recurrent fuzzy neural network with application to the wind speed prediction using spatial correlation

Abstract

Introduction

Section snippets

Structure of the LF-DFNN Recurrent Network

The truncated BPTT algorithm

Weight grouping

Statement of the problem

Conclusions

Sol. Energy

Sol. Energy

Fuzzy Sets and Systems

Fuzzy Sets and Systems

Wind speed and power forecasting based on spatial correlation models

IEEE Trans. Energy Conversion

FIR and IIR synapses, a new neural network architecture for time series modeling

Neural Comput.

On-line learning algorithms for locally recurrent neural networks

IEEE Trans. Neural Networks

Operations requirements of utilities with wind power generation

IEEE Trans. Power Appl. Systems

Variance analysis of wind characteristics for energy conversion

J. Appl. Meteorol.

A fuzzy model for wind speed prediction and power generation in wind parks using spatial correlation

IEEE Trans. Energy Conversion

Local feedback multilayered networks

Neural Comput.

Recurrent fuzzy systems

Self-learning fuzzy controllers based on temporal back propagation

IEEE Trans. Neural Networks

A recurrent self-organizing neural fuzzy inference network

IEEE Trans. Neural Networks

Noisy signal processing by recurrently adaptive fuzzy filters

IEEE Trans. Fuzzy Systems

A TSK-type recurrent fuzzy network for dynamic systems processing by neural network and genetic algorithms

IEEE Trans. Fuzzy Systems