Online sequential ELM algorithm with forgetting factor for real applications

doi:10.1016/j.neucom.2016.09.121

Neurocomputing

Volume 261, 25 October 2017, Pages 144-152

https://doi.org/10.1016/j.neucom.2016.09.121 Get rights and content

Abstract

Sequential learning algorithms are a good choice for learning data one-by-one or chunk-by-chunk. Liang et al. has proposed OS-ELM algorithm based on the ordinary ELM algorithm, which produces better generalization performance than other famous sequential learning algorithms. One of the deficiencies of OS-ELM is that all the observations are weighted equally regardless of the acquisition time. However, the training data often have timeliness in many real industrial applications. In this paper, we propose a modified online sequential learning algorithm with the forgetting factor (named WOS-ELM algorithm) that weights the new observations more. Then a convergence analysis is presented to make sure the estimation of output weights tend to converge at the exponential speed with the arriving of new observations. For the determination of the value of forgetting factor, it would change with the forecast error automatically and get rid of excessive human interference. We employ several applications in the simulation part including time-series predication, time-variant system identification and the weather forecast problem. The simulation results show that WOS-ELM is more accurate and robust than other sequential learning algorithms.

Introduction

Extreme learning machine (ELM) proposed by Huang in 2006 is a fast machine learning algorithm based on the generalized single-hidden layer feedforward networks (SLFNs) [1]. The key advantages of ELM compared with other famous neural network algorithms are that the learning parameters in the neural model are generated randomly without human tuning or iterative method [2], [3]. The output weights are determined by the method of least square (LS). Nowadays, it has been widely used in many real applications including both regression and classification problems [4], [5], [6], [7].

In many real applications, data are obtained one by one or chunk by chunk. Online sequential machine learning is a model of induction that learns one instance or some instances at a time [8], [9]. Liang et al. has proposed a fast and accurate online sequential learning algorithm (OS-ELM) for SLFNs based on ELM network with additive or radial basis function (RBF) hidden nodes [10]. In OS-ELM, the newly generated observations can be trained one-by-one or chunk-by-chunk with fixed or varying data size, while the output weights will be updated analytically simultaneously. Then many modified OS-ELM algorithms have been proposed, such as EOS-ELM [11], OS-ELMK [12], OL-ELM-TV [13] et al. However, the above listed online sequential learning methods do not take timeliness aspect of the problem into consideration. Timeliness problem extensively exists in our daily life, such as weather forecast and stock forecast [14], [15]. With the time passing by, the distribution of data changes and shows much non-stationary phenomenon. In such cases, the old data should contribute lesser and lesser so that the model represents the most recent behavior [16]. Broadly speaking, in the training of ELM model, we should allocate high weights for new data and low weights for old ones.

There are many ELM-related online learning algorithms subject to nonstationary applications. FOS-ELM aims to learn the sequential data with timeliness, where a removable sliding-window is employed to limit the active area during the process of data acquisition [17]. Zhou employed the same forgetting mechanism in the regularized and kernelized ELM algorithms [18]. In addition, Wang proposed OS-ELMK algorithm, and combined it with a sliding window for nonstationary time series prediction [19]. With the arriving of new observations, the sliding window would move forward in order to forget the ‘old’ samples. Another strategy to deal with nonstationary data is based on the introduction of forgetting factor. From the extreme point of view, the method of sliding window can be seen as a special case of the method of forgetting factor. Matias introduced the forgetting factor into OS-ELM algorithm [20]. However, the authors did not present the method to choose an approximate value for the forgetting factor. Lim presented a relatively complex mechanism to determine the value of forgetting factor based on the gradient descent method [21]. The additionally required computational complexity would increase at remarkable speed with the increase of the number of hidden nodes, which is time-consuming and cannot meet the needs of online implementation.

In this paper, we propose a novel modified online sequential ELM algorithm named WOS-ELM. WOS-ELM algorithm introduces the forgetting factor in the performance indicator. For the online sequential learning, old data are gradually being forgotten, while new coming data gets more emphasis. Then we present a convergence theorem to ensure the estimation of output weights converges to the true value with the arriving of new observations at the exponential speed. In addition, the forgetting factor can be set to be variable according to the output prediction error automatically. Thus the model can ensure the output error fluctuates around the set point. This automatic updating strategy for the forgetting factor is easy and time-saving to implement, which would not affect the advantage of rapid training speed of ELM algorithm. In addition, we present a mechanism to deal with the contaminated industrial data inspired by the introduction of forgetting factor. More details will be discussed in Section 4.

In the simulation section, we carry out three experiments to verify the performance of WOS-ELM algorithm. First, WOS-ELM is conducted for a time-series prediction problem (Mackey Glass Time-Series Application) with other two famous online learning algorithms (GGAP-RBF [22] and MRAN [23]). Second, we employ WOS-ELM algorithm in the time-variant system identification problem. In the end, a real world application, weather forecast, is taken into consideration, where WOS-ELM is employed to make forecast for the average temperature in the next day. Simulations have shown that the WOS-ELM algorithm can produce more rational and robust results. And its fast training speed can satisfy the demand for real online implementation.

The paper are organized as follows: Section 2 presents the review of the ordinary ELM and the OS-ELM algorithm. The basic theory of WOS-ELM algorithm will be presented in Section 3 and some discussions are given in Section 4. Sections 5 and6 are the simulation results and conclusions respectively.

Section snippets

ELM

The ELM algorithm was originally proposed by Huang subject to a general single hidden layer network. ELM gets rid of human tuning with random initialization of SLFNs learning parameters. Then the output weights can be determined by the theory of least square method [1], [24].

Given a training set consisting of N arbitrary distinct samples $S = {(x_{i}, t_{i}) | x_{i} \in R^{n}, t_{i} \in R^{m}, i = 1, 2, \dots, N},$ the SLFNs network function with $\tilde{N}$ hidden nodes can be formulated as $f_{\tilde{N}} = \sum_{j = 1}^{\tilde{N}} β_{j} G (a_{j}, b_{j}, x_{i}) = t_{i}, i = 1, 2, \dots, N$ where a_j and b_j are

WOS-ELM

In this section, we present a brief introduction of the proposed WOS-ELM algorithm. WOS-ELM algorithm aims at the following three aspects:

1.
Pay more emphasis on the new observations and ignore the old ones gradually. Generally, with the arriving of new observations, the contribution of old samples to the model becomes smaller and smaller. The trained model should closely track the changes of data distribution.
2.
Improve the computing efficiency and reduce the training time and manual intervention.

Further iscussion

A) In WOS-ELM, the forgetting factor plays a significant role in online learning network. One should set a suitable value for the forgetting factor before training the model. Here we make the forgetting factor variable with the forecast error e_f. When the norm of forecast error ‖e_f‖ becomes smaller than the setpoint error ε_fe, it means that the trained model can adapt well to the new data. Consequentially, the forgetting factor λ should tends to 1 along a special curve. On the contrary, larger

Simulation esults

The performance evaluation of the proposed WOS-ELM algorithm in three applications is presented in this section. Firstly, WOS-ELM is verified on the MackeyGlass chaotic time series problem. Secondly, we employ WOS-ELM in the Narendra system identification problem. In the above two experiments, some measurements have been done to create time-variant of nonstationary data series. In the end, WOS-ELM is applied in a real world application: weather forecast. The sigmoidal additive activation

Conclusions

This paper introduces the forgetting factor into OS-ELM algorithm and proposes the WOS-ELM algorithm. Compared with the ordinary online sequential learning algorithms of ELM, the WOS-ELM algorithm can well be employed in the non-stationary process in the real world. In order to get rid of the human interface, the forgetting factor can vary automatically based on the output prediction error. We employ several databases including time-series prediction, time-variant system identification and a

Haigang Zhang received the B.S. degree in School of Electronic and Information Engineering from University of Science and Technology Liaoning in 2012. Now he is a Ph.D. candidate of control science and engineering in University of Science and Technology Beijing. His research interests include machine learning and its application to control of the world.

References (35)

HuangG.B. et al.
Extreme learning machine: theory and applications
Neurocomputing
(2006)
R. Ahila et al.
An integrated PSO for parameter determination and feature selection of ELM and its application in classification of power system disturbances
Appl. Soft Comput.
(2015)
H.J. Rong et al.
A fast pruned-extreme learning machine for classification problem
Neurocomputing
(2008)
WangY.G. et al.
A study on effectiveness of extreme learning machine
Neurocomputing
(2011)
S. Suresh et al.
No-reference image quality assessment using modified extreme learning machine classifier
Appl. Soft Comput.
(2009)
Y. Lan et al.
Ensemble of online sequential extreme learning machine
Neurocomputing
(2009)
ZhaoJ.W. et al.
Online sequential extreme learning machine with forgetting mechanism
Neuromputing
(2012)
ZhaoJ.W. et al.
Online sequential extreme learning machine with forgetting mechanism
Neurocomputing
(2012)
WangX.Y. et al.
Online sequential extreme learning machine with kernels for nonstationary time series prediction
Neurocomputing
(2014)
T. Matias et al.
On-line sequential extreme learning machine based on recursive partial least squares
J. Process Control
(2015)

ZhangH.G. et al.

An improved ELM algorithm for the measurement of hot metal temperature in blast furnace

Neurocomputing

(2016)

HuangG.B. et al.

Extreme learning machine for regression and multiclass classification

IEEE Trans. Systems. Man. Cybern. Part B Cybern.

(2012)

HuangG.B.

An insight into extreme learning machines: random neurons, random features and kernels

Cognit. Comput.

(2014)

YuJ. et al.

An enhanced online sequential extreme learning machine algorithm

Proceedings of the 2008 China Control and Decision Conference, Shandong, China, 2–4 July

(2008)

LiuN. et al.

Ensemble based extreme learning machine

IEEE Signal Process. Lett.

(2010)

LiangN.Y. et al.

A fast and accurate online sequential learning algorithm for feedforward networks

IEEE Trans. Neural Netw.

(2006)

C. Cingolani et al.

An extreme learning machine approach for training time variant neural networks

Proceedings of the 2008 IEEE Asia Pacific Conference on Circuits and Systems, Macao, 30 November–3 December

(2008)

Cited by (32)

Real-Time Monitoring of Mental Fatigue of Construction Workers Using Enhanced Sequential Learning and Timeliness
2024, Automation in Construction
The demanding nature of construction works exposes workers to prolonged physical labor and high-risk environments, increasing their vulnerability to mental fatigue and consequently posing risks to safety and productivity. Over the years, a number of mental fatigue monitoring models have been created based on the captured vital signs. However, these models are basically based on batch learning algorithms such as machine learning or deep learning, which are time-consuming and costly, and fail to consider the timeliness features embedded in the biological data. Aiming at time-varying vital signs, a novel regularized online sequential extreme learning machine with dynamic forgetting factor (ROSELM-DFF) model was established to realize the real-time monitoring of workers' mental fatigue. A cognitive experiment has been carried out and the results indicate that the proposed OSELM-DFF model outperforms other models in terms of computational efficiency and prediction accuracy, achieving the best Average Accuracy of 96.106%. This study offers an effective solution for proactive management of workers' mental fatigue, which is expected to foster a safer and more productive construction environment.
Battery SOH estimation and RUL prediction framework based on variable forgetting factor online sequential extreme learning machine and particle filter
2023, Journal of Energy Storage
Battery life prediction is of great practical significance to ensure the safety and reliability of equipment. This paper proposes a new framework to realize battery state of health (SOH) estimation and remaining useful life (RUL) prediction. The variable forgetting factor online sequential extreme learning machine (VFOS-ELM) is used to estimate battery SOH, and particle filter (PF) algorithm used to predict battery RUL. To improve the estimation accuracy, a new nonlinear decline method, adaptive weight and Gaussian variation are used to improve the standard whale optimization algorithm (WOA) algorithm. And the improved IWOA algorithm is used for parameter optimization of the VFOS-ELM and PF algorithm. The extremely randomized trees (ERT) algorithm is used to obtain the features with high correlation with the available capacity to reduce the complexity of the model and improve the estimation accuracy. Compared with other methods, the proposed IWOA-VFOS-ELM algorithm has higher estimation performance and noise anti-interference ability. The MAE of APR-3 and APR-4 for SOH estimation are both within 0.12 %, RMSE are within 0.15 %, and IA are both higher than 99.9 %. Compared with PF algorithm, the RUL prediction accuracy obtained by IWOA-PF algorithm is improved by 7.143 %, 6.445 % and 15.094, respectively. In summary, the IWOA-PF algorithm proposed in this paper can be used to predict the battery RUL, and the prediction performance is better than the PF algorithm.
Variational quantum extreme learning machine
2022, Neurocomputing
Citation Excerpt :
The authors proposed class-specific ELM to handle binary class imbalance problem [25]. The work [26] proposed a modified online sequential learning algorithm that weights the new observations more and achieves better accuracy and more robustness. For unknown noise in the environment, a robust ELM is proposed to improve the robustness and generalization ability of the model under the disturbance of Gaussian and non-Gaussian noise [6]; (4) Extend ELM to deeper structures.
Extreme learning machine (ELM), with fast training speed and high generalization performance, has been widely used in many fields. However, it becomes inefficient or even impossible to process data with extremely large feature spaces, which is expected to be solved by quantum computing with an exponentially large quantum state space. Here, we propose a novel variational quantum extreme learning machine (VQELM). In detail, we design a special feature mapping method to achieve nonlinear transformation of the input data, replacing the hard-to-construct activation function on quantum devices. Considering that the Harrow-Hassidim-Lloyd algorithm is difficult to solve the ELM parameters on near-term quantum devices, we adopt a variational framework to facilitate implementation on the near-term noisy intermediate scale quantum computer. On both classification and regression tasks, our proposed method outperforms classical ELM in classical simulations. Moreover, the classification tasks achieved on IBM quantum simulator also show comparable classification accuracy. The final analysis shows that our proposed algorithm has an exponential improvement over classical ones for high-dimensional data processing, and is a powerful application of quantum machine learning on near-term quantum devices.
Improved multi-layer online sequential extreme learning machine and its application for hot metal silicon content
2020, Journal of the Franklin Institute
Citation Excerpt :
Hence, in this paper, the variable forgetting factor (VFF) is introduced for ML-OSELM. VFF can balance the weight between the new data and the old data [18,25]. Besides, ML-OSELM may also have instability in different trails of simulations.
Hot metal silicon content is an important indicator for measuring the smooth operation of the blast furnace. However, the hot metal silicon content cannot be directly detected online. Hence, this paper proposes a prediction model of the hot metal silicon content based on the improved multi-layer online extreme learning machine (ML-OSELM). The improved ML-OSLEM algorithm is based on ML-OSELM, the variable forgetting factor (VFF) and the ensemble model. VFF is introduced to make the new coming data get more emphasis. The ensemble model can overcome the overfitting problem of ML-OSELM. This improved algorithm is named as EVFF-ML-OSELM. The real blast furnace production data are used to testify the established prediction model based on EVFF-ML-OSELM. Compared with the prediction models of the hot metal silicon content based on other algorithms, the simulation results demonstrate that the prediction model based on EVFF-ML-OSELM has better prediction accuracy and generalization performance.
A T–S fuzzy model identification approach based on evolving MIT2-FCRM and WOS-ELM algorithm
2020, Engineering Applications of Artificial Intelligence
Citation Excerpt :
Since the evolving MIT2-FCRM based on GSA and the hyper-plane-shaped MF have performed quite well in fuzzy space partition and antecedent parameter identification, we try to improve the consequent parameter identification approach to achieve higher performance of the identified model. In a recent study (Zhang et al., 2017), an extreme learning machine algorithm with forgetting factor for processing online sequences (namely WOS-ELM) is proposed. Considering the parameters of the model may change over time, a forgetting factor has been introduced in the WOS-ELM algorithm, making that the new observations of the model weight more and the old samples are gradually forgotten.
Inter type-2 fuzzy model has been confirmed to be more effective in Takagi–Sugeno (T–S) fuzzy model identification compared to type-1 fuzzy model. It is indisputable that some algorithms based on inter type-2 fuzzy model have already been developed and shown remarkable modeling performance. To further improve the modeling accuracy, the optimization methods and the neural network are taken into consideration. In this paper, an evolving modified inter type-2 fuzzy c-regression model (MIT2-FCRM) algorithm based on gravitational search algorithm (GSA) and a consequent parameter identification method based on extreme learning machine algorithm with forgetting factor for processing online sequences (namely WOS-ELM) were proposed. Then a novel approach for T–S fuzzy modeling was presented, in which, the coefficients of the upper and lower hyperplanes were obtained by evolving MIT2-FCRM algorithm based on GSA, a hyper-plane-shaped membership function (MF) was utilized to identify the antecedent parameters of the T–S fuzzy model, and WOS-ELM was employed to identify the consequent parameters. The modeling results of six examples indicate that the proposed approach is superior to other studies in terms of identification accuracy, compact fuzzy rules and noise resistance ability.
Meta-cognitive recurrent kernel online sequential extreme learning machine with kernel adaptive filter for concept drift handling
2020, Engineering Applications of Artificial Intelligence
Citation Excerpt :
More importantly, compared with other popular online learning algorithms, OSELM can provide better generalization performance at a much faster learning speed. Depending upon these advantages, OSELM has been successfully applied in time series prediction and non-stationary environments (Zhang et al., 2017). There are two main steps in the learning process of OS-ELM.
This paper proposes a multi-step prediction model for time series prediction, i.e. Meta-cognitive Recurrent Kernel Online Sequential Extreme Learning Machine with Drift Detector Mechanism (Meta-RKOS-ELM_ALD). Recurrent multi-step algorithm is applied to release the limitation in the number of prediction steps, and Drift Detector Mechanism (DDM) is used to overcome the problem of concept drift in the prediction model. The new meta-cognitive strategy decides the way of the incoming data during training, which decreases the training computation of prediction model and solves the parameter dependency. In our evaluation, we use a total of six artificial data sets and three real-world data sets (Standard & Poor’s 500 Index, Shanghai Stock Exchange Composite Index, and Ozone Concentration in Toronto) to prove the ability of kernel filters, the detecting ability of concept drift detector, and situation of applying meta-cognitive strategy in our proposed model. Experiments results indicate that the Meta-KOS-ELM_ALD with DDM has better forecasting ability in various predicting periods with the shortest learning time, as compared with other algorithms.

View all citing articles on Scopus

Sen Zhang received the Ph.D. degree in Electrical Engineering from Nanyang Technological University in 2005. She has been working as postdoctoral research fellow in National University of Singapore and lecturer in charge in Singapore Polytechnic. She is currently an associate professor of the school of automation and electrical engineering in the university of science and technology Beijing. Her research interests include ELM, target tracking and estimation theory.

Yixin Yin received the Ph.D. degree in Electrical Engineering from University of Science and Technology Beijing in 2002. He is full professor of the school of automation and electrical engineering in the university of science and technology Beijing. His research interests include control theory and its applications.

^☆: This work has been supported by the National Natural Science Foundation of China (NSFC grant nos. 61333002, 61673056, 61673055 and 61671054).

View full text

Online sequential ELM algorithm with forgetting factor for real applications☆

Abstract

Introduction

Section snippets

ELM

WOS-ELM

Further iscussion

Simulation esults

Conclusions

Neurocomputing

Appl. Soft Comput.

Neurocomputing

Neurocomputing

Appl. Soft Comput.

Neurocomputing

Neuromputing

Neurocomputing

Neurocomputing

J. Process Control

Neurocomputing

Extreme learning machine for regression and multiclass classification

IEEE Trans. Systems. Man. Cybern. Part B Cybern.

An insight into extreme learning machines: random neurons, random features and kernels

Cognit. Comput.

An enhanced online sequential extreme learning machine algorithm

Proceedings of the 2008 China Control and Decision Conference, Shandong, China, 2–4 July

Ensemble based extreme learning machine

IEEE Signal Process. Lett.

A fast and accurate online sequential learning algorithm for feedforward networks

IEEE Trans. Neural Netw.

An extreme learning machine approach for training time variant neural networks

Proceedings of the 2008 IEEE Asia Pacific Conference on Circuits and Systems, Macao, 30 November–3 December