A semi-supervised online sequential extreme learning machine method

doi:10.1016/j.neucom.2015.04.102

Neurocomputing

Volume 174, Part A, 22 January 2016, Pages 168-178

https://doi.org/10.1016/j.neucom.2015.04.102 Get rights and content

Abstract

This paper proposes a learning algorithm called Semi-supervised Online Sequential ELM, denoted as SOS-ELM. It aims to provide a solution for streaming data applications by learning from just the newly arrived observations, called a chunk. In addition, SOS-ELM can utilize both labeled and unlabeled training data by combining the advantages of two existing algorithms: Online Sequential ELM (OS-ELM) and Semi-Supervised ELM (SS-ELM). The rationale behind our algorithm exploits an optimal condition to alleviate empirical risk and structure risk used by SS-ELM, in combination with block calculation of matrices similar to OS-ELM. Efficient implementation of the SOS-ELM algorithm is made viable by an additional assumption that there is negligible structural relationship between chunks from different times. Experiments have been performed on standard benchmark problems for regression, balanced binary classification, unbalanced binary classification and multi-class classification by comparing the performance of the proposed SOS-ELM with OS-ELM and SS-ELM. The experimental results show that the SOS-ELM outperforms OS-ELM in generalization performance with similar training speed, and in addition outperforms SS-ELM with much lower supervision overheads.

Introduction

In recent years, Extreme Learning Machine (ELM), proposed by Huang et al. [1], [2], [3], [4], is attracting more and more attention because of its outstanding performance in training speed, predicting accuracy and generalization ability [5], [6], [7], [8], [9], [10]. In particular, it is shown that ELM tends to outperform support vector machine (SVM) in both regression and classification applications with much easier implementation [11]. However, batch ELM is still a time consuming affair, although much faster than traditional learning algorithms including SVM. Time cost is mainly reflected in two aspects: (1) Matrix inversion calculation, whose computational complexity is between quadratic and cubic with respect to the training data size [12]; (2) Output weight updating, which consumes considerable time as batch ELM needs to do recalculation by combining the old data together with the new data when a new chunk of training data is received. Online Sequential ELM (OS-ELM) [13] adapted batch ELM to overcome the abovementioned problems for practical applications. OS-ELM learns the training data chunk-by-chunk and updates the output weight only by new training data. Thus OS-ELM not only saves storage but also decreases the computational complexity. In addition, unlike other sequential learning algorithms which have many control parameters to be tuned, OS-ELM only requires the number of hidden nodes to be specified. In a word, OS-ELM presents much more advantage than many traditional sequential learning algorithms, like SGBP [14], RAN [15], RANEKF [16], MRAN [17], [18], GAP-RBF [19] and GGAP-RBF [20].

Although OS-ELM has advantages in generalization performance and learning speed, it still cannot avoid the dependency on a large amount of labeled data, which usually involves high cost in labor and time. Compared with OS-ELM, semi-supervised learning methods benefit from utilizing unlabeled data and reducing the need for labeled data. Compared with labeled data, unlabeled data is much easier and cheaper to acquire. Therefore semi-supervised learning provides an effective solution for the problems with small amount of labeled samples in various classification and regression tasks. To exploit unlabeled data, some semi-supervised ELM variants [21], [22], [23], [24] have been proposed. As typical examples, [22], [24] propose a kind of semi-supervised ELM based on manifold regularization, so that the learning system can balance the empirical risk and the complexity of the learned function f, where [22] is an improvement of [24] in terms of semi-supervised ELM, which brings good performance on predicting accuracy. However, the semi-supervised ELM mentioned above learns in a batch way, so its training speed decreases rapidly as the sample size gets larger.

To solve the manual labeling cost problem and meet the demand of sequential learning for many real applications, we propose a new type of online sequential ELM, which we name as semi-supervised online sequential ELM (SOS-ELM). It inherits not only the training and testing speed of OS-ELM but also the prediction accurate of SS-ELM. The experimental results show that using the same number of labeled samples, our proposed SOS-ELM has higher accurate prediction rate than that of OS-ELM and much faster training speed than that of SS-ELM.

The details of our proposed SOS-ELM are elaborated in the remainder part of the paper which is organized as follows. Section 2 gives a brief review of the batch ELM, OS-ELM and SS-ELM. Section 3 illustrates the derivation of SOS-ELM. Section 4 presents the experimental results and discussion based on the benchmark problems in the areas of regression and classification. Conclusions and future work based on the study are given in Section 5.

Section snippets

Review of ELM, SS-ELM and OS-ELM

This section briefly reviews the batch ELM, OS-ELM and SS-ELM, which are foundation of our extended algorithm: SOS-ELM. This part covers the related motivation, modeling and algorithm steps.

Proposed SOS-ELM

Both SS-ELM and OS-ELM improve the performance of basic ELM from different points of view. However, SS-ELM does the training with both labeled and unlabeled data in a batch way, whilst OS-ELM only utilizes labeled data. Clearly, it is feasible in practical applications to process the sequential data and make use of unlabeled data. Therefore, this paper proposes to integrate the both advantages together by modifying SS-ELM algorithm to suit sequential learning, and we refer to the new algorithm

Introduction of datasets used for performance evaluation

In this section, we will systematically evaluate the performance of our proposed SOS-ELM for regression and classification problems on some benchmark datasets by comparing with that of SS-ELM and OS-ELM. The benchmark datasets used in the paper are listed in Table 1. The number of labeled data, unlabeled data and testing data and other specifications: attribution number to represent the input and classification number of sample data for classification problem used in our experiments are also

Conclusion and future work

In this paper, a new algorithm in the ELM family called semi-supervised OS-ELM (SOS-ELM) is proposed. This algorithm can not only handle data arriving chunk-by-chunk like OS-ELM, but also reduce the requirement of labeled data and increases the performance with utilizing the unlabeled data as well. The performance of SOS-ELM is evaluated by comparing with that of OS-ELM and SS-ELM on real world benchmark datasets for regression and classification problems. The results demonstrate that proposed

Acknowledgments

This research is partly supported by the Natural Science Foundation of China (Nos. 61375059, 61175115), the Beijing Natural Science Foundation under Grant (Nos. 4122004, 4152005), Specialized Research Fund for the Doctoral Program of Higher Education (20121103110031), the Importation and the Importation and Development of High-Caliber Talents Project of Beijing Municipal Institutions (CIT&TCD201304035), Special training program for construction of teachers of Beijing High education – 2014

Xibin Jia, born in 1969, received Ph.D. degree in computer science and technology from Beijing University of Technology in 2007, M.S. degree in intelligent instrument from North China Institute of Technology in 1996 and B.S. degree in wireless technology from Chongqing University in 1991. She is an Associate Professor in the College of Computing at the Beijing University of Technology in Beijing, China. Her areas of interest include visual information cognition, and multi-information fusion,

References (32)

G.-B. Huang et al.
Extreme learning machinetheory and applications
Neurocomputing
(2006)
G.-B. Huang et al.
Convex incremental extreme learning machine
Neurocomputing
(2007)
G.-B. Huang et al.
Optimization method based extreme learning machine for classification
Neurocomputing
(2010)
Y. Lan et al.
Constructive hidden nodes selection of extreme learning machine for regression
Neurocomputing
(2010)
Y. Lan et al.
Two-stage extreme learning machine for regression
Neurocomputing
(2010)
W. Zong et al.
Weighted extreme learning machine for imbalance learning
Neurocomputing
(2013)
Q. He et al.
Parallel extreme learning machine for regression based on MapReduce
Neurocomputing
(2013)
J. Liu et al.
Selm: semi-supervised elm with application in sparse calibrated location estimation
Neurocomputing
(2011)
G.-B. Huang, Q.-Y. Zhu, C.-K. Siew, Extreme learning machine: a new learning scheme of feedforward neural networks, in:...
G.-B. Huang et al.
Universal approximation using incremental constructive feedforward networks with random hidden nodes
IEEE Trans. Neural Netw.
(2006)

G.-B. Huang et al.

Extreme learning machine for regression and multiclass classification

IEEE Trans. Syst. Man Cybern. Part B: Cybern.

(2012)

G.-B. Huang, An insight into extreme learning machines: random neurons, random features and kernels, Cognit. Comput....

Z. Bai, G.-B. Huang, D. Wang, H. Wang, M.B. Westover, Sparse extreme learning machine for...

N.-Y. Liang et al.

A fast and accurate online sequential learning algorithm for feedforward networks

IEEE Trans. Neural Netw.

(2006)

Y.A. LeCun, L. Bottou, G.B. Orr, K.-R. Müller, Efficient backprop, Neural Networks: Tricks of the Trade, Springer,...

J. Platt

A resource-allocating network for function interpolation

Neural Comput.

(1991)

Cited by (36)

An online ensemble semi-supervised classification framework for air combat target maneuver recognition
2023, Chinese Journal of Aeronautics
Online target maneuver recognition is an important prerequisite for air combat situation recognition and maneuver decision-making. Conventional target maneuver recognition methods adopt mainly supervised learning methods and assume that many sample labels are available. However, in real-world applications, manual sample labeling is often time-consuming and laborious. In addition, airborne sensors collecting target maneuver trajectory information in data streams often cannot process information in real time. To solve these problems, in this paper, an air combat target maneuver recognition model based on an online ensemble semi-supervised classification framework based on online learning, ensemble learning, semi-supervised learning, and Tri-training algorithm, abbreviated as Online Ensemble Semi-supervised Classification Framework (OESCF), is proposed. The framework is divided into four parts: basic classifier offline training stage, online recognition model initialization stage, target maneuver online recognition stage, and online model update stage. Firstly, based on the improved Tri-training algorithm and the fusion decision filtering strategy combined with disagreement, basic classifiers are trained offline by making full use of labeled and unlabeled sample data. Secondly, the dynamic density clustering algorithm of the target maneuver is performed, statistical information of each cluster is calculated, and a set of micro-clusters is obtained to initialize the online recognition model. Thirdly, the ensemble K-Nearest Neighbor (KNN)-based learning method is used to recognize the incoming target maneuver trajectory instances. Finally, to further improve the accuracy and adaptability of the model under the condition of high dynamic air combat, the parameters of the model are updated online using error-driven representation learning, exponential decay function and basic classifier obtained in the offline training stage. The experimental results on several University of California Irvine (UCI) datasets and real air combat target maneuver trajectory data validate the effectiveness of the proposed method in comparison with other semi-supervised models and supervised models, and the results show that the proposed model achieves higher classification accuracy.
A near infrared H<inf>2</inf>S leakage detection system based on WDO-ELM using a digital lock-in amplifier combined with discrete wavelet transform filter
2023, Infrared Physics and Technology
A gas leakage detection system was developed based on tunable diode laser absorption spectroscopy (TDLAS) technique with the combination frequency absorption band of hydrogen sulfide (H₂S) gas molecules at 1578 nm for telemetry. For further telemetry distance and signal-to-noise ratio (SNR), a Fresnel lens with a diameter of 250 mm is utilized to focus the laser beam that diverges gradually with the increase of telemetry distance. A digital lock-in amplifier (DLIA) combined with discrete wavelet transform (DWT) filter was devised to process faint photoelectric signal received by InGaAs photodiode. Meanwhile, a global optimization algorithm, combined wind driven optimization (WDO) with extreme learning machine (ELM), named WDO-ELM algorithm, is applied for H₂S concentration inversion to make up the limitation of ELM algorithm and seek for the optimal weight and threshold in a more extensive range. The reliability and sensitivity of the H₂S leakage detection system have been verified by calibration and stability experiments. The limit of detection (LoD) reaches a minimum of 0.983 parts-per-million·meter (ppm·m) with a 96 s integration time by Allan-Werle deviation analysis. This self-developed H₂S leakage detection system with high integration and low cost is more convenient for commercial application in tank farm to realize remote detection of gas leakage.
An accuracy-maximization learning framework for supervised and semi-supervised imbalanced data
2022, Knowledge-Based Systems
Citation Excerpt :
To utilize a large amount of unlabeled data as well as a relatively limited amount of labeled data for better classification, semi-supervised learning methods [19] was proposed and demonstrated promising performances in various tasks. Semi-supervised ELM variants have also been proposed [4–9] to specifically tackle the labeled data scarcity problem in ELM learning. However, most of the existing semi-supervised ELM variants can only support learning on balanced datasets but not imbalanced ones, which relates to another real-world challenge facing ELM — class imbalance issues.
While we attempt to develop the balanced error rate (BER) minimization learning framework for randomized learning of feedforward neural networks to deal with imbalanced datasets, it remains unclear whether the BER minimization learning framework can be effectively extended into its semi-supervised version. This paper proposes a new concept of accuracy maximization for randomized learning methods on imbalanced datasets for the first time, and theoretically proves that it is equivalent to the minimization of the generalized BER for the use of the selected neural networks. In particular, accuracy maximization can be easily extended to semi-supervised scenarios as its semi-supervised version is proved to be linearly dependent on its original. In this paper, based on the proposed accuracy maximization concept, we propose an accuracy-maximization learning framework, and further develop a new accuracy-maximization extreme learning machine AMELM by taking Extreme Learning Machine (ELM) as a typical randomized learning method for feedforward neural networks so as to handle challenging data issues such as the class imbalance and label scarcity. It is worth noting that the proposed accuracy maximization based framework is not only suitable for ELM, but can be extended to different randomized learning methods, such as Random Vector Functional Link Network (RVFL), and Schmidt Neural Network (SNN) for supervised and semi-supervised imbalanced data. The efficacy of AMELM is tested on extensive benchmark datasets. Experimental results show that AMELM can achieve satisfactory performances on labeled or partially labeled imbalanced data. Also, AMELM obtains at least comparable classification performance to other baseline methods yet has fewer hyperparameters to tune, showing its potential for practical applications.
A novel WWH problem-based semi-supervised online method for sensor drift compensation in E-nose
2021, Sensors and Actuators B: Chemical
Sensor drift caused by the aging components and unsuspected environmental factors is an urgent problem to be solved, as it seriously affects the detection performance and service life of electronic nose (E-nose). Existing researches mainly resorted to offline compensation techniques. Nevertheless, due to the dynamic and uncertainty of sensor drift, the offline techniques are not suitable for practical application scenarios. For this reason, the methods of compensating sensor drift online have been attracting more and more attention. To achieve the online compensation, three problems about the prediction model updating should be addressed first: ① When to update the prediction model (When); ② Which samples are used to update the prediction model (Which); ③ How to update the prediction model (How), that is, a WWH-problem. For addressing the three problems, a WWH problem-based semi-supervised online (WWH-SSO) method is proposed in this paper. The proposed WWH-SSO uses the unlabeled samples collected in the work process of E-nose to update the prediction model for realizing the unsupervised and online drift compensation. The sensor drift benchmark dataset collected by A. Vergara is used to verify the effectiveness of proposed method. The experiment results show that the sensor drift can be satisfactorily compensated online.
Robust semi-supervised classification based on data augmented online ELMs with deep features
2021, Knowledge-Based Systems
One important strategy in semi-supervised learning is to utilize the predicted pseudo labels of unlabeled data to relieve the overdependence on the ground truth of supervised learning algorithms. However, the performance of such kinds of semi-supervised methods heavily relies on the quality of pseudo labels. To address this issue, a robust semi-supervised classification method, named data augmented online extreme learning machines (ELMs) with deep features (DF-DAELM) is proposed. This method firstly extracts features and infers labels for unlabeled data through self-training. Then, with the learned features and inferred labels, two noise-robust shallow classifiers based on data augmentation (i.e., SLI-OELM and CR-OELM) are proposed to eliminate the adverse effects of noises on classifier training. Specifically, inspired by label smoothing, a data augmented method, SLI-OELM is designed based on stochastic linear interpolation to improve the robustness of classifiers based on ELMs. Furthermore, based on the smoothing assumption, the proposed CR-OELM utilizes an $ℓ_{2}$ -norm consistency regularization term to implicitly weight noisy samples. Comprehensive experiments demonstrate that DF-DAELM achieves competitive or even better performance on CIFAR-10/100 and SVHN over the related state-of-the-art methods. Meanwhile, for the proposed classifiers, experimental results on the MNIST dataset with different noise levels and sample scales demonstrate their superior performance, especially when the sample scale is small ( $\leq 20$ K) and the noise is strong ( $40 % \sim 80 %$ ).
Robust Adaptive Semi-supervised Classification Method based on Dynamic Graph and Self-paced Learning
2021, Information Processing and Management
Despite the computers have developed rapidly in recent years, there are still many difficulties to obtain a large number of labelled data in many practical problems, for example, medical image diagnosis, internet fraud, and pedestrian detection. To deal with learning problems with only a few labeled data, a novel semi-supervised learning method combined with dynamic graph learning with self-paced learning mechanism is present in this work, namely SS-GSELM. Firstly, according to the loss value of labeled samples in each training, the algorithm selects the sample with the smaller loss value for learning, and then gradually adds the sample with the larger loss value during the training process until all labeled samples are trained. In particular, different weights are given to samples through a regularization function to adjust the importance of different samples on the model results. Secondly, the algorithm uses local consistency property as supplementary information to enhance the performance of the learning machine, so an adaptive graph matrix is constructed to retain data similarity information. To do this, an alternative strategy is proposed to update graph matrices and self-paced weights to adapt to the classifier. Experimental results on real data sets exhibit that the proposed method superior to the classic methods in classification tasks.

View all citing articles on Scopus

Runyuan Wang, born in 1991, received B.S. degree from University of Science and Technology, Beijing in 2012. He is currently pursuing the M.S. degree in computer science at the Beijing University of Technology. His research interests include pattern recognition and applications of computer vision to multimedia.

Junfa Liu, born in 1973, received Ph.D. degree in Institute of Computing Technology (ICT), Chinese Academy of Sciences (CAS) in 2009. He is an Associate Researcher in ICT, CAS now, and his areas of interest include pervasive computing, virtual reality and data mining.

David M.W. Powers is a Professor of Computer Science and Director of AI Lab and the Centre for Knowledge and Interaction Technology (KIT) at Flinders University. He specializes in applications of unsupervised learning to language and speech processing. Dr Powers undertook his Ph.D. in this area, as well as co-founding ACLs SIGNLL and CoNLL. He is also a trader, and has a Diploma in Technical Analysis, being the study of how to find and exploit edge in the financial markets.

View full text

A semi-supervised online sequential extreme learning machine method

Abstract

Introduction

Section snippets

Review of ELM, SS-ELM and OS-ELM

Proposed SOS-ELM

Introduction of datasets used for performance evaluation

Conclusion and future work

Acknowledgments

Neurocomputing

Neurocomputing

Neurocomputing

Neurocomputing

Neurocomputing

Neurocomputing

Neurocomputing

Neurocomputing

Universal approximation using incremental constructive feedforward networks with random hidden nodes

IEEE Trans. Neural Netw.

Extreme learning machine for regression and multiclass classification

IEEE Trans. Syst. Man Cybern. Part B: Cybern.

A fast and accurate online sequential learning algorithm for feedforward networks

IEEE Trans. Neural Netw.

A resource-allocating network for function interpolation

Neural Comput.