Abstract
Uncertain or missing data may occur in many practical applications. A principled strategy for handling this problem would therefore be very useful. We consider two-class and multi-class classification problems where the mean and covariance of each class are assumed to be known. With simple structure, fast speed and good performance, extreme learning machine (ELM) has been an important technology in machine learning. In this work, from the viewpoint of probability, we present a robust ELM framework (RELM) for missing data classification. Applying the Chebyshev–Cantelli inequality, the proposed RELM is reformulated as a second-order cone programming with global optimal solution. The proposed RELM only relates to the second moments of input samples and makes no assumption about the data probability distribution. Expectation maximization algorithm is used to fill in missing values and then obtain complete data. Numerical experiments are simulated in various datasets from UCI database and a practical application database. Experimental results show that the proposed method can achieve better performance than traditional methods. These results illustrate the feasibility and effectiveness of the proposed method for missing data classification.








Similar content being viewed by others
References
Dempster A (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc 39(1):1–38
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501
Huang GB, Ding XJ, Zhou HM (2010) Optimization method based extreme learning machine for classification. Neurocomputing 74:155–163
Zhu H, Tsang ECC, Zhu J (2018) Training an extreme learning machine by localized generalization error model. Soft Comput 2:1–9
Yang LM, Zhang S (2016) A sparse extreme learning machine framework by continuous optimization algorithms and its application in pattern recognition. Eng Appl Artif Intell 53:176–189
Yu YL, Sun Z (2017) Sparse coding extreme learning machine for classification. Neurocomputing 261:50–56. https://doi.org/10.1016/j.neucom.2016.06.078
Wang Z, Wu D, Gravina R, Fortino G, Jiang Y (2017) Kernel fusion based extreme learning machine for cross-location activity recognition. Inf Fusion 37:1–9
Yu Q, Miche Y, Eirola E, Heeswijk MV, Verin E (2013) Regularized extreme learning machine for regression with missing data. Neurocomputing 102(2):45–51
Sovilj D, Eirola E, Miche Y, Björk KM, Nian R, Akusok A, Lendasse A (2016) Extreme learning machine for missing data using multiple imputations. Neurocomputing 174((PA)):220–231
Shivaswamy PK, Bhattacharyya C, Smola AJ (2006) Second order cone programming approaches for handling missing and uncertain data. J Mach Learn Res 7:1283–1314
Scholkopf B, Platt J, Hofmann T (2007) Max-margin classification of incomplete data. In: Advances in neural information processing system, pp 233–240
Garcia-Laencina PJ, Sancho-Gomez J-L, Figueiras-Vidal AR (2010) Pattern classification with missing data: a review. Neural Comput Appl 19(2):263–282
Fraley C, Hesterberg T (2009) Least angle regression and LASSO for large datasets. Stat Anal Data Min 1(4):251–259
Mclachlan GJ, Krishnan T (2008) The EM algorithm and extensions. Biometrics 382(1):154–156
Welling M, Weber M (2001) A constrained EM algorithm for independent component analysis. Neural Comput 13(3):677–689
Marshall W, Olkin I (1960) Multivariate Chebychev inequalities. Ann Math Stat 31(4):1001–1014
Lobo MS, Vandenberghe L, Boyd S, Lebret H (1998) Applications of second-order cone programming. Linear Algebra Appl 284(1–3):193–228
Alzalg BM (2012) Stochastic second-order cone programming: applications models. Appl Math Model 36(10):5122–5134
Bartlett PL, Wegkamp MH (2008) Classification with a reject option using a hinge loss. J Mach Learn Res 9(12):1823–1840
Lanckriet GRG, Ghaoui LE, Bhattacharyya C et al (2003) A robust minimax approach to classification. J Mach Learn Res 3(3):555–582
Blake CL, Merz CJ (1998) UCI Repository for Machine Learning Databases, Department of Information and Computer Sciences, University of California, Irvine http://www.ics.uci.edu/mlearn/MLRepository.html
Yang L, Dong H (2018) Support vector machine with truncated pinball loss and its application in pattern recognition. Chemom Intell Lab Syst 177:89–99
Acknowledgements
This work was supported by National Nature Science Foundation of China (11471010) and Chinese Universities Scientific Fund (2017LX003). Moreover, the authors thank very the referees and the editor for their constructive comments. Their suggestions improved the paper significantly.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jing, S., Yang, L. A robust extreme learning machine framework for uncertain data classification. J Supercomput 76, 2390–2416 (2020). https://doi.org/10.1007/s11227-018-2430-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-018-2430-6