Abstract
Traditional classification algorithms are widely used on determinate data. However, uncertain data is ubiquitous in many real applications, which poses a great challenge to traditional classification algorithms. Extreme learning machine (ELM) is a traditional and powerful classification algorithm. However, existing ELM-based uncertain data classification algorithms can not deal with data uncertainty well. In this paper, we propose a novel ELM-based uncertain data classification algorithm, called UELM. UELM firstly employs exact probability density function (PDF) instead of expected values or sample points to model uncertain data, thus avoiding the loss of uncertain information (probability distribution information of uncertain data). Furthermore, UELM redesigns the traditional ELM algorithm by modifying the received content of input layer and the activation function of hidden layer, thus making the ELM algorithm more applicable to uncertain data. Extensive experimental results on different datasets show that our proposed UELM algorithm outperforms the baselines in accuracy and efficiency.
This work was supported by 863 project of China (No. 2015AA015403) and NSFC (No. 61632019).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aggarwal, C.C.: On density based transforms for uncertain data mining. In: Proceedings of ICDE, pp. 866–875. IEEE (2007)
Aggarwal, C.C.: On high dimensional projected clustering of uncertain data streams. In: Proceedings of ICDE, pp. 1152–1154. IEEE (2009)
Aggarwal, C.C., Philip, S.Y.: Outlier detection with uncertain data. In: SDM, vol. 8, pp. 483–493. SIAM (2008)
Aggarwal, C.C., Yu, P.S.: A survey of uncertain data algorithms and applications. IEEE Trans. Knowl. Data Eng. 21(5), 609–623 (2009)
Angiulli, F., Fassetti, F.: Nearest neighbor-based classification of uncertain data. ACM Trans. Knowl. Disc. Data 7(1), 1–34 (2013)
Asuncion, A., Newman, D.: UCI machine learning repository (2007)
Bi, J., Zhang, T.: Support vector classification with input data uncertainty. In: Proceedings of NIPS, pp. 161–168 (2004)
Cao, K., Wang, G., Han, D., Bai, M., Li, S.: An algorithm for classification over uncertain data based on extreme learning machine. Neurocomputing 174, 194–202 (2016)
Cormode, G., Mcgregor, A.: Approximation algorithms for clustering uncertain data. In: Proceedings of PODS, pp. 191–200 (2008)
Gao, C., Wang, J.: Direct mining of discriminative patterns for classifying uncertain data. In: Proceedings of SIGKDD, pp. 861–870 (2010)
Ge, J., Xia, Y., Nadungodage, C.: UNN: a neural network for uncertain data classification. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010. LNCS (LNAI), vol. 6118, pp. 449–460. Springer, Heidelberg (2010). doi:10.1007/978-3-642-13657-3_48
Han, J., Pei, J., Kamber, M.: Data Mining: Concepts and Techniques. Elsevier, New York (2011)
Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)
Qin, B., Xia, Y., Prabhakar, S., Tu, Y.: A rule-based classification algorithm for uncertain data. In: Proceedings of ICDE, pp. 1633–1640. IEEE (2009)
Qin, X., Zhang, Y., Li, X., Wang, Y.: Associative classifier for uncertain data. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds.) WAIM 2010. LNCS, vol. 6184, pp. 692–703. Springer, Heidelberg (2010). doi:10.1007/978-3-642-14246-8_66
Rao, C.R., Mitra, S.K.: Generalized Inverse of Matrices and Its Applications, vol. 7. Wiley, New York (1971)
Ren, J., Lee, S.D., Chen, X., Kao, B., Cheng, R., Cheung, D.: Naive bayes classification of uncertain data. In: Proceedings of ICDM, pp. 944–949. IEEE (2009)
Sun, Y., Yuan, Y., Wang, G.: Extreme learning machine for classification over uncertain data. Neurocomputing 128, 500–506 (2014)
Tsang, S., Kao, B., Yip, K.Y., Ho, W.S., Lee, S.D.: Decision trees for uncertain data. IEEE Trans. Knowl. Data Eng. 23(1), 64–78 (2011)
Zhang, X., Liu, H., Zhang, X., Liu, X.: Novel density-based clustering algorithms for uncertain data. In: Proceedings of AAAI, pp. 2191–2197 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Zhang, X., Sun, D., Li, Y., Liu, H., Liang, W. (2017). A Novel Extreme Learning Machine-Based Classification Algorithm for Uncertain Data. In: Kang, U., Lim, EP., Yu, J., Moon, YS. (eds) Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2017. Lecture Notes in Computer Science(), vol 10526. Springer, Cham. https://doi.org/10.1007/978-3-319-67274-8_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-67274-8_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67273-1
Online ISBN: 978-3-319-67274-8
eBook Packages: Computer ScienceComputer Science (R0)