Abstract
With the explosion of ubiquitous continuous sensing, on-line streaming clustering continues to attract attention. The requirements are that the streaming clustering algorithm recognize and adapt clusters as the data evolves, that anomalies are detected, and that new clusters are automatically formed as incoming data dictate. In this paper, we extend an earlier approach, called Extended Robust On-Line Streaming Clustering (EROLSC), which utilizes both the Possibilistic C-Means and Gaussian Mixture Decomposition to perform this task. We show the superiority of EROLSC over traditional streaming clustering algorithms on synthetic and real data sets.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Silva, J.A., Faria, E.R., Barros, R.C., Hruschka, E.R., de Carvalho, A.C.P.L.F., Gama, J.: Data stream clustering: a survey. ACM Comput. Surv. 46(1), 13–31 (2013)
Moshtaghi, M., Leckie, C., Bezdek, J.C.: Online clustering of multivariate time-series. In: Proceedings of SIAM International Conference on Data Mining, pp. 360–368 (2016)
MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, no. 14, pp. 281–297 (1967)
Ibrahim, O.A., Shao, J., Keller, J.M., Popescu, M.: A temporal analysis system for early detection of health changes. In: IEEE International Conference on Fuzzy Systems, FUZZ-IEEE, pp. 186–193 (2016)
Theodoridis, S., Koutroumbas, K.: Basic Sequential Algorithmic Scheme (BSAS). Academic Press, London (1999)
Theodoridis, S., Koutroumbas, K.: Pattern Recognition. Academic Press, London (2006)
Guha, S., Meyerson, A., Mishra, N., Motwani, R., O’Callaghan, L.: Clustering data streams: theory and practice. IEEE Trans. Knowl. Data Eng. 15(3), 515–528 (2003)
Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: On clustering massive data streams: a summarization paradigm. In: Aggarwal, C.C. (ed.) data streams. Advances in Database Systems, vol. 31, pp. 9–38. Springer, Boston (2007). https://doi.org/10.1007/978-0-387-47534-9_2
Ailon, N., Jaiswal, R., Monteleoni, C.: Streaming k-means approximation. In: Neural Information Processing Systems, vol. 22, pp. 10–18 (2009)
Salehi, M., Leckie, Christopher A., Moshtaghi, M., Vaithianathan, T.: A relevance weighted ensemble model for anomaly detection in switching data streams. In: Tseng, V.S., Ho, T.B., Zhou, Z.-H., Chen, A.L.P., Kao, H.-Y. (eds.) PAKDD 2014. LNCS (LNAI), vol. 8444, pp. 461–473. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-06605-9_38
Ackerman, M., Dasgupta, S.: Incremental clustering: the case for extra clusters. In: Advances in Neural Information Processing Systems, pp. 307–315 (2014)
Angelov, P., Zhou, X.: Evolving fuzzy-rule-based classifiers from data streams. IEEE Trans. Fuzzy Syst. 16(6), 1462–1475 (2008)
Ho, S.-S.: A martingale framework for concept change detection in time-varying data streams. In: Proceedings of the International Conference on Machine Learning, pp. 321–327 (2005)
Krishnapuram, R., Keller, J.M.: A possibilistic approach to clustering. IEEE Trans. Fuzzy Syst. I(2), 98–110 (1993)
Yang, M.-S., Lai, C.-Y.: A robust automatic merging possibilistic clustering method. IEEE Trans. Fuzzy Syst. 19(1), 26–41 (2011)
SensorScope (2007). http://lcav.epfl.ch/page-86035-en.html
Lichman, M.: UCI Machine Learning Repository. School of Information and Computer Science, University of California, Irvine, CA (2013). http://archive.ics.uci.edu/ml
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Ibrahim, O.A., Du, Y., Keller, J. (2018). Robust On-Line Streaming Clustering. In: Medina, J., et al. Information Processing and Management of Uncertainty in Knowledge-Based Systems. Theory and Foundations. IPMU 2018. Communications in Computer and Information Science, vol 853. Springer, Cham. https://doi.org/10.1007/978-3-319-91473-2_40
Download citation
DOI: https://doi.org/10.1007/978-3-319-91473-2_40
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91472-5
Online ISBN: 978-3-319-91473-2
eBook Packages: Computer ScienceComputer Science (R0)