Abstract
We present Kernel-QuantTree Exponentially Weighted Moving Average (KQT-EWMA), a non-parametric change-detection algorithm that combines the Kernel-QuantTree (KQT) histogram and the EWMA statistic to monitor multivariate data streams online. The resulting monitoring scheme is very flexible, since histograms can be used to model any stationary distribution, and practical, since the distribution of test statistics does not depend on the distribution of datastream in stationary conditions (non-parametric monitoring). KQT-EWMA enables controlling false alarms by operating at a pre-determined Average Run Length (\(ARL_0\)), which measures the expected number of stationary samples to be monitored before triggering a false alarm. The latter peculiarity is in contrast with most non-parametric change-detection tests, which rarely can control the \(ARL_0\) a priori. Our experiments on synthetic and real-world datasets demonstrate that KQT-EWMA can control \(ARL_0\) while achieving detection delays comparable to or lower than state-of-the-art methods designed to work in the same conditions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alippi, C., Boracchi, G., Carrera, D., Roveri, M.: Change detection in multivariate datastreams: likelihood and detectability loss. In: International Joint Conference on Artificial Intelligence (IJCAI), vol. 2, pp. 1368–1374 (2016)
Boracchi, G., Carrera, D., Cervellera, C., Macciò, D.: QuantTree: histograms for change detection in multivariate data streams. In: Proceedings of the 35th International Conference on Machine Learning, vol. 80, pp. 639–648. PMLR (2018)
Carrera, D., Boracchi, G.: Generating high-dimensional datastreams for change detection. Big Data Res. 11, 11–21 (2018)
Frittoli, L., Carrera, D., Boracchi, G.: Change detection in multivariate datastreams controlling false alarms. In: Machine Learning and Knowledge Discovery in Databases. Research Track, pp. 421–436. Springer, Cham (2021)
Frittoli, L., Carrera, D., Boracchi, G.: Nonparametric and online change detection in multivariate datastreams using QuantTree. IEEE Trans. Knowl. Data Eng. 25(8), 8328–8342 (2022)
Kelly, M., Longjohn, R., Nottingham, K.: The UCI machine learning repository. https://archive.ics.uci.edu
Keriven, N., Garreau, D., Poli, I.: NEWMA: a new method for scalable model-free online change-point detection. IEEE Trans. Signal Process. 68, 3515–3528 (2020). https://doi.org/10.1109/TSP.2020.2990597
Kuncheva, L.I.: Change detection in streaming multivariate data using likelihood detectors. IEEE Trans. Knowl. Data Eng. 25(5), 1175–1180 (2013)
Li, S., Xie, Y., Dai, H., Song, L.: Scan B-statistic for kernel change-point detection. Seq. Anal. 38(4), 503–544 (2019)
Ross, G.J., Tasoulis, D.K., Adams, N.M.: Nonparametric monitoring of data streams for changes in location and scale. Technometrics 53(4), 379–389 (2011)
Souza, V.M.A., dos Reis, D.M., Maletzke, A.G., Batista, G.E.A.P.A.: Challenges in benchmarking stream learning algorithms with real-world data. Data Min. Knowl. Discov. 34, 1805–1858 (2020)
Stucchi, D., Rizzo, P., Folloni, N., Boracchi, G.: Kernel QuantTree. In: Proceedings of the 40th International Conference on Machine Learning (2023)
Vershynin, R.: How close is the sample covariance matrix to the actual covariance matrix? J. Theor. Probab. 25 (2010). https://doi.org/10.1007/s10959-010-0338-z
Wei, S., Xie, Y.: Online kernel CUSUM for change-point detection (2022). https://doi.org/10.48550/arXiv.2211.15070
Zamba, K.D., Hawkins, D.M.: A multivariate change-point model for statistical process control. Technometrics 48(4), 539–549 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Nogara Notarianni, M.O., Leveni, F., Stucchi, D., Frittoli, L., Boracchi, G. (2025). Change Detection in Multivariate Data Streams: Online Analysis with Kernel-QuantTree. In: Lemaire, V., et al. Advanced Analytics and Learning on Temporal Data. AALTD 2024. Lecture Notes in Computer Science(), vol 15433. Springer, Cham. https://doi.org/10.1007/978-3-031-77066-1_7
Download citation
DOI: https://doi.org/10.1007/978-3-031-77066-1_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-77065-4
Online ISBN: 978-3-031-77066-1
eBook Packages: Computer ScienceComputer Science (R0)