Abstract
In the non-stationary data stream distribution, concept drift occurs due to change in patterns with respect to time. It is necessary to identify drift in the data stream during the early stage. One way to explore the change in patterns is windowing, where two windows compare to find the difference in data distribution. In the two-window-based methods, the concept drift may occur much before the incoming window. The current window will wait to compare with a new incoming window’s data distribution for drift detection. It may lead to delay in detection, increasing misclassification error, and decreasing classification accuracy. The paper proposes DD-SCC-I and DD-KRC-I, incrementally adaptive single-window-based drift detection methods, to overcome the above issue. These methods localize the concept change by finding the correlation between attribute vectors. The proposed work deals with multi-dimensional data, binary-class classification, and multi-class classification problems. An improved two-window-based concept drift detection methods, DD-SCC-II and DD-KRC-II, are built to find drift using the same correlation. Further, the comparison is made among proposed methods in terms of the number of drift detected and drift detection times to demonstrate the behavior of methods. These proposed methods compare with state-of-the-art methods using real-time and synthetic data sets. The evaluation result shows DD-SCC-I and DD-KRC-I detect early drift with an increase in average rank of 4.18 and 4.56, respectively.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Aggarwal CC (2009) Data streams: An overview and scientific applications. Sci Data Min Knowl Discov 377–397
Khezri S, Tanha J, Ahmadi A, Sharifi A (2020) Stds: self-training data streams for mining limited labeled data in non-stationary environment. Appl Intell 1–20
Agrahari S, Srivastava S, Goswami S, Singh AK (2024) Identification of novelty and recurrent drift in the streaming environment. Multimed Tools Appl. https://doi.org/10.1007/s11042-024-20324-6
Agrahari S, Singh AK (2023) Unsupervised virtual drift detection method in streaming environment. In: Computer vision and machine intelligence: Proceedings of CVMI 2022, pp 311–322. Springer
Gonçalves PM, de Carvalho Santos SGT, Barros RSM, Vieira DCL (2014) A comparative study on concept drift detectors. Expert Syst Appl 41(18):8144–8156
Ning L, Zhang G, Jie L (2014) Concept drift detection via competence models. Artif Intell 209:11–28
Jie L, Liu A, Dong F, Feng G, Gama J, Zhang G (2018) Learning under concept drift: A review. IEEE Trans Knowl Data Eng 31(12):2346–2363
Frias-Blanco I, del Campo-Ávila J, Ramos-Jimenez G, Morales-Bueno R, Ortiz-Diaz A, Caballero-Mota Y (2014) Online and non-parametric drift detection methods based on hoeffding’s bounds. IEEE Trans Knowl Data Eng 27(3):810–823
Agrahari S, Srivastava S, Singh AK (2023) Review on novelty detection in the non-stationary environment. Knowl Inf Syst 1–26
Agrahari S, Singh AK (2021) Concept drift detection in data stream mining: A literature review. J King Saud Univ-Comput Inf Sci
de Barros RSM, González Hidalgo JI, de Lima Cabral DR (2018) Wilcoxon rank sum test drift detector. Neurocomputing 275:1954–1963
Agrahari S, Singh AK (2022) Adaptive pca-based feature drift detection using statistical measure. Clust Comput 25(6):4481–4494
Agrahari S, Singh AK (2022) Disposition-based concept drift detection and adaptation in data stream. Arab J Sci Eng 47(8):10605–10621
Lu J, Liu A, Song Y, Zhang G (2020) Data-driven decision support under concept drift in streamed big data. Complex Intell Syst 6(1):157–163
Bifet A, Gavalda R (2007) Learning from time-changing data with adaptive windowing. In Proceedings of the 2007 SIAM international conference on data mining, pp 443–448. SIAM
Gama J, Medas P, Castillo G, Rodrigues P (2004) Learning with drift detection. In: Brazilian symposium on artificial intelligence, pp 286–295. Springer
Ross GJ, Adams NM, Tasoulis DK, Hand DJ (2012) Exponentially weighted moving average charts for detecting concept drift. Pattern Recognit Lett 33(2):191–198
Song X, Wu M, Jermaine C, Ranka S (2007) Statistical change detection for multi-dimensional data. In: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 667–676
Shao J, Ahmadi Z, Kramer S (2014) Prototype-based learning on concept-drifting data streams. In: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pp 412–421
Qahtan AA, Alharbi B, Wang S, Zhang X (2015) A pca-based change detection framework for multidimensional data streams: Change detection in multidimensional data streams. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, pp 935–944
Gu F, Zhang G, Lu J, Lin C-T (2016) Concept drift detection based on equal density estimation. In: 2016 International Joint Conference on Neural Networks (IJCNN), pp 24–30. IEEE
Lu N, Lu J, Zhang G, Lopez De Mantaras R (2016) A concept drift-tolerant case-base editing technique. Artif Intell 230:108–133
Li B, Alippi C, Zhao D (2016) A pdf-free change detection test based on density difference estimation. IEEE Trans Neural Netw Learn Syst 29(2):324–334
Li B, Zhao D, Alippi C (2017) An incremental change detection test based on density difference estimation. IEEE Trans Syst Man Cybern Syst 47(10):2714–2726
Page ES (1954) Continuous inspection schemes. Biometrika 41(1/2):100–115
Kifer D, Ben-David S, Gehrke J (2004) Detecting change in data streams. In: VLDB, vol 4, pp 180–191. Toronto, Canada
Liu A, Song Y, Zhang G, Lu J (2017) Regional concept drift detection and density synchronized drift adaptation. In: IJCAI international joint conference on artificial intelligence
Baena-Garcıa M, del Campo-Ávila J, Fidalgo R, Bifet A, Gavalda R, Morales-Bueno R (2006) Early drift detection method. In: Fourth international workshop on knowledge discovery from data streams, vol 6, pp 77–86
Barros RSM, Cabral DRL, Gonçalves Jr PM, Santos SGTC (2017) Rddm: Reactive drift detection method. Expert Syst Appl 90:344–355
Bifet A, Gavalda R (2009) Adaptive learning from evolving data streams. In: International symposium on intelligent data analysis, pp 249–260. Springer
Pears R, Sakthithasan S, Koh YS (2014) Detecting concept change in dynamic data streams. Mach Learn 97(3):259–293
Pesaranghader A, Viktor HL (2016) Fast hoeffding drift detection method for evolving data streams. In: Joint European conference on machine learning and knowledge discovery in databases, pp 96–111. Springer
Pesaranghader A, Viktor HL, Paquet E (2018) Mcdiarmid drift detection methods for evolving data streams. In: 2018 International Joint Conference on Neural Networks (IJCNN), pp 1–9. IEEE
Pesaranghader A, Viktor H, Paquet E (2018) Reservoir of diverse adaptive learners and stacking fast hoeffding drift detection methods for evolving data streams. Mach Learn 107(11):1711–1743
Gözüaçık Ö, Can F (2021) Concept learning using one-class classifiers for implicit drift detection in evolving data streams. Artif Intell Rev 54(5):3725–3747
Gao J, Fan W, Han J (2007) On appropriate assumptions to mine data streams: Analysis and practice. In: Seventh IEEE international conference on data mining (ICDM 2007), pp 143–152. IEEE
Hu H, Kantardzic M, Sethi TS (2020) No free lunch theorem for concept drift detection in streaming data classification: A review. Wiley Interdiscip Rev: Data Min Knowl Discov 10(2):e1327
Brzezinski D, Stefanowski J (2017) Prequential auc: properties of the area under the roc curve for data streams with concept drift. Knowl Inf Syst 52(2):531–562
Zar JH (2005) Spearman rank correlation. Encycl Biostat 7
Sedgwick P (2014) Spearman’s rank correlation coefficient. Bmj, 349
Abdi H (2007) The kendall rank correlation coefficient. Encyclopedia of Measurement and Statistics. Sage, Thousand Oaks, CA, pp 508–510
Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Agrahari, S., Singh, A.K. Comparison based analysis of window approach for concept drift detection and adaptation. Appl Intell 55, 39 (2025). https://doi.org/10.1007/s10489-024-05890-4
Accepted:
Published:
DOI: https://doi.org/10.1007/s10489-024-05890-4