Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 226))

Abstract

In this paper we examine the possibilities of using popular univariate statistical tests for discovering virtual concept drift in the stream of multidimensional data. Three popular methods are evaluated with different generalization approaches both on simulated and real data and compared by the specificity and sensitivity scores.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Newman, D.J., Asuncion, A.: UCI machine learning repository (2007)

    Google Scholar 

  2. Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2002, pp. 1–16. ACM, New York (2002)

    Chapter  Google Scholar 

  3. Corder, G.W., Foreman, D.I.: Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach, 1st edn. Wiley (May 2009)

    Google Scholar 

  4. Dries, A., Rückert, U.: Adaptive concept drift detection. Stat. Anal. Data Min. 2(5-6), 311–327 (2009)

    Article  MathSciNet  Google Scholar 

  5. Fasano, G., Franceschini, A.: A multidimensional version of the Kolmogorov-Smirnov test. Monthy Notices of the Royal Astronomical Society 225, 155–170 (1987)

    Google Scholar 

  6. Friedman, J., Rafsky, L.: Multivariate generalizations of the wald-wolfowitz and smirnov two-sample tests. In: The Annals of Statistics, pp. 697–717 (1979)

    Google Scholar 

  7. Greiner, R., Grove, A.J., Roth, D.: Learning cost-sensitive active classifiers. Artif. Intell. 139(2), 137–174 (2002)

    Article  MathSciNet  Google Scholar 

  8. Lowry, R.: Concepts and Applications of Inferential Statistics. Online (June 2007)

    Google Scholar 

  9. Markou, M., Singh, S.: Novelty detection: a review–part 1: statistical approaches. Signal Processing 83(12), 2481–2497 (2003)

    Article  MATH  Google Scholar 

  10. Nishida, K., Yamauchi, K.: Detecting concept drift using statistical testing. In: Corruble, V., Takeda, M., Suzuki, E. (eds.) DS 2007. LNCS (LNAI), vol. 4755, pp. 264–269. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  11. Revuz, D., Yor, M.: Continuous Martingales and Brownian Motion (Grundlehren der mathematischen Wissenschaften), 3rd edn. Springer (December 2004)

    Google Scholar 

  12. Schlimmer, J.C., Granger Jr., R.H.: Incremental learning from noisy data. Mach. Learn. 1(3), 317–354 (1986)

    Google Scholar 

  13. Smirnov, N.V.: Table for estimating the goodness of fit of empirical distributions. Ann. Math. Stat. 19, 279–281 (1948)

    Article  MATH  Google Scholar 

  14. Sugiura, N.: Multisample and multivariate nonparametric tests based on u statistics and their asymptotic efficiencies. Osaka J. Math. 2, 385–426 (1965)

    MathSciNet  MATH  Google Scholar 

  15. Vreeken, J., van Leeuwen, M., Siebes, A.: Characterising the difference. In: Berkhin, P., Caruana, R., Wu, X. (eds.) KDD, pp. 765–774. ACM (2007)

    Google Scholar 

  16. Wald, A.: Sequential Tests of Statistical Hypotheses. The Annals of Mathematical Statistics 16(2), 117–186 (1945)

    Article  MathSciNet  MATH  Google Scholar 

  17. Wang, S., Schlobach, S., Klein, M.C.A.: Concept drift and how to identify it. J. Web Sem. 9(3), 247–265 (2011)

    Article  Google Scholar 

  18. Widmer, G., Kubat, M.: Effective learning in dynamic environments by explicit context tracking. In: Brazdil, P.B. (ed.) ECML 1993. LNCS, vol. 667, pp. 227–243. Springer, Heidelberg (1993)

    Chapter  Google Scholar 

  19. Wilcoxon, F.: Individual Comparisons by Ranking Methods. Biometrics Bulletin 1(6), 80–83 (1945)

    Article  Google Scholar 

  20. Wolfowitz, J.: On Waldś Proof of the Consistency of the Maximum Likelihood Estimate. The Annals of Mathematical Statistics 20, 601–602 (1949)

    Article  MathSciNet  MATH  Google Scholar 

  21. Žliobaitė, I.: Learning under Concept Drift: an Overview. Technical report, Vilnius University, Faculty of Mathematics and Informatic (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Piotr Sobolewski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Sobolewski, P., Woźniak, M. (2013). Comparable Study of Statistical Tests for Virtual Concept Drift Detection. In: Burduk, R., Jackowski, K., Kurzynski, M., Wozniak, M., Zolnierek, A. (eds) Proceedings of the 8th International Conference on Computer Recognition Systems CORES 2013. Advances in Intelligent Systems and Computing, vol 226. Springer, Heidelberg. https://doi.org/10.1007/978-3-319-00969-8_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-00969-8_32

  • Publisher Name: Springer, Heidelberg

  • Print ISBN: 978-3-319-00968-1

  • Online ISBN: 978-3-319-00969-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics