Skip to main content

Knowledge Maintenance on Data Streams with Concept Drifting

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3314))

Abstract

Concept drifting in data streams often occurs unpredictably at any time. Currently many classification mining algorithms deal with this problem by using an incremental learning approach or ensemble classifiers approach. However, both of them can not make a prediction at any time exactly. In this paper, we propose a novel strategy for the maintenance of knowledge. Our approach stores and maintains knowledge in ambiguous decision table with current statistical indicators. With our disambiguation algorithm, a decision tree without any time problem can be synthesized on the fly efficiently. Our experiment results have shown that the accuracy rate of our approach is higher and smoother than other approaches. So, our algorithm is demonstrated to be a real anytime approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Schlimmer, J.C., Granger II., J.R.: Beyond incremental processing: Tracking concept drift. In: AAAI National Conference on Artificial Intelligence, Philadelphia, PA, USA, pp. 502–507. AAAI Press, Menlo Park (1986)

    Google Scholar 

  2. Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, pp. 97–106 (2001)

    Google Scholar 

  3. Jin, R., Agrawal, G.: Efficient decision tree construction on streaming data. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2003)

    Google Scholar 

  4. Kalles, D., Morris, T.: Efficient incremental induction of decision trees. Machine Learning 24, 231–242 (1996)

    Google Scholar 

  5. Wang, H., Fan, W., Yu, P., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2003)

    Google Scholar 

  6. Street, W.N., Kim, Y.: A streaming ensemble algorithm (sea) for large-scale classi- fication. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, pp. 377–382 (2001)

    Google Scholar 

  7. Kolter, J.Z., Maloof, M.A.: Dynamic weighted majority: A new ensemble method for tracking concept drift. In: International Conference on Data Engineering, Bangalore, India (2003)

    Google Scholar 

  8. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993)

    Google Scholar 

  9. Colomb, R.M.: Representation of propositional expert systems as partial functions. Artificial Intelligence 109, 187–209 (1999)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Natwichai, J., Li, X. (2004). Knowledge Maintenance on Data Streams with Concept Drifting. In: Zhang, J., He, JH., Fu, Y. (eds) Computational and Information Science. CIS 2004. Lecture Notes in Computer Science, vol 3314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30497-5_110

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30497-5_110

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24127-0

  • Online ISBN: 978-3-540-30497-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics