Abstract
Concept drifting in data streams often occurs unpredictably at any time. Currently many classification mining algorithms deal with this problem by using an incremental learning approach or ensemble classifiers approach. However, both of them can not make a prediction at any time exactly. In this paper, we propose a novel strategy for the maintenance of knowledge. Our approach stores and maintains knowledge in ambiguous decision table with current statistical indicators. With our disambiguation algorithm, a decision tree without any time problem can be synthesized on the fly efficiently. Our experiment results have shown that the accuracy rate of our approach is higher and smoother than other approaches. So, our algorithm is demonstrated to be a real anytime approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Schlimmer, J.C., Granger II., J.R.: Beyond incremental processing: Tracking concept drift. In: AAAI National Conference on Artificial Intelligence, Philadelphia, PA, USA, pp. 502–507. AAAI Press, Menlo Park (1986)
Hulten, G., Spencer, L., Domingos, P.: Mining time-changing data streams. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, pp. 97–106 (2001)
Jin, R., Agrawal, G.: Efficient decision tree construction on streaming data. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2003)
Kalles, D., Morris, T.: Efficient incremental induction of decision trees. Machine Learning 24, 231–242 (1996)
Wang, H., Fan, W., Yu, P., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA (2003)
Street, W.N., Kim, Y.: A streaming ensemble algorithm (sea) for large-scale classi- fication. In: ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, pp. 377–382 (2001)
Kolter, J.Z., Maloof, M.A.: Dynamic weighted majority: A new ensemble method for tracking concept drift. In: International Conference on Data Engineering, Bangalore, India (2003)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993)
Colomb, R.M.: Representation of propositional expert systems as partial functions. Artificial Intelligence 109, 187–209 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Natwichai, J., Li, X. (2004). Knowledge Maintenance on Data Streams with Concept Drifting. In: Zhang, J., He, JH., Fu, Y. (eds) Computational and Information Science. CIS 2004. Lecture Notes in Computer Science, vol 3314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30497-5_110
Download citation
DOI: https://doi.org/10.1007/978-3-540-30497-5_110
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24127-0
Online ISBN: 978-3-540-30497-5
eBook Packages: Computer ScienceComputer Science (R0)