skip to main content
10.1145/2245276.2245431acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

Very Fast Decision Rules for multi-class problems

Published:26 March 2012Publication History

ABSTRACT

Decision rules are one of the most interpretable and flexible models for data mining prediction tasks. Till now, few works presented online, any-time and one-pass algorithms for learning decision rules in the stream mining scenario. A quite recent algorithm, the Very Fast Decision Rules (VFDR), learns set of rules, where each rule discriminates one class from all the other. In this work we extend the VFDR algorithm by decomposing a multi-class problem into a set of two-class problems and inducing a set of discriminative rules for each binary problem. The proposed algorithm maintains all properties required when learning from stationary data streams: online and any-time classifiers, processing each example once. Moreover, it is able to learn ordered and unordered rule sets. The new approach is evaluated on various real and artificial datasets. The new algorithm improves the performance of the previous version and is competitive with the state-of-the-art decision tree learning method for data streams.

References

  1. M. R. Berthold, C. Borgelt, F. Hoeppner, and F. Klawonn. Guide to Intelligent Data Analysis: How to Intelligently Make Sense of Real Data, volume 42 of Texts in Computer Science. Springer-Verlag, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Bifet, G. Holmes, R. Kirkby, and B. Pfahringer. Moa: Massive online analysis. Journal of Machine Learning Research (JMLR), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. P. Clark and R. Boswell. Rule induction with cn2: Some recent improvements. pages 151--163. Springer-Verlag, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. Clark and T. Niblett. The CN2 induction algorithm. Machine Learning, 3: 261--283, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. W. Cohen. Fast effective rule induction. In A. Prieditis and S. Russel, editors, Machine Learning, Proceedings of the 12th International Conference. Morgan Kaufmann, 1995.Google ScholarGoogle Scholar
  6. P. Domingos. Unifying instance-based and rule-based induction. Machine Learning, 24: 141--168, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Domingos and G. Hulten. Mining high-speed data streams. In KDD, pages 71--80, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. F. Ferrer, J. Aguilar, and J. Riquelme. Incremental rule learning and border examples selection from numerical data streams. Journal of Universal Computer Science, 11(8): 1426--1439, 2005.Google ScholarGoogle Scholar
  9. A. Frank and A. Asuncion. UCI machine learning repository, 2010.Google ScholarGoogle Scholar
  10. E. Frank and I. H. Witten. Generating accurate rule sets without global optimization. In J. Shavlik, editor, Proceedings of the 15th International Conference - ICML'98, pages 144--151. Morgan Kaufmann, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. Fürnkranz. Round robin rule learning. In Proceedings of the 18th International Conference on Machine Learning (ICML-01): 146--153, pages 146--153. Morgan Kaufmann, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Gama, R. Fernandes, and R. Rocha. Decision trees for mining data streams. Intelligent Data Analysis, 10: 23--45, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Gama and P. Kosina. Learning decision rules from data streams. In Proceedings of the 22nd International Joint Conference on Artificial Intelligence, pages 1255--1260. AAAI, Menlo Park, USA, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Gama, R. Rocha, and P. Medas. Accurate decision trees for mining high-speed data streams. In Proceedings of the Ninth International Conference on Knowledge Discovery and Data Mining. ACM Press, New York, NY, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. G. Hulten, L. Spencer, and P. Domingos. Mining time-changing data streams. In Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, pages 97--106. ACM, New York, NY, USA, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Z. Kolter and M. A. Maloof. Dynamic weighted majority: A new ensemble method for tracking concept drift. In Proceedings of the 3th International IEEE Conference on Data Mining, pages 123--130. IEEE Computer Society, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Maloof and R. Michalski. Incremental learning with partial instance memory. Artificial Intelligence, 154: 95--126, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. R. Quinlan and R. M. Cameron-Jones. Induction of logic programs: Foil and related systems. New Generation Comput., 13(3&4): 287--312, 1995.Google ScholarGoogle ScholarCross RefCross Ref
  19. R. Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, Inc. San Mateo, CA, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R. Rivest. Learning decision lists. Machine Learning, 2: 229--246, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. C. Schlimmer and R. H. Granger. Incremental learning from noisy data. Machine Learning, 1: 317--354, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. W. N. Street and Y. Kim. A streaming ensemble algorithm SEA for large-scale classification. pages 377--382. ACM Press, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. S. Weiss and N. Indurkhya. Predictive Data Mining, a practical Guide. Morgan Kaufmann Publishers, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. G. Widmer and M. Kubat. Learning in the presence of concept drift and hidden contexts. Machine Learning, 23: 69--101, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Very Fast Decision Rules for multi-class problems

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SAC '12: Proceedings of the 27th Annual ACM Symposium on Applied Computing
      March 2012
      2179 pages
      ISBN:9781450308571
      DOI:10.1145/2245276
      • Conference Chairs:
      • Sascha Ossowski,
      • Paola Lecca

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 March 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      SAC '12 Paper Acceptance Rate270of1,056submissions,26%Overall Acceptance Rate1,650of6,669submissions,25%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader