Article

On handling conflicts between rules with numerical features

Author:
Tony Lindgren

Stockholm University and Royal Institute of Technology, Kista, Sweden

Stockholm University and Royal Institute of Technology, Kista, Sweden
View Profile

SAC '06: Proceedings of the 2006 ACM symposium on Applied computingApril 2006Pages 37–41https://doi.org/10.1145/1141277.1141284

Published:23 April 2006Publication History

SAC '06: Proceedings of the 2006 ACM symposium on Applied computing

Pages 37–41

ABSTRACT

Rule conflicts can arise in machine learning systems that utilise unordered rule sets. A rule conflict is when two or more rules cover the same example but differ in their majority classes. This conflict must be solved before a classification can be made. The standard methods for solving this type of problem are to use naive Bayes to solve the conflict or using the most frequent class (CN2). This paper studies the problem of rule conflicts in the area of numerical features. A novel family of methods, called distance based methods, for solving rule conflicts in continuous domains is presented. An empirical evaluation between a distance based method, CN2 and naive Bayes is made. It is shown that the distance based method significantly outperforms both naive Bayes and CN2.

References

P. Clark and R. Boswell. Rule induction with CN2: Some recent improvements. In Proceedings of the Fifth European Working Session on Learning, pages 151--163, Berlin, 1991. Springer-Verlag. Google ScholarDigital Library
P. Clark and T. Niblett. The CN2 Induction Algorithm. Machine Learning, 3, 261--283, 1989. Google ScholarDigital Library
James Dougherty, Ron Kohavi, and Mehran Sahami. Supervised and unsupervised discretization of continuous features. In International Conference on Machine Learning, pages 194--202, 1995.Google ScholarCross Ref
Tom Fawcett. Using rule sets to maximize roc performance. In ICDM, pages 131--138, 2001. Google ScholarDigital Library
U. M. Fayyad and K. B. Irani. On the handling of continuous-valued attributes in decision tree generation. Machine Learning, 8:87--102, 1992. Google ScholarCross Ref
J. Fürnkranz and G. Widmer. Incremental Reduced Error Pruning. In Proceedings of the 11th International Conference on Machine Learning, 1994.Google ScholarCross Ref
Johannes Fürnkranz. Separate-and-Conquer Rule Learning. Artificial Intelligence Review, 1999. Google ScholarDigital Library
R. Kohavi, B. Becker, and D. Sommerfield. Improving simple Bayes. In Proceedings of the European Conference on Machine Learning, 1997.Google Scholar
T. Lindgren and H. Boström. Resolving rule conflicts with double induction. Intelligent Data Analysis - An International Journal, Volume 8, Number 5, 2004. Google ScholarDigital Library
Tony Lindgren. Methods for Rule Conflict Resolution. In Proceedings of the 15th European Conference on Machine Learning (ECML-04), pages 262--273. Springer, 2004.Google Scholar
Tony Lindgren and Henrik Boström. Classification with Intersecting Rules. In Proceedings of the 13th International Conference on Algorithmic Learning Theory (ALT'02), pages 395--402. Springer, 2002. Google ScholarDigital Library
Huan Liu, Farhad Hussain, Chew Lim Tan, and Manoranjan Dash. Discretization: An enabling technique. Data Min. Knowl. Discov., 6(4):393--423, 2002. Google ScholarDigital Library
J. R. Quinlan. Induction of decision trees. Machine Learning, 1:81--106, 1986. Google ScholarCross Ref
RDS. Rule Discovery System (RDS) --- 1.0, Compumine AB, 2003. www.compumine.com.Google Scholar

Index Terms

On handling conflicts between rules with numerical features
1. Computing methodologies
  1. Machine learning

Recommendations

Efficient learning of large sets of locally optimal classification rules
Abstract
Conventional rule learning algorithms aim at finding a set of simple rules, where each rule covers as many examples as possible. In this paper, we argue that the rules found in this way may not be the optimal explanations for each of the examples ...
Read More
Learning semantically coherent rules
DMNLP'14: Proceedings of the 1st International Conference on Interactions between Data Mining and Natural Language Processing - Volume 1202

The capability of building a model that can be understood and interpreted by humans is one of the main selling points of symbolic machine learning algorithms, such as rule or decision tree learners. However, those algorithms are most often optimized ...
Read More
Random rules from data streams
SAC '13: Proceedings of the 28th Annual ACM Symposium on Applied Computing

Existing works suggest that random inputs and random features produce good results in classification. In this paper we study the problem of generating random rule sets from data streams. One of the most interpretable and flexible models for data stream ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SAC '06: Proceedings of the 2006 ACM symposium on Applied computing
April 2006
1967 pages
ISBN:1595931082
DOI:10.1145/1141277
Conference Chair:
Hisham M. Haddad
Kennesaw State University, Kennesaw, Georgia
Copyright © 2006 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 April 2006
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
numerical features
rule conflicts
rule learning
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,650of6,669submissions,25%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 126
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

On handling conflicts between rules with numerical features

SAC '06: Proceedings of the 2006 ACM symposium on Applied computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Efficient learning of large sets of locally optimal classification rules

Learning semantically coherent rules

Random rules from data streams