Article

Mining dependence rules by finding largest itemset support quota

Author:
Alexandr Savinov

Fraunhofer Institute for Autonomous Intelligent Systems, Sankt-Augustin, Germany

Fraunhofer Institute for Autonomous Intelligent Systems, Sankt-Augustin, Germany
View Profile

SAC '04: Proceedings of the 2004 ACM symposium on Applied computingMarch 2004Pages 525–529https://doi.org/10.1145/967900.968011

Published:14 March 2004Publication History

SAC '04: Proceedings of the 2004 ACM symposium on Applied computing

Pages 525–529

ABSTRACT

In the paper a new data mining algorithm for finding the most interesting dependence rules is described. Dependence rules are derived from the itemsets with support significantly different from its expected value and therefore considered interesting. Since such itemsets are distributed non-monotonically in the lattice of all itemsets the support monotonicity property cannot be used for their search. Instead we estimate upper/lower bounds for the support to find itemsets with large interval of possible support values called support quota. Since the support quota is known to be monotonically decreasing the search space can be effectively restricted. Strongly dependent itemsets are selected by computing their expected support using iterative proportional fitting algorithm and comparing it with the real itemset support.

References

A. A. Freitas, On rule interestingness measures, Knowlege Based Systems 12, 309--315, 1999.Google ScholarDigital Library
R. Agrawal, T. Imielinski, A. Swami. Mining association rules between sets of items in large databases. Proc. of the ACM SIGMOD Conference on Management of Data, Washington, D.C., May 1993, 207--216. Google ScholarDigital Library
B. Liu, L.-P. Ku and W. Hsu, Discovering Interesting Holes in Data, Proceedings of Fifteenth International Joint Conference on Artificial Intelligence (IJCAI-97), pp. 930--935, August 23--29, 1997, Nagoya, Japan. Google ScholarDigital Library
A. Savinov, Mining Possibilistic Set-Valued Rules by Generating Prime Disjunctions, Proc. 3rd European Conference on Principles of Data Mining and Knowledge Discovery (PKDD'99), LNCS No. 1704, pp. 536--541. Google ScholarDigital Library
A. Savinov, Mining Interesting Possibilistic Set-Valued Rules, in: Fuzzy If-Then Rules in Computational Intelligence: Theory and Applications (Eds.: Da Ruan and Etienne E. Kerre), Kluwer, 2000, 107--133.Google Scholar
S. Brin, R. Motwani, and C. Silverstein, Beyond market basket: Generalizing association rules to correlations, SIGMOD'97, pp. 265--276. Google ScholarDigital Library
C. Silverstein, S. Brin, and R. Motwani, Beyond Market Baskets: Generalizing Association Rules to Dependence Rules, Data Mining and Knowledge Discovery 2(1), 39--68. Google ScholarDigital Library
R. Meo, Theory of dependence values, ACM Transactions on Database Systems, 25(3), 2000, 380--406. Google ScholarDigital Library
T. Calders and B. Goethals. Mining all non-derivable frequent itemsets. Proc. 6th European Conference on Principles of Data Mining and Knowledge Discovery (PKDD'02), LNCS No. 2431, pp. 74--85. Google ScholarDigital Library
Darroch and D. Ratchli, Generalized Iterative Scaling for Log-Linear Models, The Annals of Mathematical Statistics, Vol. 43, No. 5, pp. 1470--1480, 1972.Google ScholarCross Ref
S. Jaroszewicz and D. A. Simovici. Pruning Redundant Association Rules Using Maximum Entropy Principle. Advances in Knowledge Discovery and Data Mining, 6th Pacific-Asia Conference, PAKDD'02, 135--147. Google ScholarDigital Library

Mining dependence rules by finding largest itemset support quota
1. Information systems
  1. Information systems applications

Recommendations

Non-derivable itemset mining

All frequent itemset mining algorithms rely heavily on the monotonicity principle for pruning. This principle allows for excluding candidate itemsets from the expensive counting phase. In this paper, we present sound and complete deduction rules to ...
Read More
A survey of incremental high-utility itemset mining

Traditional association rule mining has been widely studied. But it is unsuitable for real-world applications where factors such as unit profits of items and purchase quantities must be considered. High-utility itemset mining HUIM is designed to find ...
Read More
Pushing Support Constraints Into Association Rules Mining

Interesting patterns often occur at varied levels of support. The classic association mining based on a uniform minimum support, such as Apriori, either misses interesting patterns of low support or suffers from the bottleneck of itemset generation ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SAC '04: Proceedings of the 2004 ACM symposium on Applied computing
March 2004
1733 pages
ISBN:1581138121
DOI:10.1145/967900
Conference Chair:
Hisham M. Haddad
Kennesaw State University
,
Program Chairs:
Andrea Omicini
Università degli Studi di Bologna a Cesena
,
Roger L. Wainwright
University of Tulsa
,
Publications Chair:
Lorie M. Liebrock
New Mexico Institute of Mining and Technology
Copyright © 2004 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 14 March 2004
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
data mining
dependence rules
expected support
support bounding
support quota
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,650of6,669submissions,25%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 5
  Total Citations
  View Citations
- 467
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Mining dependence rules by finding largest itemset support quota

SAC '04: Proceedings of the 2004 ACM symposium on Applied computing

ABSTRACT

References

Cited By

Recommendations

Non-derivable itemset mining

A survey of incremental high-utility itemset mining

Pushing Support Constraints Into Association Rules Mining