Abstract
The discovery of association relationship among the data in a huge database has been known to be useful in selective marketing, decision analysis, and business management. A significant amount of research effort has been elaborated upon the development of efficient algorithms for data mining. However, without fully considering the time-variant characteristics of items and transactions, it is noted that some discovered rules may be expired from users’ interest. In other words, some discovered knowledge may be obsolete and of little use, especially when we perform the mining schemes on a transaction database of short life cycle products. This aspect is, however, rarely addressed in prior studies. To remedy this, we broaden in this paper the horizon of frequent pattern mining by introducing a weighted model of transaction-weighted association rules in a time-variant database. Specifically, we propose an efficient Progressive Weighted Miner (abbreviatedly as PWM) algorithm to perform the mining for this problem as well as conduct the corresponding performance studies. In algorithm PWM, the importance of each transaction period is first reflected by a proper weight assigned by the user. Then, PWM partitions the time-variant database in light of weighted periods of transactions and performs weighted mining. Algorithm PWM is designed to progressively accumulate the itemset counts based on the intrinsic partitioning characteristics and employ a filtering threshold in each partition to early prune out those cumulatively infrequent 2-itemsets. With this design, algorithm PWM is able to efficiently produce weighted association rules for applications where different time periods are assigned with different weights and lead to results of more interest.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Agrawal, T. Imielinski, and A. Swami. Mining Association Rules between Sets of Items in Large Databases. Proc. of ACM SIGMOD, pages 207–216, May 1993.
R. Agrawal and R. Srikant. Fast Algorithms for Mining Association Rules in Large Databases. Proc. of the 20th International Conference on Very Large Data Bases, pages 478–499, September 1994.
J. M. Ale and G. Rossi. An Approach to Discovering Temporal Association Rules. ACM Symposium on Applied Computing, 2000.
A. M. Ayad, N. M. El-Makky, and Y. Taha. Incremental mining of constrained association rules. Proc. of the First SIAM Conference on Data Mining, 2001.
C.-Y. Chang, M.-S. Chen, and C.-H. Lee. Mining General Temporal Association Rules for Items with Different Exhibition Periods. Proc. of the IEEE 2nd Intern’l Conf. on Data Mining (ICDM-2002), December.
M.-S. Chen, J. Han, and P. S. Yu. Data Mining: An Overview from Database Perspective. IEEE Transactions on Knowledge and Data Engineering, 8(6):866–883, December 1996.
M.-S. Chen, J.-S. Park, and P. S. Yu. Efficient Data Mining for Path Traversal Patterns. IEEE Transactions on Knowledge and Data Engineering, 10(2):209–221, April 1998.
X. Chen and I. Petr. Discovering Temporal Association Rules: Algorithms, Language and System. Proc. of 2000 Int. Conf. on Data Engineering, 2000.
D. Cheung, J. Han, V. Ng, and C. Y. Wong. Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique. Proc. of 1996 Int’l Conf. on Data Engineering, pages 106–114, February 1996.
J. Han and J. Pei. Mining Frequent Patterns by Pattern-Growth: Methodology and Implications. ACM SIGKDD Explorations (Special Issue on Scaleble Data Mining Algorithms), December 2000.
J. Hipp, U. Güntzer, and G. Nakhaeizadeh. Algorithms for association rule mining — a general survey and comparison. SIGKDD Explorations, 2(1):58–64, July 2000.
D. Kifer, C. Bucila, J. Gehrke, and W. White. Dualminer: A dual-pruning algorithm for itemsets with constraints. Proc. of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002.
L. V. S. Lakshmanan, R. Ng, J. Han, and A. Pang. Optimization of Constrained Frequent Set Queries with 2-Variable Constraints. Proc. of 1999 ACM-SIGMOD Conf. on Management of Data, pages 157–168, June 1999.
C.-H. Lee, C.-R. Lin, and M.-S. Chen. On Mining General Temporal Association Rules in a Publication Database. Proc. of 2001 IEEE International Conference on Data Mining, November 2001.
C.-H. Lee, C.-R. Lin, and M.-S. Chen. Sliding-Window Filtering: An Efficient Algorithm for Incremental Mining. Proc. of the Tenth ACM Intern’l Conf. on Information and Knowledge Management, November 2001.
J.-L. Lin and M. H. Dunham. Mining Association Rules: Anti-Skew Algorithms. Proc. of 1998 Int’l Conf. on Data Engineering, pages 486–493, 1998.
B. Liu, W. Hsu, and Y. Ma. Mining Association Rules with Multiple Minimum Supports. Proc. of 1999 Int. Conf. on Knowledge Discovery and Data Mining, August 1999.
J.-S. Park, M.-S. Chen, and P. S. Yu. Using a Hash-Based Method with Transaction Trimming for Mining Association Rules. IEEE Transactions on Knowledge and Data Engineering, 9(5):813–825, October 1997.
J. Pei and J. Han. Can We Push More Constraints into Frequent Pattern Mining? Proc. of 2000 Int. Conf. on Knowledge Discovery and Data Mining, August 2000.
J. Pei, J. Han, and L. V. S. Lakshmanan. Mining Frequent Itemsets with Convertible Constraints. Proc. of 2001 Int. Conf. on Data Engineering, 2001.
R. Srikant and R. Agrawal. Mining Generalized Association Rules. Proc. of the 21th International Conference on Very Large Data Bases, pages 407–419, September 1995.
K. Wang, Y. He, and J. Han. Mining Frequent Itemsets Using Support Constraints. Proc. of 2000 Int. Conf. on Very Large Data Bases, September 2000.
W. Wang, J. Yang, and P. S. Yu. Efficient mining of weighted association rules (WAR). Proc. of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2000.
C. Yang, U. Fayyad, and P. Bradley. Efficient discovery of error-tolerant frequent itemsets in high dimensions. Proc. of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, CH., Ou, J.C., Chen, MS. (2003). Progressive Weighted Miner: An Efficient Method for Time-Constraint Mining. In: Whang, KY., Jeon, J., Shim, K., Srivastava, J. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2003. Lecture Notes in Computer Science(), vol 2637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36175-8_45
Download citation
DOI: https://doi.org/10.1007/3-540-36175-8_45
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-04760-5
Online ISBN: 978-3-540-36175-6
eBook Packages: Springer Book Archive