Skip to main content
Log in

Maintaining discovered frequent itemsets: Cases for changeable database and support

  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

Mining frequent itemsets from large databases has played an essential role in many data mining tasks. It is also important to maintain the discovered frequent itemsets for these data mining tasks when the database is updated. All algorithms proposed so far for the maintenance of discovered frequent itemsets are only performed with a fixed minimum support, which is the same as that used to obtain the discovered frequent itemsets. That is, users cannot change the minimum support even if the new results are unsatisfactory to the users. In this paper two new complementary algorithms, FMP (First Maintaining Process) and RMP (Repeated Maintaining Process), are proposed to maintain discovered frequent itemsets in the case that new transaction data are added to a transaction database. Both algorithms allow users to change the minimum support for the maintenance processes. FMP is used for the first maintaining process, and when the result derived from the FMP is unsatisfactory, RMP will be performed repeatedly until satisfactory results are obtained. The proposed algorithms re-use the previous results to cut down the cost of maintenance. Extensive experiments have been conducted to assess the performance of the algorithms. The experimental results show that the proposed algorithms are very resultful compared with the previous mining and maintenance algorithms for maintenance of discovered frequent itemsets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Agrawal R, Srikant R. Fast algorithms for mining association rules. InVLDB'94, Santiago de Chile, Sept. 1994, pp.487–499.

  2. Agrawal R, Srikant R. Mining sequential patterns. InICDE'95, Taipei, March 1995, pp.3–14.

  3. Pei Jet al. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. InICDE'01, Heiderberg, Germany, April 2001, pp.215–224.

  4. Pei, J, Han J, Mao R. CLOSET: An efficient algorithm for mining frequent closed itemsets. InProc. ACMSIGMOD Int. Workshop Data Mining and Knowledge Discovery, Texas, USA, May 2000, pp.11–20.

  5. Bayardo R J. Efficiently mining long patterns from databases. InSIGMOD'98, pp.85–93.

  6. Liu B, Hsu W, Ma Y. Integrating classification and association rule mining. InKDD'98, New York, USA, August 1998, pp80–86.

  7. Wang K, Zhou S, Liew S C. Building hierarchical classifiers using class proximity. InVLDB'99, Edinburgh, Scotland, UK, Sept. 1999, pp.363–374.

  8. Kamber M, Han J, Chiang J Y. Metarule-guided mining of multi-dimensional association rules using data cubes. InKDD'97, California, USA, August 1997, pp.207–210.

  9. Han J, Dong G, Yin Y. Efficient mining of partial periodic patterns in time series database. InICDE'99, Sydney, Australia, March 1999, pp.106–115.

  10. Agrawal R, Gehrke J, Gunopulos D, Raghavan P. Automatic subspace clustering of high dimensional data for data mining applications. InSIGMOD'98, New York, USA, June 1998, pp.94–105.

  11. Grahne G, Lakshmanan L, Wang X. Efficient mining of constrained correlated sets. InICDE'00, California, USA, March 2000, pp.512–521.

  12. Park J S, Chen M S, Yu P S. An effective hash-based algorithm for mining association rules. InSIGMOD'95, California, USA, May 1995, pp.175–186.

  13. Srikant R, Vu Q, Agrawal R. Mining association rules with item constraints. InKDD'97, California, USA, August 1997, pp.67–73.

  14. Han J, Pei J, Yin Y. Mining frequent patterns without candidate generation. InSIGMOD'00, Texas, USA, May 2000, pp.1–12.

  15. Agrawal R C, Aggrawal C C, Prasad V V V. A tree projection algorithm for generation of frequent item sets.Journal of Parallel and Distributed Computing, 2001, 61: 350–371.

    Article  Google Scholar 

  16. Cheung D Wet al. Maintenance of discovered association rules in large databases: An incremental updating technique. InICED'96, Louisiana, USA, March 1996, pp.106–114.

  17. Cheung D W, Ng V, Tam B W. Maintenance of discovered knowledge: A case in multi-level association rules. InKDD'96, Oregon, USA, 1996, pp.307–310.

  18. Cheung D Wet al. A general incremental technique for updating discovered association rules. InProc. Int. Conf. on Database Systems for Advanced Application, Melbourne, Australia, April, 1997, pp.185–194.

  19. Lee S D, Cheung D W. Maintenance of discovered association rules: When to update? InProc. ACM-SIGMOD Workshop on Data Mining and Knowledge Discovery, Tucson, Arizona. May, 1997.

  20. Lee S D, Cheung D W, Kao B. Is sampling useful in data mining? A case in the maintenance of discovered association rules.Data Mining and Knowledge Discovery, Kluwer Academic Publishers, September, 1998, 2(3): 233–262.

    Article  Google Scholar 

  21. Du X P, Kaneko K, Makinouchi A. Fast algorithm to find frequent itemsets for mining of association rules. InProc. Int. Conf. Info. Society in the 21st Century: Emerging Technologies and New Challenges, Fukushima, Japan, Nov., 2000, pp.408–414.

  22. Du X P, Kaneko K, Makinouchi A. Two fast algorithms for repeated mining of association rules based on resource reuse. InICEIS 1999, Setubal, Portugal, Mar., 1999, pp.65–73.

  23. Du X P, Kaneko K, Makinouchi A. Efficient algorithms to repeatedly mine large itemsets with different minimum supports.Res. Rep. ISEE Kyushu University, March 2000, 5(1): 81–86.

    Google Scholar 

  24. Liu J, Yin J. Towards efficient data re-ming. InPAKDD'01, Hong Kong, Cina, April 2001, pp.406–412.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Du XiaoPing.

Additional information

This work was supported in part by the National Grand Fundamental Research 973 Program of China under Grant No.G1999032705, and the Returned Overseas Scholar Research Foundation of the Ministry of Education of China.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Du, X., Tang, S. & Akifumi, M. Maintaining discovered frequent itemsets: Cases for changeable database and support. J. Comput. Sci. & Technol. 18, 648–658 (2003). https://doi.org/10.1007/BF02947125

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02947125

Keywords

Navigation