skip to main content
article

Towards long pattern generation in dense databases

Published:01 July 2001Publication History
Skip Abstract Section

Abstract

This paper discusses the problem of long pattern generation in dense databases. In recent years, there has been an increase of interest in techniques for maximal pattern generation. We present a survey of this class of methods for long pattern generation which differ considerably from the level-wise approach of traditional methods. Many of these techniques are rooted in combinatorial tricks which can be applied only when the generation of frequent patterns is not forced to be level wise. We present an overview of the different kinds of methods which can be used in order to improve the counting and search space exploration methods for long patterns.

References

  1. R. C. Agarwal, C. C. Aggarwal, V. V. V. Prasad. A Tree Projection Algorithm for generation of frequent itemsets. Journal on Parallel and Distributed Computing, Vol. 61, No. 3, pp. 350-371, March 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. C. Agarwal, C. C. Aggarwal, V. V. V. Prasad. Depth First Generation of Long Patterns. Proceedings of the ACM SIGKDD Conference, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Agrawal, T. Imielinski, A. Swami. Mining Association Rules between Sets of Items in Very Large Databases. ACM SIGMOD Conference Proceedings, pages 207-216, 1993.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. R. Agrawal, R. Srikant. Fast Algorithms for Mining Association Rules. VLDB Conference Proceedings, pages 487-499, 1994.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. R. J. Bayardo. Efficiently Mining Long Patterns from Databases. ACM SIGMOD Conference Proceedings, pages 85-93, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. J. Bayardo, R. Agrawal, D. Gunopulos. Constraint-Based Rule Mining in Large Dense Databases. ICDE Conference Proceedings, 1999.]]Google ScholarGoogle Scholar
  7. S. Brin, R. Motwani, J. D. Ullman, S. Tsur. Dynamic Itemset Counting and Implication Rules for Market Basket Data. ACM SIGMOD Conference Proceedings, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. Burdick, M. Calimlim, J. Gehrke. MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases. Proceedings of the ICDE Conference, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. B. Dunkel, N. Soparkar. Data Organization and Access for Efficient Data Mining. ICDE Conference Proceedings, pages 522-529, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Gunopulos, H. Mannila, S. Saluja. Discovering All Most Specific Sentences by Randomized Algorithms. ICDT Conference Proceedings, pages 215-229, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. Han, J. Pei, Y. Yin. Mining Frequent Patterns without Candidate Generation. ACM SIGMOD Conference Proceedings, pages 1-12, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. N. Pasquier, Y. Bastide, R. Taouil, L. Lakhal. Discovering Frequent Closed Itemsets for Association Rules. ICDT Conference Proceedings, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Pei, J. Han, R. Mao. CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets. DMKD, 2000.]]Google ScholarGoogle Scholar
  14. J. Han, J. Pei, B. Mortazavi, Q. Chen, U. Dayal, M.-C. Hsu. FreeSpan: Frequent Pattern-Projected Sequential Pattern Mining. Proceedings of the ACM KDD Conference, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Pei, J. Han, L. Lakshmanan. Mining Frequent Itemsets with Convertible Constraints. Proceedings of the ICDE Conference, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. Lin, Z. M. Kedem. Pincer-Search: A New Algorithm for Discovering the Maximum Frequent Itemset. EDBT Conference Proceedings, pages 105-119, 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. H. Mannila, H. Toivonen, A. I. Verkamo. Efficient algorithms for discovering association rules. AAAI Workshop on KDD, 1994.]]Google ScholarGoogle Scholar
  18. I. Rigoutsos, A. Floratos. Combinatorial Pattern Discovery in Biological Sequences. Bioinformatics, 14(1): pages 55-67, 1998.]]Google ScholarGoogle Scholar
  19. R. Rymon. Search Through Systematic Set Enumeration. International Conference on Principles of Knowledge Representation and Reasoning, 1992.]]Google ScholarGoogle Scholar
  20. A. Savasere, E. Omiecinski, S. B. Navathe. An Efficient Algorithm for Mining Association Rules in Large Databases. VLDB Conference Proceedings, pages 432-444, 1995.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. P. Shenoy et al. Turbo-charging Vertical Mining of Large Databases. ACM SIGMOD Conference Proceedings, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. H. Toivonen. Sampling Large Databases for Association Rules. VLDB Conference Proceedings, pages 134-145, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. G. I. Webb. OPUS: An efficient admissible algorithm for unordered search. Journal of Artificial Intelligence Research, 3:45-83, 1996.]]Google ScholarGoogle Scholar
  24. M. J. Zaki. Scalable Algorithms for Association Rule Mining. IEEE TKDE Journal, 12(3), pp. 372-390, May/June 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. M. J. Zaki. Generating non-redundant association rules. Proceedings of the ACM SIGKDD Conference, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. J. Zaki, C. Hsiao. CHARM: An Efficient Algorithm for Closed Association Rule Mininf. Technical Report, RPI, 1999.]]Google ScholarGoogle Scholar
  27. M. J. Zaki, S. Parthasarathy, M. Ogihara, W. Li. New Algorithms for Fast Discovery of Association Rules. KDD Conference Proceedings, pages 283-286, 1997.]]Google ScholarGoogle Scholar

Index Terms

  1. Towards long pattern generation in dense databases
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM SIGKDD Explorations Newsletter
        ACM SIGKDD Explorations Newsletter  Volume 3, Issue 1
        July 2001
        50 pages
        ISSN:1931-0145
        EISSN:1931-0153
        DOI:10.1145/507533
        Issue’s Table of Contents

        Copyright © 2001 Author

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 July 2001

        Check for updates

        Qualifiers

        • article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader