Tight upper bounds on the number of candidate patterns

Published: 01 June 2005


In the context of mining for frequent patterns using the standard levelwise algorithm, the following question arises: given the current level and the current set of frequent patterns, what is the maximal number of candidate patterns that can be generated on the next level? We answer this question by providing tight upper bounds, derived from a combinatorial result from the sixties by Kruskal and Katona. Our result is useful to secure existing algorithms from a combinatorial explosion of the number of candidate patterns.


  (2022)The pattern frequency distribution theory: a mathematic establishment toward rational and reliable pattern miningInternational Journal of Data Science and Analytics10.1007/s41060-022-00340-116:1(43-83)Online publication date: 20-Aug-2022
  (2019)Maximizing Gain over Flexible Attributes in Peer to Peer MarketplacesAdvances in Knowledge Discovery and Data Mining10.1007/978-3-030-16142-2_26(327-345)Online publication date: 14-Apr-2019
  (2019)Method evaluation, parameterization, and result validation in unsupervised data mining: A critical surveyWIREs Data Mining and Knowledge Discovery10.1002/widm.133010:2Online publication date: 29-Jul-2019
Author Tags

  1. Data mining
  2. frequent patterns
  3. upper bounds


