Research on the Weighting Method Based on Tf-IDF and Apriori Algorithm | IEEE Conference Publication | IEEE Xplore

Research on the Weighting Method Based on Tf-IDF and Apriori Algorithm


Abstract:

Association rule mining algorithms, such as the well-known Apriori algorithm, utilize support and confidence measures to mine interesting correlations in transactional da...Show More

Abstract:

Association rule mining algorithms, such as the well-known Apriori algorithm, utilize support and confidence measures to mine interesting correlations in transactional data sets. Traditional support mechanisms cannot differentiate between projects of varying importance and therefore ignore uncommon but relevant terms. This paper introduces the concept of combining term frequency-inverse document frequency (TF-IDF) with weighted support, preferentially discovers association rules containing rare but key terms, and designs and optimizes the complex weighted Apriori algorithm. This study pioneers a novel TF-IDF weighted support metric that combines project support with the average TF-IDF of its component projects. This weighted support has been mathematically proven to comply with downward closure properties, enabling hierarchical exploration of weighted frequent patterns. This results in a more compact, high-quality rule set than traditional Apriori methods, while revealing dependencies ignored by confidence metrics.
Date of Conference: 23-25 September 2023
Date Added to IEEE Xplore: 22 January 2024
ISBN Information:

ISSN Information:

Conference Location: Dalian, China

Contact IEEE to Subscribe

References

References is not available for this document.