Abstract
The discovery of diversity patterns from binary data is an important data mining task. This paper proposes entropy l-diversity patterns based on information theory, and develops techniques for discovering such diversity patterns. We study the properties of the entropy l-diversity patterns, and propose some pruning strategies to speed our mining algorithm. Experiments show that our mining algorithm is fast in practice. For real datesets the running time are improved by serval orders of magnitude over brute force method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cover, T., Thomas, J.: Elements of Information Theory. Wiley Interscience, Hoboken (1991)
Knobbe, A., Ho, E.: Maximally informative k-itemsets and their efficient discovery. In: KDD, pp. 237–244 (2006)
Machanavajjhala, A., et al.: l-Diversity: Privacy Beyond k-Anonymity. In: ICDE 2006 (2006)
Pan, F., Roberts, A., McMillan, L., de Villena, F., Threadgill, D., Wang, W.: Sample selection for maximal diversity. In: Perner, P. (ed.) ICDM 2007. LNCS, vol. 4597. Springer, Heidelberg (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sha, C., Gong, J., Zhou, A. (2009). Mining Entropy l-Diversity Patterns. In: Zhou, X., Yokota, H., Deng, K., Liu, Q. (eds) Database Systems for Advanced Applications. DASFAA 2009. Lecture Notes in Computer Science, vol 5463. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00887-0_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-00887-0_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00886-3
Online ISBN: 978-3-642-00887-0
eBook Packages: Computer ScienceComputer Science (R0)