Abstract
The issue of maintaining privacy in frequent itemset mining has attracted considerable attentions. In most of those works, only distorted data are available which may bring a lot of issues in the data-mining process. Especially, in the dynamic update distorted database environment, it is nontrivial to mine frequent itemsets incrementally due to the high counting overhead to recompute support counts for itemsets. This paper investigates such a problem and develops an efficient algorithm SA-IFIM for incrementally mining frequent itemsets in update distorted databases. In this algorithm, some additional information is stored during the earlier mining process to support the efficient incremental computation. Especially, with the introduction of supporting aggregate and representing it with bit vector, the transaction database is transformed into machine oriented model to perform fast support computation. The performance studies show the efficiency of our algorithm.
Supported by the Natural Science Foundation of China (No. 60402010), Zhejiang Provincial Natural Science Foundation of China (Y105250) and the Science-Technology Progrom of Zhejiang Province of China (No. 2004C31098).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Privacy-preserving data mining. In: Proceedings of SIGMOD, pp. 439–450 (2000)
Rizvi, S., Haritsa, J.: Maintaining data privacy in association rule mining. In: Proceedings of VLDB, pp. 682–693 (2002)
Agrawal, S., Krishnan, V., Haritsa, J.: On addressing efficiency concerns in privacy-preserving mining. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 113–124. Springer, Heidelberg (2004)
Xu, C., Wang, J., Dan, H., Pan, Y.: An Improved EMASK Algorithm for Privacy-Preserving Frequent Pattern Mining. In: Hao, Y., Liu, J., Wang, Y.-P., Cheung, Y.-m., Yin, H., Jiao, L., Ma, J., Jiao, Y.-C. (eds.) CIS 2005. LNCS (LNAI), vol. 3801, pp. 752–757. Springer, Heidelberg (2005)
Cheung, D., Han, J., Ng, V., Wong, C.: Maintenance of discovered association rules in large databases: An incremental updating tedchnique. In: Proceedings of ICDE, pp. 104–114 (1996)
Cheung, D., Lee, S., Kao, B.: A general incremental technique for updating discovered association rules. In: Proceedings of DASFAA, pp. 106–114 (1997)
Wang, J., Xu, C., Pan, Y.: An Incremental Algorithm for Mining Privacy-Preserving Frequent Itemsets. In: Airoldi, E., Blei, D.M., Fienberg, S.E., Goldenberg, A., Xing, E.P., Zheng, A.X. (eds.) ICML 2006. LNCS, vol. 4503, Springer, Heidelberg (2007)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of VLDB, pp. 487–499 (1994)
http://dsl.serc.iisc.ernet.in/projects/software/software.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, J., Xu, C., Dan, H., Pan, Y. (2006). SA-IFIM: Incrementally Mining Frequent Itemsets in Update Distorted Databases. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_9
Download citation
DOI: https://doi.org/10.1007/11811305_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)