Privacy Preserving Data Mining: A New Methodology for Data Transformation

Upadhayay, A. K.; Agarwal, Abhijat; Masand, Rachita; Gupta, Rajeev

doi:10.1007/978-81-8489-203-1_36

A. K. Upadhayay²,
Abhijat Agarwal²,
Rachita Masand² &
…
Rajeev Gupta³

1125 Accesses
1 Citations

Abstract

Today, privacy preservation is one of the greater concerns in data mining. While the research to develop different techniques for data preservation is on, a concrete solution is awaited. We address the privacy issue in data mining by a novel privacy preserving data mining technique. We develop and introduce a novel ICT (inverse cosine based transformation) method to preserve the data before subjecting it to clustering or any kind of analysis. A novel ‘privacy preserved k-clustering algorithm’ (PrivClust) is developed by embedding our ICT method into existing K-means clustering algorithm. This algorithm is explicitly designed with conversion to a privacy-preserving version in mind. The challenge was how to meet privacy requirements and guarantee valid clustering results as well. Simulation was carried out using Matlab. Our analysis and simulation show that this algorithm efficiently preserves the intended information on the one hand and yields valid cluster results on the other.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aggarwal, C.C., and Yu P.S.: Privacy Preserving data mining, Springer (2008)
Google Scholar
Clifton C., Kantarcioglu M., Vaidya J.: Defining Privacy for Data Mining. Purdue University, West Lafayette.
Google Scholar
Elmasri, N., Gupta S.: Fundamentals of Database Systems, Pearson Education, Inc, First Impression, (2006)
Google Scholar
Evfimievski, A.: Randomization in Privacy-Preserving Data Mining. In SIGKDD Explorations, 4(2): 43–48, December (2002)
Article Google Scholar
Hann, J., Kamber M.: Data Mining concepts and techniques, Elsevier, 2ed. (2006)
Google Scholar
Jagannathan, G., Pillaipakkamnatt, K., Wright, R.N.: A New Privacy-Preserving Distributed k-Clustering Algorithm in proceedings of 2006 SIAM international conference on data mining on SDM-(2006)
Google Scholar
Lindell, Y., Pinkas, B.: Privacy Preserving Data Mining, Advances in Cryptology—Crypto’ 00 Proceedings, LNCS 1880, Springer-Verlag, pp. 20–24, August 2000. A full version appeared in the Journal of Cryptology, Volume 15-Number 3, (2002)
Google Scholar
Oliveira, S. R. M., Zaïane, O. R.: Privacy Preserving Clustering By Data Transformation. In Proceedings of the 18th Brazilian Symposium on Databases, Manaus, Amazonas, Brazil, October (2003), pp. 304–318.
Google Scholar
Oliveira, S. R. M., Zaïane, O. R.: Achieving Privacy Preservation When Sharing Data for Clustering. In Proceedings of the International Workshop on Secure Data Management in a Connected World (SDM’04) in conjunction with VLDB (2004), Toronto, Canada, August, (2004)
Google Scholar
Pinkas, B.: Cryptographic Techniques for Privacy-Preserving Data Mining SIGKDD Explorations, the newsletter of the ACM Special Interest Group on Knowledge Discovery and Data Mining, January (2003)
Google Scholar
Sweeny, L.: Achieving k-anonymity privacy protection using generalization and suppression. (2002) CMU.
Google Scholar
Upadhyay, A.K., Gupta R., Kumar R.: Analytical model for revised K-clustering algorithm for privacy preservation in data mining. RACE (2007) at BEC Bikaner, IEEE sponsored international conference.
Google Scholar
Vaidya, J., Clifton, C.: Privacy-Preserving K-Means Clustering over Vertically Partitioned Data. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August (2003) pp. 206–215.
Google Scholar
Agrawal, R., Srikant, R.: Privacy-Preserving Data Mining in proceedings of (2000) ACM SIGMOD Conference on Management of Data, pages 439–450, Dallas, TX, May 14–19 (2000). ACM.
Google Scholar
Adam, N. R., Wortmann, J. C.: Security-Control Methods for Statistical Databases. ACM Computing Surveys, 21(4):515–556, Dec. (1989)
Article Google Scholar
Murlidhar, K., Parsa, R., Sarathy, R.: A General Additive Data Perturbation Method for Database Security. Management Science, 45(10): 1399–1415, October (1999)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Amity School of Engineering and Technology, Noida, U.P., India
A. K. Upadhayay, Abhijat Agarwal & Rachita Masand
Rajasthan Technical University, Kota, Rajasthan, India
Rajeev Gupta

Authors

A. K. Upadhayay
View author publications
You can also search for this author in PubMed Google Scholar
Abhijat Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Rachita Masand
View author publications
You can also search for this author in PubMed Google Scholar
Rajeev Gupta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Indian Institute of Information Technology, Allahabad, India
U. S. Tiwary (Professor), Tanveer J. Siddiqui (Assistant Professor), M. Radhakrishna (Professor) & M. D. Tiwari (Director) (Professor), (Assistant Professor), (Professor) & (Director)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Upadhayay, A.K., Agarwal, A., Masand, R., Gupta, R. (2009). Privacy Preserving Data Mining: A New Methodology for Data Transformation. In: Tiwary, U.S., Siddiqui, T.J., Radhakrishna, M., Tiwari, M.D. (eds) Proceedings of the First International Conference on Intelligent Human Computer Interaction. Springer, New Delhi. https://doi.org/10.1007/978-81-8489-203-1_36

Download citation

DOI: https://doi.org/10.1007/978-81-8489-203-1_36
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-8489-404-2
Online ISBN: 978-81-8489-203-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics