Abstract
The paper focuses on the problem of privacy preserving for classification task. This issue is quite an important subject for the machine learning approach based on distributed databases. On the basis of the study of available works devoted to privacy we propose its new definition and its taxonomy. We use this taxonomy to create several modifications of k-nearest neighbors classifier which are consistent with the proposed privacy levels. Their computational complexity are evaluated on the basis of computer experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aha, D.W., Kibler, D., Albert, M.: K Instance-Based Learning Algorithms. Machine Learning 6, 37–66 (1991)
Alpaydin, E.: Introduction to Machine Learning, 2nd edn. The MIT Press, London (2010)
Angiulli, F., Folino, G.: Distributed Nearest Neighbor Based Condensation of Very Large Datasets. IEEE Transactions on Knowledge and Data Engineering 19(12), 1593–1606 (2007)
Asuncion, A., Newman, D.J.: UCI Machine Learning Repository. School of Information and Computer Science, University of California, Irvine (2007), http://www.ics.uci.edu/~mlearn/MLRepository.html
Beygelzimer, A., Kakade, S., Langford, J.: Cover trees for nearest neighbor. In: Proc. of the 23rd International Conference on Machine Learning, Pittsburgh, PA, pp. 97–104 (2006)
Chitti, S., Liu, L., Xiong, L.: Mining Multiple Private Databases isung Privacy Preserving kNN Classifier, Technical Reports TR-2006-008, Emory University (2006)
Clifton, C., Kantarcioglu, M., Vaidya, J., Lin, X., Zhu, M.Y.: Tools for privacy preserving data mining. In: SIGKDD Explorations, pp. 28–34 (2002)
Cost, S., Salzberg, S.: A Weighted Nearest Neighbor Algorithm for Learning with Symbolic Features. Machine Learning 10(1), 57–78 (1993)
Cover, T.M., Hart, P.E.: Nearest neighbor pattern classification. IEEE Trans. on Inform. Theory 13(1), 21–27 (1967)
Dasarathy, B.: Nearest Neighbor (NN) Norms NN Pattern Classification Techniques. IEEE Computer Society Press, Los Alamitos (1991)
Devroye, L.: On the inequality of cover and hart in nearest neighbor discrimination. IEEE Trans. on Pat. Anal. and Mach. Intel. 3, 75–78 (1981)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley Interscience, NewYork (2001)
Freitas, A.A., Lavington, S.H.: Mining Very Large Databases with Parallel Processing. Kluwer Academic Publishers, Boston (1998)
Gantz, J., Reinsel, D.: As the Economy Contracts, the Digital Universe Expands. IDC Multimedia Whitepaper (2009)
Hart, P.E.: The condensed nearest neighbor rule. IEEE Trans. on Inform. Th. 14(3), 515–516 (1968)
Hacigumus, H., Iyer, B., Li, C., Mehrotra, S.: Executing sql over encrypted data in the database service provider model. In: ACM SIGMOD Conference (2002)
Han, J.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publ. Inc., San Francisco (2005)
Jajodia, S., Sandhu, R.: Toward a multilevel secure relational data model. In: ACM SIGMOD Conference (1991)
Kantarcioglu, M., Clifton, C.: Privacy preserving k-nn classifier. In: IEEE International Conference on Data Engineering, ICDE (2005)
Kuncheva, L.I.: Combining pattern classifiers: Methods and algorithms. Wiley-Interscience, New Jersey (2004)
Lindell, Y., Pinkas, B.: Privacy Preserving Data Mining. Journal of Cryptology 15(3), 177–206 (2004)
Moor, J.H.: The future of computer ethics: You ain’t seen nothing yet! In: Ethics and Information Technology, vol. 3, pp. 89–91. Kluwer Academic Publishers, Dordrecht (2001)
Nissenbaum, H.: Can we Protect Privacy in Public? In: Computer Ethics Philosophical Enquiry ACM/SIGCAS Conference, Rotterdam, The Netherlands (1997)
Teng, Z., Du, W.: A hybrid multi-group privacypreserving approach for building decision trees. In: Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining PAKDD 2006, pp. 296–307 (2006)
Westin, A.F.: Privacy and Freedom. The Bodley Head Ltd (1970)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Krawczyk, B., Wozniak, M. (2011). Privacy Preserving Models of k-NN Algorithm. In: Burduk, R., Kurzyński, M., Woźniak, M., Żołnierek, A. (eds) Computer Recognition Systems 4. Advances in Intelligent and Soft Computing, vol 95. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20320-6_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-20320-6_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20319-0
Online ISBN: 978-3-642-20320-6
eBook Packages: EngineeringEngineering (R0)