Abstract
Current advances in parallel processing technology aims at providing unmatched degree of computational power in upcoming days. Parallel computation is an efficient form of information processing which exploits the concurrency of execution. This paper investigates the use of parallel programming, when applied on k Nearest Neighbors (kNN) algorithm which is intended for classification and prediction of the large dataset. Breast cancer dataset is used for classification and prediction which consists of two labels namely, malignant and benign. kNN is a non-parametric algorithm which makes use of similarity measure to classify the dataset into different categories. The similarity between the data points is computed by using Euclidean distance formula. Multiple threads are created for parallel processing and an appropriate kNN graph is constructed, which helps in easier implementation. Finally, execution speeds for sequential and parallel programs is recorded. The results are verified by using frameworks namely, Message Passing Interface (MPI) and Compute Unified Device Architecture (CUDA) highlighting that parallel execution takes less time when compared to sequential execution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cheung, D.W., Lee, S.D., Xiao, Y.: Effect of data skewness and workload balance in parallel data mining. IEEE Trans. Knowl. Data Eng. 14(3) (2002)
Gavahi, M., Mirzaei, R., Nazarbeygi, A., Ahmadzadeh, A., Gorgin, S.: High performance GPU implementation of k-NN based on mahalanobis distance. In: 2015 International Symposium Computer Science and Software Engineering (CSSE), pp. 1–6 (2015)
Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf Theory 13(1), 21–27 (1967)
Huang, L., Li, Z..: A novel method of parallel gpu implementation of knn used in text classification. In: 2013 Fourth International Conference on Networking and Distributed Computing (ICNDC), pp. 6–8 (2013)
Shenshen, L., et al.: CUKNN: a parallel implementation of K-nearest neighbor on CUDA-enabled GPU. In: IEEE Youth Conference on Information Computing and Telecommunication 2009, YC-ICT’09, (2009). IEEE
Wang, D., Zheng, Y., Cao, J.: Parallel construction of approximate kNN Graph. In: Proceeding of IEEE DCABES, pp. 22–26 (2012)
Despotovski, F.., Gusev, M., Zdraveski, V.: Parallel implementation of k-nearest-neighbors for face recognition. In: 26th Telecommunications Forum (TELFOR), pp. 1–4 (2018)
AparÃcio, G., Blanquer, I., Hernández, V.: A parallel implementation of the k nearest neighbours classifier in three levels: threads, MPI processes and the grid. In: Proceedings of the 7 International Conference on High Performance Computing For Computational Science, pp. 225–235 (2006)
Vajda, S., Santosh, K.C.: Fast k-nearest neighbor classifier using unsupervised clustering. In: Recent Trends in Image Processing and Pattern Recognition, pp. 185–193 (2016)
Chaudhari, P., et al.: Data augmentation for cancer classification in oncogenomics: an improved kNN based approach. In: Evolutionary Intelligence, Springer, Germany, pp. 1–10 (2019)
Bhateja, V., Misra, M., Urooj, S.:. Non-Linear polynomial filters for contrast enhancement of mammograms. In: Non-Linear Filters for Mammogram Enhancement. Springer, Singapore, pp. 123–162 (2020)
Bhateja, V., et al.: Classification of mammograms using sigmoidal transformation and SVM. In: Smart Computing and Informatics. Springer, Singapore, pp. 193–199 (2018)
Navlani, A.: KNN Classification using Scikit-learn. DataCamp Community, 2019. (Online). Available: https://www.datacamp.com/community/tutorials/k-nearest-neighbor-classification-scikit-learn. Accessed 01 Oct 2019
UCI machine learning repository: breast cancer wisconsin (Original) Data Set Archive.ics.uci.edu, (Online). Available: https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(original) (2019). Accessed: 17 Sep 2019
Christina Chun M.: Tumors: benign, premalignant, and malignant, Medical News Today, (Online). Available: https://www.medicalnewstoday.com/articles/249141.php (2019). Accessed 17 Sep 2019
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Athani, S., Joshi, S., Rao, B.A., Rai, S., Kini, N.G. (2021). Parallel Implementation of kNN Algorithm for Breast Cancer Detection. In: Bhateja, V., Peng, SL., Satapathy, S.C., Zhang, YD. (eds) Evolution in Computational Intelligence. Advances in Intelligent Systems and Computing, vol 1176. Springer, Singapore. https://doi.org/10.1007/978-981-15-5788-0_46
Download citation
DOI: https://doi.org/10.1007/978-981-15-5788-0_46
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-5787-3
Online ISBN: 978-981-15-5788-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)