Skip to main content

Parallel Implementation of kNN Algorithm for Breast Cancer Detection

  • Conference paper
  • First Online:
Book cover Evolution in Computational Intelligence

Abstract

Current advances in parallel processing technology aims at providing unmatched degree of computational power in upcoming days. Parallel computation is an efficient form of information processing which exploits the concurrency of execution. This paper investigates the use of parallel programming, when applied on k Nearest Neighbors (kNN) algorithm which is intended for classification and prediction of the large dataset. Breast cancer dataset is used for classification and prediction which consists of two labels namely, malignant and benign. kNN is a non-parametric algorithm which makes use of similarity measure to classify the dataset into different categories. The similarity between the data points is computed by using Euclidean distance formula. Multiple threads are created for parallel processing and an appropriate kNN graph is constructed, which helps in easier implementation. Finally, execution speeds for sequential and parallel programs is recorded. The results are verified by using frameworks namely, Message Passing Interface (MPI) and Compute Unified Device Architecture (CUDA) highlighting that parallel execution takes less time when compared to sequential execution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cheung, D.W., Lee, S.D., Xiao, Y.: Effect of data skewness and workload balance in parallel data mining. IEEE Trans. Knowl. Data Eng. 14(3) (2002)

    Google Scholar 

  2. Gavahi, M., Mirzaei, R., Nazarbeygi, A., Ahmadzadeh, A., Gorgin, S.: High performance GPU implementation of k-NN based on mahalanobis distance. In: 2015 International Symposium Computer Science and Software Engineering (CSSE), pp. 1–6 (2015)

    Google Scholar 

  3. Cover, T., Hart, P.: Nearest neighbor pattern classification. IEEE Trans. Inf Theory 13(1), 21–27 (1967)

    Article  Google Scholar 

  4. Huang, L., Li, Z..: A novel method of parallel gpu implementation of knn used in text classification. In: 2013 Fourth International Conference on Networking and Distributed Computing (ICNDC), pp. 6–8 (2013)

    Google Scholar 

  5. Shenshen, L., et al.: CUKNN: a parallel implementation of K-nearest neighbor on CUDA-enabled GPU. In: IEEE Youth Conference on Information Computing and Telecommunication 2009, YC-ICT’09, (2009). IEEE

    Google Scholar 

  6. Wang, D., Zheng, Y., Cao, J.: Parallel construction of approximate kNN Graph. In: Proceeding of IEEE DCABES, pp. 22–26 (2012)

    Google Scholar 

  7. Despotovski, F.., Gusev, M., Zdraveski, V.: Parallel implementation of k-nearest-neighbors for face recognition. In: 26th Telecommunications Forum (TELFOR), pp. 1–4 (2018)

    Google Scholar 

  8. Aparício, G., Blanquer, I., Hernández, V.: A parallel implementation of the k nearest neighbours classifier in three levels: threads, MPI processes and the grid. In: Proceedings of the 7 International Conference on High Performance Computing For Computational Science, pp. 225–235 (2006)

    Google Scholar 

  9. Vajda, S., Santosh, K.C.: Fast k-nearest neighbor classifier using unsupervised clustering. In: Recent Trends in Image Processing and Pattern Recognition, pp. 185–193 (2016)

    Google Scholar 

  10. Chaudhari, P., et al.: Data augmentation for cancer classification in oncogenomics: an improved kNN based approach. In: Evolutionary Intelligence, Springer, Germany, pp. 1–10 (2019)

    Google Scholar 

  11. Bhateja, V., Misra, M., Urooj, S.:. Non-Linear polynomial filters for contrast enhancement of mammograms. In: Non-Linear Filters for Mammogram Enhancement. Springer, Singapore, pp. 123–162 (2020)

    Google Scholar 

  12. Bhateja, V., et al.: Classification of mammograms using sigmoidal transformation and SVM. In: Smart Computing and Informatics. Springer, Singapore, pp. 193–199 (2018)

    Google Scholar 

  13. Navlani, A.: KNN Classification using Scikit-learn. DataCamp Community, 2019. (Online). Available: https://www.datacamp.com/community/tutorials/k-nearest-neighbor-classification-scikit-learn. Accessed 01 Oct 2019

  14. UCI machine learning repository: breast cancer wisconsin (Original) Data Set Archive.ics.uci.edu, (Online). Available: https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(original) (2019). Accessed: 17 Sep 2019

  15. Christina Chun M.: Tumors: benign, premalignant, and malignant, Medical News Today, (Online). Available: https://www.medicalnewstoday.com/articles/249141.php (2019). Accessed 17 Sep 2019

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to B. Ashwath Rao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Athani, S., Joshi, S., Rao, B.A., Rai, S., Kini, N.G. (2021). Parallel Implementation of kNN Algorithm for Breast Cancer Detection. In: Bhateja, V., Peng, SL., Satapathy, S.C., Zhang, YD. (eds) Evolution in Computational Intelligence. Advances in Intelligent Systems and Computing, vol 1176. Springer, Singapore. https://doi.org/10.1007/978-981-15-5788-0_46

Download citation

Publish with us

Policies and ethics