The Parallelization and Optimization of K-means Algorithm Based on MGPUSim

Mo, Zhangbin; Wang, Yaobin; Zhang, Qingming; Zhang, Guangbing; Guo, Mingfeng; Zhang, Yaqing; Shen, Chao

doi:10.1007/978-3-031-15937-4_26

Zhangbin Mo¹²,
Yaobin Wang¹²,
Qingming Zhang¹²,
Guangbing Zhang¹²,
Mingfeng Guo¹²,
Yaqing Zhang¹² &
…
Chao Shen¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13532))

Included in the following conference series:

International Conference on Artificial Neural Networks

2192 Accesses

Abstract

Although the k-means algorithm has been parallelized into different platforms, it has not yet been explored on multi-GPU architecture thoroughly. This paper presents a study of parallelizing k-means on a novel MGPUSim architecture, including its parallel execution mechanism, architecture design, etc. In addition, it proposes an optimization method “O-kmeans” to initialize the selection of clustering centers by first finding the centroids of the samples and then dividing the initialized clustering centers with centroids, thus solving the problem of poor clustering effect of the k-means algorithm when the data size is large. The performance of this algorithm is tested with both real and synthetic datasets. The experimental results show that:(1) The proposed O-kmeans algorithm performs well on the MGPUSim. It can achieve a 26.74×–62.92× speedup for real data sets, which is better than the CUDA implementation of kernel k-means. (2) In synthetic datasets, by conducting controlled variable experiments at varying data sizes and data dimensions, and different clustering centers. We find that the algorithm has higher stability and good processing speed on MGPUSim.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Efficient and Scalable k‑Means on GPUs

Article 06 September 2018

CPU and GPU parallelized kernel K-means

Article 22 May 2018

Large scale K-means clustering using GPUs

Article Open access 18 October 2022

References

Zaki, M.J.: Parallel and distributed data mining: an introduction. In: Zaki, M.J., Ho, C.-T. (eds.) LSPDM 1999. LNCS (LNAI), vol. 1759, pp. 1–23. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-46502-2_1
Chapter Google Scholar
Cuomo, S., et al.: A GPU-accelerated parallel K-means algorithm. Comput. Elect. Eng. 75, 262–274 (2017)
Article Google Scholar
Wang, Z., et al.: The parallelization and optimization of k-means algorithm based on spark. In: 2020 15th International Conference on Computer Science & Education (2020)
Google Scholar
Bachem, O., et al.: Scalable k-means clustering via lightweight coresets. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (2018)
Google Scholar
Yang, L., et al.: High performance data clustering: a comparison analysis of performance for GPU, RASC, MPI, and OpenMP implementation. Parallel Distrib. Process. Techn. App. 70, 284–300 (2010)
Google Scholar
Hall, J., et al.: GPU Acceleration of Iterative Clustering. Siggraph Poster (2004)
Google Scholar
Li, Y., Zhao, K., Chu, X., Liu, J.: Speeding up the k-means algorithm by GPUs. J. Comput. Syst. Sci. 79(2), 216–229 (2013)
Article MathSciNet Google Scholar
Sun, Y., et al.: MGPUSim: enabling multi-GPU performance modeling and optimization. In: Proceedings of the 46th International Symposium on Computer Architecture (2019)
Google Scholar
Ausavarungnirun, R., et al.: Mosaic: a GPU memory manager with application-transparent support for multiple page sizes. In: the 50th Annual IEEE/ACM International Symposium ACM (2017)
Google Scholar
Sun, Y., et al.: Daisen: a framework for visualizing detailed GPU execution. Comput. Graph. Forum. 40(3), 1–12 (2021)
Article Google Scholar
Baydoun, M., Dawi, M., Ghaziri, H.: Enhanced parallel implementation of the K-means clustering algorithm. In: Advances in Computational Tools for Engineering Applications (ACTEA), 2016 3rd International Conference on IEEE, pp. 7–11 (2016)
Google Scholar
Bhimani, J., Leeser, M., Mi, N.: Accelerating K-means clustering with parallel implementations and GPU computing. In: High Performance Extreme Computing Conference (HPEC), pp. 1–6 (2015)
Google Scholar
Nelson, J., and Roberto, P.: Don't forget about synchronization! A case study of K-means on GPU. In: Proceedings of the 10th International Workshop on Programming Models and Applications for Multicores and Manycores- PMAM'192019, pp. 11–20 (2019)
Google Scholar
Daoudi, S., et al.: A Comparative study of parallel CPU/GPU implementations of the K-means algorithm. In: 2019 International Conference on Advanced Electrical Engineering (ICAEE). IEEE (2019)
Google Scholar
Salvatore, C., De Angelis, V., Gennaro, F., Livia, M., Gerardo, T.: A GPU-accelerated parallel K-means algorithm. Comput. Elect. Eng. 75, 262–274 (2019)
Article Google Scholar
Baydoun, M., Ghaziri, H., Al-Husseini, M.: CPU and GPU parallelized kernel K-means. J. Supercomput. 74(8), 3975–3998 (2018). https://doi.org/10.1007/s11227-018-2405-7
Article Google Scholar
Kruliš, M., Miroslav, K.: Detailed analysis and optimization of CUDA K-means algorithm. In: 49th International Conference on Parallel Processing-ICPP (2020)
Google Scholar
Sun, Y., et al.: MGSim+MGMark: a framework for multi-GPU system research. arXiv preprint arXiv:1811.02884 (2018)
Young, V., et al.: Combining HW/SW mechanisms to improve NUMA performance of multi-GPU systems. In: 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), IEEE (2018)
Google Scholar
Lee, S., Won, W.R.: Parallel gpu architecture simulation framework exploiting architectural-level parallelism with timing error prediction. IEEE Trans. Comput. 65(4), 1253–1265 (2015)
Article MathSciNet Google Scholar
Michie, D., Spiegelhalter, D.J., Taylor, C.C.: Machine Learning, Neural and Statistical Classification. Ellis Horwood, Amsterdam (1994)
MATH Google Scholar
Blake, C., Merz, C.J.: UCI repository of machine learning databases. University of California, Irvine
Google Scholar

Download references

Acknowledgment

This work has been supported by a grant from the National Natural Science Foundation of China General Program (61672438) and the Special Project of the China Association of Higher Education (21SZYB16).

Author information

Authors and Affiliations

School of Computer Science and Technology, Key Laboratory of Testing Technology for Manufacturing Process in Ministry of Education, Southwest University of Science and Technology, Mianyang, 621010, China
Zhangbin Mo, Yaobin Wang, Qingming Zhang, Guangbing Zhang, Mingfeng Guo, Yaqing Zhang & Chao Shen

Authors

Zhangbin Mo
View author publications
You can also search for this author in PubMed Google Scholar
Yaobin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qingming Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Guangbing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Mingfeng Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yaqing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chao Shen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yaobin Wang .

Editor information

Editors and Affiliations

University of the West of England, Bristol, UK
Elias Pimenidis
Lancaster University, Lancaster, UK
Plamen Angelov
Digital Innovation, Teeside University, Middlesbrough, UK
Chrisina Jayne
Democritus University of Thrace, Xanthi, Greece
Antonios Papaleonidas
The University of the West of England, Bristol, UK
Mehmet Aydin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mo, Z. et al. (2022). The Parallelization and Optimization of K-means Algorithm Based on MGPUSim. In: Pimenidis, E., Angelov, P., Jayne, C., Papaleonidas, A., Aydin, M. (eds) Artificial Neural Networks and Machine Learning – ICANN 2022. ICANN 2022. Lecture Notes in Computer Science, vol 13532. Springer, Cham. https://doi.org/10.1007/978-3-031-15937-4_26

Download citation

DOI: https://doi.org/10.1007/978-3-031-15937-4_26
Published: 07 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-15936-7
Online ISBN: 978-3-031-15937-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

The Parallelization and Optimization of K-means Algorithm Based on MGPUSim

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Efficient and Scalable k‑Means on GPUs

CPU and GPU parallelized kernel K-means

Large scale K-means clustering using GPUs

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

The Parallelization and Optimization of K-means Algorithm Based on MGPUSim

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Efficient and Scalable k‑Means on GPUs

CPU and GPU parallelized kernel K-means

Large scale K-means clustering using GPUs

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation