Parallel implementing improved k-means applied for image retrieval and anomaly detection

Yin, Chunyong; Zhang, Sun

doi:10.1007/s11042-016-3638-1

Parallel implementing improved k-means applied for image retrieval and anomaly detection

Published: 29 May 2016

Volume 76, pages 16911–16927, (2017)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Chunyong Yin^1,2 &
Sun Zhang¹

592 Accesses
17 Citations
Explore all metrics

Abstract

Anomaly detection based on data mining is one of the key technologies to be applied to intelligent detection. K-means is a classic clustering algorithm which is efficient for anomaly detection. Traditional K-means is sensitive to the selection of initial clustering centers. Different initial value can cause different clustering results. We combine improved DD algorithm with information entropy to improve the performance of K-means. Improved K-means can optimize the selection of initial clustering centers; automatically decide the number of clusters and output stable clustering results. After the pretreatment of PCA, the adaptability of improved K-means has a distinct progress. To solve the problem of massive data processing time, we adopt the technology of cloud computing and modify the algorithm for parallel processing. We analyze the performance of improved K-means by using different data sets, KDD Cup99 and public mobile malware data set (i.e. MalGenome). The experimental results illustrate that improved K-means has accurate results and can be applied to anomaly detection in mobile networks. This improved K-means also can be applied for image retrieval by calculating the similarity between each image.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Anomaly Detection Algorithm Based on Cluster of Entropy

Data Categorization Using Hadoop MapReduce-Based Parallel K-Means Clustering

Article 25 February 2019

Performance of the K-means and fuzzy C-means algorithms in big data analytics

Article 31 October 2023

References

Anagnostopoulos M, Kambourakis G, Gritzalis S (2015) New facets of mobile botnet: architecture and evaluation. Int J Inf Secur 2015:1–19
Google Scholar
Gu B, Sheng VS, Tay KY, Romano W, Li S (2014) Incremental support vector learning for ordinal regression. Ieee T Neur Net Learn 26(7):1403–1416
Article MathSciNet Google Scholar
Gu B, Sheng VS, Wang Z, Ho D, Osman S, Li S (2015) Incremental learning for ν-support vector regression. Neural Netw 67:140–150
Article Google Scholar
Laxman S, Sastry PS (2006) A survey of temporal data mining. Sadhana Acad P Eng S 31(2):173–198
Article MathSciNet MATH Google Scholar
Leea S, Kimb G, Kimc S (2011) Self-adaptive and dynamic clustering for online anomaly detection. Exp Syst Appl 38(12):14891–14898
Article Google Scholar
Narudin FA, Feizollah A, Anuar NB, Gani A (2016) Evaluation of machine learning classifiers for mobile malware detection. Soft Comput 20(1):343–357
Article Google Scholar
Pandeeswari N, Kumar G (2015) Anomaly detection system in cloud environment using fuzzy clustering based ANN. Mob Netw Appl 2015:1–12
Google Scholar
Shamir O, Tishby N (2010) Stability and model selection in k-means clustering. Mach Learn 80(2):213–243
Article MathSciNet Google Scholar
Tong XJ, Meng FR, Wang ZX (2011) Optimization to k-means initial cluster centers. Comput Eng Des 32(8):2721–2723
Google Scholar
Villalba SD, Cunningham P (2007) An evaluation of dimension reduction techniques for one-class classification. Artif Intell Rev 27(4):273–294
Article Google Scholar
Yin C (2014) Towards accurate node-based detection of P2P Botnets. Sci World J 2014:425–491
Google Scholar
Yin C, Feng L, Ma L (2015) An improved Hoeffding-ID data-stream classification algorithm. J Supercomput 2015:1–12
Google Scholar
Yin C, Ma L, Feng L (2016) A feature selection method for improved clonal algorithm towards intrusion detection. Int J Pattern Recognit Artif Intell 30(5):1–13
Article Google Scholar
Yin C, Zou M, Iko D, Wang J (2013) Botnet detection based on correlation of malicious behaviors. Int J Hybrid Inf Technol 6(6):291–300
Article Google Scholar
Yuan FY, Zhang XC, Luo SB (2011) Accurate property weighted K- means clustering algorithm based on information entropy. J Comput Appl 31(6):1675–1677
Google Scholar
Zhou Y, Jiang X (2012) Dissecting android malware: characterization and evolution. 2012 I.E. Symp Secur Priv 59:95–109
Article Google Scholar

Download references

Acknowledgments

Foundation item: This work was funded by the National Natural Science Foundation of China (No.61373134). It was also supported by the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD), Jiangsu Key Laboratory of Meteorological Observation and Information Processing (No.KDXS1105) and Jiangsu Collaborative Innovation Center on Atmospheric Environment and Equipment Technology (CICAEET).

Author information

Authors and Affiliations

School of Computer and Software, Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science & Technology, Nanjing, 210044, China
Chunyong Yin & Sun Zhang
Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, Nanjing University of Information Science and Technology, Nanjing, Jiangsu, 210044, China
Chunyong Yin

Authors

Chunyong Yin
View author publications
You can also search for this author in PubMed Google Scholar
Sun Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chunyong Yin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yin, C., Zhang, S. Parallel implementing improved k-means applied for image retrieval and anomaly detection. Multimed Tools Appl 76, 16911–16927 (2017). https://doi.org/10.1007/s11042-016-3638-1

Download citation

Received: 16 February 2016
Revised: 06 May 2016
Accepted: 24 May 2016
Published: 29 May 2016
Issue Date: August 2017
DOI: https://doi.org/10.1007/s11042-016-3638-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Parallel implementing improved k-means applied for image retrieval and anomaly detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Anomaly Detection Algorithm Based on Cluster of Entropy

Data Categorization Using Hadoop MapReduce-Based Parallel K-Means Clustering

Performance of the K-means and fuzzy C-means algorithms in big data analytics

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now