Reference Hub2
An Improved Genetic Algorithm for Document Clustering on the Cloud

An Improved Genetic Algorithm for Document Clustering on the Cloud

Ruksana Akter, Yoojin Chung
Copyright: © 2018 |Volume: 8 |Issue: 4 |Pages: 9
ISSN: 2156-1834|EISSN: 2156-1826|EISBN13: 9781522546368|DOI: 10.4018/IJCAC.2018100102
Cite Article Cite Article

MLA

Akter, Ruksana, and Yoojin Chung. "An Improved Genetic Algorithm for Document Clustering on the Cloud." IJCAC vol.8, no.4 2018: pp.20-28. http://doi.org/10.4018/IJCAC.2018100102

APA

Akter, R. & Chung, Y. (2018). An Improved Genetic Algorithm for Document Clustering on the Cloud. International Journal of Cloud Applications and Computing (IJCAC), 8(4), 20-28. http://doi.org/10.4018/IJCAC.2018100102

Chicago

Akter, Ruksana, and Yoojin Chung. "An Improved Genetic Algorithm for Document Clustering on the Cloud," International Journal of Cloud Applications and Computing (IJCAC) 8, no.4: 20-28. http://doi.org/10.4018/IJCAC.2018100102

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

This article presents a modified genetic algorithm for text document clustering on the cloud. Traditional approaches of genetic algorithms in document clustering represents chromosomes based on cluster centroids, and does not divide cluster centroids during crossover operations. This limits the possibility of the algorithm to introduce different variations to the population, leading it to be trapped in local minima. In this approach, a crossover point may be selected even at a position inside a cluster centroid, which allows modifying some cluster centroids. This also guides the algorithm to get rid of the local minima, and find better solutions than the traditional approaches. Moreover, instead of running only one genetic algorithm as done in the traditional approaches, this article partitions the population and runs a genetic algorithm on each of them. This gives an opportunity to simultaneously run different parts of the algorithm on different virtual machines in cloud environments. Experimental results also demonstrate that the accuracy of the proposed approach is at least 4% higher than the other approaches.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.