Loading [a11y]/accessibility-menu.js
A new partitioning based algorithm for document clustering | IEEE Conference Publication | IEEE Xplore

A new partitioning based algorithm for document clustering


Abstract:

Document clustering is one of the key problems in text mining and information retrieval area. It groups text documents in a way that maximizes the similarity within clust...Show More

Abstract:

Document clustering is one of the key problems in text mining and information retrieval area. It groups text documents in a way that maximizes the similarity within clusters and minimizes the similarity between different clusters. Most partitioning based algorithms are sensitive to the initial centroids, the clustering result greatly depends on the initial centroids. This paper first uses unsupervised feature selection method to reduce the dimension of document feature space and then proposes a novel partitioning based algorithm which select initial cluster centriods in the process of clustering by the size and density of cluster in the datasets. The experiments on several text datasets show that the proposed approach effectively improves the quality of clustering.
Date of Conference: 26-28 July 2011
Date Added to IEEE Xplore: 15 September 2011
ISBN Information:
Conference Location: Shanghai, China

Contact IEEE to Subscribe

References

References is not available for this document.