A novel density peak based semi-supervised clustering algorithm | IEEE Conference Publication | IEEE Xplore

A novel density peak based semi-supervised clustering algorithm


Abstract:

With the rapid development of technology, acquiring and storing big data from various fields is no longer a problem. Instead, how to utilize the data becomes an important...Show More

Abstract:

With the rapid development of technology, acquiring and storing big data from various fields is no longer a problem. Instead, how to utilize the data becomes an important and hot research topic. Clustering is one of the important tasks for big data utility. However, there exists one well-known challenge for the task, i.e. it is difficult to incorporate prior information into the clustering results. In this paper, we proposed a density peak based semi-supervised clustering algorithm, which is able to leverage label information of some seed objects for obtaining a better clustering result. Specifically, we first adopted a density based clustering algorithm to identify density peaks as the possible cluster centers for a dataset, and then proposed a graph-based algorithm to assign each center a class label by utilizing some given seed objects. Finally, we leveraged the label information of seed objects and identified centers to generate must-link and cannot-link constraints for clustering. Extensive experiments have been conducted on various publicly available data sets to verify the effectiveness of the proposed method, and the results showed that the proposed density-peak based semi-supervised algorithm outperforms the existing methods substantially.
Date of Conference: 10-13 July 2016
Date Added to IEEE Xplore: 23 February 2017
ISBN Information:
Electronic ISSN: 2160-1348
Conference Location: Jeju, Korea (South)

Contact IEEE to Subscribe

References

References is not available for this document.