Elsevier

Fuzzy Sets and Systems

Volume 93, Issue 1, 1 January 1998, Pages 49-56
Fuzzy Sets and Systems

Short communication
Fast fuzzy clustering

https://doi.org/10.1016/S0165-0114(96)00232-1Get rights and content

Abstract

This paper presents a multistage random sampling fuzzy c-means-based clustering algorithm, which significantly reduces the computation time required to partition a data set into c classes. A series of subsets of the full data set are used to create initial cluster centers in order to provide an approximation to the final cluster centers. The quality of the final partitions is equivalent to those created by fuzzy c-means. The speed-up is normally a factor of 2–3 times, which is especially significant for high-dimensional spaces and large data sets. Examples of the improved speed of the algorithm in two multi-spectral domains, magnetic resonance image segmentation and satellite image segmentation, are given. The results are compared with fuzzy c-means in terms of both the time required and the final resulting partition. Significant speedup is shown in each example presented in the paper. Further, the convergence properties of fuzzy c-means are preserved.

References (11)

  • J.C. Bezdek

    Pattern Recognition with Fuzzy Objective Function Algorithm

    (1981)
  • J.C. Bezdek et al.

    Review of MR image segmentation techniques using pattern recognition

    Med. Phys.

    (1993)
  • R.L. Cannon et al.

    Efficient implementation of the fuzzy c-means clustering algorithms

    IEEE Trans. Pattern Anal. Machine Intelligence

    (1986)
  • R.L. Cannon et al.

    Efficient implementation of the fuzzy c-means clustering algorithms

    IEEE Trans. Pattern Anal. Machine Intelligence

    (1986)
  • T.W. Cheng et al.

    Fast clustering with application to fuzzy rule generation

There are more references available in the full text version of this article.

Cited by (116)

  • IBRIDIA: A hybrid solution for processing big logistics data

    2019, Future Generation Computer Systems
    Citation Excerpt :

    Although the algorithm has the disadvantage of needing an expert intervention to specify many parameters before it works, its performance is better than HPStream algorithm [27]. In [28], a multi-level unordered sampling technique was suggested to boost the time performance of fuzzy C-means. The technique is double phased.

  • Approximate spectral clustering with utilized similarity information using geodesic based hybrid distance measures

    2015, Pattern Recognition
    Citation Excerpt :

    We briefly explain these three methods below. Various sampling methods have been employed for clustering of large datasets to make spectral clustering feasible [12,16,17], to speed-up the fuzzy c-means [29], or to make support vector machines scalable [30]. Among them, random sampling is the fastest due to its non-parametric straightforward approach in expense of relatively high error rates.

  • Generalization rules for the suppressed fuzzy c-means clustering algorithm

    2014, Neurocomputing
    Citation Excerpt :

    The first accelerated FCM algorithms [5,25] used integer computation only. Cheng et al. [8] proposed data reduction based on random sampling, leading to a fast approximative FCM clustering. Higher speed has been also reached via data reduction.

View all citing articles on Scopus
View full text