Abstract
Data clustering, also called unsupervised learning, is a fundamental issue in data mining that is used to understand and mine the structure of an untagged assemblage of data into separate groups based on their similarity. Recent studies have shown that clustering techniques that optimize a single objective may not provide satisfactory result because no single validity measure works well on different kinds of data sets. Moreover, the performance of clustering algorithms degrades with more and more overlaps among clusters in a data set. These facts have motivated us to develop a fuzzy multi-objective particle swarm optimization framework in an innovative fashion for data clustering, termed as FMOPSO, which is able to deliver more effective results than state-of-the-art clustering algorithms. The key challenge in designing FMOPSO framework for data clustering is how to resolve cluster assignments confusion with such points in the data set which have significant belongingness to more than one cluster. The proposed framework addresses this problem by identification of points having significant membership to multiple classes, excluding them, and re-classifying them into single class assignments. To ascertain the superiority of the proposed algorithm, statistical tests have been performed on a variety of numerical and categorical real life data sets. Our empirical study shows that the performance of the proposed framework (in both terms of efficiency and effectiveness) significantly outperforms the state-of-the-art data clustering algorithms.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bandyopadhyay S, Maulik U, Mukhopadhyay A (2007) Multiobjective genetic clustering for pixel classification in remote sensing imagery. IEEE Trans Geosci Remote Sens 45(5): 1506–1511
Ben-Hur A, Guyon I (2003) Detecting stable clusters using principal component analysis. In: Brownstein MJ, Khodursky A (eds) Methods in molecular biology. Humana press, Clifton,, pp 159–182
Coello CAC, Lamont GB, Van Veldhuizen DA (2007) Evolutionary algorithms for solving multi-objective problems, 2nd edn. Springer, Berlin
Coello CAC, Salazar-Lechuga M (2002) MOPSO: a proposal for multiple objective particle swarm optimization. In: Cong on Evol Comput (CEC’2002), Piscataway, vol 2, New Jersey, IEEE Press, pp 1051–1056
Coello CAC, Toscano-Pulido G (2005) Multiobjective structural optimization using a micro-genetic algorithm. Struct Multidiscip Optim 30(5): 388–403
Coello CAC, Toscano-Pulido G, Salazar-Lechuga M (2004) Handling multiple objectives with particle swarm optimization. IEEE Trans Evol Comput 8(3): 256–279
Coello CAC, Van Veldhuizen DA, Lamont GB (2002) Evolutionary algorithms for solving multi-objective problems. Kluwer, Dordrecht
Das S, Abraham A, Konar A (2009) Metaheuristic clustering. SCI 178. Springer, Berlin
Deb K (2001) Multi-objective optimization using evolutionary algorithms. Wiley, New York
Deb K, Agrawal S, Pratap A, Meyarivan T (2000) A fast elitist non-dominated sorting genetic algorithm for multi-objective optimization:NSGA-II. In: Proceedings of the parallel problem solving from nature VI conference, 16–20 September, vol 1917. Paris, France, pp 849–858
Deb K, Pratap A, Agrawal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA II. IEEE Trans Evol Comput 6(2): 182–197
Eberhart RC, Shi Y (1998) Comparison between genetic algorithms and particle swarm optimization. In: Porto VW, Saravanan N, Waagen D, Eiben AE (eds) Proc 7th Ann Conf Evol Program, vol 1447. Springer, Berlin, pp 611–619
Hassan R, Cohanim B, de Weck O (2005) A comparison of particle swarm optimization and the genetic algorithm. In: 46th AIAA/ASME/ASCE/AHS/ASC Structures, Struct Dyn Mater Conf, AIAA 2005–1897
Horn J, Nafpliotis N, Goldberg DE (1993) Multiobjective optimization using the niched pareto genetic algorithm. Technical Report IlliGAL Report 93005, University of Illinois at Urbana-Champaign, Urbana, IL, USA
Ishibuchi H, Narukawa K, Nojima Y (2005) Handling of overlapping objective vectors in evolutionary multiobjective optimization. Int J Comput Intell Res 1(1): 1–18
Ishibuchi H, Hitotsuyanagi Y, Tsukamoto N, Nojima Y (2009) Implementation of multiobjective memetic algorithms for combinatorial optimization problems: a knapsack problem case study. In: Gah C-K, Ong Y-S, Tan KC (eds) Multi-objective memetic algorithms. SCI 171. Springer, Berlin, pp 27–49
Knowles JD, Corne DW (2000) Approximating the nondominated front using the pareto archived evolution strategy. Evol Comput 8(2): 149–172
Krishnapuram R, Joshi A, Yi L (1999) A Fuzzy relative of the k-medoids algorithm with application to web document and snippet clustering. In: Proc IEEE Intl Conf Fuzzy Syst, Korea, pp 1281–1286
Mariano CE, Morales E (1999) MOAQ an Ant-Q algorithm for multiple objective optimization problems. In: Banzhaf W, Daida J, Eiben AE, Garzon MH, Honavar V, Jakiela M, Smith RE (eds) Proc Genet Evol Comput Conf (GECCO 99), Morgan Kaufmann, vol 1. Orlando, FL, pp 894–901
Maulik U, Mukhopadhyay A, Bandyopadhyay S (2006) Efficient clustering with multi-class point identification. J Three Dimensional Images 20(1): 35–40
Mukhopadhyay A, Bandyopadhyay S, Maulik U (2006) Clustering using multi-objective genetic algorithm and its application to image segmentation. IEEE Int Conf Syst Man Cybern 2678–2683
Pal NR, Bezdek JC (1995) On cluster validity for the fuzzy c-means model. IEEE Trans Fuzzy Syst 3: 370–379
Reyes-Sierra M, Coello CAC (2006) Multi-objective particle swarm optimizers: a survey of the state-of-the-art. Int J Comput Intell Res 2(3): 287–308
Saha I, Mukhopadhyay A (2008) An improved crisp and fuzzy based clustering technique for categorical data. Int J Comput Sci Eng 2(4): 184–193
Salazar-Lechuga M, Rowe JE (2005) Particle swarm optimization and fitness sharing to solve multi-objective optimization problems In: Proc 2005 IEEE Congr Evol Comput (CEC 2005), vol 2. Edinburgh, Scotland, UK, pp 1204–1211
Toscano-Pulido G (2005) On the use of self-adaptation and elitism for multiobjective particle swarm optimization. Dissertation, Center for research and advanced studies of the national polytechnic institute of Mixico
Wang Y, Li B (2010) Multi-strategy ensemble evolutionary algorithm for dynamic multi-objective optimization. Memetic Comp 2: 3–24
Xie XL, Beni G (1991) A validity measure for fuzzy clustering. IEEE Trans Pattern Anal Mach Intell 13(8): 841–847
Xu R, Wunsch D (2008) Clustering. Wiley-IEEE Press, New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Attea, B.A. A fuzzy multi-objective particle swarm optimization for effective data clustering. Memetic Comp. 2, 305–312 (2010). https://doi.org/10.1007/s12293-010-0047-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12293-010-0047-2