Skip to main content

Advertisement

Log in

A fuzzy multi-objective particle swarm optimization for effective data clustering

  • Regular Research Paper
  • Published:
Memetic Computing Aims and scope Submit manuscript

Abstract

Data clustering, also called unsupervised learning, is a fundamental issue in data mining that is used to understand and mine the structure of an untagged assemblage of data into separate groups based on their similarity. Recent studies have shown that clustering techniques that optimize a single objective may not provide satisfactory result because no single validity measure works well on different kinds of data sets. Moreover, the performance of clustering algorithms degrades with more and more overlaps among clusters in a data set. These facts have motivated us to develop a fuzzy multi-objective particle swarm optimization framework in an innovative fashion for data clustering, termed as FMOPSO, which is able to deliver more effective results than state-of-the-art clustering algorithms. The key challenge in designing FMOPSO framework for data clustering is how to resolve cluster assignments confusion with such points in the data set which have significant belongingness to more than one cluster. The proposed framework addresses this problem by identification of points having significant membership to multiple classes, excluding them, and re-classifying them into single class assignments. To ascertain the superiority of the proposed algorithm, statistical tests have been performed on a variety of numerical and categorical real life data sets. Our empirical study shows that the performance of the proposed framework (in both terms of efficiency and effectiveness) significantly outperforms the state-of-the-art data clustering algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Bandyopadhyay S, Maulik U, Mukhopadhyay A (2007) Multiobjective genetic clustering for pixel classification in remote sensing imagery. IEEE Trans Geosci Remote Sens 45(5): 1506–1511

    Article  Google Scholar 

  2. Ben-Hur A, Guyon I (2003) Detecting stable clusters using principal component analysis. In: Brownstein MJ, Khodursky A (eds) Methods in molecular biology. Humana press, Clifton,, pp 159–182

    Google Scholar 

  3. Coello CAC, Lamont GB, Van Veldhuizen DA (2007) Evolutionary algorithms for solving multi-objective problems, 2nd edn. Springer, Berlin

    MATH  Google Scholar 

  4. Coello CAC, Salazar-Lechuga M (2002) MOPSO: a proposal for multiple objective particle swarm optimization. In: Cong on Evol Comput (CEC’2002), Piscataway, vol 2, New Jersey, IEEE Press, pp 1051–1056

  5. Coello CAC, Toscano-Pulido G (2005) Multiobjective structural optimization using a micro-genetic algorithm. Struct Multidiscip Optim 30(5): 388–403

    Article  Google Scholar 

  6. Coello CAC, Toscano-Pulido G, Salazar-Lechuga M (2004) Handling multiple objectives with particle swarm optimization. IEEE Trans Evol Comput 8(3): 256–279

    Article  Google Scholar 

  7. Coello CAC, Van Veldhuizen DA, Lamont GB (2002) Evolutionary algorithms for solving multi-objective problems. Kluwer, Dordrecht

    MATH  Google Scholar 

  8. Das S, Abraham A, Konar A (2009) Metaheuristic clustering. SCI 178. Springer, Berlin

    Google Scholar 

  9. Deb K (2001) Multi-objective optimization using evolutionary algorithms. Wiley, New York

    MATH  Google Scholar 

  10. Deb K, Agrawal S, Pratap A, Meyarivan T (2000) A fast elitist non-dominated sorting genetic algorithm for multi-objective optimization:NSGA-II. In: Proceedings of the parallel problem solving from nature VI conference, 16–20 September, vol 1917. Paris, France, pp 849–858

  11. Deb K, Pratap A, Agrawal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA II. IEEE Trans Evol Comput 6(2): 182–197

    Article  Google Scholar 

  12. Eberhart RC, Shi Y (1998) Comparison between genetic algorithms and particle swarm optimization. In: Porto VW, Saravanan N, Waagen D, Eiben AE (eds) Proc 7th Ann Conf Evol Program, vol 1447. Springer, Berlin, pp 611–619

    Chapter  Google Scholar 

  13. Hassan R, Cohanim B, de Weck O (2005) A comparison of particle swarm optimization and the genetic algorithm. In: 46th AIAA/ASME/ASCE/AHS/ASC Structures, Struct Dyn Mater Conf, AIAA 2005–1897

  14. Horn J, Nafpliotis N, Goldberg DE (1993) Multiobjective optimization using the niched pareto genetic algorithm. Technical Report IlliGAL Report 93005, University of Illinois at Urbana-Champaign, Urbana, IL, USA

  15. Ishibuchi H, Narukawa K, Nojima Y (2005) Handling of overlapping objective vectors in evolutionary multiobjective optimization. Int J Comput Intell Res 1(1): 1–18

    Google Scholar 

  16. Ishibuchi H, Hitotsuyanagi Y, Tsukamoto N, Nojima Y (2009) Implementation of multiobjective memetic algorithms for combinatorial optimization problems: a knapsack problem case study. In: Gah C-K, Ong Y-S, Tan KC (eds) Multi-objective memetic algorithms. SCI 171. Springer, Berlin, pp 27–49

    Chapter  Google Scholar 

  17. Knowles JD, Corne DW (2000) Approximating the nondominated front using the pareto archived evolution strategy. Evol Comput 8(2): 149–172

    Article  Google Scholar 

  18. Krishnapuram R, Joshi A, Yi L (1999) A Fuzzy relative of the k-medoids algorithm with application to web document and snippet clustering. In: Proc IEEE Intl Conf Fuzzy Syst, Korea, pp 1281–1286

  19. Mariano CE, Morales E (1999) MOAQ an Ant-Q algorithm for multiple objective optimization problems. In: Banzhaf W, Daida J, Eiben AE, Garzon MH, Honavar V, Jakiela M, Smith RE (eds) Proc Genet Evol Comput Conf (GECCO 99), Morgan Kaufmann, vol 1. Orlando, FL, pp 894–901

    Google Scholar 

  20. Maulik U, Mukhopadhyay A, Bandyopadhyay S (2006) Efficient clustering with multi-class point identification. J Three Dimensional Images 20(1): 35–40

    Google Scholar 

  21. Mukhopadhyay A, Bandyopadhyay S, Maulik U (2006) Clustering using multi-objective genetic algorithm and its application to image segmentation. IEEE Int Conf Syst Man Cybern 2678–2683

  22. Pal NR, Bezdek JC (1995) On cluster validity for the fuzzy c-means model. IEEE Trans Fuzzy Syst 3: 370–379

    Article  Google Scholar 

  23. Reyes-Sierra M, Coello CAC (2006) Multi-objective particle swarm optimizers: a survey of the state-of-the-art. Int J Comput Intell Res 2(3): 287–308

    MathSciNet  Google Scholar 

  24. Saha I, Mukhopadhyay A (2008) An improved crisp and fuzzy based clustering technique for categorical data. Int J Comput Sci Eng 2(4): 184–193

    Google Scholar 

  25. Salazar-Lechuga M, Rowe JE (2005) Particle swarm optimization and fitness sharing to solve multi-objective optimization problems In: Proc 2005 IEEE Congr Evol Comput (CEC 2005), vol 2. Edinburgh, Scotland, UK, pp 1204–1211

  26. Toscano-Pulido G (2005) On the use of self-adaptation and elitism for multiobjective particle swarm optimization. Dissertation, Center for research and advanced studies of the national polytechnic institute of Mixico

  27. Wang Y, Li B (2010) Multi-strategy ensemble evolutionary algorithm for dynamic multi-objective optimization. Memetic Comp 2: 3–24

    Article  Google Scholar 

  28. Xie XL, Beni G (1991) A validity measure for fuzzy clustering. IEEE Trans Pattern Anal Mach Intell 13(8): 841–847

    Article  Google Scholar 

  29. Xu R, Wunsch D (2008) Clustering. Wiley-IEEE Press, New York

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bara’a Ali Attea.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Attea, B.A. A fuzzy multi-objective particle swarm optimization for effective data clustering. Memetic Comp. 2, 305–312 (2010). https://doi.org/10.1007/s12293-010-0047-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12293-010-0047-2

Keywords