Skip to main content
Log in

Fuzzy clustering based on nonconvex optimisation approaches using difference of convex (DC) functions algorithms

  • Regular Article
  • Published:
Advances in Data Analysis and Classification Aims and scope Submit manuscript

Abstract

We present a fast and robust nonconvex optimization approach for Fuzzy C-Means (FCM) clustering model. Our approach is based on DC (Difference of Convex functions) programming and DCA (DC Algorithms) that have been successfully applied in various fields of applied sciences, including Machine Learning. The FCM model is reformulated in the form of three equivalent DC programs for which different DCA schemes are investigated. For accelerating the DCA, an alternative FCM-DCA procedure is developed. Experimental results on several real world problems that include microarray data illustrate the effectiveness of the proposed algorithms and their superiority over the standard FCM algorithm, with respect to both running-time and accuracy of solutions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Alon N, Spencer JH (1991) The probabilistic method. Wiley, New York

    Google Scholar 

  • Arora S, Kannan R (2001) Learning mixtures of arbitrary Gaussians. In: Proceedings of 33rd annual ACM symposium on theory of computing, pp 247–257

  • Bradley BS, Mangasarian OL (1998). Feature selection via concave minimization and support vector machines. In: Shavlik J (eds) Machine learning Proceedings of the 15th international conferences (ICML 1998). Morgan Kaufman, San Francisco, pp. 82–90

  • Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithm. Plenum Press, New York

    Google Scholar 

  • Dhilon IS, Korgan J, Nicholas C (2003). Feature selection and document clustering. In: Berry MW (ed) A comprehensive survey of text mining. Springer, Berlin, pp. 73–100

  • Dembélé D, Kastner P (2003) Fuzzy c-means clustering method for clustering microarray data. Bioinformatics 19(8):573–580

    Article  Google Scholar 

  • Duda RO, Hart PE (1972) Pattern classification and scene analysis. Wiley, New York

    Google Scholar 

  • Feder T, Greene D (1988) Optimal algorithms for approximate clustering. Proc STOC Chicago, Illinois, pp 434–444, ISBN: 0-89791-264-0

  • Fisher D (1987) Knowledge acquisition via incremental conceptual clustering. Mach Learn 2:139–172

    Google Scholar 

  • Fukunaga K (1990) Statistical pattern recognition. Academic, New York

    MATH  Google Scholar 

  • Krause N, Singer Y (2004) Leveraging the margin more carefully. In: Proceedings of International conference on machine learning ICML, V Banff, Alberta, Canada pp. 63–70, ISBN: 1-58113-828-5

  • Klawonn F, Höppner F (2003) What is fuzzy about fuzzy clustering? Understanding and improving the concept of the fuzzifier. In: Berthold MR, Lenz H-J, Bradley E, Kruse R, Borgelt C (eds) Advances in intelligent data analysis. Springer, Berlin, pp 254–264

  • Le Thi HA (1997) Contribution à l’optimisation non convexe et l’optimisation globale: théorie, algorithmes et applications. Habilitation à Diriger des Recherches, Université de Rouen, France

  • Le Thi HA, Pham Dinh T (1997) Solving a class of linearly constrained indefinite quadratic problems by DC algorithms. J Global Optim 11(3):253–285

    Article  MATH  Google Scholar 

  • Le Thi HA, Pham Dinh T (2005) The DC (difference of convex functions) programming and DCA revisited with DC models of real world nonconvex optimization problems. Ann Oper Res 133:23–46

    Article  MATH  Google Scholar 

  • Le Thi HA, Belghiti T, Pham Dinh T (2007) A new efficient algorithm based on DC programming and DCA for Clustering. J Global Optim 37:593–608

    Article  Google Scholar 

  • Le Thi HA, Le Hoai M, Pham Dinh T (2007) Optimization based DC programming and DCA for Hierarchical Clustering. Eur J Oper Res 183:1067–1085

    Article  Google Scholar 

  • Liu Y, Shen X, Doss H (2005) Multicategory ψ-learning and support vector machine: computational tools. J Comput Graph Stat 14:219–236

    Article  Google Scholar 

  • Liu Y, Shen X (2006) Multicategoryψ-learning. J Am Stat Assoc 101:500–509

    Article  Google Scholar 

  • Mangasarian OL (1997) Mathematical programming in data mining. Data Min Knowl Discov 1:183–201

    Article  Google Scholar 

  • MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley symposium on mathematical statistics and probability. University of California Press, Berkeley 1:281–297

  • Neumann J, Schnörr C, Steidl G (2004) SVM-based feature selection by direct objective minimisation. Pattern Recognition. In: Proceedings of the 26th DAGM symposium, pp 212–219

  • Pham Dinh T, Le Thi HA (1998) DC optimization algorithms for solving the trust region subproblem. SIAM J Optim 8:476–505

    Article  MATH  Google Scholar 

  • Polyak B (1987) Introduction to optimization. Optimization Software, Inc., Publication Division, New York

  • Rajapakse JC, Giedd JN, Rapoport JL (2004) Statistical approach to segmentation of single-channel cerebral MR images. IEEE Trans Medical Imaging 16: 176–186

    Article  Google Scholar 

  • Rockfellar RT (1970) Convex Analysis. Princeton: Princeton University

    Google Scholar 

  • Ronan C, Fabian S, Jason W, Léon B (2006) Trading convexity for scalability. In: International conference on machine learning (ICML), Pittsburgh, Pennsylvania, pp. 201–208, ISBN: 1-59593-383-2

  • Shen X, Tseng GC, Zhang X, Wong WH (2003) ψ-learning. J Am Stat Assoc 98:724–734

    Article  MATH  Google Scholar 

  • Weber S, Schüle T, Schnörr C (2005) Prior learning and convex–concave regularization of binary tomography. Electron Notes Discr Math 20:313–327

    Article  Google Scholar 

  • Yuille AL, Rangarajan A (2003) The convex conCave procedure (CCCP). Neural Comput 15:915-936

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hoai An Le Thi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Thi, H.A.L., Le, H.M. & Dinh, T.P. Fuzzy clustering based on nonconvex optimisation approaches using difference of convex (DC) functions algorithms. ADAC 1, 85–104 (2007). https://doi.org/10.1007/s11634-007-0011-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11634-007-0011-2

Keywords

JEL Classification

Navigation