Abstract
In this paper, a Gaussian Kernel version of the Minimum Sum-of-Squares Clustering \((G\mathcal{K}MSSC)\) is studied. The problem is formulated as a DC (Difference of Convex functions) program for which a new algorithm based on DC programming and DCA (DC Algorithm) is developed. The related DCA is original and very inexpensive. Numerical simulations show the efficiency of DCA and its superiority with respect to K-mean, a standard method for clustering.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arora, S., Kannan, R.: Learning mixtures of arbitrary Gaussians. In: Proceedings of the 33rd Annual ACM Symposium on Theory of Computing, pp. 247–257 (2001)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)
An, L.T.H., Belghiti, T., Tao, P.D.: A new efficient algorithm based on DC programming and DCA for Clustering. Journal of Global Optimization 37, 593–608 (2007)
An, L.T.H., Minh, L.H., Tao, P.D.: Optimization based DC programming and DCA for Hierarchical Clustering. European Journal of Operational Research 183, 1067–1085 (2007)
An, L.T.H., Minh, L.H., Tao, P.D.: Fuzzy clustering based on nonconvex optimisation approaches using difference of convex (DC) functions algorithms. Journal of Advances in Data Analysis and Classification (2), 1–20 (2007)
An, L.T.H., Tao, P.D.: Solving a class of linearly constrained indefinite quadratic problems by DC algorithms. Journal of Global Optimization 11(3), 253–285 (1997)
An, L.T.H., Tao, P.D.: The DC (difference of convex functions) Programming and DCA revisited with DC models of real world nonconvex optimization problems. Annals of Operations Research 133, 23–46 (2005)
Tao, P.D., An, L.T.H.: Convex analysis approach to d.c. programming: Theory, Algorithms and Applications. Acta Mathematica Vietnamica, Dedicated to Professor Hoang Tuy on the Occasion of his 70th Birthday 22(1), 289–355 (1997)
Tao, P.D., An, L.T.H.: DC optimization algorithm for solving the trust region subproblem. SIAM J. Optimization 8, 476–505 (1998)
An, L.T.H., Ngai, N.V., Tao, P.D.: Exact Penalty and Error Bounds in DC Programming. Submitted to Journal of Global Optimization Dedicated to Reiner Horst (2011)
Bradley, B.S., Mangasarian, O.L.: Feature selection via concave minimization and support vector machines. In: Proceedings of the 15th International Conferences on Machine Learning (ICML 1998), San Francisco, California, pp. 82–90 (1998)
Brusco, M.J.: A repetitive branch-and-bound procedure for minimum within-cluster sum of squares partitioning. Psychometrika 71, 347–363 (2006)
Peng, J., Xiay, Y.: A Cutting Algorithm for the Minimum Sum-of-Squared Error Clustering. In: Proceedings of the SIAM International Data Mining Conference (2005)
Hansen, P., Jaumard, B.: Cluster analysis and mathematical programming. Mathematical Programming 79, 191–215 (1997)
Sherali, H.D., Desai, J.: A global optimization RLT-based approach for solving the hard clustering problem. Journal of Global Optimization 32, 281–306 (2005)
Fisher, D.: Knowledge acquisition via incremental conceptual clustering. Machine Learning 2, 139–172 (1987)
Hartigan, J.A.: Clustering Algorithms. Wiley, New York (1975)
Vinod, H.D.: Integer programming and the theory of grouping. J. Amer. Stat. Assoc. 64, 506–519 (1969)
Mladenović, N., Hansen, P.: Variable neighborhood search. Comput. Oper. Res. 24, 1097–1100 (1997)
Herbrich, R.: Herbrich, Learning kernel classifiers. MIT Press (2002)
Filippone, M., Camastra, F., Masulli, F., Rovetta, S.: A survey of kernel and spectral methods for clustering. Pattern Recognition 41, 176–190 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Minh, L.H., An, L.T.H., Tao, P.D. (2012). Gaussian Kernel Minimum Sum-of-Squares Clustering and Solution Method Based on DCA. In: Pan, JS., Chen, SM., Nguyen, N.T. (eds) Intelligent Information and Database Systems. ACIIDS 2012. Lecture Notes in Computer Science(), vol 7197. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28490-8_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-28490-8_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28489-2
Online ISBN: 978-3-642-28490-8
eBook Packages: Computer ScienceComputer Science (R0)