Abstract
k-Plane clustering (kPC) and k-proximal plane clustering (kPPC) cluster data points to the center plane, instead of clustering data points to cluster center in k-means. However, the cluster center plane constructed by kPC and kPPC is infinitely extending, which will affect the clustering performance. In this paper, we propose a local k-proximal plane clustering (LkPPC) by bringing k-means into kPPC which will force the data points to center around some prototypes and thus localize the representations of the cluster center plane. The contributions of our LkPPC are as follows: (1) LkPPC introduces localized representation of each cluster center plane to avoid the infinitely confusion. (2) Different from kPPC, our LkPPC constructs cluster center plane that makes the data points of the same cluster close to both the same center plane and the prototype, and meanwhile far away from the other clusters to some extent, which leads to solve eigenvalue problems. (3) Instead of randomly selecting the initial data points, a Laplace graph strategy is established to initialize the data points. (4) The experimental results on several artificial datasets and benchmark datasets show the effectiveness of our LkPPC.
References
Han J, Kamber M (2006) Data mining concepts and techniques. Morgan Kaufmann, San Francisco
Anderberg M (1973) Cluster analysis for applications. Academic Press, New York
Aldenderfer M, Blashfield R (1985) Cluster analysis. Sage, Los Angeles
Jain A, Murty M, Flynn P (1999) Data clustering: a review. ACM Comput Surv (CSUR) 31(3):264–323
Cai W, Chen S, Zhang D (2007) Fast and robust fuzzy c-means clustering algorithms incorporating local information for image segmentation. Pattern Recognition 40(3):825–838
Wu Z, Leahy R (1993) An optimal graph theoretic approach to data clustering: theory and its application to image segmentation. IEEE Trans Pattern Anal Mac Intell 15(11):1101–1113
Saha S, Bandyopadhyay S (2011) Automatic MR brain image segmentation using a multiseed based multiobjective clustering approach. Appl Intell 35(3):411–427
Berry M (2004) Survey of text mining I: clustering, classification, and retrieval, vol 1. Springer, Berlin
Hotho A, Nrnberger A, Paab G (2005) A brief survey of text mining. Ldv Forum 20(1):19–62
Shi K, Li L (2013) High performance genetic algorithm based text clustering using parts of speech and outlier elimination. Appl Intell 38(4):511–519
Yu Z, Wong H, Wang H (2007) Graph-based consensus clustering for class discovery from gene expression data. Bioinformatics 23(21):2888–2896
Bandyopadhyay S, Mukhopadhyay A, Maulik U (2007) An improved algorithm for clustering gene expression data. Bioinformatics 23(21):2859–2865
Li C, Xia M, Peng W, Yu X, Mitsuru I (2012) Mandarin emotion recognition combining acoustic and emotional point information. Appl Intell 37(4):602–612
Joseph K, Samy B (2009) Automatic speech and speaker recognition: large margin and kernel methods. Wiley Online Library, Hoboken
Bradley P, Mangasarian O (1997) Clustering via concave minimization. Adv Neural Inf Proces Syst 9:368–374
Dembele D, Kastner P (2003) Fuzzy c-means method for clustering microarray data. Bioinformatics 19(8):973–980
Bradley P, Mangasarian O (2000) k-plane clustering. J Glob Optim 16(1):23–32
Tseng P (2000) Nearest q-flat to m points. J Optim Theory Appl 105(1):249–252
Wang Y, Jiang Y, Wu Y, Zhou Z (2011) Localized k-flats. In: Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, pp 525–530
Zhang T, Szlam A, Wang Y, Lerman G (2010) Randomized hybrid linear modeling by local best-fit flats. In: In CVPR, pp 1927–1934
Shao Y, Bai L, Wang Z, Hua X, Deng N (2013) Proximal plane clustering via eigenvalues. Proc Comput Sci 17:41–47
Shao Y, Guo Y, Wang Z, Deng N (2014) k-proximal plane clustering. Neurocomputing (submitted)
Mangasarian O, Wild E (2006) Multisurface proximal support vector classification via generalize eigenvalues. IEEE Trans Pattern Anal Mach Intell 28(1):69–74
Shao Y, Deng N, Chen W, Wang Z (2013) Improved generalized eigenvalue proximal support vector machine. IEEE Signal Process Lett 20(3):213–216
Shao Y, Zhang C, Wang X, Deng N (2011) Improvements on twin support vector machines. IEEE Trans Neural Netw 22(6):962–968
Shao Y, Deng N, Yang Z, Chen W, Wang Z (2012) Probabilistic outputs for twin support vector machines. Knowl-Based Syst 33:145–151
Qi Z, Tian Y, Shi Y (2012) Twin support vector machine with universum data. Neural Netw 36:112–119
Shao Y, Deng N (2012) A coordinate descent margin based-twin support vector machine for classification. Neural Netw 25:114–121
Qi Z, Tian Y, Shi Y (2012) Laplacian twin support vector machine for semi-supervised classification. Neural Netw 35:46–53
Balasundaram S, Tanveer M (2013) On lagrangian twin support vector regression. Neural Comput Appl 22(1):257–267
Tanveer M (2014) Robust and sparse linear programming twin support vector machines. Cogn Comput 6:1866–9956
Qi Z, Tian Y, Shi Y (2013) Robust twin support vector machine for pattern classification. Pattern Recognit 46(1):305–316
Qi Z, Tian Y, Shi Y (2013) Structural twin support vector machine for classification. Knowl-Based Syst 43:74–81
Scarborough J (1958) Numerical mathematical analysis, 4th edn. Johns Hopkins Press, New York
Deng N, Tian Y, Zhang C (2013) Support vector machines: optimization based theory, algorithms, and extensions. CRC Press, Boca Raton
Naldi M, Campello R (2014) Evolutionary k-means for distributed datasets. Neurocomputing 127:30–42
Bradley P, Fayyad U (1998) Refining initial points for k-means clustering. In: Proceedings of the 15 International Conference on Machine Learning (ICML98), pp 91–99
Fayyad U, Reina C, Bradley P (1998) Initialization of iterative refinement clustering algorithms. In: Proceedings of 14th International Conference on Machine Learning (ICML), pp 194–198
Luxburg U (2007) A tutorial on spectral clustering. Stat Comput 17(4):395–416
Blake CL, Merz CJ (1998) UCI repository for machine learning databases. http://www.ics.uci.edu/mlearn/MLRepository.html
Matlab, User’s Guide, The MathWorks Inc. http://www.mathworks.com (1994–2001)
Halkidi M, Batistakis Y, Vazirgiannis M (2001) On clustering validation techniques. Intell Inf Syst J 17:107–145
Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mac Learn Res 7:1–30
Garcia S, Herrera F (2008) An extension on “statistical comparisons of classifiers over multiple data sets” for all pairwise comparisons. J Mac Learn Res 9:2677–2694
Garcia S, Fernandez A, Luengo J, Herrera F (2010) Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: experimental analysis of power. Inf Sci 9:2044–2064
Yang B, Chen S (2010) Sample-dependent graph construction with application to dimensionality reduction. Neurocomputing 74:301–314
Tian Y, Shi Y, Liu X (2012) Recent advances on support vector machines research. Technol Econ Dev Econ 18(1):5–33
Shao Y, Deng N, Yang Z (2012) Least squares recursive projection twin support vector machine for classification. Pattern Recognit 45(6):2299–2307
Shao YH, Wang Z, Chen WJ, Deng NY (2013) Least squares twin parametric-margin support vector machine for classification. Appl Intell 39(3):1–14
Ferraro MB, Guarracino MR (2014) From separating to proximal plane classifiers: a review, clusters, orders, and trees: methods and applications. Springer Optim Appl 92:167–180
Tian Y, Qi Z, Ju X, Shi Y, Liu X (2014) Nonparallel support vector machines for pattern classification. Cybern IEEE Trans 44(7):1067–1079
Acknowledgments
This work is supported by the National Natural Science Foundation of China (Nos. 11201426, 11371365, and 10971223), the Zhejiang Provincial Natural Science Foundation of China (Nos. LQ12A01020, LQ13F030010, and LQ14G010004), the Ministry of Education, Humanities and Social Sciences Research Project of China (No. 13YJC910011), and the Scientific Research Fund of Zhejiang Provincial Education Department (No. Y201432746).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yang, ZM., Guo, YR., Li, CN. et al. Local k-proximal plane clustering. Neural Comput & Applic 26, 199–211 (2015). https://doi.org/10.1007/s00521-014-1707-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-014-1707-9