ABSTRACT
Cluster analysis is one of the hot issues in the field of data mining and it has extensive applications in many aspects. The label propagation algorithm is easy to implement. At the same time, it has a low time complexity which has been recognized by scholars. Because the algorithm needs to specify the category labels of the data set, the accuracy and adaptability of the algorithm are affected. In view of the above problems, this paper proposes a new clustering algorithm that combines the advantages of density-based and label propagation. The algorithm adaptively determines the label of the data points through local density and reducing the effect of noise on the results. Experimental results show that the proposed algorithm has better adaptability while improving the accuracy of clustering results.
- Zhou Weixing, Liao Huan. Image Region Segmentation Based on K-Means Clustering and Probabilistic Relaxation{J}. Computer Technology and Development, 2010, 20(2):68 -70.Google Scholar
- Nisha, Kaur P J. Cluster quality based performance evaluation of hierarchical clustering method{C}// International Conference on Next Generation Computing Technologies. IEEE, 2016:649--653.Google Scholar
- R.Yogita, Dr.R.Harish, A Study of Hierarchical Clustering Algorithm, International Journal of Information and Computation Technology, vol.3, p. 1225--1232, Nov 2013.Google Scholar
- Zhang T, Ramakrishnan R, Livny M. BIRCH: an efficient data clustering method for very large databases{C}// ACM SIGMOD International Conference on Management of Data. ACM, 1996:103--114. Google ScholarDigital Library
- Macqueen J. Some Methods for Classification and Analysis of MultiVariate Observations{C}// Proc. of, Berkeley Symposium on Mathematical Statistics and Probability. 1967:281--297.Google Scholar
- NEster M, Kriegel H P, Xu X. A density-based algorithm for discovering clusters in large spatial databases with noise{C}// International Conference on Knowledge Discovery and Data Mining. AAAI Press, 1996:226--231. Google ScholarDigital Library
- Patwary M M A, Satish N, Sundaram N, et al. Pardicle: Parallel Approximate Density-Based Clustering{C}// High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for. IEEE, 2014:560--571. Google ScholarDigital Library
- Wang W, Yang J, Muntz R R. STING: A Statistical Information Grid Approach to Spatial Data Mining{C}// International Conference on Very Large Data Bases. Morgan Kaufmann Publishers Inc. 1997:186--195. Google ScholarDigital Library
- Chen Xinquan, Zhou Lingjing, Liu Yaozhong. A Survey of Clustering Algorithms{J}. Integrated Technology, 2017(3):41--49.Google Scholar
- Fisher D. Improving inference through conceptual clustering{C}// National Conference on Artificial Intelligence. Seattle, Wa, July. DBLP, 1987:461--465. Google ScholarDigital Library
- Zhu X, Ghahramani Z. Learning from labeled and unlabeled data with label propagation{R}. Technical Report CMU-CALD-02--107{R}.Pittsburghers:Carnegie Mellon University, 2002.Google Scholar
- Zhang Junli, Chang Yanli, Shi Wen. A Survey of Tag Propagation Algorithm Theory and Its Application{J}. Journal of Computer Applications, 2013, 30(1):21--25.Google Scholar
- Strehl A, Ghosh J. Cluster ensembles: a knowledge reuse framework for combining partitionings{C}// Eighteenth national conference on Artificial intelligence. American Association for Artificial Intelligence, 2002:93--98. Google ScholarDigital Library
- C.J.Veenman, M.J.T.Reinders, and E. Backer, A maximum variance cluster algorithm. IEEE Trans. Pattern Analysis and Machine Intelligence, 2002. 24(9): p. 1273--1280. Google ScholarDigital Library
- A. Gionis, H.Mannila, and P.Tsaparas, Clustering aggregation. ACM Transactions on Knowledge Discovery from Data (TKDD), 2007. 1(1): p. 1--30. Google ScholarDigital Library
- L. Fu and E. Medico, FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data. BMC bioinformatics, 2007. 8(1): p. 3.Google ScholarCross Ref
Index Terms
- A Label Propagation Algorithm Based on Local Density of Data Points
Recommendations
Semi-supervised partial label learning algorithm via reliable label propagation
AbstractPartial label learning (PLL) is a weakly supervised learning method that is able to predict one label as the correct answer from a given candidate label set. In PLL, when all possible candidate labels are as signed to real-world training examples, ...
Transductive Multilabel Learning via Label Set Propagation
The problem of multilabel classification has attracted great interest in the last decade, where each instance can be assigned with a set of multiple class labels simultaneously. It has a wide variety of real-world applications, e.g., automatic image ...
LSSLP - Local structure sensitive label propagation
Label propagation is an approach to iteratively spread the prior state of label confidence associated with each of samples to its neighbors until achieving a global convergence state. Such process has been shown to hold close connection with a general ...
Comments