Abstract
A good approach in data mining is density based clustering in which the clusters are constructed based on the density of shape regions. The prominent algorithm proposed in density based clustering family is DBSCAN [1] that uses two global density parameters, namely minimum number of points for a dense region and epsilon indicating the neighborhood distance. Among others, one of the weaknesses of this algorithm is its un-suitability for multi-density data sets where different regions have various densities so the same epsilon does not work. In this paper, a new density based clustering algorithm, MSDBSCAN, is proposed. MSDBSCAN uses a new definition for core point and dense region. The MSDBSCAN can find clusters in multi-variant density data sets. Also this algorithm benefits scale independency. The results obtained on data sets show that the MSDBSCAN is very effective in multi-variant environment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kriegel, H.P., Sander, J., Xu, X., Ester, M.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, Menlo Park, CA, pp. 226–231 (1996)
Breunig, M.M., Kriegel, H.P., Sander, J., Ankerst, M.: OPTICS: Ordering Points To Identify The Clustering Structure. In: Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data, Philadelphia, USA, pp. 49–60 (1999)
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques, 2nd edn., Diane Cerra, USA (2006)
MacQueen, J.B.: Some methods for classification and analysis of multivariate observations. In: Proceeding of the Fifth Berekely Symposium on Mathematics, Statistics and Probabilities, vol. 1, pp. 281–297 (1967)
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley Interscience, Hoboken (2005)
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. SIGMOD Record 25, 103–114 (1996)
Karypis, G., Han, E., Kumar, V.: Chameleon: Hierarchical Clustering Using Dynamic Modeling. Computer 32, 68–75 (1999)
Duan, L., Xu, L., Guo, F., Lee, J., Yan, B.: A local-density based spatial clustering algorithm with noise. Information Systems 32, 978–986 (2007)
Breunig, M.M., Kriegel, H.P., Ng, R.T., Sander, J.: LOF: identifying density-based local outliers. SIGMOD Record 29, 93–104 (2000)
Karypis, G.: Karypis Lab (2010), http://glaros.dtc.umn.edu/gkhome/cluto/cluto/download
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Esfandani, G., Abolhassani, H. (2010). MSDBSCAN: Multi-density Scale-Independent Clustering Algorithm Based on DBSCAN. In: Cao, L., Feng, Y., Zhong, J. (eds) Advanced Data Mining and Applications. ADMA 2010. Lecture Notes in Computer Science(), vol 6440. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17316-5_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-17316-5_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17315-8
Online ISBN: 978-3-642-17316-5
eBook Packages: Computer ScienceComputer Science (R0)