Abstract
This paper presents a distributed Grid-based Density Clustering using Triangle-subdivision (DGDCT), capable of identifying arbitrary shaped embedded clusters as well as multi density clusters over large spatial datasets. Experimental results are presented to establish the superiority of the technique in terms of scale-up, speedup as well as cluster quality.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, San Francisco (2004)
Ester, M., Kriegel, H.P., et al.: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In: KDD 1996, Portland, pp. 226–231 (1996)
Wang, W., et al.: STING: A Statistical Information Grid Approach to Spatial data Mining. In: VLDB 1997, Athens, pp. 186–195 (1997)
Sheikholeslami, G., et al.: Wavecluster: A Multi- resolution Clustering approach for very large spatial database. In: SIGMOD 1998, Seattle (1998)
Agrawal, R., et al.: Automatic subspace clustering of high dimensional data for data mining applications. In: SIGMOD 1998, pp. 94–105 (1998)
Nagesh, H.S., et al.: A scalable parallel subspace clustering algorithm for massive data sets. In: Proc. of Intnl. Conf. on Parallel Processing, p. 477 (2000)
Ertoz, L., et al.: Finding Clusters of Different Sizes, Shapes, and Densities in Noisy, High Dimensional Data. In: SDM 2003. SIAM Intnl. Conf. on Data Mining (2003)
Karypis, G., et al.: CHAMELEON: A hierarchical clustering algorithm using dynamic modeling. IEEE Computer 32(8), 68–75 (1999)
Yan-chang, Z.: Clustering Datasets Containing Clusters of Various Densities. Journal of Beijing University of Posts and Telecomm 26(2), 42–47 (2003)
Kim, H.S., et al.: DGCL: An Efficient Density and Grid Based Clustering Algorithm for Large Spatial Database. In: Yu, J.X., Kitsuregawa, M., Leong, H.V. (eds.) WAIM 2006. LNCS, vol. 4016, pp. 362–371. Springer, Heidelberg (2006)
Ankerst, M., et al.: OPTICS: Ordering Points To Identify the Clustering Structure. In: ACM-SIGMOD 1999, pp. 49–60 (1999)
Roy, S., Bhattacharyya, D.K.: An Approach to Find Embedded Clusters Using Density Based Techniques. In: Chakraborty, G. (ed.) ICDCIT 2005. LNCS, vol. 3816, pp. 523–535. Springer, Heidelberg (2005)
Borah, B., et al.: A Parallel Density-Based Data Clustering Technique on Distributed Memory Multicomputers. In: ADCOM 2004, Ahmedabad (2004)
Januzaj, E., et al.: Towards Effective and Efficient Distributed Clustering. In: ICDM 2003, Florida (2003)
Retrieved from, http://steve.hollasch.net/cgindex/math/barycentric.html
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sarmah, S., Das, R., Bhattacharyya, D.K. (2007). DGDCT: A Distributed Grid-Density Based Algorithm for Intrinsic Cluster Detection over Massive Spatial Data. In: Rao, S., Chatterjee, M., Jayanti, P., Murthy, C.S.R., Saha, S.K. (eds) Distributed Computing and Networking. ICDCN 2008. Lecture Notes in Computer Science, vol 4904. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77444-0_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-77444-0_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77443-3
Online ISBN: 978-3-540-77444-0
eBook Packages: Computer ScienceComputer Science (R0)