D-GridMST: Clustering Large Distributed Spatial Databases

Zhang, Ji; Liu, Han

doi:10.1007/11011620_5

Ji Zhang¹ &
Han Liu¹

Part of the book series: Studies in Computational Intelligence ((SCI,volume 4))

382 Accesses

Abstract

In this paper, we will propose a novel distributable clustering algorithm, called Distributed-GridMST (D–GridMST for short), which deals with large distributed spatial databases. D–GridMST employs the notion of a grid to partition the data space involved and uses density criteria to extract representative points from spatial databases, on which a global MST of representatives is constructed. Such an MST is partitioned according to users’ clustering specification and used to label data points in the respective distributed spatial database thereafter. D-GridMST is characterized by fast speed, low space requirement and small network transferring overhead. Experimental results show that D–GridMST is effective since it is able to produce exactly the same clustering result as that produced in the centralized paradigm, making D-GridMST a promising tool for clustering large distributed spatial databases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

MR-VDENCLUE: Varying Density Clustering Using MapReduce

VDMR-DBSCAN: Varied Density MapReduce DBSCAN

Grid-DPC: Improved density peaks clustering based on spatial grid walk

Article 25 May 2022

Author information

Authors and Affiliations

Department of Computer Science, University of Toronto, M5S3G4, Toronto, Ontario, Canada
Ji Zhang & Han Liu

Authors

Ji Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Han Liu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Saman K. Halgamuge Lipo Wang

Rights and permissions

Reprints and permissions

About this chapter

Cite this chapter

Zhang, J., Liu, H. D-GridMST: Clustering Large Distributed Spatial Databases. In: K. Halgamuge, S., Wang, L. (eds) Classification and Clustering for Knowledge Discovery. Studies in Computational Intelligence, vol 4. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11011620_5

Download citation

DOI: https://doi.org/10.1007/11011620_5
Published: 22 August 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26073-8
Online ISBN: 978-3-540-32404-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics