Abstract
In this paper, we will propose a novel distributable clustering algorithm, called Distributed-GridMST (D–GridMST for short), which deals with large distributed spatial databases. D–GridMST employs the notion of a grid to partition the data space involved and uses density criteria to extract representative points from spatial databases, on which a global MST of representatives is constructed. Such an MST is partitioned according to users’ clustering specification and used to label data points in the respective distributed spatial database thereafter. D-GridMST is characterized by fast speed, low space requirement and small network transferring overhead. Experimental results show that D–GridMST is effective since it is able to produce exactly the same clustering result as that produced in the centralized paradigm, making D-GridMST a promising tool for clustering large distributed spatial databases.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
Author information
Authors and Affiliations
Editor information
Rights and permissions
About this chapter
Cite this chapter
Zhang, J., Liu, H. D-GridMST: Clustering Large Distributed Spatial Databases. In: K. Halgamuge, S., Wang, L. (eds) Classification and Clustering for Knowledge Discovery. Studies in Computational Intelligence, vol 4. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11011620_5
Download citation
DOI: https://doi.org/10.1007/11011620_5
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26073-8
Online ISBN: 978-3-540-32404-1
eBook Packages: EngineeringEngineering (R0)