Abstract
Proximity and density information modeling of 2D point-data by Delaunay Diagrams has delivered a powerful exploratory and argument-free clustering algorithm [6] for geographical data mining [13]. The algorithm obtains cluster boundaries using a Short-Long criterion and detects non-convex clusters, high and low density clusters, clusters inside clusters and many other robust results. Moreover, its computation is linear in the size of the graph used. This paper demonstrates that the criterion remains effective for exploratory analysis and spatial data mining where other proximity graphs are used. It also establishes a hierarchy of the modeling power of several proximity graphs and presents how the argument free characteristic of the original algorithm can be traded for argument tuning. This enables higher than 2 dimensions by using linear size proximity graphs like k-nearest neighbors.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
M. Ankerst, M.M. Breunig, H.P. Kriegel, and J. Sander. OPTICS: Ordering Points to Identify the Clustering Structure. In Proc. ACM-SIGMOD-99, pages 49–60, 1999.
C. Eldershaw and M. Hegland. Cluster Analysis using Triangulation. In CTAC97, pages 201–208. World Scientific, Singapore, 1997.
M. Ester, H.P. Kriegel, J. Sander, and X. Xu. A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In Proc. 2nd Int. Conf. KDDM, pages 226–231, 1996.
V. Estivill-Castro and M.E. Houle. Robust Clustering of Large Geo-referenced Data Sets. In Proc. 3rd PAKDD, pages 327–337, 1999.
V. Estivill-Castro and I. Lee. AMOEBA: Hierarchical Clustering Based on Spatial Proximity Using Delaunay Diagram. In Proc. 9th Int. SDH, pages 7a.26–7a.41, 2000.
V. Estivill-Castro and I. Lee. AUTOCLUST: Automatic Clustering via Boundary Extraction for Mining Massive Point-Data Sets. In Proceedings of GeoComputation 2000, 2000.
V. Estivill-Castro, I. Lee, and A.T. Murray. Spatial Clustering Analysis with Proximity Graphs Based on Cluster Boundary Characteristics. Technical Report 2000-07, http://www.cs.newcastle.edu.au, Department of CS & SE, University of Newcastle, 2000.
I. Kang, T. Kim, and K. Li. A Spatial Data Mining Method by Delaunay Triangulation. In Proc. 5th Int. ACM-Workshop on Advances in GIS, pages 35–39, 1997.
G. Karypis, E. Han, and V. Kumar. CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling. IEEE Computer, 32(8):68–75, 1999.
D.W. Matula and R.R. Sokal. Properties of Gabriel Graphs Relevant to Geographic Variation Research and the Clustering of Points in the Plane. Geographical Analysis, 12:205–222, 1980.
R.T. Ng and J. Han. Efficient and Effective Clustering Method for Spatial Data Mining. In 20th VLDB, pages 144–155, 1994.
A. Okabe, B.N. Boots, and K. Sugihara. Spatial Tessellations: Concepts and Applications of Voronoi Diagrams. John Wiley & Sons, West Sussex, 1992.
S. Openshaw. Geographical data mining: key design issues. In Proceedings of GeoComputation 99, 1999.
W. Wang, J. Yang, and R. Muntz. STING+: An Approach to Active Spatial Data Mining. In Proc. ICDE, pages 116–125, 1999.
T. Zhang, R. Ramakrishnan, and M. Livny. BIRCH: An Efficient Data Clustering Method for Very Large Databases. In Proc. ACM SIGMOD, pages 103–114, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Estivill-Castro, V., Lee, I., Murray, A.T. (2001). Criteria on Proximity Graphs for Boundary Extraction and Spatial Clustering. In: Cheung, D., Williams, G.J., Li, Q. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2001. Lecture Notes in Computer Science(), vol 2035. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45357-1_37
Download citation
DOI: https://doi.org/10.1007/3-540-45357-1_37
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41910-5
Online ISBN: 978-3-540-45357-4
eBook Packages: Springer Book Archive