Skip to main content

Discovering associations in spatial data — An efficient medoid based approach

  • Papers
  • Conference paper
  • First Online:
Research and Development in Knowledge Discovery and Data Mining (PAKDD 1998)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1394))

Included in the following conference series:

Abstract

Spatial data mining is the discovery of novel and interesting relationships and characteristics that may exist implicitly in spatial databases. The identification of clusters coupled with Geographical Information System provides a means of information generalization. A variety of clustering approaches exists. A non-hierarchical method in data mining applications is the medoid approach. Many heuristics have been developed for this approach. This paper carefully analyses the complexity of hill-climbing heuristics for medoid based spatial clustering. Improvements to recently suggested heuristics like CLARANS are identified. We propose a novel idea, the stopping early of the heuristic search, and demonstrate that this provides large savings in computational time while the quality of the partition remains unaffected.

This research was supported in part by a grant from the Australian Research Council.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Y. Cai, N. Cercone, and J. Han. Attribute-oriented induction in relational databases. In G. Piatetsky-Shapiro and W.J. Frawley, Knowledge Discovery in Databases, 213–228, Menlo Park, CA. USA, 1991. AAAI Press.

    Google Scholar 

  2. M. Ester, H.P. Kriegel, and J. Sander. Spatial data mining: A database approach. In A. School and A. Voisrad, Advances in Spatial Databases, 5th Int. Symp., SDD97, 47–66, Berlin, Germany, 1997. Springer-Verlag LNCS 1262.

    Chapter  Google Scholar 

  3. M. Ester, H.P. Kriegel, S. Sander, and X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise. In E. Simoudis, J. Han, and U. Fayyad, Proc. of the 2nd Int. Conf. on Knowledge Discovery and Data Mining (KDD-96), 226–231, Menlo Park, CA, 1996. AAAI, AAAI Press.

    Google Scholar 

  4. V. Estivill-Castro and A.T. Murray. Mining spatial data via clustering. Technical Report #5/97, FIT, QUT, Brisbane 4000, Australia, 1997.

    Google Scholar 

  5. A. Ghozeil and D.B. Fogel. Discovering patterns in spatial data using evolutionary programming. In J.R. Koza, Genetic Programming: Proc. of the First Annual Conf., 521–527, Cambridge, MA, 1996. MIT Press.

    Google Scholar 

  6. M Goodchild and V. Noronha. Location-allocation for small computers. Monograph 8, University of Iowa, 1983.

    Google Scholar 

  7. K. Han, J. Koperski and N. Stefanovic. GeoMiner: A system prototype for spatial data mining. SIGMOD Record, 26(2):553–556, 1997.

    Article  Google Scholar 

  8. D.S. Johnson, C.H. Papadimitrou, and M. Yanakakis. How easy is local search? Journal of Computer System Sciences, 37:79–100, 1988.

    Article  MathSciNet  Google Scholar 

  9. L. Kaufman and P.J. Rousseuw. Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, NY, US, 1990.

    Book  Google Scholar 

  10. A.T. Murray and R.L. Church. Applying simulated annealing to location-planning models. Journal of Heuristics, 2:31–53, 1996.

    Article  Google Scholar 

  11. A.T. Murray and V. Estivill-Castro. Cluster discovery techniques for exploratory spatial data analysis. Int. J. of GIS (to appear).

    Google Scholar 

  12. R.T. Ng and J. Han. Efficient and effective clustering methods for spatial data mining. In J. Bocca, M. Jarke, and C. Zaniolo, Proc. of the 20th Conf. on Very Large Data Bases (VLDB), 144–155, San Francisco, CA, 1994. Morgan Kaufmann.

    Google Scholar 

  13. D. Rolland, E. Schilling and J. Current. An efficient tabu search procedure for the p-median problem. European Journal of Operations Research, 96:329–342, 1996.

    Article  MATH  Google Scholar 

  14. P. Sorensen. Analysis and design of heuristics for the p-median location-allocation problem. Master's thesis, Dep. of Geography, U. California, Santa Barbara, 1994.

    Google Scholar 

  15. T. Zhang, R. Ramakrishnan, and M. Livny. BIRCH:an efficient data clustering method for very large databases. SIGMOD Record, 25(2):103–114, June 1996.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Estivill-Castrol, V., Murray, A.T. (1998). Discovering associations in spatial data — An efficient medoid based approach. In: Wu, X., Kotagiri, R., Korb, K.B. (eds) Research and Development in Knowledge Discovery and Data Mining. PAKDD 1998. Lecture Notes in Computer Science, vol 1394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64383-4_10

Download citation

  • DOI: https://doi.org/10.1007/3-540-64383-4_10

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-64383-8

  • Online ISBN: 978-3-540-69768-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics