Efficient Spatial Clustering Algorithm Using Binary Tree

Ali, Mohsin; Li, Xue; Dong, Zhao Yang

doi:10.1007/11508069_39

Mohsin Ali¹⁹,
Xue Li¹⁹ &
Zhao Yang Dong¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3578))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1344 Accesses

Abstract

In this paper we present an efficient k-Means clustering algorithm for two dimensional data. The proposed algorithm re-organizes dataset into a form of nested binary tree. Data items are compared at each node with only two nearest means with respect to each dimension and assigned to the one that has the closer mean. The main intuition of our research is as follows: We build the nested binary tree. Then we scan the data in raster order by in-order traversal of the tree. Lastly we compare data item at each node to the only two nearest means to assign the value to the intendant cluster. In this way we are able to save the computational cost significantly by reducing the number of comparisons with means and also by the least use to Euclidian distance formula. Our results showed that our method can perform clustering operation much faster than the classical ones.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Han, J., Kamber, M.: Data Mining Basic and Advanced Technigues. Morgan Kaufmann Publishers, San Francisco (2001)
Google Scholar
Alsabti, K., Ranka, S., Singh, V.: An Efficient k-Means Clustering Algorithm. In: 11th International Parallel Processing Symposium (1998)
Google Scholar
Wirth, N.: Algorithms and Data Structures. Prentice Hall, Inc., Englewood cliffs (1986)
MATH Google Scholar
Weiss, M.A.: Data Structures and Algorithm Analysis. The Benjamin/Cummings Publishing Company, Inc., Redwood City (1992)
Google Scholar
Aho, A.V., Hopcroft, J.E., Ullman, J.D.: Data structures and algorithms. Addison-Wesley, Reading (1983)
MATH Google Scholar
Karyapis, G., Han, E.H., Kumar, V.: CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling. IEEE Computer (1999) (special Issue on Data Analysis and Mining)
Google Scholar
Han, J., Kamber, M., Tung, A.: Spatial Clustering Methods in Data Mining: A Survey. In: Miller, H., Han, J. (eds.) Geographic Data Mining and Knowledge Discovery, p. 21. Taylor and Francis, Abington (2001)
Google Scholar
Halkidi, M., Batistakis, Y., Vazirgiannis, M.: Cluster Validity Methods: Part 1. ACM SIGMOD Record 31(2) (June 2002)
Google Scholar
Kanungo, T., David, M.M., Nathan, S.N., Piatko, C.D.: A Local Search Approximation Algorithm for k-Means Clustering. ACM Press, New York (2002)
Google Scholar
Kanungo, T., Mount, D.M., Netanyahu, N., Piatko, C., Silverman, R., Wu, A.Y.: An efficient k-means clustering algorithm: analysis and implementation. IEEE Trans. Pattern Analysis and Machine Intelligence 24, 881–892 (2002)
Article Google Scholar
Estivill-Castro, V., Fast, J.Y.: Robust General Purpose clustering algorithms. Data Mining and knowledge discovery 8, 127–150 (2004)
Article MathSciNet Google Scholar
Grade, V., Oliver: Multidimensional Access Methods. ACM Computing Surveys 30(2) (June 1998)
Google Scholar
Guttman, A.: R-Trees: Adynamic Index Structure for Spatial Searching. In: SIGMOD Conference 1984. ACM, Boston (1984)
Google Scholar
Oehler, K.L., Gray, R.M.: Combining Image Compression and Classification Using Vector Quantization. Ieee Transactions On Pattern Analysis And Machine Intelligence 17(5) (May 1995)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, QLD 4072, Australia
Mohsin Ali, Xue Li & Zhao Yang Dong

Authors

Mohsin Ali
View author publications
You can also search for this author in PubMed Google Scholar
Xue Li
View author publications
You can also search for this author in PubMed Google Scholar
Zhao Yang Dong
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electrical Engineering, University of Queensland, 4072, Australia
Marcus Gallagher
, POB 30031, FL 32503-1031, Pensacola
James P. Hogan
Faculty of Information Technology, Queensland University of Technology, Box 2434, Q 4001, Brisbane, Australia
Frederic Maire

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ali, M., Li, X., Dong, Z.Y. (2005). Efficient Spatial Clustering Algorithm Using Binary Tree. In: Gallagher, M., Hogan, J.P., Maire, F. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2005. IDEAL 2005. Lecture Notes in Computer Science, vol 3578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508069_39

Download citation

DOI: https://doi.org/10.1007/11508069_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26972-4
Online ISBN: 978-3-540-31693-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics