An Approach to Reshaping Clusters for Nearest Neighbor Search

Shi, Yong; Graham, Brian

doi:10.1007/978-3-642-32639-4_8

Yong Shi¹⁹ &
Brian Graham¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7435))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1564 Accesses
1 Citations

Abstract

In this paper, we present our research on similarity search and clustering problems. Similarity search problems define the distances between data points and a given query point Q, efficiently and effectively selecting data points which are closest to Q. Clustering algorithms separate data points into different groups, in a way that data points in the same group have high similarity and data points from different groups are different from each other. In this paper, we explore the meaning of clusters from a new perspective, and propose an approach to reshape the clusters based on K nearest neighbor search results. The reconstructed clusters can help improve the performance of the following K nearest search process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aggarwal, C.C.: Towards meaningful high-dimensional nearest neighbor search by human-computer interaction. In: ICDE (2002)
Google Scholar
Aggarwal, C.C., Yu, P.S.: The IGrid index: reversing the dimensionality curse for similarity indexing in high dimensional space. In: Knowledge Discovery and Data Mining, pp. 119–129 (2000)
Google Scholar
Ankerst, M., Breunig, M.M., Kriegel, H.-P., Sander, J.: OPTICS: Ordering Points To Identify the Clustering Structure. In: Proc. ACM SIGMOD Int. Conf. on Management of Data (SIGMOD 1999), Philadelphia, PA, pp. 49–60 (1999)
Google Scholar
Bay, S.D.: The UCI KDD Archive. Department of Information and Computer Science. University of California, Irvine, http://kdd.ics.uci.edu
Fagin, R., Kumar, R., Sivakumar, D.: Efficient similarity search and classification via rank aggregation (2003)
Google Scholar
Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. The VLDB Journal, 518–529 (1999)
Google Scholar
Hinneburg, A., Aggarwal, C.C., Keim, D.A.: What is the nearest neighbor in high dimensional spaces? The VLDB Journal, 506–515 (2000)
Google Scholar
Sheikholeslami, G., Chatterjee, S., Zhang, A.: Wavecluster: A multi-resolution clustering approach for very large spatial databases. In: Proceedings of the 24th International Conference on Very Large Data Bases (1998)
Google Scholar
Shi, Y., Song, Y., Zhang, A.: A shrinking-based clustering approach for multidimensional data. IEEE Transactions on Knowledge and Data Engineering 17, 1389–1403 (2005)
Article Google Scholar
Shi, Y., Zhang, L.: Panknn: A dimension-wise approach to similarity search problems. In: DMIN, pp. 555–561 (2008)
Google Scholar
Tung, A.K.H., Zhang, R., Koudas, N., Ooi, B.C.: Similarity search: a matching based approach. In: VLDB 2006, pp. 631–642. VLDB Endowment (2006)
Google Scholar
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: An Efficient Data Clustering Method for Very Large Databases. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, Montreal, Canada, pp. 103–114 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Systems, Kennesaw State University, 1000 Chastain Road, Kennesaw, GA, 30144, USA
Yong Shi & Brian Graham

Authors

Yong Shi
View author publications
You can also search for this author in PubMed Google Scholar
Brian Graham
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electrical and Electronic Engineering, The University of Manchester, M13 9PL, Manchester, UK
Hujun Yin
Department of Electrical Engineering, Federal University of Rio Grande do Norte, Lagoa Nova, 59072-970, Natal, RN, Brazil
José A. F. Costa
Department of Teleinformatics Engineering, Federal University of Ceará, Campus of Pici, CP 6005, 60455-760, Fortaleza, CE, Brazil
Guilherme Barreto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shi, Y., Graham, B. (2012). An Approach to Reshaping Clusters for Nearest Neighbor Search. In: Yin, H., Costa, J.A.F., Barreto, G. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2012. IDEAL 2012. Lecture Notes in Computer Science, vol 7435. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32639-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-32639-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32638-7
Online ISBN: 978-3-642-32639-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics