Neighborhood graphs for indexing and retrieving multi-dimensional data

Hacid, Hakim; Yoshida, Tetsuya

doi:10.1007/s10844-009-0081-z

Neighborhood graphs for indexing and retrieving multi-dimensional data

Published: 10 March 2009

Volume 34, pages 93–111, (2010)
Cite this article

Journal of Intelligent Information Systems Aims and scope Submit manuscript

Hakim Hacid¹ &
Tetsuya Yoshida²

237 Accesses
5 Citations
3 Altmetric
Explore all metrics

Abstract

We propose a methodology based on a structure called neighborhood graphs for indexing and retrieving multi-dimensional data. In accordance with the increase of the quantity of data, it gets more and more important to process multi-dimensional data. Processing of data includes various tasks, for instance, mining, classifying, clustering, to name a few. However, to enable the effective processing of such multi-dimensional data, it is often necessary to locate each data precisely in the multi-dimensional space where the data reside so that each data can be effectively retrieved for processing. This amounts to solving the point location problem (neighborhood search) for multi-dimensional space. In this paper, in order to utilize the structure of neighborhood graphs as an indexing structure for multi-dimensional data, we propose the following: i) a local insertion and deletion method, and ii) an incremental neighborhood graph construction method. The first method enables to cope with the problem incurred from the updating of the graph. The second method realizes fast neighborhood graph construction from scratch, through the recursive application of the first method. Several experiments are conducted to evaluate the proposed approach, and the results indicate the effectiveness of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Grid-R-tree: a data structure for efficient neighborhood and nearest neighbor queries in data mining

Article 03 April 2020

A Novel High-Dimensional Index Method Based on the Mathematical Features

Application of the nonlinear dimensionality reduction method with the use of reference nodes for solving the problem of multidimensional data search

Article 01 July 2015

Notes

This structure is the foundation of the other multi-dimensional topological models.
Relations are calculated in the original space and the illustration is given in the bi dimensional space.

References

Anderson, E. (1935). The irises of the gaspé peninsula. Bulletin of the American Iris Society, 59, 2–5.
Google Scholar
Baeza-Yates, R., & Ribeiro-Neto, B. (1999). Modern information retrieval. Reading: Addison Wesley.
Google Scholar
Beckmann, N., Kriegel, H.-P., Schneider, R., & Seeger, B. (1990). The r*-tree: An efficient and robust access method for points and rectangles. In SIGMOD conference (pp. 322–331).
Bei, C.-D., Gray, R. M. (1985). An improvement of the minimum distortion encoding algorithm for vector quantization. IEEE Transactions on Communications, 33, 1132–1133.
Article Google Scholar
Bentley, J. L. (1975). Multidimensional binary search trees used for associative searching. Communications of the ACM, 18(9), 509–517.
Article MATH MathSciNet Google Scholar
Berchtold, S., Böhm, C., Keim, D. A., & Kriegel, H.-P. (1997). A cost model for nearest neighbor search in high-dimensional data space. In PODS ’97: Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems (pp. 78–86). New York: ACM.
Chapter Google Scholar
Böhm, C., Berchtold, S., & Keim, D. A. (2001). Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases. ACM Computing Surveys, 33(3), 322–373.
Article Google Scholar
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees (pp. 43–49). Belmont, California: Wadsworth International Group.
MATH Google Scholar
Fisher, R. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7, 179–188.
Google Scholar
Friedman, J. H., Baskett, F., & Shustek, L. J. (1975). An algorithm for finding nearest neighbors. IEEE Transactions on Computers, 24(10), 1000–1006.
Article MATH Google Scholar
Gabriel, K. R., & Sokal, R. R. (1969). A new statistical approach to geographic variation analysis. Systematic Zoology, 18, 259–278.
Article Google Scholar
Gaede, V., & Günther, O. (1998a). Multidimensional access methods. ACM Computing Surveys, 30(2), 170–231.
Article Google Scholar
Gaede, V., & Günther, O. (1998b). Multidimensional access methods. ACM Computing Surveys, 30(2), 170–231.
Article Google Scholar
Guan, L., & Kamel, M. (1992). Equal-average hyperplane partitioning method for vector quantization of image data. Pattern Recognition Letters, 13(10), 693–699.
Article Google Scholar
Guttman, A. (1984). R-trees: A dynamic index structure for spatial searching. In SIGMOD conference (pp. 47–57).
Hacid, H. (2007). Neighborhood graphs for semi-automatic annotation of large image databases. In MMM (1) (pp. 586–595).
Hacid, H., & Zighed, D. (2006). Content-based image retrieval in large image databases. In IEEE international conference on granular computing (GrC 2006) (pp. 498–501). Atlanta, USA.
Hacid, H., & Zighed, D. A. (2005). An effective method for locally neighborhood graphs updating. In DEXA (pp. 930–939).
Henrich, A., Six, H.-W., & Widmayer, P. (1989). The lsd-tree: Spatial access to multidimensional point and nonpoint objects. In Proceedings 15th international conference on very large data bases. Amsterdam.
Hettich, S., Blake, C., & Merz, C. (1998). Uci repository of machine learning databases. http://www.ics.uci.edu/ mlearn/MLRepository.html.
Jaromczyk, J., & Toussaint, G. (1992). Relative neighborhood graphs and their relatives. P-IEEE, 80, 1502–1517.
Article Google Scholar
Katajainen, J. (1988). The region approach for computing relative neighborhood graphs in the lp metric. Computing, 40, 147–161.
Article MATH MathSciNet Google Scholar
Katayama, N., & Satoh, S. (1997). The sr-tree: An index structure for high-dimensional nearest neighbor queries. In SIGMOD conference (pp. 369–380).
Lee, C.-H., & Chen, L. H. (1994). Fast closest codeword search algorithm for vector quantisation. IEE Proceedings Vision, Image, and Signal Process, 141, 143–148.
Article Google Scholar
Lomet, D. B. (1992). A review of recent work on multi-attribute access methods. SIGMOD Record, 21(3), 56–63.
Article Google Scholar
Lomet, D. B., & Salzberg, B. (1989). A robust multi-attribute search structure. In ICDE (pp. 296–304).
MacQueen, J. B. (1967). Some methods for classification and analysis of multivariate observations. In Proceedings of 5-th Berkeley symposium on mathematical statistics and probability (Vol. 1, pp. 281–297). Berkeley: University of California Press.
Google Scholar
Preparata, F., & Shamos, M. I. (1985). Computationnal geometry-introduction. New York: Springer.
Google Scholar
Robinson, J. T. (1981). The k-d-b-tree: A search structure for large multidimensional dynamic indexes. In SIGMOD conference (pp. 10–18).
Samet, H. (1984). The quadtree and related hierarchical data structures. ACM Computing Surveys, 16(2), 187–260.
Article MathSciNet Google Scholar
Scuturici, M., Clech, J., Scuturici, V.-M., & Zighed, D. A. (2005). Topological representation model for image database query. Journal of Experimental and Theoretical Artificial Intelligence, 17(1–2), 145–160.
Article Google Scholar
Smith, W. D. (1989). Studies in computational geometry motivated by mesh generation. PhD thesis, Princeton University.
Somervuo, P., & Kohonen, T. (1999). Self-organizing maps and learning vector quantization for feature sequences. Neural Processing Letters, 10(2), 151–159.
Article Google Scholar
Toussaint, G. T. (1980). The relative neighborhood graphs in a finite planar set. Pattern Recognition, 12, 261–268.
Article MATH MathSciNet Google Scholar
Toussaint, G. T. (1991). Some insolved problems on proximity graphs. In D. W. Dearholt, & F. Harrary (Eds.), Proc. of the first workshop on proximity graphs. Memoranda in computer and cognitive science MCCS-91-224. Computing research lab. Las Cruces: State University Las Cruces.
Google Scholar
White, D. A., & Jain, R. (1996). Similarity indexing: Algorithms and performance. In Stor. and retr. for image and video DBs(SPIE) (pp. 62–73).
Wojna, A. (2005). Analogy-based reasoning in classifier construction, transactions on. In Rough sets IV, subseries of lecture notes in computer science, LNCS 3700 (pp. 277–374). New York: Springer.
Google Scholar
Yao, Y. Y. (2006). Neighborhood systems and approximate retrieval. Information Science, 176(23), 3431–3452.
Article MATH Google Scholar
Zighed, D., & Hacid, H. (2006). Proximity graphs and separability of classes. In The 11th international conference on information processing and management of uncertainty in knowledge-based systems (IPMU) (pp. 1488–1495). Paris, France.

Download references

Acknowledgements

This work was partially supported by Région Rhône Alpes under grant EMERGENCE 2004 and the grant-in-aid for scientific research (No. 20500123) funded by MEXT, Japan. The authors would like to thank the anonymous reviewers for their valuable comments that improved the paper.

Author information

Authors and Affiliations

Alcatel-Lucent Bell Labs, Centre de Villarceaux, Route de Villejust, 91620, Nozay, France
Hakim Hacid
Grad. School of Information Science and Technology, Hokkaido University, N-14 W-9, Sapporo, 060-0814, Japan
Tetsuya Yoshida

Authors

Hakim Hacid
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuya Yoshida
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hakim Hacid.

Additional information

This work has been mainly done when Hakim Hacid was a PhD student at the University of Lyon, France.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hacid, H., Yoshida, T. Neighborhood graphs for indexing and retrieving multi-dimensional data. J Intell Inf Syst 34, 93–111 (2010). https://doi.org/10.1007/s10844-009-0081-z

Download citation

Received: 02 June 2008
Revised: 12 February 2009
Accepted: 13 February 2009
Published: 10 March 2009
Issue Date: February 2010
DOI: https://doi.org/10.1007/s10844-009-0081-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Neighborhood graphs for indexing and retrieving multi-dimensional data

Abstract

Access this article

Similar content being viewed by others

Grid-R-tree: a data structure for efficient neighborhood and nearest neighbor queries in data mining

A Novel High-Dimensional Index Method Based on the Mathematical Features

Application of the nonlinear dimensionality reduction method with the use of reference nodes for solving the problem of multidimensional data search

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Neighborhood graphs for indexing and retrieving multi-dimensional data

Abstract

Access this article

Similar content being viewed by others

Grid-R-tree: a data structure for efficient neighborhood and nearest neighbor queries in data mining

A Novel High-Dimensional Index Method Based on the Mathematical Features

Application of the nonlinear dimensionality reduction method with the use of reference nodes for solving the problem of multidimensional data search

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation