Compressing the Index - A Simple and yet Efficient Approximation Approach to High-Dimensional Indexing

Wang, Shuguang; Yu, Cui; Ooi, Beng Chin

doi:10.1007/3-540-47714-4_27

Shuguang Wang⁷,
Cui Yu⁷ &
Beng Chin Ooi⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2118))

Included in the following conference series:

International Conference on Web-Age Information Management

324 Accesses

Abstract

An efficient tunable high-dimensional indexing scheme called the iMinMax(θ) was proposed to map high-dimensional data points into single dimension value based on the minimum or maximum values among all dimensions [7]. Unfortunately, the number of leaf nodes needs to be scanned remains large. To reduce the number of leaf nodes, we propose to use the compression technique proposed in the Vector Approximation File (VA-file) [10] to represent vectors. We call the hybrid method, the iMinMax(θ)*. While the marriage is straight forward, the gain in performance is significant. In our extensive performance study, the results clearly indicate that iMinMax(θ)* outperforms the original iMinMax(θ) index scheme and the VA-file. iMinMax(θ)* is also attractive from a practical view point for its implementation cost is only slightly higher than that of the original iMinMax(θ). The approximation concept that is incorporated in iMinMax(θ)* can be integrated in other high-dimensional index structures without much difficulty.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. Berchtold, D.A. Keim, H.P. Kriegel:The X-tree: an index structure for high-dimensional data. Proc. Very Large Data Bases VLDB’96 (1996) 23–27.
Google Scholar
N. Beckmann, H-P. Kriegel R. Schneider, B. Seeger: The R*-tree, An efficient and robust access method for points and rectangles. Proc. ACM SIGMOD Int. Conf. On Management of Data SIGMOD’90 (1990) 322–331.
Google Scholar
R. Finkel and J. Bentley: Quad-trees: A data structure for retrieval on composite keys. ACTA Information (1974) 1–9.
Google Scholar
A. Guttman:R-tree: A dynamic index structure for spatial searching. Proc. ACM SIGMOD Int. Conf. On Management of Data SIGMOD’84 (1984) 47–54.
Google Scholar
J. Nievergelt, H. Hinterberger, and K. Sevcik:The grid file: An adaptable symmetric multikey file structure. ACM Transactions on Database Systems 1984 38–71.
Google Scholar
B. C. Ooi: Efficient query processing in geographical information system. Lecture Notes in Computer Science #471, Springer-Verlag, 1990.
Google Scholar
B.C. Ooi, K.L. Tan, C. Yu, and S. Bressan:Indexing the Edges-A simple and yet efficient approach to high-dimensional indexing. Proc. ACM SIGMOD-SIGACT-SIGART 19th Symposium on Principles of Database Systems PODS’2000 (2000) 166–174.
Google Scholar
J. Robinson: The k-d-b tree: A search structure for large multidimensional dynamic indexes. Proc. ACM SIGMOD Int. Conf. On Management of Data (1981) 10–18.
Google Scholar
R. Weber and S. Blott. An approximation based data structure for similarity search. Technical Report 24, ESPRIT project HERMES (no. 9141) (1997)
Google Scholar
R. Weber, Hans-J. Schek, and S. Blott: A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. Proc. Int. Conf. Very Large Data Bases VLDB’98 (1998) 194–205.
Google Scholar
C. Yu: High-dimensional indexing. PhD Thesis. National University of Singapore (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, National University of Singapore, 3 Science Drive 2, Singapore, 117543
Shuguang Wang, Cui Yu & Beng Chin Ooi

Authors

Shuguang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Cui Yu
View author publications
You can also search for this author in PubMed Google Scholar
Beng Chin Ooi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information and Software Engineering, George Mason University, Fairfax, VA, 22030-4444, USA
X. Sean Wang
Department of Computer Science and Engineering, Northeastern University, Shenyang, 110004, China
Ge Yu
Department of Computer Science, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
Hongjun Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, S., Yu, C., Ooi, B.C. (2001). Compressing the Index - A Simple and yet Efficient Approximation Approach to High-Dimensional Indexing. In: Wang, X.S., Yu, G., Lu, H. (eds) Advances in Web-Age Information Management. WAIM 2001. Lecture Notes in Computer Science, vol 2118. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47714-4_27

Download citation

DOI: https://doi.org/10.1007/3-540-47714-4_27
Published: 28 June 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42298-3
Online ISBN: 978-3-540-47714-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics