Skip to main content

Fast Search in Large-Scale Image Database Using Vector Quantization

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2728))

Abstract

Practical content-based image retrieval systems require efficient indexing schemes for fast searches. Researchers have proposed many methods using space and data partitioning for exact similarity searches. However, traditional indexing methods perform poorly and will degrade to simple sequential scans at high dimensionality — that is so-called “curse of dimensionality”. Recently, several filtering approaches based on vector approximation (VA) were proposed and showed promising performance. In fact, existing VA-based methods assume independent distribution of dataset and utilize scalar quantizer to partition each dimension of data space. In real databases, however, images are from different categories and often clustered. In this paper, a novel indexing method using vector quantization is proposed. This approach introduces a vector quantizer to partition data space. It assumes a Gaussian mixture distribution and estimates this distribution through Expectation-Maximization (EM) method. Experiments on a large database of 275,465 images demonstrated a remarkable improvement of retrieval efficiency.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Y. Rui, T. Huang, S. Chang: Image Retrieval: Current Techniques, Promising Directions and Open Issues. J. of Visual Communication and Image Representation, vol. 10, (1999) 1–23

    Article  Google Scholar 

  2. Y. Rui, T. S. Huang, M. Ortega, et al.: Relevance feedback: A power tool for interactive content-based image retrieval. IEEE Trans. on CSVT, no. 5, (1998) 644–655

    Google Scholar 

  3. J. Nievergelt, H. Hinterberger, K. Sevcik: The gridfile: An adaptable symmetric multikey file structure. ACM Transactions on Database Systems, vol. 9, no. 1, (1984) 38–71

    Article  Google Scholar 

  4. J. Robinson: The k-d-b-tree: A search structure for large multidimensional dynamic indexes. Proc. of the ACM SIGMOD ICMD (1981) 10–18

    Google Scholar 

  5. N. Beckmann, H. P. Kriegel, R. Schneider, et al.: The R*-tree: An efficient and robust access method for points and rectangles. Proc. ACM SIGMOD ICMD (1990) 322–331

    Google Scholar 

  6. N. Katayama, S. Satoh: The SR-tree: An index structure for high-dimensional nearest neighbor queries. Proc. ACM SIGMOD Int. Conf. Management of Data (1997) 369–380

    Google Scholar 

  7. R. Weber, H. Schek, S. Blott: A quantitative analysis and performance study for simi-larity-search methods in high-dimensional spaces. Proc. ACM VLDB (1998)

    Google Scholar 

  8. K. Beyer, J. Goldstein, R. Ramakrishnan: When Is ‘Nearest Neighbor’ Meaningful?. Proc. of the 7th International Conference on Database Theory, Jerusalem (1999) 217–235

    Google Scholar 

  9. D. W. Scott, Density Estimation, Wiley, New York (1992)

    MATH  Google Scholar 

  10. H. Ferhatosmanoglu, E. Tuncel, D. Agrawal: Vector Approximation based Indexing for Non-uniform High Dimensional Data Sets. ACM CKIM, McLean, (2000)

    Google Scholar 

  11. P. Wu, B. Manjunath, S. Chandrasekaran: An adaptive index structure for highdimensional similarity search. Proc. PCM, Beijing, China, (2001) 71–77

    Google Scholar 

  12. G.-H. Cha, X. Zhu, D. Petkovic, et al: An efficient indexing method for nearest neighbor searches in high-dimensional image databases. IEEE Trans. Multimedia, vol. 4, no. 1, (2002) 76–87

    Article  Google Scholar 

  13. A. Gersho, R. M. Gray: Vector Quantization and Signal Compression. Kluwer Academic (1992)

    Google Scholar 

  14. T. D. Lookabaugh, R. M. Gray: High-resolution Theory and the Vector Quantizer Advantage. IEEE Trans. On Information Theory, no. 35, (1989) 1020–1033

    Article  MathSciNet  Google Scholar 

  15. E. Forgy: Cluster analysis of multivariate data: Efficiency vs. interpretability of classifica-tions. Biometrics, vol. 21, no. 768, (1965)

    Google Scholar 

  16. A. P. Dempster, N. M. Laird, D. B. Rubin: Maximum likelihood from incomplete data via the EM algorithm. J. of the Royal Statistical Society B, vol. 39, no. 1, (1977) 1–38

    MATH  MathSciNet  Google Scholar 

  17. B. S. Manjunath, Aerial photo image database, http://vision.ece.ucsb.edu/datasets/

    Google Scholar 

  18. B. S. Manjunath, W. Y. Ma: Texture features for browsing and retrieval of image data. IEEE PAMI, vol. 18, no. 8, (1996) 837–842

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ye, H., Xu, G. (2003). Fast Search in Large-Scale Image Database Using Vector Quantization. In: Bakker, E.M., Lew, M.S., Huang, T.S., Sebe, N., Zhou, X.S. (eds) Image and Video Retrieval. CIVR 2003. Lecture Notes in Computer Science, vol 2728. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45113-7_47

Download citation

  • DOI: https://doi.org/10.1007/3-540-45113-7_47

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40634-1

  • Online ISBN: 978-3-540-45113-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics