Skip to main content

Online Optimized Product Quantization for ANN Queries over Dynamic Database using SVD-Updating

  • Chapter
  • First Online:
Transactions on Large-Scale Data- and Knowledge-Centered Systems LII

Part of the book series: Lecture Notes in Computer Science ((TLDKS,volume 13470))

  • 197 Accesses

Abstract

Approximate nearest neighbor (ANN) search allows us to perform similarity search over massive vectors with less memory and computation. Optimized Product Quantization (OPQ) is one of the state-of-the-art methods for ANN where data vectors are represented as combinations of codewords by taking into account the data distribution. However, it suffers from degradation in accuracy when the database is frequently updated with incoming data whose distribution is different. An existing work, Online OPQ, addressed this problem, but the computational cost is high because it requires to perform of costly singular value decomposition for updating the codewords. To this problem, we propose a method for updating the rotation matrix using SVD-Updating, which can dynamically update the singular matrix using low-rank approximation. Using SVD-Updating, instead of performing multiple singular value decomposition on a high-rank matrix, we can update the rotation matrix by performing only one singular value decomposition on a low-rank matrix. In the experiments, we prove that the proposed method shows a better trade-off between update time and retrieval accuracy than the comparative methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Babenko, A., Lempitsky, V.: Additive quantization for extreme vector compression. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 931–938 (2014)

    Google Scholar 

  2. Babenko, A., Lempitsky, V.: Tree quantization for large-scale similarity search and classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4240–4248 (2015)

    Google Scholar 

  3. Berry, M.W., Dumais, S.T., O’Brien, G.W.: Using linear algebra for intelligent information retrieval. SIAM Rev. 37(4), 573–595 (1995)

    Article  MathSciNet  Google Scholar 

  4. Datar, M., Immorlica, N., Indyk, P., Mirrokni, V.S.: Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the Twentieth Annual Symposium on Computational Geometry, pp. 253–262 (2004)

    Google Scholar 

  5. Dong, A., Bhanu, B.: Concept learning and transplantation for dynamic image databases. In: Proceedings of 2003 International Conference on Multimedia and Expo, ICME 2003, (Cat. No. 03TH8698), vol. 1, pp. I-765. IEEE (2003)

    Google Scholar 

  6. Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: 2004 Conference on Computer Vision and Pattern Recognition Workshop, pp. 178–178. IEEE (2004)

    Google Scholar 

  7. Ge, T., He, K., Ke, Q., Sun, J.: Optimized product quantization. IEEE Trans. Pattern Anal. Mach. intell. 36(4), 744–755 (2013)

    Article  Google Scholar 

  8. Gray, R.: Vector quantization. IEEE Assp Mag. 1(2), 4–29 (1984)

    Article  Google Scholar 

  9. Jegou, H., Douze, M., Schmid, C.: Product quantization for nearest neighbor search. IEEE Trans. Pattern Anal. Mach. Intell. 33(1), 117–128 (2010)

    Article  Google Scholar 

  10. Kalantidis, Y., Avrithis, Y.: Locally optimized product quantization for approximate nearest neighbor search. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2321–2328 (2014)

    Google Scholar 

  11. Lang, K.: Newsweeder: learning to filter netnews. In: Machine Learning Proceedings 1995, pp. 331–339. Elsevier (1995)

    Google Scholar 

  12. Liu, C., Lian, D., Nie, M., Xia, H.: Online optimized product quantization. In: 2020 IEEE International Conference on Data Mining (ICDM), pp. 362–371. IEEE (2020)

    Google Scholar 

  13. Moffat, A., Zobel, J., Sharman, N.: Text compression for dynamic document databases. IEEE Trans. Knowl. Data Eng. 9(2), 302–313 (1997)

    Article  Google Scholar 

  14. Norouzi, M., Fleet, D.J.: Cartesian k-means. In: Proceedings of the IEEE Conference on computer Vision and Pattern Recognition, pp. 3017–3024 (2013)

    Google Scholar 

  15. Schönemann, P.H.: A generalized solution of the orthogonal procrustes problem. Psychometrika 31(1), 1–10 (1966)

    Article  MathSciNet  Google Scholar 

  16. Silpa-Anan, C., Hartley, R.: Optimised kd-trees for fast image descriptor matching. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2008)

    Google Scholar 

  17. Wang, J., Wang, J., Song, J., Xu, X.S., Shen, H.T., Li, S.: Optimized cartesian k-means. IEEE Trans. Knowl. Data Eng. 27(1), 180–192 (2014)

    Article  Google Scholar 

  18. Wang, X., Zhang, T., Qi, G.J., Tang, J., Wang, J.: Supervised quantization for similarity search. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2018–2026 (2016)

    Google Scholar 

  19. Xiao, J., Hays, J., Ehinger, K.A., Oliva, A., Torralba, A.: Sun database: large-scale scene recognition from abbey to zoo. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3485–3492. IEEE (2010)

    Google Scholar 

  20. Xu, D., Tsang, I.W., Zhang, Y.: Online product quantization. IEEE Trans. Knowl. Data Eng. 30(11), 2185–2198 (2018)

    Google Scholar 

  21. Zhang, T., Du, C., Wang, J.: Composite quantization for approximate nearest neighbor search. In: International Conference on Machine Learning, pp. 838–846. PMLR (2014)

    Google Scholar 

  22. Zhang, T., Qi, G.J., Tang, J., Wang, J.: Sparse composite quantization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4548–4556 (2015)

    Google Scholar 

  23. Zhang, T., Wang, J.: Collaborative quantization for cross-modal similarity search. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2036–2045 (2016)

    Google Scholar 

Download references

Acknowledgments

This paper was supported by Japan Society for the Promotion of Science (JSPS) KAKENHI under Grant Number JP22H03694 and the New Energy and Industrial Technology Development Organization (NEDO) Grant Number JPNP20006.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Toshiyuki Amagasa .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer-Verlag GmbH Germany, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Yukawa, K., Amagasa, T. (2022). Online Optimized Product Quantization for ANN Queries over Dynamic Database using SVD-Updating. In: Hameurlain, A., Tjoa, A.M. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems LII. Lecture Notes in Computer Science(), vol 13470. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-66146-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-66146-8_4

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-66145-1

  • Online ISBN: 978-3-662-66146-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics