Abstract
Approximate nearest neighbor (ANN) search allows us to perform similarity search over massive vectors with less memory and computation. Optimized Product Quantization (OPQ) is one of the state-of-the-art methods for ANN where data vectors are represented as combinations of codewords by taking into account the data distribution. However, it suffers from degradation in accuracy when the database is frequently updated with incoming data whose distribution is different. An existing work, Online OPQ, addressed this problem, but the computational cost is high because it requires to perform of costly singular value decomposition for updating the codewords. To this problem, we propose a method for updating the rotation matrix using SVD-Updating, which can dynamically update the singular matrix using low-rank approximation. Using SVD-Updating, instead of performing multiple singular value decomposition on a high-rank matrix, we can update the rotation matrix by performing only one singular value decomposition on a low-rank matrix. In the experiments, we prove that the proposed method shows a better trade-off between update time and retrieval accuracy than the comparative methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Babenko, A., Lempitsky, V.: Additive quantization for extreme vector compression. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 931–938 (2014)
Babenko, A., Lempitsky, V.: Tree quantization for large-scale similarity search and classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4240–4248 (2015)
Berry, M.W., Dumais, S.T., O’Brien, G.W.: Using linear algebra for intelligent information retrieval. SIAM Rev. 37(4), 573–595 (1995)
Datar, M., Immorlica, N., Indyk, P., Mirrokni, V.S.: Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the Twentieth Annual Symposium on Computational Geometry, pp. 253–262 (2004)
Dong, A., Bhanu, B.: Concept learning and transplantation for dynamic image databases. In: Proceedings of 2003 International Conference on Multimedia and Expo, ICME 2003, (Cat. No. 03TH8698), vol. 1, pp. I-765. IEEE (2003)
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: 2004 Conference on Computer Vision and Pattern Recognition Workshop, pp. 178–178. IEEE (2004)
Ge, T., He, K., Ke, Q., Sun, J.: Optimized product quantization. IEEE Trans. Pattern Anal. Mach. intell. 36(4), 744–755 (2013)
Gray, R.: Vector quantization. IEEE Assp Mag. 1(2), 4–29 (1984)
Jegou, H., Douze, M., Schmid, C.: Product quantization for nearest neighbor search. IEEE Trans. Pattern Anal. Mach. Intell. 33(1), 117–128 (2010)
Kalantidis, Y., Avrithis, Y.: Locally optimized product quantization for approximate nearest neighbor search. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2321–2328 (2014)
Lang, K.: Newsweeder: learning to filter netnews. In: Machine Learning Proceedings 1995, pp. 331–339. Elsevier (1995)
Liu, C., Lian, D., Nie, M., Xia, H.: Online optimized product quantization. In: 2020 IEEE International Conference on Data Mining (ICDM), pp. 362–371. IEEE (2020)
Moffat, A., Zobel, J., Sharman, N.: Text compression for dynamic document databases. IEEE Trans. Knowl. Data Eng. 9(2), 302–313 (1997)
Norouzi, M., Fleet, D.J.: Cartesian k-means. In: Proceedings of the IEEE Conference on computer Vision and Pattern Recognition, pp. 3017–3024 (2013)
Schönemann, P.H.: A generalized solution of the orthogonal procrustes problem. Psychometrika 31(1), 1–10 (1966)
Silpa-Anan, C., Hartley, R.: Optimised kd-trees for fast image descriptor matching. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2008)
Wang, J., Wang, J., Song, J., Xu, X.S., Shen, H.T., Li, S.: Optimized cartesian k-means. IEEE Trans. Knowl. Data Eng. 27(1), 180–192 (2014)
Wang, X., Zhang, T., Qi, G.J., Tang, J., Wang, J.: Supervised quantization for similarity search. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2018–2026 (2016)
Xiao, J., Hays, J., Ehinger, K.A., Oliva, A., Torralba, A.: Sun database: large-scale scene recognition from abbey to zoo. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3485–3492. IEEE (2010)
Xu, D., Tsang, I.W., Zhang, Y.: Online product quantization. IEEE Trans. Knowl. Data Eng. 30(11), 2185–2198 (2018)
Zhang, T., Du, C., Wang, J.: Composite quantization for approximate nearest neighbor search. In: International Conference on Machine Learning, pp. 838–846. PMLR (2014)
Zhang, T., Qi, G.J., Tang, J., Wang, J.: Sparse composite quantization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4548–4556 (2015)
Zhang, T., Wang, J.: Collaborative quantization for cross-modal similarity search. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2036–2045 (2016)
Acknowledgments
This paper was supported by Japan Society for the Promotion of Science (JSPS) KAKENHI under Grant Number JP22H03694 and the New Energy and Industrial Technology Development Organization (NEDO) Grant Number JPNP20006.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer-Verlag GmbH Germany, part of Springer Nature
About this chapter
Cite this chapter
Yukawa, K., Amagasa, T. (2022). Online Optimized Product Quantization for ANN Queries over Dynamic Database using SVD-Updating. In: Hameurlain, A., Tjoa, A.M. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems LII. Lecture Notes in Computer Science(), vol 13470. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-66146-8_4
Download citation
DOI: https://doi.org/10.1007/978-3-662-66146-8_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-66145-1
Online ISBN: 978-3-662-66146-8
eBook Packages: Computer ScienceComputer Science (R0)