Abstract
Product quantization is a widely used lossy compression technique that can generate high quantization levels by a compact codebook set. It has been conducted in cluster-based index structures, termed as product quantization-based indexing. In this paper, we propose a novel product quantization-based indexing method for approximate nearest neighbor search. Inspired by the study for learning to rank, a ranking scheme is presented to learn the weighting relation between query-dependent features. The clusters in an index table are ranked by the relevance scores derived from the weighted features with respect to the query. We then present an approximate nearest neighbor search algorithm integrating the proposed ranking scheme with the product quantization-based index structure. Experimental results on the billion-level datasets demonstrate the effectiveness and superiority of the proposed method compared with several state-of-the-art methods.
Similar content being viewed by others
References
Babenko A, Lempitsky V (2014) Additive quantization for extreme vector compression. Proc IEEE Int Conf Comput Vis Pattern Recognit:931–938
Babenko A, Lempitsky V (2015) The inverted multi-index. IEEE Trans Pattern Anal Mach Intell 37(6):1247–1260
Babenko A, Lempitsky V (2016) Efficient indexing of billion-scale datasets of deep descriptors. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2055–2063, Las Vegas, USA, Jun. 26-Jul. 1, 2016
Calonder M, Lepetit V, Strecha C, Fua P (2010) BRIEF: binary robust independent elementary features. Proc Eur conf Comput Vis:778–792
Chen Y, Guan T, Wang C (2010) Approximate nearest neighbor search by residual vector quantization. Sensors 10:11259–11273
Chiu CY, Liou YC, Prayoonwong A (2016) Approximate asymmetric search for binary embedding codes. ACM Trans Multimed Comput Commun Appl 13(1):1–25
Dai Q, Li J, Wang J, Jiang YG (2016) Binary optimized hashing. In Proceedings of ACM International Conference on Multimedia (SIGMM), pp. 1247–1256, Amsterdam, Netherlands, Oct. 15–19, 2016
Dong W, Charikar M, Li K (2008) Asymmetric distance estimation with sketches for similarity search in high-dimensional spaces. In Proceedings of ACM International Conference on Information Retrieval (SIGIR), pp. 128–130, 2008
Ge T, He K, Ke Q, Sun J (2013) Optimized product quantization for approximate nearest neighbor search. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Portland, USA, Jun. 23–28, 2013
Ge T, He K, Sun J (2014) graph cuts for supervised binary coding. Proc Eur conf Comput Vis 7:250–264
Gong Y, Lazebnik S (2011) iterative quantization: a procrustean approach to learning binary codes. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2011
Gordo A, Perronnin F, Gong Y, Lazebnik S (2014) Asymmetric distances for binary embeddings. IEEE Trans Pattern Anal Mach Intell 36(1):33–47
He K, Wen F, Sun J. (2013) K-means hashing: an affinity-preserving quantization method for learning binary compact codes. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Portland, USA, Jun. 23–28, 2013
Indyk P, Motwani R (1998) Approximate nearest neighbors: towards removing the curse of dimensionality. In Proceedings of ACM Symposium on Theory of Computing (STOC), pp 604–613, 1998
Jégou H, Douze M, Schmid C (2011) Product quantization for nearest neighbor search. IEEE Trans Pattern Anal Mach Intell 33(1):2481–2488
Jégou H, Tavenard R, Douze M, Amsaleg L (2011) Searching in one billion vectors: re-rank with source coding. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 861–864, Prague, Czech Republic, May 22–27, 2011
Kalantidis Y, Avrithis Y (2014) Locally optimized product quantization for approximate nearest neighbor search. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, USA, Jun. 23–28, 2014
Li J, Lan X, Wang J, Yang M, Zheng N (2017) Fast additive quantization for vector compression in nearest neighbor search. Multimed Tools Appl 76(22):23273–23289
Liong VE, Lu J, Wang G, Moulin P, Zhou J (2015) Deep hashing for compact binary codes learning. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2475–2483, Boston, USA, Jun. 7–12, 2015
Liu W, Wang J, Ji R, Jiang YG, Chang SF (2012) Supervised hashing with kernels. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2123–2130, Rhode Island, USA, Jun. 16–21, 2012
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Martinez J, Clement J, Hoos HH, Little JJ (2016) Revisiting additive quantization. In Proceedings of European Conference on Computer Vision (ECCV), pp. 137–153, Amsterdam, The Netherlands, Oct. 8–16, 2016
Matsui Y, Yamasaki T, Aizawa K (2015) PQTable: fast exact asymmetric distance neighbor search for product quantization using hash tables. In Proceedings of IEEE international conference on computer vision (ICCV), pp. 1940–1948, Santiago, Chile, 2015
Muja M, Lowe DG (2014) Scalable nearest neighbor algorithms for dimensional data. IEEE Trans Pattern Anal Mach Intell 36(11):2227–2240
Nistér D, Stewénius H (2006) Scalable recognition with a vocabulary tree. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR). New York, USA, 2006
Norouzi M, Fleet DJ (2011) Minimal loss hashing for compact binary codes. In Proceedings of International Conference on Machine Learning (ICML), Bellevue, USA, Jun. 28-Jul. 2, 2011
Norouzi M, Fleet DJ (2013) Cartesian k-means. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Portland, USA, Jun. 23–28, 2013
Paulevé L, Jégou H, Amsaleg L (2010) Locality sensitive hashing: a comparison of hash function types and querying mechanisms. Pattern Recogn Lett 31:1348–1358
Silpa-Anan C, Hartley R (2008) Optimized KD-trees for fast image descriptor matching. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), Anchorage, USA, 2008
Torralba A, Fergus R, Weiss Y (2008) small codes and large image databases for recognition. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2008
Wang J, Kumar S, Chang SF (2010) semi-supervised hashing for scalable image retrieval. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2010
Wang J, Shen HT, Song J, Ji J (2014) Hashing for similarity search: a survey. ArXiv:1408.2927
Wang J, Shen HT, Yan S, Yu N, Li S, Wang J (2014) optimized distances for binary code ranking. In Proceedings of ACM international conference on multimedia (ACMMM), pp. 517–526, 2014
Wei B, Guan T, Yu J (2014) Projected residual vector quantization for ANN search. IEEE Multimedia 21(3):41–51
Weiss Y, Fergus R, Torralba A (2012) Multi-dimensional spectral hashing. Proc Eur Conf Comput Vis 5:340–353
Xia Y, He K, Wen F, Sun J (2013) Joint inverted indexing. In Proceedings of IEEE International Conference on Computer Vision (ICCV), pp. 3416–3423, Sydney, Australia, Dec. 1–8, 2013
Zhang T, Du C, Wang J (2014) composite quantization for approximate nearest neighbor search. In Proceedings of International conference on Machine Learning (ICML), Beijing, China, Jun. 21–26, 2014
Zhang T, Qi GJ, Tang J, Wang J (2015) Sparse composite quantization. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4548–4556, Boston, USA, Jun. 7–12, 2015
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chiu, CY., Chiu, JS., Markchit, S. et al. Effective product quantization-based indexing for nearest neighbor search. Multimed Tools Appl 78, 2877–2895 (2019). https://doi.org/10.1007/s11042-018-6059-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-6059-5