Skip to main content
Log in

Encoding Spatial Context for Large-Scale Partial-Duplicate Web Image Retrieval

  • Regular Paper
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

Many recent state-of-the-art image retrieval approaches are based on Bag-of-Visual-Words model and represent an image with a set of visual words by quantizing local SIFT (scale invariant feature transform) features. Feature quantization reduces the discriminative power of local features and unavoidably causes many false local matches between images, which degrades the retrieval accuracy. To filter those false matches, geometric context among visual words has been popularly explored for the verification of geometric consistency. However, existing studies with global or local geometric verification are either computationally expensive or achieve limited accuracy. To address this issue, in this paper, we focus on partial duplicate Web image retrieval, and propose a scheme to encode the spatial context for visual matching verification. An efficient affine enhancement scheme is proposed to refine the verification results. Experiments on partial-duplicate Web image search, using a database of one million images, demonstrate the effectiveness and efficiency of the proposed approach. Evaluation on a 10-million image database further reveals the scalability of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Wu Z, Ke Q, Isard M, Sun J. Bundling features for large scale partial-duplicate web image search. In Proc. CVPR, June 2009, pp.25–32.

  2. Xie H, Gao K, Zhang Y et al. Efficient feature detection and effective post-verification for large scale near-duplicate image search. IEEE Trans. Multimedia, 2011, 13(6): 1319–1332.

    Article  Google Scholar 

  3. Xie L, Tian Q, Zhou W et al. Fast and accurate near-duplicate image search with affinity propagation on the Image Web. Computer Vision and Image Understanding, 2014, 124: 31–41.

    Article  Google Scholar 

  4. Chu L, Jiang S, Wang S, Zhang Y, Huang Q. Robust spatial consistency graph model for partial duplicate image retrieval. IEEE Trans. Multimedia, 2013, 15(8): 1982–1996.

    Article  Google Scholar 

  5. Sivic J, Zisserman A. Video Google: A text retrieval approach to object matching in videos. In Proc. the 9th IEEE Int. Conf. Computer Vision, Oct. 2003, pp.1470–1477.

  6. Nister D, Stewenius H. Scalable recognition with a vocabulary tree. In Proc. CVRP, June 2006, pp.2161–2168.

  7. Chum O, Philbin J, Sivic J, Isard M, Zisserman A. Total recall: Automatic query expansion with a generative featuremodel for object retrieval. In Proc. the 11th IEEE Int. Conf. Computer Vision, Oct. 2007, pp.1–8.

  8. Chum O, Philbin J, Zisserman A. Near duplicate image detection: Min-Hash and tf-idf weighting. In Proc. the 19th BMVC, Sept. 2008, pp.493–502.

  9. Chum O, Perdoch M, Matas J. Geometric min-hashing: Finding a (thick) needle in a haystack. In Proc. CVPR, June 2009, pp.17–24.

  10. Jegou H, Douze M, Schmid C. Hamming embedding and weak geometric consistency for large scale image search. In Proc. the 10th ECCV, Oct. 2008, pp.304–317.

  11. Philbin J, Chum O, Isard M, Sivic J, Zisserman A. Object retrieval with large vocabularies and fast spatial matching. In Proc. CVPR, June 2007, pp.1–8.

  12. Philbin J, Chum O, Isard M et al. Lost in quantization: Improving particular object retrieval in large scale image databases. In Proc. CVPR, June 2008, pp.1–8.

  13. Jégou H, Douze M, Schmid C, Pérez P. Aggregating local descriptors into a compact image representation. In Proc. CVPR, June 2010, pp.3304–3311.

  14. Zhang Y, Jia Z, Chen T. Image retrieval with geometry-preserving visual phrases. In Proc. CVPR, June 2011, pp.809–816.

  15. Zhou W, Lu Y, Li H, Song Y, Tian Q. Spatial coding for large scale partial-duplicate Web image search. In Proc. Int. Conf. Multimedia, Oct. 2010, pp.511–520.

  16. Zheng L,Wang S, Liu Z, Tian Q. LP-Norm IDF for large scale image search. In Proc. CVPR, June 2013, pp.1626–1633.

  17. Xie H, Zhang Y, Tan J, Guo L, Li J. Contextual query expansion for image retrieval. IEEE Trans. Multimedia, 2014, 16(4): 1104–1114.

    Article  Google Scholar 

  18. Liu Z, Li H, Zhou W, Zhao R, Tian Q. Contextual hashing for large-scale image search. IEEE Trans. Image Processing, 2014, 23(4): 1606–1614.

    Article  MathSciNet  Google Scholar 

  19. Lowe D G. Distinctive image features from scale invariant keypoints. International Journal of Computer Vision, 2004, 60(2): 91–110.

    Article  Google Scholar 

  20. Zhou W, Lu Y, Li H, Tian Q. Scalar quantization for large scale image search. In Proc. the 20th ACM Multimedia, Oct. 2012, pp.169–178.

  21. Babenko A, Lempitsky V. The inverted multi-index. In Proc. CVPR, June 2012, pp.3069–3076.

  22. Shen X, Lin Z, Brandt J et al. Object retrieval and localization with spatially-constrained similarity measure and k-NN re-ranking. In Proc. CVPR, June 2012, pp.3013–3020.

  23. Zhou W, Li H, Lu Y, Tian Qi. SIFT match verification by geometric coding for large-scale partial-duplicate Web image search. ACM Trans. Multimedia Computing, Communications, and Applications, 2013, 9(1): Article No. 4.

  24. Wang W, Zhang D, Zhang Y, Li J, Gu X. Robust spatial matching for object retrieval and its parallel implementation on GPU. IEEE Trans. Multimedia, 2011, 13(6): 1308–1318.

    Article  Google Scholar 

  25. Chum O, Mikulik A, Perdoch M, Matas J. Total recall II: Query expansion revisited. In Proc. CVPR, June 2011, pp.889–896.

  26. Zhou W, Li H, Lu Y, Wang M, Tian Q. Visual word expansion and BSIFT verification for large-scale image search. Multimedia Systems, 2013. http://link.springer.com/article/10.1007/s00530-013-0330-4, Aug. 2014.

  27. Zhang S, Yang M, Wang X et al. Semantic-aware co-indexing for image retrieval. In Proc. ICCV, 2013, pp.1673–1680.

  28. Zhang S, Tian Q, Lu K et al. Edge-SIFT: Discriminative binary descriptor for scalable partial-duplicate mobile search. IEEE Trans. Image Processing, 2013, 22(7): pp.2889–2902.

  29. Jégou H, Harzallah H, Schmid C. A contextual dissimilarity measure for accurate and efficient image search. In Proc. CVPR, June 2007, pp.1–8.

  30. Zhou W, Yang M, Li H, Wang X, Lin Y, Tian Q. Towards codebook-free: Scalable cascaded hashing for mobile image search. IEEE Trans. Multimedia, 2014, 16(3): 601–611.

    Article  Google Scholar 

  31. Arandjelovic R, Zisserman A. Three things everyone should know to improve object retrieval. In Proc. CVPR, June 2012, pp.2911–2918.

  32. Zhang X, Zhang L, Shum H Y. QsRank: Query-sensitive hash code ranking for efficient 2-neighbor search. In Proc. CVPR, June 2012, pp.2058–2065.

  33. Fischler M A, Bolles R C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 1981, 24(6): 381–395.

    Article  MathSciNet  Google Scholar 

  34. Chum O, Matas J. Matching with PROSAC-progressive sample consensus. In Proc. CVPR, June 2005, pp.220–226.

  35. Smith J R, Chang S F. VisualSEEk: A fully automated content-based image query system. In Proc. the 4th ACM Multimedia, Nov. 1996, pp.75–84.

  36. Chang S, Shi Q, Yan C. Iconic indexing by 2-D strings. IEEE Trans. Pattern Analysis and Machine Intelligence, 1987, 9(3): 413–328.

    Article  Google Scholar 

  37. Matas J, Chum O, Urban M, Pajdla T. Robust wide-baseline stereo from maximally stable extremal regions. Image and Vision Computing, 2004, 22(10): 761–767.

    Article  Google Scholar 

  38. Chum O, Philbin J, Isard M et al. Scalable near identical image and shot detection. In Proc. CIVR, July 2007, pp.549–556.

  39. Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In Proc. CVPR, June 2006, pp.2169–2178.

  40. Belongie S, Malik J, Puzicha J. Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Analysis and Machine Intelligence, 2002, 24(4): 509–522.

    Article  Google Scholar 

  41. Savarese S, Winn J, Criminisi A. Discriminative object class models of appearance and shape by correlatons. In Proc. CVPR, June 2006, pp.2033–2040.

  42. Yuan J, Wu Y, Yang M. Discovery of collocation patterns: From visual words to visual phrases. In Proc. CVPR, June 2007, pp.1–8.

  43. Zhang Y, Chen T. Efficient kernels for identifying unbounded-order spatial features. In Proc. CVPR, June 2009, pp.1762–1769.

  44. Deng J, Dong W, Socher R et al. ImageNet: A large-scale hierarchical image database. In Proc. CVPR, June 2009, pp.248–255.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wen-Gang Zhou.

Additional information

This work was supported in part to Dr. Wen-Gang Zhou by the Fundamental Research Funds for the Central Universities of China under Grant Nos. WK2100060014 and WK2100060011, the Start-Up Funding from the University of Science and Technology of China under Grant No. KY2100000036, the Open Project of Beijing Multimedia and Intelligent Software Key Laboratory in Beijing University of Technology, and the sponsor from Intel ICRI MNC project, in part to Dr. Hou-Qiang Li by the National Natural Science Foundation of China (NSFC) under Grant Nos. 61325009, 61390514, and 61272316, in part to Dr. Yijuan Lu by the Army Research Office (ARO) of USA under Grant No. W911NF-12-1-0057 and the National Science Foundation of USA under Grant No. CRI 1305302, and in part to Dr. Qi Tian by ARO under Grant No. W911NF-12-1-0057 and the Faculty Research Award by NEC Laboratories of America, respectively. This work was supported in part by NSFC under Grant No. 61128007.

Electronic supplementary material

Below is the link to the electronic supplementary material.

ESM 1

(PDF 129 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, WG., Li, HQ., Lu, Y. et al. Encoding Spatial Context for Large-Scale Partial-Duplicate Web Image Retrieval. J. Comput. Sci. Technol. 29, 837–848 (2014). https://doi.org/10.1007/s11390-014-1472-3

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11390-014-1472-3

Keywords

Navigation