Skip to main content
Log in

Efficient near-duplicate image detection with a local-based binary representation

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Efficient near-duplicate image detection is important for several applications that feature extraction and matching need to be taken online. Most image representations targeting at conventional image retrieval problems are either computationally expensive to extract and match, or limited in robustness. Aiming at this problem, in this paper, we propose an effective and efficient local-based representation method to encode an image as a binary vector, which is called Local-based Binary Representation (LBR). Local regions are extracted densely from the image, and each region is converted to a simple and effective feature describing its texture. A statistical histogram can be calculated over all the local features, and then it is encoded to a binary vector as the holistic image representation. The proposed binary representation jointly utilizes the local region texture and global visual distribution of the image, based on which a similarity measure can be applied to detect near-duplicate image effectively. The binary encoding scheme can not only greatly speed up the online computation, but also reduce memory cost in real applications. In experiments the precision and recall, as well as computational time of the proposed method are compared with other state-of-the-art image representations and LBR shows clear advantages on online near-duplicate image detection and video keyframe detection tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Bao BK, Zhu G, Shen J, Yan S (2013) General subspace learning with corrupted training data via graph embedding. IEEE Trans Image Process 22(11):4380–4393

    Article  MathSciNet  Google Scholar 

  2. Bao BK, Zhu G, Shen J, Yan S (2013) Robust image analysis with sparse representation on quantized visual features. IEEE Trans Image Process 22(3):860–871

    Article  MathSciNet  Google Scholar 

  3. Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (surf). Comp Vision Image Underst 110(3):346–359

    Article  Google Scholar 

  4. Chang EY, Wang JZ, Li C, Wiederhold G (1998) Rime: A replicated image detector for the world wide web. In: Photonics East (ISAM, VVDC, IEMB), pp 58–67. International Society for Optics and Photonics

  5. Choi Y, Park C, Lee JY, Kweon IS (2014) Robust binary feature using intensity order. In: The 12th Asian Conference on Computer Vision (ACCV)

  6. Chum O, Philbin J, Zisserman A (2008) Near duplicate image detection: min-hash and tf-idf weighting. In: BMVC, vol 810, pp 812–815

  7. Crow FC (1984) Summed-area tables for texture mapping. ACM SIGGRAPH Comput Graph 18(3):207–212

    Article  Google Scholar 

  8. Hamming RW (1950) Error detecting and error correcting codes. Bell Syst Tech J 29(2):147–160

    Article  MathSciNet  Google Scholar 

  9. Ke Y, Sukthankar R, Huston L (2004) An efficient parts-based near-duplicate and sub-image retrieval system. In: Proceedings of the 12th annual ACM international conference on multimedia, pp 869–876. ACM

  10. Kim C (2003) Content-based image copy detection. Signal Process Image Commun 18(3):169–184

    Article  Google Scholar 

  11. Krawetz N (2011) Image indexing. http://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html

  12. Law-To J, Chen L, Joly A, Laptev I, Buisson O, Gouet-Brunet V, Boujemaa N, Stentiford F (2007) Video copy detection: a comparative study. In: Proceedings of the 6th ACM international conference on image and video retrieval, pp 371–378. ACM

  13. Leutenegger S, Chli M, Siegwart RY (2011) Brisk: binary robust invariant scalable keypoints. In: IEEE international conference on computer vision (ICCV), 2011, pp 2548–2555. IEEE

  14. Li T, Nian F, Wu X, Gao Q, Lu Y (2014) Efficient video copy detection using multi-modality and dynamic path search. Multimedia Systems 1–11

  15. Li Z, Feng X (2013) Near duplicate image detecting algorithm based on bag of visual word model. J Multimedia 8(5):557–564

    Google Scholar 

  16. Liu B, Li Z, Wang M (2010) Efficient video duplicate detection via compact curve matching. In: IEEE international conference on multimedia and expo (ICME), 2010, pp 100–105. IEEE

  17. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  18. Meng Y, Chang E, Li B (2003) Enhancing dpf for near-replica image recognition. In: Proceedings of 2003 IEEE computer society conference on computer vision and pattern recognition, 2003, vol 2, pp II–416. IEEE

  19. Mishra P, Sonam M, Vijayalakshmi MS (2014) Content based image retrieval using clustering technique: a survey. Int J Res Comput Eng Electron 3(2)

  20. Nister D, Stewenius H (2006) Scalable recognition with a vocabulary tree. In: IEEE computer society conference on computer vision and pattern recognition, 2006, vol 2, pp 2161–2168. IEEE

  21. Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987

    Article  MATH  Google Scholar 

  22. Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  23. Rublee E, Rabaud V, Konolige K, Bradski G (2011) Orb: an efficient alternative to sift or surf. In: IEEE international conference on computer vision (ICCV), 2011, pp 2564–2571. IEEE

  24. Shang L, Yang L, Wang F, Chan K.P, Hua X.S (2010) Real-time large scale near-duplicate web video retrieval. In: Proceedings of the international conference on multimedia, pp 531–540. ACM

  25. Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: Proceedings of 9th IEEE international conference on computer vision, 2003, pp 1470–1477. IEEE

  26. Swain M.J, Ballard D.H (1991) Color indexing. Int J Comput Vis 7(1):11–32

    Article  Google Scholar 

  27. Takala V, Ahonen T, Pietikäinen M (2005) Block-based methods for image retrieval using local binary patterns. In: Image analysis, pp 882–891. Springer

  28. Thomee B, Huiskes M.J, Bakker E, Lew M.S (2008) Large scale image copy detection evaluation. In: Proceedings of the 1st ACM international conference on multimedia information retrieval, pp 59–66. ACM

  29. Wang M, Li H, Tao D, Lu K, Wu X (2012) Multimodal graph-based reranking for web image search. IEEE Trans Image Process 21(11):4649–4661

    Article  MathSciNet  Google Scholar 

  30. Wang M, Ni B, Hua X.S, Chua T.S (2012) Assistive tagging: a survey of multimedia tagging with human-computer joint exploration. ACM Comput Surv (CSUR) 44(4):25

    Article  Google Scholar 

  31. Wu X, Hauptmann A.G, Ngo C.W (2007) Practical elimination of near-duplicates from web video search. In: Proceedings of the 15th international conference on multimedia, pp 218–227. ACM

  32. Wu X, Ngo C.W, Hauptmann A.G, Tan H.K (2009) Real-time near-duplicate elimination for web video search with content and context. IEEE Trans Multimedia 11(2):196–207

    Article  Google Scholar 

  33. Xin Y, Qiang Z, Kwang-Ting C (2009) Near-duplicate detection for images and videos. In: In 1st ACM workshop on large scale multimedia retrieval and mining, pp 73–80. ACM

  34. Yang C, Peng J, Fan J (2013) Speed-up multi-modal near duplicate image detection. Open J Appl Sci 3:16–21

    Article  Google Scholar 

  35. Yang J, Jiang Y.G, Hauptmann A.G, Ngo C.W (2007) Evaluating bag-of-visual-words representations in scene classification. In: Proceedings of ACM SIGMM workshop on multimedia information retrieval

  36. Zhang S, Tian Q, Huang Q, Gao W, Rui Y (2014) Usb: Ultra short binary descriptor for fast visual matching and retrieval. Image Process

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation (NSF) of China (No. 61300056), the Ph.D. Programs Foundation of Ministry of Education of China (No. 20133401120005), the Anhui Provincial Natural Science Foundation of China (No. 1408085QF118), the Open Project Program of the National Laboratory of Pattern Recognition (NLPR) (No. 201306282) and a grant from Shenzhen Science and Technology Project (No. ZDS Y20120617113312191).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Teng Li.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nian, F., Li, T., Wu, X. et al. Efficient near-duplicate image detection with a local-based binary representation. Multimed Tools Appl 75, 2435–2452 (2016). https://doi.org/10.1007/s11042-015-2472-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-015-2472-1

Keywords

Navigation