skip to main content
10.1145/1282280.1282359acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
Article

Scalable near identical image and shot detection

Published:09 July 2007Publication History

ABSTRACT

This paper proposes and compares two novel schemes for near duplicate image and video-shot detection. The first approach is based on global hierarchical colour histograms, using Locality Sensitive Hashing for fast retrieval. The second approach uses local feature descriptors (SIFT) and for retrieval exploits techniques used in the information retrieval community to compute approximate set intersections between documents using a min-Hash algorithm.

The requirements for near-duplicate images vary according to the application, and we address two types of near duplicate definition: (i) being perceptually identical (e.g. up to noise, discretization effects, small photometric distortions etc); and (ii) being images of the same 3D scene (so allowing for viewpoint changes and partial occlusion). We define two shots to be near-duplicates if they share a large percentage of near-duplicate frames.

We focus primarily on scalability to very large image and video databases, where fast query processing is necessary. Both methods are designed so that only a small amount of data need be stored for each image. In the case of near-duplicate shot detection it is shown that a weak approximation to histogram matching, consuming substantially less storage, is sufficient for good results. We demonstrate our methods on the TRECVID 2006 data set which contains approximately 165 hours of video (about 17.8M frames with 146K key frames), and also on feature films and pop videos.

References

  1. M. Bertini, A. D. Bimbo, and W. Nunziati. Video clip matching using mpeg-7 descriptors and edit distance. In CIVR, pages 133--142, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Broder. On the resemblance and containment of documents. In SEQS: Sequences '91, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Datar, N. Immorlica, P. Indyk, and V. Mirrokni. Locality-sensitive hashing scheme based on p-stable distributions. In SCG, pages 253--262, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Geusebroek, R. van den Boomgaard, A. Smeulders, and H. Geerts. Color invariance. PAMI, 23(12):1338--1350, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Henzinger. Finding near-duplicate web pages: a large-scale evaluation of algorithms. In SIGIR '06, pages 284--291, New York, NY, USA, 2006. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. T. C. Hoad and J. Zobel. Fast video matching with signature alignment. In MIR, pages 262--269, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Indyk. Stable distributions, pseudorandom generators, embeddings and data stream computation. In IEEE Symposium on Foundations of CS, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Joly, O. Buisson, and C. Frélicot. Content-based copy detection using distortion-based probabilistic similarity search. IEEE Transactions on Multimedia, to appear, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Joly, C. Frelicot, and O. Buisson. Robust content-based video copy identification in a large reference database. In Proc. CIVR, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Y. Ke, R. Sukthankar, and L. Huston. Efficient near-duplicate detection and sub-image retrieval. In ACM Multimedia, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2):91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Mikolajczyk and C. Schmid. An affine invariant interest point detector. In Proc. ECCV. Springer-Verlag, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. In Proc. CVPR, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  14. K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, and L. Van Gool. A comparison of affine region detectors. IJCV, 65(1/2):43--72, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In Proc. CVPR, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. T. Quack, V. Ferrari, and L. Van Gool. Video mining with frequent itemset configurations. In Proc. CIVR, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. F. Schaffalitzky and A. Zisserman. Multi-view matching for unordered image sets, or "How do I organize my holiday snaps?". In Proc. ECCV, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. F. Schaffalitzky and A. Zisserman. Automated location matching in movies. CVIU, 92:236--264, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In Proc. ICCV, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. TRECVID. http://trecvid.nist.gov/.Google ScholarGoogle Scholar
  21. Wikipedia. Come into my world. http://en.wikipedia.org/wiki/Come_Into_My_World.Google ScholarGoogle Scholar
  22. YouTube. http://www.youtube.com/.Google ScholarGoogle Scholar
  23. D. Zhang and S. Chang. Detecting image near-duplicate by stochastic attributed relational graph matching with learning. In ACM Multimedia, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. J. Zhou and X.-P. Zhang. Automatic identification of digital video based on shot-level sequence matching. In ACM MM, pages 515--518, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Scalable near identical image and shot detection

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval
            July 2007
            655 pages
            ISBN:9781595937339
            DOI:10.1145/1282280

            Copyright © 2007 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 9 July 2007

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader