Skip to main content

Automated Scene Matching in Movies

  • Conference paper
  • First Online:
Image and Video Retrieval (CIVR 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2383))

Included in the following conference series:

Abstract

We describe progress in matching shots which are images of the same 3D scene in a film. The problem is hard because the camera viewpoint may change substantially between shots, with consequent changes in the imaged appearance of the scene due to foreshortening, scale changes and partial occlusion.

We demonstrate that wide baseline matching techniques can be successfully employed for this task by matching key frames between shots. The wide baseline method represents each frame by a set of viewpoint invariant local feature vectors. The local spatial support of the features means that segmentation of the frame (e.g. into foreground/background) is not required, and partial occlusion is tolerated.

Results of matching shots for a number of different scene types are illustrated on a commercial film.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. Baumberg. Reliable feature matching across widely separated views. In Proc. CVPR, pages 774–781, 2000.

    Google Scholar 

  2. M. Gelgon and P. Bouthemy. Determining a structured spatio-temporal representation of video content for efficient visualization and indexing. In Proc. ECCV, pages 595–609, 1998.

    Google Scholar 

  3. R. I. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision. Cambridge University Press, ISBN: 0521623049, 2000.

    Google Scholar 

  4. H. Lee, A. Smeaton, N. Murphy, N. O’Conner, and S. Marlow. User interface design for keyframe-based browsing of digital video. In Workshop on Image Analysis for Multimedia Interactive Services. Tampere, Finland, 16-17 May, 2001.

    Google Scholar 

  5. R. Lienhart. Reliable transition detection in videos: A survey and practitioner’s guide. International Journal of Image and Graphics, Aug 2001.

    Google Scholar 

  6. T. Lindeberg and J. Gårding. Shape-adapted smoothing in estimation of 3-d depth cues from affine distortions of local 2-d brightness structure. In Proc. ECCV, LNCS 800, pages 389–400, May 1994.

    Google Scholar 

  7. J. Matas, J. Burianek, and J. Kittler. Object recognition using the invariant pixel-set signature. In Proc. BMVC., pages 606–615, 2000.

    Google Scholar 

  8. J. Matas, O. Chum, M. Urban, and T. Pajdla. Distinguished regions for wide-baseline stereo. Research Report CTU-CMP-2001-33, Center for Machine Perception, K333 FEE Czech Technical University, Prague, Czech Republic, November 2001.

    Google Scholar 

  9. J. Matas, M. Urban, and T. Pajdla. Unifying view for wide-baseline stereo. In B Likar, editor, Proc. Computer Vision Winter Workshop, pages 214–222, Ljubljana, Sloveni, February 2001. Slovenian Pattern Recorgnition Society.

    Google Scholar 

  10. K. Mikolajczyk and C. Schmid. Indexing based on scale invariant interest points. In Proc. ICCV, 2001.

    Google Scholar 

  11. K. Mikolajczyk and C. Schmid. An affine invariant interest point detector. In Proc. ECCV. Springer-Verlag, 2002.

    Google Scholar 

  12. P. Pritchett and A. Zisserman. Matching and reconstruction from widely separated views. In R. Koch and L. Van Gool, editors, 3D Structure from Multiple Images of Large-Scale Environments, LNCS 1506, pages 78–92. Springer-Verlag, Jun 1998.

    Chapter  Google Scholar 

  13. P. Pritchett and A. Zisserman. Wide baseline stereo matching. In Proc. ICCV, pages 754–760, Jan 1998.

    Google Scholar 

  14. F. Schaffalitzky and A. Zisserman. Viewpoint invariant texture matching and wide baseline stereo. In Proc. ICCV, Jul 2001.

    Google Scholar 

  15. F. Schaffalitzky and A. Zisserman. Multi-view matching for unordered image sets, or “How do I organize my holiday snaps?”. In Proc. ECCV. Springer-Verlag, 2002.

    Google Scholar 

  16. C. Schmid and R. Mohr. Local greyvalue invariants for image retrieval. IEEE PAMI, 19(5):530–534, May 1997.

    Google Scholar 

  17. D. Tell and S. Carlsson. Wide baseline point matching using affine invariants computed from intensity profiles. In Proc. ECCV, LNCS 1842-1843, pages 814–828. Springer-Verlag, Jun 2000.

    Google Scholar 

  18. P. H. S. Torr and D. W. Murray. The development and comparison of robust methods for estimating the fundamental matrix. IJCV, 24(3):271–300, 1997.

    Article  Google Scholar 

  19. T. Tuytelaars and L. Van Gool. Content-based image retrieval based on local affinely invariant regions. In Int. Conf. on Visual Information Systems, pages 493–500, 1999.

    Google Scholar 

  20. T. Tuytelaars and L. Van Gool. Wide baseline stereo matching based on local, affinely invariant regions. In Proc. BMVC., pages 412–425, 2000.

    Google Scholar 

  21. Z. Zhang, R. Deriche, O. D. Faugeras, and Q.-T. Luong. A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry. Artificial Intelligence, 78:87–119, 1995.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Schaffalitzky, F., Zisserman, A. (2002). Automated Scene Matching in Movies. In: Lew, M.S., Sebe, N., Eakins, J.P. (eds) Image and Video Retrieval. CIVR 2002. Lecture Notes in Computer Science, vol 2383. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45479-9_20

Download citation

  • DOI: https://doi.org/10.1007/3-540-45479-9_20

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43899-1

  • Online ISBN: 978-3-540-45479-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics