Abstract
We describe progress in matching shots which are images of the same 3D scene in a film. The problem is hard because the camera viewpoint may change substantially between shots, with consequent changes in the imaged appearance of the scene due to foreshortening, scale changes and partial occlusion.
We demonstrate that wide baseline matching techniques can be successfully employed for this task by matching key frames between shots. The wide baseline method represents each frame by a set of viewpoint invariant local feature vectors. The local spatial support of the features means that segmentation of the frame (e.g. into foreground/background) is not required, and partial occlusion is tolerated.
Results of matching shots for a number of different scene types are illustrated on a commercial film.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
A. Baumberg. Reliable feature matching across widely separated views. In Proc. CVPR, pages 774–781, 2000.
M. Gelgon and P. Bouthemy. Determining a structured spatio-temporal representation of video content for efficient visualization and indexing. In Proc. ECCV, pages 595–609, 1998.
R. I. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision. Cambridge University Press, ISBN: 0521623049, 2000.
H. Lee, A. Smeaton, N. Murphy, N. O’Conner, and S. Marlow. User interface design for keyframe-based browsing of digital video. In Workshop on Image Analysis for Multimedia Interactive Services. Tampere, Finland, 16-17 May, 2001.
R. Lienhart. Reliable transition detection in videos: A survey and practitioner’s guide. International Journal of Image and Graphics, Aug 2001.
T. Lindeberg and J. Gårding. Shape-adapted smoothing in estimation of 3-d depth cues from affine distortions of local 2-d brightness structure. In Proc. ECCV, LNCS 800, pages 389–400, May 1994.
J. Matas, J. Burianek, and J. Kittler. Object recognition using the invariant pixel-set signature. In Proc. BMVC., pages 606–615, 2000.
J. Matas, O. Chum, M. Urban, and T. Pajdla. Distinguished regions for wide-baseline stereo. Research Report CTU-CMP-2001-33, Center for Machine Perception, K333 FEE Czech Technical University, Prague, Czech Republic, November 2001.
J. Matas, M. Urban, and T. Pajdla. Unifying view for wide-baseline stereo. In B Likar, editor, Proc. Computer Vision Winter Workshop, pages 214–222, Ljubljana, Sloveni, February 2001. Slovenian Pattern Recorgnition Society.
K. Mikolajczyk and C. Schmid. Indexing based on scale invariant interest points. In Proc. ICCV, 2001.
K. Mikolajczyk and C. Schmid. An affine invariant interest point detector. In Proc. ECCV. Springer-Verlag, 2002.
P. Pritchett and A. Zisserman. Matching and reconstruction from widely separated views. In R. Koch and L. Van Gool, editors, 3D Structure from Multiple Images of Large-Scale Environments, LNCS 1506, pages 78–92. Springer-Verlag, Jun 1998.
P. Pritchett and A. Zisserman. Wide baseline stereo matching. In Proc. ICCV, pages 754–760, Jan 1998.
F. Schaffalitzky and A. Zisserman. Viewpoint invariant texture matching and wide baseline stereo. In Proc. ICCV, Jul 2001.
F. Schaffalitzky and A. Zisserman. Multi-view matching for unordered image sets, or “How do I organize my holiday snaps?”. In Proc. ECCV. Springer-Verlag, 2002.
C. Schmid and R. Mohr. Local greyvalue invariants for image retrieval. IEEE PAMI, 19(5):530–534, May 1997.
D. Tell and S. Carlsson. Wide baseline point matching using affine invariants computed from intensity profiles. In Proc. ECCV, LNCS 1842-1843, pages 814–828. Springer-Verlag, Jun 2000.
P. H. S. Torr and D. W. Murray. The development and comparison of robust methods for estimating the fundamental matrix. IJCV, 24(3):271–300, 1997.
T. Tuytelaars and L. Van Gool. Content-based image retrieval based on local affinely invariant regions. In Int. Conf. on Visual Information Systems, pages 493–500, 1999.
T. Tuytelaars and L. Van Gool. Wide baseline stereo matching based on local, affinely invariant regions. In Proc. BMVC., pages 412–425, 2000.
Z. Zhang, R. Deriche, O. D. Faugeras, and Q.-T. Luong. A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry. Artificial Intelligence, 78:87–119, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schaffalitzky, F., Zisserman, A. (2002). Automated Scene Matching in Movies. In: Lew, M.S., Sebe, N., Eakins, J.P. (eds) Image and Video Retrieval. CIVR 2002. Lecture Notes in Computer Science, vol 2383. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45479-9_20
Download citation
DOI: https://doi.org/10.1007/3-540-45479-9_20
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43899-1
Online ISBN: 978-3-540-45479-3
eBook Packages: Springer Book Archive