Interactive Retrieval of Video Sequences from Local Feature Dynamics

Moënne-Loccoz, Nicolas; Bruno, Eric; Marchand-Maillet, Stéphane

doi:10.1007/11670834_11

Nicolas Moënne-Loccoz²⁰,
Eric Bruno²⁰ &
Stéphane Marchand-Maillet²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3877))

Included in the following conference series:

International Workshop on Adaptive Multimedia Retrieval

265 Accesses
2 Citations

Abstract

This paper addresses the problem of retrieving video sequences that contain a spatio-temporal pattern queried by a user. To achieve this, the visual content of each video sequence is first decomposed through the analysis of its local feature dynamics. Camera motion of the sequence, background and objects present in the captured scene and events occurring within it are represented respectively by the parameters of the estimated global motion model, the appearance of the extracted local features and their trajectories. At query-time, a probabilistic model of the visual pattern is estimated from the user interaction, captured through a relevance-feedback loop. We show that the method permits to efficiently retrieve video sequences that share, even partially, a spatio-temporal pattern.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baeza-Yates, R.A., Ribeiro-Neto, B.A.: Modern Information Retrieval. ACM Press / Addison-Wesley (1999)
Google Scholar
Bruno, E., Moënne-Loccoz, N., Marchand-Maillet, S.: Learning user queries in multimodal dissimilarity spaces. In: Detyniecki, M., Jose, J.M., Nürnberger, A., van Rijsbergen, C.J.K. (eds.) AMR 2005. LNCS, vol. 3877, pp. 168–179. Springer, Heidelberg (2006)
Chapter Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (June 2003)
Google Scholar
Fisher, M.A., Bolles, R.C.: Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 381–395 (1981)
Google Scholar
Förstner, W.: A feature-based correspondence algorithm for image matching. Int. Arch. Photogrammetry and Remote Sensing 26, 150–166 (1986)
Google Scholar
Harris, C., Stephens, M.: A combined corner and edge detector. In: 4th Alvey Vision Conference, pp. 189–192 (1988)
Google Scholar
Janvier, B., Bruno, E., Marchand-Maillet, S., Pun, T.: Information-theoretic framework for the joint temporal partioning and representation of video data. In: Proceedings of the European Conference on Content-based Multimedia Indexing, CBMI 2003 (September 2003)
Google Scholar
Kadir, T., Zisserman, A., Brady, M.: An affine invariant salient region detector. In: Proceedings of the 8th European Conference on Computer Vision, Prague, Czech Republic (May 2004)
Google Scholar
Kuhn, H.W.: The hungarian method for the assignment problem. Naval Research Logistics Quaterly 2, 83–97 (1955)
Article MathSciNet MATH Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Learning local affine-invariant part models for object class recognition. In: Workshop on Learning, Snowbird, Utah (2004)
Google Scholar
Li, F., Fergus, R., Perona, P.: A bayesian approach to unsupervised one-shot learning of object categories. In: Ninth IEEE International Conference on Computer Vision (ICCV), vol. 2, p. 1134 (2003)
Google Scholar
Lindeberg, T.: Feature detection with automatic scale selection. International Journal of Computer Vision 30(2), 77–116 (1998)
Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: Proc. of the International Conference on Computer Vision ICCV, Corfu., pp. 1150–1157 (1999)
Google Scholar
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: 8th Internationnal Conference on Computer Vision, pp. 525–531 (2001)
Google Scholar
Mikolajczyk, K., Schmid, C.: An affine invariant interest point detector. In: European Conference on Computer Vision, Copenhagen, pp. 128–142. Springer, Heidelberg (2002)
Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. In: IEEE Conference on Computer Vision and Pattern Recognition (2003)
Google Scholar
Moënne-Loccoz, N., Janvier, B., Marchand-Maillet, S., Bruno, E.: Managing video collections at large. In: Proceedings of the First Workshop on Computer Vision Meets Databases, CVDB 2004, Paris, France (2004)
Google Scholar
Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: Segmenting, modeling and matching video clips containing multiple moving objects. In: IEEE Conference on Computer Vision, vol. 2, pp. 914–921 (2004)
Google Scholar
Shi, J., Tomasi, C.: Good features to track. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR 1994), Seattle (June 1994)
Google Scholar
Sivic, J., Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: Proceedings of the International Conference on Computer Vision (October 2003)
Google Scholar
Tian, Q., Sebe, N., Lew, M.S., Loupias, E., Huang, T.S.: Image retrieval using wavelet-based salient points. Journal of Electronic Imaging, Special Issue on Storage and Retrieval of Digital Media, 835–849 (2001)
Google Scholar
Torr, P.H.S., Zisserman, A.: Feature based methods for structure and motion estimation. In: Triggs, B., Zisserman, A., Szeliski, R. (eds.) ICCV-WS 1999. LNCS, vol. 1883, pp. 278–294. Springer, Heidelberg (2000)
Chapter Google Scholar
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, pp. 103–114 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Viper Group, Computer Vision and Multimedia Lab, University of Geneva, 24, rue du Général Dufour, 1211, Geneva 4, Switzerland
Nicolas Moënne-Loccoz, Eric Bruno & Stéphane Marchand-Maillet

Authors

Nicolas Moënne-Loccoz
View author publications
You can also search for this author in PubMed Google Scholar
Eric Bruno
View author publications
You can also search for this author in PubMed Google Scholar
Stéphane Marchand-Maillet
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Laboratoire d’Informatique de Paris 6, France
Marcin Detyniecki
Department of Computer Science, University of Glasgow, 17 Lilybank Gardens, G12 8QQ, Glasgow, UK
Joemon M. Jose
Fakultät für Informatik, Otto-von-Guericke Universität Madgeburg, Universitätsplatz 2, 39106, Germany
Andreas Nürnberger
Department of Computing Science, University of Glasgow, G12 8QQ, Glasgow, UK
C. J. van Rijsbergen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Moënne-Loccoz, N., Bruno, E., Marchand-Maillet, S. (2006). Interactive Retrieval of Video Sequences from Local Feature Dynamics. In: Detyniecki, M., Jose, J.M., Nürnberger, A., van Rijsbergen, C.J. (eds) Adaptive Multimedia Retrieval: User, Context, and Feedback. AMR 2005. Lecture Notes in Computer Science, vol 3877. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11670834_11

Download citation

DOI: https://doi.org/10.1007/11670834_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32174-3
Online ISBN: 978-3-540-32175-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics