Skip to main content

Linking Identities and Viewpoints in Home Movies Based on Robust Feature Matching

  • Conference paper
Advances in Multimedia Modeling (MMM 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4351))

Included in the following conference series:

  • 850 Accesses

Abstract

The identification of useful structures in home video is difficult because this class of video is distinguished from other video sources by its unrestricted, non edited content and the absence of regulated storyline. In addition, home videos contain a lot of motion and erratic camera movements, with shots of the same character being captured from various angles and viewpoints. In this paper, we present a solution to the challenging problem of clustering shots and faces in home videos, based on the use of SIFT features. SIFT features have been known to be robust for object recognition; however, in dealing with the complexities of home video setting, the matching process needs to be augmented and adapted. This paper describes various techniques that can improve the number of matches returned as well as the correctness of matches. For example, existing methods for verification of matches are inadequate for cases when a small number of matches are returned, a common situation in home videos. We address this by constructing a robust classifier that works on matching sets instead of individual matches, allowing the exploitation of the geometric constraints between matches. Finally, we propose techniques for robustly extracting target clusters from individual feature matches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Rui, Y., Huang, T.S., S.M.: Constructing table-of-content for videos. ACM Multimedia System Journal: Special Issue in Multimedia Systems on Video Libraries 7, 359–368 (1999)

    Google Scholar 

  2. Veneau, E., Ronfard, R., Bouthemy, P.: From video shot clustering to sequence segmentation. In: ICPR 2000, Barcelona, vol. 4, pp. 254–257 (2000)

    Google Scholar 

  3. Yeung, M., Yeo, B.L., Liu, B.: Segmentation of video by clustering and graph analysis. Computer Vision and Image Understanding 7, 94–109 (1998)

    Article  Google Scholar 

  4. Zhao, L., Qi, W., Yang, S., Zhang, H.: Video shot grouping using best-first model merging. In: Proc. 13th SPIE Symposium on Electronic Imaging - Storage and Retrieval for Image and Video Databases, San Jose, pp. 262–267 (2001)

    Google Scholar 

  5. Gatica-Perez, D., Loui, A., Sun, M.T.: Finding structure in home videos by probabilistic hierarchical clustering. IEEE Transactions on Circuits and Systems for Video Technology 13, 539–548 (2003)

    Article  Google Scholar 

  6. Truong, B.T., Venkatesh, S., Dorai, C.: Application of computational media aesthetics methodology to extracting color semantics in film. In: ACM Multimedia (ACMMM 2002), France Les Pins, pp. 339–342 (2002)

    Google Scholar 

  7. Satoh, S.: News video analysis based on identical shot detection. In: Multimedia and Expo. In: Proceedings. 2002 IEEE International Conference on ICME 2002, vol. 1, pp. 69–72 (2002)

    Google Scholar 

  8. Truong, B.T., Venkatesh, S., Dorai, C.: Identifying film takes for cinematic analysis. Multimedia Tools and Applications 26, 277–298 (2005)

    Article  Google Scholar 

  9. Schaffalitzky, F., Zisserman, A.: Automated location matching in movies. Computer Vision and Image Understanding 92, 236–264 (2003)

    Article  Google Scholar 

  10. Truong, B.T., Venkatesh, S.: Sift feature for home video analysis. Technical report, IMPCA - Curtin University of Technology (2006)

    Google Scholar 

  11. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)

    Article  Google Scholar 

  12. Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence 27, 1615–1630 (2005)

    Article  Google Scholar 

  13. Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 381–395 (1981)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Truong, B.T., Venkatesh, S. (2006). Linking Identities and Viewpoints in Home Movies Based on Robust Feature Matching. In: Cham, TJ., Cai, J., Dorai, C., Rajan, D., Chua, TS., Chia, LT. (eds) Advances in Multimedia Modeling. MMM 2007. Lecture Notes in Computer Science, vol 4351. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69423-6_62

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-69423-6_62

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-69421-2

  • Online ISBN: 978-3-540-69423-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics