Linking Identities and Viewpoints in Home Movies Based on Robust Feature Matching

Truong, Ba Tu; Venkatesh, Svetha

doi:10.1007/978-3-540-69423-6_62

Ba Tu Truong²¹ &
Svetha Venkatesh²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4351))

Included in the following conference series:

International Conference on Multimedia Modeling

850 Accesses

Abstract

The identification of useful structures in home video is difficult because this class of video is distinguished from other video sources by its unrestricted, non edited content and the absence of regulated storyline. In addition, home videos contain a lot of motion and erratic camera movements, with shots of the same character being captured from various angles and viewpoints. In this paper, we present a solution to the challenging problem of clustering shots and faces in home videos, based on the use of SIFT features. SIFT features have been known to be robust for object recognition; however, in dealing with the complexities of home video setting, the matching process needs to be augmented and adapted. This paper describes various techniques that can improve the number of matches returned as well as the correctness of matches. For example, existing methods for verification of matches are inadequate for cases when a small number of matches are returned, a common situation in home videos. We address this by constructing a robust classifier that works on matching sets instead of individual matches, allowing the exploitation of the geometric constraints between matches. Finally, we propose techniques for robustly extracting target clusters from individual feature matches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rui, Y., Huang, T.S., S.M.: Constructing table-of-content for videos. ACM Multimedia System Journal: Special Issue in Multimedia Systems on Video Libraries 7, 359–368 (1999)
Google Scholar
Veneau, E., Ronfard, R., Bouthemy, P.: From video shot clustering to sequence segmentation. In: ICPR 2000, Barcelona, vol. 4, pp. 254–257 (2000)
Google Scholar
Yeung, M., Yeo, B.L., Liu, B.: Segmentation of video by clustering and graph analysis. Computer Vision and Image Understanding 7, 94–109 (1998)
Article Google Scholar
Zhao, L., Qi, W., Yang, S., Zhang, H.: Video shot grouping using best-first model merging. In: Proc. 13th SPIE Symposium on Electronic Imaging - Storage and Retrieval for Image and Video Databases, San Jose, pp. 262–267 (2001)
Google Scholar
Gatica-Perez, D., Loui, A., Sun, M.T.: Finding structure in home videos by probabilistic hierarchical clustering. IEEE Transactions on Circuits and Systems for Video Technology 13, 539–548 (2003)
Article Google Scholar
Truong, B.T., Venkatesh, S., Dorai, C.: Application of computational media aesthetics methodology to extracting color semantics in film. In: ACM Multimedia (ACMMM 2002), France Les Pins, pp. 339–342 (2002)
Google Scholar
Satoh, S.: News video analysis based on identical shot detection. In: Multimedia and Expo. In: Proceedings. 2002 IEEE International Conference on ICME 2002, vol. 1, pp. 69–72 (2002)
Google Scholar
Truong, B.T., Venkatesh, S., Dorai, C.: Identifying film takes for cinematic analysis. Multimedia Tools and Applications 26, 277–298 (2005)
Article Google Scholar
Schaffalitzky, F., Zisserman, A.: Automated location matching in movies. Computer Vision and Image Understanding 92, 236–264 (2003)
Article Google Scholar
Truong, B.T., Venkatesh, S.: Sift feature for home video analysis. Technical report, IMPCA - Curtin University of Technology (2006)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence 27, 1615–1630 (2005)
Article Google Scholar
Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 381–395 (1981)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing, Curtin University of Technology, Perth, Western Australia
Ba Tu Truong & Svetha Venkatesh

Authors

Ba Tu Truong
View author publications
You can also search for this author in PubMed Google Scholar
Svetha Venkatesh
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Engineering, Nanyang Technological University, Block N4, Nanyang Avenue, 639798, Singapore
Tat-Jen Cham & Deepu Rajan &
School of Computer Engineering, Nanyang Technological University, 639798, Singapore
Jianfei Cai
IBM T.J. Watson Research Center, Yorktown Heights, P.O. Box 704, 10598, New York, USA
Chitra Dorai
National University of Singapore, 3 Science Dr, 117543, Singapore
Tat-Seng Chua
Center for Multimedia and Network Technology, School of Computer Enginnering, Nanyang Technological University, 639798, Singapore
Liang-Tien Chia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Truong, B.T., Venkatesh, S. (2006). Linking Identities and Viewpoints in Home Movies Based on Robust Feature Matching. In: Cham, TJ., Cai, J., Dorai, C., Rajan, D., Chua, TS., Chia, LT. (eds) Advances in Multimedia Modeling. MMM 2007. Lecture Notes in Computer Science, vol 4351. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69423-6_62

Download citation

DOI: https://doi.org/10.1007/978-3-540-69423-6_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69421-2
Online ISBN: 978-3-540-69423-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics