Skip to main content

Face Retrieval in Broadcasting News Video by Fusing Temporal and Intensity Information

  • Conference paper
Image and Video Retrieval (CIVR 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4071))

Included in the following conference series:

Abstract

Human faces play an important role in efficiently indexing and accessing video contents, especially broadcasting news video. However, face appearance in real environments exhibits many variations such as pose changes, facial expressions, aging, illumination changes, low resolution and occlusion, making it difficult for current state of the art face recognition techniques to obtain reasonable retrieval results. To handle this problem, this paper proposes an efficient retrieval method by integrating temporal information into facial intensity information. First, representative faces are quickly generated by using facial intensities to organize the face dataset into clusters. Next, temporal information is introduced to reorganize cluster memberships so as to improve overall retrieval performance. For scalability and efficiency, the clustering is based on a recently-proposed model involving correlations among relevant sets (neighborhoods) of data items. Neighborhood queries are handled using an approximate search index. Experiments on the 2005 TRECVID dataset show promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhao, W., Chellappa, R., Phillips, P.J., Rosenfeld, A.: Face recognition: A literature survey. ACM Computing Surveys 35(4), 399–458 (2003)

    Article  Google Scholar 

  2. Yang, J., Chen, M., Hauptmann, A.: Finding person x: Correlating names with visual appearances. In: Proc. Int. Conf. on Image and Video Retrieval (CIVR), pp. 270–278 (2004)

    Google Scholar 

  3. Weber, R., Schek, H.J., Blott, S.: A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In: Proc. Intl. Conf. on Very Large Data Bases (VLDB), pp. 194–205 (1998)

    Google Scholar 

  4. Fitzgibbon, A., Zisserman, A.: On Affine Invariant Clustering and Automatic Cast Listing in Movies. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 304–320. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Fitzgibbon, A.,, Z.: Joint manifold distance: a new approach to appearance based clustering. In: Proc. Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 26–36 (2003)

    Google Scholar 

  6. Arandjelovic, O., Zisserman, A.: Automatic face recognition for film character retrieval in feature-length films. In: Proc. Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 860–867 (2005)

    Google Scholar 

  7. Satoh, S., Kanade, T.: Name-it: Association of face and name in video. In: Proc. Intl. Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 368–373 (1997)

    Google Scholar 

  8. Yang, J., Hauptmann, A.: Naming every individual in news video monologues. In: Proc. ACM International Conference on Multimedia (MM), pp. 580–587 (2004)

    Google Scholar 

  9. Yang, J., Yan, R., Hauptmann, A.: Multiple instance learning for labeling faces in broadcasting news video. In: Proc. ACM International Conference on Multimedia (MM), pp. 31–40 (2005)

    Google Scholar 

  10. Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, Chichester (1990)

    Google Scholar 

  11. Houle, M.E.: A generic query-based model for scalable clustering. Technical Report NII-2006-008E, National Institute of Informatics (2006)

    Google Scholar 

  12. Houle, M.E., Sakuma, J.: Fast approximate similarity search in extremely high-dimensional data sets. In: Proc. Int. Conf. on Data Engineering (ICDE), pp. 619–630 (2005)

    Google Scholar 

  13. http://www-nlpir.nist.gov/projects/trecvid/

  14. Le, D.D., Satoh, S.: Multi-stage approach to fast face detection. In: Proc. British Machine Vison Conf. (BMVC), vol. 2, pp. 769–778 (2005)

    Google Scholar 

  15. Le, D.D., Satoh, S.: Fusion of local and global features for efficient object detection. In: Proc. SPIE, Applications of Neural Networks and Machine Learning in Image Processing IX, vol. 5673, pp. 106–116 (2005)

    Google Scholar 

  16. Rowley, H., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(1), 23–38 (1998)

    Article  Google Scholar 

  17. Turk, M., Pentland, A.: Face recognition using eigenfaces. In: Proc. Intl. Conf. on Computer Vision and Pattern Recognition (CVPR) (1991)

    Google Scholar 

  18. Phillips, P., Moon, H., Rizvi, S., Rauss, P.: The feret evaluation methodology for face recognition algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(10), 1094–1104 (2002)

    Google Scholar 

  19. Houle, M.E.: Navigating massive data sets via local clustering. In: Proc. ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (SIGKDD), pp. 547–552 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Le, DD., Satoh, S., Houle, M.E. (2006). Face Retrieval in Broadcasting News Video by Fusing Temporal and Intensity Information. In: Sundaram, H., Naphade, M., Smith, J.R., Rui, Y. (eds) Image and Video Retrieval. CIVR 2006. Lecture Notes in Computer Science, vol 4071. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11788034_40

Download citation

  • DOI: https://doi.org/10.1007/11788034_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-36018-6

  • Online ISBN: 978-3-540-36019-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics