skip to main content
10.1145/2187836.2187962acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Lightweight automatic face annotation in media pages

Published: 16 April 2012 Publication History

Abstract

Labeling human faces in images contained in Web media stories enables enriching the user experience offered by media sites. We propose a lightweight framework for automatic image annotation that exploits named entities mentioned in the article to significantly boost the accuracy of face recognition. While previous works in the area labor to train comprehensive offline visual models for a pre-defined universe of candidates, our approach models the people mentioned in a given story on the y, using a standard Web image search engine as an image sampling mechanism. We overcome multiple sources of noise introduced by this ad-hoc process, to build a fast and robust end-to-end system from off-the-shelf error-prone text analysis and machine vision components. In experiments conducted on approximately 900 faces depicted in 500 stories from a major celebrity news website, we were able to correctly label 81.5% of the faces while mislabeling 14.8% of them.

References

[1]
J. Allan.Introduction to topic detection and tracking. Kluwer Academic Publishers Norwell, 2002.
[2]
R. A. Baeza-Yates and B. A. Ribeiro-Neto.Modern Information Retrieval. ACM Press / Addison Wesley, New York, NY, 1999.
[3]
P. N. Bellhumer, J. Hespanha, and D. Kriegman. Eigenfaces vs. fisherfaces: Recognition using class specific linear projection. PAMI, 17(7):711--720, 1997.
[4]
T. L. Berg, A. C. Berg, J. Edwards, M. Maire, R. White, Y. W. Teh, E. Learned-Miller, and D. A. Forsyth. Names and faces in the news. In CVPR, June 2004.
[5]
K. Crammer and Y. Singer. On the Algorithmic Implementation of Multi-class SVMs. JMLR, 2001.
[6]
M. Everingham, J. Sivic, and A. Zisserman. Hello! my name is... buffy - automatic naming of characters in tv video. In BMVC, 2006.
[7]
D. A. Forsyth and J. Ponce.Computer Vision: A Modern Approach. Prentice Hall, 2002.
[8]
A. Gallagher and T. Chen. Estimating age, gender, and identity using first name priors. In CVPR, 2008.
[9]
G. B. Huang and V. Jain. Unsupervised joint alignment of complex images. In ICCV, 2007.
[10]
G. B. Huang, M. Ramesh, T. Berg, and E. Learned-Miller. Labeled faces in the wild: A database for studying face recognition in unconstrained environments. Technical Report 07-49, University of Massachusetts, Amherst, October 2007.
[11]
H. W. Kuhn. The hungarian method for the assignment problem.Naval Research Logistics Quarterly, 2:83--97, 1955.
[12]
D. Le and S. Satoh. Unsupervised face annotation by mining the web. In IEEE ICDM, pages 383--392, 2008.
[13]
C. Liu, S. Jiang, and Q. Huang. Naming faces in broadcast news video by image google. In ACM Multimedia, pages 717--720, 2008.
[14]
D. Ozkan and P. Duygulu. A graph based approach for naming faces in news photos. In CVPR, 2006.
[15]
P. Phillips, H. Moon, and S. R. P. Rauss. The feret evaluation methodology for face recognition algorithms.PAMI, 22:1090--1104, 2000.
[16]
R. Sandler and M. Lindenbaum. Nonnegative Matrix Factorization with Earth Mover's Distance Metric for Image Analysis. PAMI, 33:1590--1602, 2011.
[17]
S. Shirdhonkar and D. Jacobs. Approximate earth mover's distance in linear time. In CVPR, 2008.
[18]
R. Song, H. Liu, J. Wen, and W. Ma. Learning block importance models for web pages. In WWW, 2004.
[19]
M. Turk and A. Pentland. Eigenfaces for recognition. Journal of Cognitive Neuroscience, 3 (1):71--86, 1991.
[20]
S. Vadrevu and E. Velipasaoglu. Identifying primary content from web pages and its application to web search ranking. In WWW, 2011.
[21]
P. Viola and M. Jones. Rapid object detection using a boosted cascade of simple features. In CVPR, 2001.
[22]
L. Wiskott, J.-M. Fellous, N. Kruger, and C. V. D. Malsburg. Face recognition by elastic bunch graph matching.PAMI, 19:775--779, 1997.
[23]
L. Wolf, T. Hassner, and Y. Taigman. Descriptor based methods in the wild. In Faces in Real-Life Images Workshop in ECCV, 2008.
[24]
M. Zhao, J. Yagnik, H. Adam, and D. Bau. Large scale learning and recognition of faces in web videos. In CVPR, 2008.

Cited By

View all
  • (2018)Extracting semantic knowledge from web context for multimedia IRMultimedia Tools and Applications10.1007/s11042-017-4997-y77:11(13853-13889)Online publication date: 1-Jun-2018
  • (2018)Photo annotationMultimedia Tools and Applications10.1007/s11042-016-4281-677:1(423-457)Online publication date: 1-Jan-2018

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '12: Proceedings of the 21st international conference on World Wide Web
April 2012
1078 pages
ISBN:9781450312295
DOI:10.1145/2187836
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • Univ. de Lyon: Universite de Lyon

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 April 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. face recognition
  2. machine learning
  3. text analysis
  4. web search

Qualifiers

  • Research-article

Conference

WWW 2012
Sponsor:
  • Univ. de Lyon
WWW 2012: 21st World Wide Web Conference 2012
April 16 - 20, 2012
Lyon, France

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Extracting semantic knowledge from web context for multimedia IRMultimedia Tools and Applications10.1007/s11042-017-4997-y77:11(13853-13889)Online publication date: 1-Jun-2018
  • (2018)Photo annotationMultimedia Tools and Applications10.1007/s11042-016-4281-677:1(423-457)Online publication date: 1-Jan-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media