In this study, we propose a method of labeling faces with names in a large number of news images with captions. Other works explored facial similarities to label faces with names that are sensitive to the intra-person appearance variations, and the captions can offer the cues for the correlations of candidate names. Our method combines textual similarity from image captions with visual similarity of face collections of candidate name to automatically recognize celebrities. It does not require any supervisory inputs. It includes two main steps. Firstly, we build a name semantic network based on textual and visual similarity. Secondly, we apply a name semantic network to label face images with names. We perform experiments on the data set which consists of approximate half a million news images from Yahoo news. The experimental results show that the performance of our method is better than the existing algorithms.

Similar content being viewed by others
Berg T, Berg A, Edwards J, Maire M, White R (2004) Names and faces in the news. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p 848–854
Berg T, Berg A, Edwards J, Forsyth D (2005) Who’s in the picture. Neural Inf Process Syst :137–144
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140
Brendan JF, Delbert D (2007) Clustering by passing messages between data points. Science 315(5814):972–976
Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2(27):1–27
Duy DL, Shin’ichi S (2008) Unsupervised face annotation by mining the web. In International Conference on Data Mining, p 383–392
Duy DL, Shin'ichi S (2012) Auto face re-ranking by mining the web and video archives. IEEE Conference on CVPR, p 2965–2972
Eric M, Alberto B, Giorgio D, Andrea DL (2011) Automatic face annotation in news images by mining the web. Proceedings of International Conferences on Web Intelligence and Intelligent Agent Technology, p 47–54
Guillaumin M, Mensink T, Verbeek J, Schmid C (2008) Automatic face naming with caption-based supervision. IEEE Conference on CVPR, page 1–8
Liu WX, Tao DC (2013) Multiview Hessian regularization for image annotation. IEEE Trans Image Process 22(7):2676–2687
Liu WX, Song CF, Wang YJ (2012) Facial expression recognition based on discriminative dictionary learning. In International Conference on Pattern Recognition, p 1839–1842
Liu WX, Tao DC, Cheng J, Tang YY (2014) Multiview Hessian discriminative sparse coding for image annotation. Comput Vis Image Underst 118(1):50–60
Liu WX, Wang YJ, Li SJ (2011) LBP feature extraction for facial expression recognition. J Inf Comput Sci 8(3):412–421
Lu W, Du C, Wei B, Shen C, Ye Z (2012) Distributed Affinity propagation clustering based on MapReduce. Journal of Computer Research and Development 49(8):1762–1772
Ozkan D, Duygulu P (2010) Interesting faces: a graph-based approach for finding people in news. Pattern Recogn 43(5):1717–1735
Pham PT, Moens MF, Tuytelaars T (2010) Cross-media alignment of names and faces. IEEE Transactions on multimedia 12(1):13–27
Pham PT, Tuytelaars T, Moens MF (2010) Naming persons in news video with label propagation. International Conference on Multimedia Computing and Systems/International Conference on Multimedia and Expo 1528–1533
Poppe R (2012) Facing scalability: naming faces in an online social network. Pattern Recogn 45(6):2335–2347
Ratinov L, Roth D (2009) Design challenges and misconceptions in named entity recognition. CoNLL 2009:147–155
Stephen M, Nicolls F (2008) Locating facial features with an extended active shape model. Proceedings of the 10th European Conference on Computer Vision, p 504–513
Su XP, Peng JY, Feng XY, Wu J, Fan JP (2011) Linking names and faces by person-based subset clustering. In Proceedings of the Third International Conference on Internet Multimedia Computing and Service, p 120–123
Su XP, Peng JY, Feng XY, Wu J, Fan JP (2013) Cross-modality based celebrity face naming for news image collections. Multimedia tools and application 73(3):1643–1661
Xia ZQ, Feng XY, Peng JY, Fan JP (2014) Tag cleansing via Bi-layer clustering and peer corporation. J Signal Process Syst 1–16
Zhang B, Shan S, Gao W, Chen X (2005) Local Gabor binary pattern histogram sequence (LGBPHS): a novel non-statisticalmodel for face representation and recognition. In International Conference on Computer Vision (ICCV), p 786–791
This work is supported by the doctorate foundation of Northwestern Polytechnical University under CX201114, Ministry of Education Fund for Doctoral Students Newcomer Awards of China, National Natural Science Foundation of China under Grant 61075014, 61272285, 61103062, The Research Fund for the Doctoral Program of Higher Education under Grant 20106102110028, 20116102110027, 20116102120031, 20126101110022, The Science and Technology project of Shaanxi Province under Grant 2013 K06-29, and NPU Basic Research Foundation under Grant JC201249.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Su, X., Peng, J., Feng, X. et al. Labeling faces with names based on the name semantic network. Multimed Tools Appl 75, 6445–6462 (2016). https://doi.org/10.1007/s11042-015-2581-x
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2581-x