research-article

Robust Face Recognition with Deep Multi-View Representation Learning

Authors:

Jianshu Li,

Jian Zhao,

Fang Zhao,

Hao Liu,

Jing Li,

Shengmei Shen,

Jiashi Feng,

Terence SimAuthors Info & Claims

MM '16: Proceedings of the 24th ACM international conference on Multimedia

Pages 1068 - 1072

https://doi.org/10.1145/2964284.2984061

Published: 01 October 2016 Publication History

Get Access

Abstract

This paper describes our proposed method targeting at the MSR Image Recognition Challenge MS-Celeb-1M. The challenge is to recognize one million celebrities from their face images captured in the real world. The challenge provides a large scale dataset crawled from the Web, which contains a large number of celebrities with many images for each subject. Given a new testing image, the challenge requires an identify for the image and the corresponding confidence score. To complete the challenge, we propose a two-stage approach consisting of data cleaning and multi-view deep representation learning. The data cleaning can effectively reduce the noise level of training data and thus improves the performance of deep learning based face recognition models. The multi-view representation learning enables the learned face representations to be more specific and discriminative. Thus the difficulties of recognizing faces out of a huge number of subjects are substantially relieved. Our proposed method achieves a coverage of 46.1% at 95% precision on the random set and a coverage of 33.0% at 95% precision on the hard set of this challenge.

References

[1]

J. M. Cimbala. Outliers. Technical report, Penn State University, September 2011.

Google Scholar

[2]

K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385, 2015.

Google Scholar

[3]

G. B. Huang, M. Ramesh, T. Berg, and E. Learned-Miller. Labeled faces in the wild: A database for studying face recognition in unconstrained environments. Technical Report 07--49, University of Massachusetts, Amherst, October 2007.

Google Scholar

[4]

A. Lucas, R. Van Dijk, and T. Kloek. Outlier robust gmm estimation of leverage determinants in linear dynamic panel data models. Available at SSRN 20611, 1997.

Google Scholar

[5]

F. Schroff, D. Kalenichenko, and J. Philbin. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 815--823, 2015.

Crossref

Google Scholar

[6]

N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15(1):1929--1958, 2014.

Digital Library

Google Scholar

[7]

Y. Sun, D. Liang, X. Wang, and X. Tang. Deepid3: Face recognition with very deep neural networks. arXiv preprint arXiv:1502.00873, 2015.

Google Scholar

[8]

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1--9, 2015.

Crossref

Google Scholar

[9]

Y. Taigman, M. Yang, M. Ranzato, and L. Wolf. Deepface: Closing the gap to human-level performance in face verification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1701--1708, 2014.

Digital Library

Google Scholar

[10]

D. Yi, Z. Lei, S. Liao, and S. Z. Li. Learning face representation from scratch. arXiv preprint arXiv:1411.7923, 2014.

Google Scholar

Cited By

View all

Liu YChen JLi YWu TWen H(2024)Joint face normalization and representation learning for face recognitionPattern Analysis and Applications10.1007/s10044-024-01255-227:2Online publication date: 17-May-2024
https://doi.org/10.1007/s10044-024-01255-2
Dong WSun S(2023)Multi-View Deep Gaussian Processes for Supervised LearningIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.331667145:12(15137-15153)Online publication date: Dec-2023
https://doi.org/10.1109/TPAMI.2023.3316671
Jia XJing XSun QChen SDu BZhang D(2023)Human Collective Intelligence Inspired Multi-View Representation Learning — Enabling View Communication by Simulating Human Communication MechanismIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.321860545:6(7412-7429)Online publication date: 1-Jun-2023
https://doi.org/10.1109/TPAMI.2022.3218605
Show More Cited By

Index Terms

Robust Face Recognition with Deep Multi-View Representation Learning
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object recognition
      2. Computer vision representations
        Image representations
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Neural networks
2. Security and privacy
  1. Security services
    1. Authentication
      1. Biometrics

Recommendations

Robust face recognition based on dynamic rank representation

Robust face recognition is an active topic in computer vision, while face occlusion is one of the most challenging problems for robust face recognition algorithm. The latest research on low-rank representation demonstrated its high efficiency to ...
2D Pose-Invariant Face Recognition Using Single Frontal-View Face Database
Abstract
Personal identification systems that use face recognition work well for test images with frontal view face, but often fail when the input face is a pose view. Most face databases come from picture ID sources such as passports or driver’s licenses. ...
Robust Deep Auto-encoder for Occluded Face Recognition
MM '15: Proceedings of the 23rd ACM international conference on Multimedia

Occlusions by sunglasses, scarf, hats, beard, shadow etc, can significantly reduce the performance of face recognition systems. Although there exists a rich literature of researches focusing on face recognition with illuminations, poses and facial ...

Comments

Information & Contributors

Information

Published In

MM '16: Proceedings of the 24th ACM international conference on Multimedia

October 2016

1542 pages

ISBN:9781450336031

DOI:10.1145/2964284

General Chairs:
Alan Hanjalic
Delft University of Technology
,
Cees Snoek
Qualcomm Research Netherlands / University of Amsterdam
,
Marcel Worring
University of Amsterdam
,
Moderator:
Dick Bulterman
CWI / VU University Amsterdam
,
Program Chairs:
Benoit Huet
EURECOM
,
Aisling Kelliher
Virginia Tech
,
Yiannis Kompatsiaris
CERTH-ITI
,
Jin Li
Microsoft

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '16

Sponsor:

SIGMM

MM '16: ACM Multimedia Conference

October 15 - 19, 2016

Amsterdam, The Netherlands

Acceptance Rates

MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

39
Total Citations
View Citations
614
Total Downloads

Downloads (Last 12 months)19
Downloads (Last 6 weeks)5

Reflects downloads up to 02 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Liu YChen JLi YWu TWen H(2024)Joint face normalization and representation learning for face recognitionPattern Analysis and Applications10.1007/s10044-024-01255-227:2Online publication date: 17-May-2024
https://doi.org/10.1007/s10044-024-01255-2
Dong WSun S(2023)Multi-View Deep Gaussian Processes for Supervised LearningIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.331667145:12(15137-15153)Online publication date: Dec-2023
https://doi.org/10.1109/TPAMI.2023.3316671
Jia XJing XSun QChen SDu BZhang D(2023)Human Collective Intelligence Inspired Multi-View Representation Learning — Enabling View Communication by Simulating Human Communication MechanismIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.321860545:6(7412-7429)Online publication date: 1-Jun-2023
https://doi.org/10.1109/TPAMI.2022.3218605
Chen JHuang AGao WNiu YZhao T(2023)Joint Shared-and-Specific Information for Deep Multi-View ClusteringIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.327828533:12(7224-7235)Online publication date: Dec-2023
https://doi.org/10.1109/TCSVT.2023.3278285
Qiusha MZiyi LWenhao L(2023)Automatic Classification of Instructional Video Based on Different Presentation Forms2023 IEEE 12th International Conference on Educational and Information Technology (ICEIT)10.1109/ICEIT57125.2023.10107851(353-357)Online publication date: 16-Mar-2023
https://doi.org/10.1109/ICEIT57125.2023.10107851
Ding JFang XJia LJiang YLi R(2023)Diversity Multi-View Clustering With Subspace and NMF-Based Manifold LearningIEEE Access10.1109/ACCESS.2023.326483711(37041-37051)Online publication date: 2023
https://doi.org/10.1109/ACCESS.2023.3264837
Hao YJing XChen RLiu W(2023)Learning enhanced specific representations for multi-view feature learningKnowledge-Based Systems10.1016/j.knosys.2023.110590272(110590)Online publication date: Jul-2023
https://doi.org/10.1016/j.knosys.2023.110590
Karmali TAtrishi AHarsha SAgrawal SJampani VBabu R(2022)LEAD: Self-Supervised Landmark Estimation by Aligning Distributions of Feature Similarity2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV51458.2022.00310(3046-3055)Online publication date: Jan-2022
https://doi.org/10.1109/WACV51458.2022.00310
Zhao JYan SFeng J(2022)Towards Age-Invariant Face RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2020.301142644:1(474-487)Online publication date: 1-Jan-2022
https://doi.org/10.1109/TPAMI.2020.3011426
Gao GYu YYang JQi GYang M(2022)Hierarchical Deep CNN Feature Set-Based Representation Learning for Robust Cross-Resolution Face RecognitionIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2020.304217832:5(2550-2560)Online publication date: May-2022
https://doi.org/10.1109/TCSVT.2020.3042178
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Robust face recognition based on dynamic rank representation

2D Pose-Invariant Face Recognition Using Single Frontal-View Face Database

Robust Deep Auto-encoder for Occluded Face Recognition

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations