skip to main content
10.1145/2964284.2984061acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Robust Face Recognition with Deep Multi-View Representation Learning

Published: 01 October 2016 Publication History

Abstract

This paper describes our proposed method targeting at the MSR Image Recognition Challenge MS-Celeb-1M. The challenge is to recognize one million celebrities from their face images captured in the real world. The challenge provides a large scale dataset crawled from the Web, which contains a large number of celebrities with many images for each subject. Given a new testing image, the challenge requires an identify for the image and the corresponding confidence score. To complete the challenge, we propose a two-stage approach consisting of data cleaning and multi-view deep representation learning. The data cleaning can effectively reduce the noise level of training data and thus improves the performance of deep learning based face recognition models. The multi-view representation learning enables the learned face representations to be more specific and discriminative. Thus the difficulties of recognizing faces out of a huge number of subjects are substantially relieved. Our proposed method achieves a coverage of 46.1% at 95% precision on the random set and a coverage of 33.0% at 95% precision on the hard set of this challenge.

References

[1]
J. M. Cimbala. Outliers. Technical report, Penn State University, September 2011.
[2]
K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385, 2015.
[3]
G. B. Huang, M. Ramesh, T. Berg, and E. Learned-Miller. Labeled faces in the wild: A database for studying face recognition in unconstrained environments. Technical Report 07--49, University of Massachusetts, Amherst, October 2007.
[4]
A. Lucas, R. Van Dijk, and T. Kloek. Outlier robust gmm estimation of leverage determinants in linear dynamic panel data models. Available at SSRN 20611, 1997.
[5]
F. Schroff, D. Kalenichenko, and J. Philbin. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 815--823, 2015.
[6]
N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15(1):1929--1958, 2014.
[7]
Y. Sun, D. Liang, X. Wang, and X. Tang. Deepid3: Face recognition with very deep neural networks. arXiv preprint arXiv:1502.00873, 2015.
[8]
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1--9, 2015.
[9]
Y. Taigman, M. Yang, M. Ranzato, and L. Wolf. Deepface: Closing the gap to human-level performance in face verification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1701--1708, 2014.
[10]
D. Yi, Z. Lei, S. Liao, and S. Z. Li. Learning face representation from scratch. arXiv preprint arXiv:1411.7923, 2014.

Cited By

View all
  • (2024)Joint face normalization and representation learning for face recognitionPattern Analysis and Applications10.1007/s10044-024-01255-227:2Online publication date: 17-May-2024
  • (2023)Multi-View Deep Gaussian Processes for Supervised LearningIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.331667145:12(15137-15153)Online publication date: Dec-2023
  • (2023)Human Collective Intelligence Inspired Multi-View Representation Learning — Enabling View Communication by Simulating Human Communication MechanismIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.321860545:6(7412-7429)Online publication date: 1-Jun-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '16: Proceedings of the 24th ACM international conference on Multimedia
October 2016
1542 pages
ISBN:9781450336031
DOI:10.1145/2964284
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. deep learning
  2. face recognition
  3. model ensemble
  4. multi-view feature representation

Qualifiers

  • Research-article

Conference

MM '16
Sponsor:
MM '16: ACM Multimedia Conference
October 15 - 19, 2016
Amsterdam, The Netherlands

Acceptance Rates

MM '16 Paper Acceptance Rate 52 of 237 submissions, 22%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)5
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Joint face normalization and representation learning for face recognitionPattern Analysis and Applications10.1007/s10044-024-01255-227:2Online publication date: 17-May-2024
  • (2023)Multi-View Deep Gaussian Processes for Supervised LearningIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.331667145:12(15137-15153)Online publication date: Dec-2023
  • (2023)Human Collective Intelligence Inspired Multi-View Representation Learning — Enabling View Communication by Simulating Human Communication MechanismIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2022.321860545:6(7412-7429)Online publication date: 1-Jun-2023
  • (2023)Joint Shared-and-Specific Information for Deep Multi-View ClusteringIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.327828533:12(7224-7235)Online publication date: Dec-2023
  • (2023)Automatic Classification of Instructional Video Based on Different Presentation Forms2023 IEEE 12th International Conference on Educational and Information Technology (ICEIT)10.1109/ICEIT57125.2023.10107851(353-357)Online publication date: 16-Mar-2023
  • (2023)Diversity Multi-View Clustering With Subspace and NMF-Based Manifold LearningIEEE Access10.1109/ACCESS.2023.326483711(37041-37051)Online publication date: 2023
  • (2023)Learning enhanced specific representations for multi-view feature learningKnowledge-Based Systems10.1016/j.knosys.2023.110590272(110590)Online publication date: Jul-2023
  • (2022)LEAD: Self-Supervised Landmark Estimation by Aligning Distributions of Feature Similarity2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV51458.2022.00310(3046-3055)Online publication date: Jan-2022
  • (2022)Towards Age-Invariant Face RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2020.301142644:1(474-487)Online publication date: 1-Jan-2022
  • (2022)Hierarchical Deep CNN Feature Set-Based Representation Learning for Robust Cross-Resolution Face RecognitionIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2020.304217832:5(2550-2560)Online publication date: May-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media