skip to main content
10.1145/3425577.3425586acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicccvConference Proceedingsconference-collections
research-article

MTCNN Based on Gaussian Mixture Image Pyramid

Authors Info & Claims
Published:23 January 2021Publication History

ABSTRACT

As the first two steps of face recognition system, face detection and key point location are of great significance to the accuracy of face recognition system. The Multi-task Cascaded Convolutional Network(MTCNN) uses a single network model for the first time to complete the task of face detection and key point location. It is easy to find the problem of false detection in the practice of people's face detection. Therefore, this paper introduces multi-scale and multi template data preprocessing in MTCNN model. By increasing the diversity of data, the accuracy of MTCNN model is enhanced. The results show that compared with the original MTCNN model, the recall of the hybrid test set composed of WIDER Face and Celeba is increased by 2% after introducing multi-scale and multi template data preprocessing.

References

  1. Milborrow S, Nicolls F. Locating facial features with an extended active shape model[C]. European conference on computer vision. Springer, Berlin, Heidelberg, 2008: 504-513.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Wan Kwok-Wai, Lam K M, Ng K C. An accurate active shape model for facial feature extraction [J]. Pattern recognition letters, 2005, 26 (15): 2409-2423.Google ScholarGoogle Scholar
  3. Cootes T F, Edwards G J, Taylor C J. Active appearance models [J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2001 (6): 681- 685Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Kahraman F, Gokmen M, Darkner S, An active illumination and appearance (AIA) model for face alignment[C]. IEEE Conference on Computer Vision and Pattern Recognition, 2007: 1-7.Google ScholarGoogle Scholar
  5. Burgos-Artizzu X P, Perona P, Dollár P. Robust face landmark estimation under occlusion[C]. IEEE International Conference on Computer Vision, 2013: 1513-1520Google ScholarGoogle Scholar
  6. Chen Dong, Ren Shaoqing, Wei Yichen, Joint cascade face detection and alignment[C]. European Conference on Computer Vision. Springer, Cham, 2014: 109-122Google ScholarGoogle Scholar
  7. SUN Y, WANG X G, TANG X O. Deep convolutional network cascade for facial point detection[C]. IEEE conference on computer vision and pattern recognition, 2013: 3476 − 3483.Google ScholarGoogle Scholar
  8. ZHOU E J, FAN H Q, CAO Z M, Extensive facial landmark localization with coarse-to-fine convolutional network cascade[C]. IEEE international conference on computer vision, 2013: 386 – 391.Google ScholarGoogle Scholar
  9. Zhang, Kaipeng, Joint face detection and alignment using multitask cascaded convolution function and GLCM. Autex Research Journal, 2015: 226-232.Google ScholarGoogle Scholar
  10. S. Yang, P. Luo, C. C. Loy, and X. Tang. WIDER FACE: A Face Detection Benchmark. arXiv preprint arXiv:1511.06523Google ScholarGoogle Scholar
  11. M.G.P. Rose, S.M. Palmer, and X. Tang. Deep learning face attributes in the world[C]. IEEE International Conference on Computer Vision, 2015: 3730-3738Google ScholarGoogle Scholar
  12. Y. Sun, X. Wang, and X. Tang. Deep Convolutional NetWork Cascade for Facial Point Detection[C]. IEEE Conference on Computer Vision and Pattern Recognition, 2013: 2570-2576Google ScholarGoogle Scholar
  13. GROSSR, MATTHEWSI, COHNJ, etal. Multi-PIE[J]. Image and Vision Computing, 2010, 28(5), 807-813Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ketkar N.Introduction to PyTorch [M]. Deep Learning with Python. 2017Google ScholarGoogle Scholar
  15. Lv Jiangjing, Shao Xiaohu, Xing Junliang, A deep regression architecture with two-stage re-initialization for high performance facial landmark detection[C]. IEEE Conference on Computer Vision and Pattern Recognition, 2017: 3317-3326Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICCCV '20: Proceedings of the 3rd International Conference on Control and Computer Vision
    August 2020
    114 pages
    ISBN:9781450388023
    DOI:10.1145/3425577

    Copyright © 2020 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 23 January 2021

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)0

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format