Abstract
Cascade has been widely used in face detection where classifier with low computational cost can be firstly used to shrink most of the background while keeping the recall. In this paper, a new cascaded convolutional neural network method consisting of two main steps is proposed. During the first stage, low-pixel candidate window is used as an input such that the shallow convolutional neural network quickly extracts the candidate window. In the second stage, the window from the former stage is resized and used as an input to the corresponding network layer respectively. During the training period, joint online training is conducted for hard samples and the soft non-maximum suppression algorithm is used to test on the dataset. The whole network achieves improved performance on the FDDB and PASCAL face datasets.













Similar content being viewed by others
References
Bourdev L, Brandt J (2005) Robust Object Detection via Soft Cascade, Computer Vision and Pattern Recognition, 236–243
Chen D, Ren S, Wei Y, Cao X, Sun J (2014) Joint cascade face detection and alignment, in European Conference on Computer Vision, 109–122
Dollar P, Tu Z, Perona P, Belongie S (2009) Integral channel features, in BMVA
Dollár P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545
Farfade SS, Saberian M, Li L, Multi-view face detection using deep convolutional neural networks, ICMR2015
Girshick R, Fast R-CNN, ICCV2015
Girshick R, Donahue J, Darrell T, Malik J, Rich feature hierarchies for accurate object detection and semantic segmentation, IEEE CVPR2014
He K, Zhang X, Ren S et al (2015) Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Trans Pattern Anal Mach Intell 37(9):1904
Huang L, Yang Y, Deng Y, Yu Y (2015) DenseBox: Unifying Landmark Localization with End to End Object Detection arXiv:1509.04874
Jain V, Learned-Miller E (2010) FDDB: A benchmark for face detection in unconstrained settings, Tech. Rep. UM-CS-2010-009, University of Massachusetts. In: Amherst
Krizhevsky A, Sutskever I, Hinton G (2012) Imagenet classification with deep convolutional neural networks. NIPS 1097–1105
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444
Li H, Lin Z, Shen X, Brandt J, Hua G (2015) A convolutional neural network cascade for face detection computer vision and pattern recognition
Li J, Lu K, Huang Z, Zhu L, Shen HT Transfer independently together: a generalized framework for domain adaption. IEEE Trans Cybern, Digit Object Identifier. https://doi.org/10.1109/TCYB.2018.2820174
Liu L, Ouyang W, Wang X, Fieguth P, Chen J, Liu X, Pietikäinen M (2018) Deep learning for generic object detection: a survery, arXiv:1809.02165v1 [cs.CV] 6 Sep
Najibi M, Samangouei P, Chellapa R, Davis LS, SSH: single stage headless face detector, ICCV2007
Nie L, Wang X, Zhang J, He X, Zhang H, Hong R, Tian Q, Enhancing mircro-video understanding by harnessing external sounds, ACMM2017
Peiyun H, Ramanan D (2017) Finding tiny faces, CVPR
Ren S, He K, Girshick R, Sun J, (2016) Faster R-CNN: Towards real-Time object detection with region proposal networks, IEEE CVPR 1137–1149
Shelhamer E, Long J, Darrell T (2014) Fully Convolutional Networks for Semantic Segmentation. IEEE Trans Pattern Anal Mach Intell 39(4):640
Shen F, Xu Y, Liu L, Yang Y, Huang Z, Shen HT, Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization, IEEE Transactions on Pattern Analysis and Machine Intelligence. https://doi.org/10.1109/TPAMI.2018.2789887
Song X, Feng F, Han X, Yang X, Liu W, Nie L Neural compatibility modeling with attentive knowledge distillation, SIGIR2018
Tang X, Du DK, He Z, Liu J, (2018) PyramidBox: A Context-assisted Single Shot Face Detector. arXiv preprint arXiv:1803.07737
Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features, in Proceedings of the 19th Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, pp. 511–518. IEEE
Wang X, Han TX, Yan S (2009) An hog-lbp human detector with partial occlusion handling, IEEE ICCV
Wang H, Li Z, Ji X, Wang Y, Face R-CNN (2017) arXiv preprint arXiv:1706.01061
Xie L, Shen J, Han J, Zhu L, Shao L, Dynamic multi-view hashing for online image retrieval, IJCAI2017
Yan J, Lei Z, Wen L, Li S (2014) “The fastest deformable part model for object detection,” in IEEE Conference on Computer Vision and Pattern Recognition, 2497–2504
Yan J, Zhang X, Lei Z, Li SZ (2014) Face detection by structural models. Image Vis Comput 32(10):790–799
Yang MH, Kriegman D, Ahuja N (2002) Detecting faces in images: A survey, IEEE Trans. PAMI
Yang S, Luo P, Loy CC, Tang X (2015) From facial parts responses to face detection: A deep learning approach, in IEEE International Conference on Computer Vision, 3676–3684
Yang S, Luo P, Loy CC, Tang X (2018) Faceness-Net: face detection through deep facial part response. IEEE Trans Pattern Anal Mach Intell 40(8):1845–1859
Zafeiriou S, Zhang C, Zhang Z (2015) A survey on face detection in the wild: past, present and future. Comput Vis Image Underst 138:1–24
Zhan K, Zhang Z, Li Z, Qiao Y (2016) Joint face detection and alignment using multi-task cascade convolutional Networks. IEEE Signal process lett 23(10):1499–1503
Zheng R, Yao C, Jin H, Zhou L, Zhang Q, Dong W (2015) Parallel key frame extraction for surveillance video service in a smart city. PLoS One 10(8):e0135694
Zhu X, Ramanan D (2012) “Face detection, pose estimation, and landmark localization in the wild,” in IEEE Conference on Computer Vision and Pattern Recognition 2879–2886
Zhu Q, Yeh MC, Cheng KT, Avidan S (2006) Fast human detection using a cascade of histograms of oriented gradients, IEEE CVPR
Zhu L, Huang Z, Chang X, Song J, Shen HT, Exploring consistent preferences: discrete hashing with pair-exemplar for scalable landmark search, ACMM2017
Zhu L, Huang Z, Li Z, Xie L, Shen HT (2018) Exploring auxiliary context: discrete semantic transfer hashing for scalable image retrieval. IEEE Trans NNLS 29(11):5264–5276
Acknowledgements
This work is supported by National Natural Science Foundation (NNSF) of China under Grant No. 61473086, 61603080 and 61773117. Jiangsu key R & D plan (No.BE2017157).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yang, W., Zhou, L., Li, T. et al. A Face Detection Method Based on Cascade Convolutional Neural Network. Multimed Tools Appl 78, 24373–24390 (2019). https://doi.org/10.1007/s11042-018-6995-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-6995-0