Abstract
We propose a novel multi-view face detector that operates accurately and fast in challenging environments. It consists of four consecutive functional components: background rejector, pose classifier, pose-specific face detectors, and face validator. The background rejector removes non-face patches quickly, the pose classifier estimates poses of the surviving patches, one or more selected pose-specific face detectors according to their estimated pose labels determine that a given patch is a face by using winner take all (WTA) strategy, and the face validator checks whether the face-like patch is really a face. For achieving strong discrimination power with low computing overhead, we devise several types of order relation features (ORF) that encode the order relation among feature elements as a unique code. The devised ORFs are placed in functional components appropriately to ensure fast operation of the multi-view face detector. For accurate classification, we propose a doubly domain-partitioning (DDP) classifier that consists of a coarse domain-partitioning weak classifier followed by a fine bin-partitioning weighted linear discriminant analysis (wLDA) classifier. For fast classification, we devise a feature sharing method that shares identical features between the background rejector and the pose classifier, and among all classes in the pose classifier. We evaluated the proposed multi-view face detector using the FDDB, AFW, and PASCAL face datasets. The experimental results show that the proposed multi-view face detector outperforms other state-of-the-art methods in terms of detection accuracy and execution time.


















Similar content being viewed by others
References
Bay, H., Tuytelaars, T., Van Gool, L.: Surf: speeded up robust features. In: European Conference on Computer Vision, pp. 404–417 (2006)
Chen, D., Ren, S., Wei, Y., Cao, X., Sun, J.: Joint cascade face detection and alignment. In: European Conference on Computer Vision, pp. 109–122 (2014)
Dubout, C, Fleuret, F.: Exact acceleration of linear object detectors. In: European Conference on Computer Vision, pp. 301–311. Springer (2012)
Farfade, S.S., Saberian, M., Li, L.-J.: Multi-view face detection using deep convolutional neural networks. In: International Conference on Multimedia Retrieval, pp. 643–650 (2015)
Fröba, B., Ernst, A.: Face detection with the modified census transform. In: IEEE International Conference on Automatic Face and Gesture Recognition, pp. 91–96 (2004)
Gross, R., Matthews, I., Cohn, J., Kanade, T., Baker, S.: Multi-pie. In: IEEE International Conference on Automatic Face and Gesture Recognition, pp. 807–813. IEEE Computer Society (2008)
Huang, C., Ai, H., Wu, B., Lao, S.: Boosting nested cascade detector for multi-view face detection. In: International Conference on Pattern Recognition, pp. 415–418 (2004)
Jain, V., Learned-Miller, E.: FDDB: A Benchmark for Face Detection in Unconstrained Settings. Technical Report UM-CS-2010-009, University of Massachusetts, Amherst (2010)
Jones, M., Viola, P.: Fast Multi-view Face Detection. In: Technical Report TR2003-96, MERL - Mitsubishi Electric Research Laboratories, Cambridge, MA 02139, August (2003)
Jun, B., Kim, D.: Robust face detection using local gradient patterns and evidence accumulation. Pattern Recognit. 45(9), 3304–3316 (2012)
Jun, B., Choi, I., Kim, D.: Local transform features and hybridization for accurate face and human detection. IEEE Trans. Pattern Anal. Mach. Intell. 35(6), 1423–1436 (2013)
Koestinger, M., Wohlhart, P., Roth, P.M., Bischof, H.: Annotated facial landmarks in the wild: a large-scale, real-world database for facial landmark localization. In: International Workshop on Benchmarking Facial Image Analysis Technologies (2011)
Li, H., Lin, Z., Shen, X., Brandt, J., Hua, G.: A convolutional neural network cascade for face detection. In: Computer Vision and Pattern Recognition, pp. 5325–5334 (2015)
Li, J., Wang, T., Zhang, Y.: Face detection using surf cascade. In: International Conference on Computer Vision Workshops, pp. 2183–2190 (2011)
Liao, S., Jain, A.K., Li, S.Z.: A fast and accurate unconstrained face detector. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 211–223 (2016)
Yang, B., Yan, J., Lei, Z., Li, S. Z.: Convolutional channel features. In: IEEE International Conference on Computer Vision, pp. 82–90 (2015)
Matas, J., Sochman, J.: Waldboost—learning for time constrained sequential detection. In: Computer Vision and Pattern Recognition, pp. 150–156 (2005)
Mathias, M., Benenson, R., Pedersoli, M., Van Gool, L.: Face detection without bells and whistles. In: European Conference on Computer Vision, pp. 720–735 (2014)
Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recognit. 29(1), 51–59 (1996)
Özuysal, M., Calonder, M., Lepetit, V., Fua, P.: Fast keypoint recognition using random ferns. IEEE Trans. Pattern Anal. Mach. Intell. 32(3), 448–461 (2010)
Ranjan, R., Patel, V.M., Chellappa, R.: A deep pyramid deformable part model for face detection. In: IEEE International Conference on Biometrics Theory, Applications and Systems (2015)
Ren, S., Cao, X., Wei, Y., Sun, J.: Face alignment at 3000 fps via regressing local binary features (LBFs). In: Computer Vision and Pattern Recognition, pp. 1685–1692 (2014)
Shen, X., Lin, Z., Brandt, J., Wu, Y.: Detecting and aligning faces by image retrieval. In: Computer Vision and Pattern Recognition, pp. 3460–3467 (2013)
Tuzel, O., Porikli, F., Meer, P.: Region covariance: A fast descriptor for detection and classification. In: European Conference on Computer Vision, pp. 589–600 (2006)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Computer Vision and Pattern Recognition, pp. 511–518 (2001)
Yan, J., Zhang, X., Lei, Z., Li, S.Z.: Face detection by structural models. Image Vis. Comput. 32(10), 790–799 (2014)
Yang, S., Luo, P., Loy, C., Tang, X.: From facial parts responses to face detection: a deep learning approach. In: IEEE International Conference on Computer Vision, pp. 3676–3684 (2015)
Yang, B., Yan, J., Lei, Z., Li, S.Z.: Aggregate channel features for multi-view face detection. In: IEEE International Joint Conference on Biometrics (2014)
Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: Computer Vision and Pattern Recognition, pp. 2879–2886 (2012)
Acknowledgements
This research was supported by the Ministry of Science and ICT (MSIT), Korea, under the SW Starlab support program (IITP-2017-0-00897) supervised by the Institute for Information and Communications Technology Promotion (IITP). Also, this research was supported by Institute for Information and Communications Technology Promotion (IITP) grant funded by the Ministry of Science and ICT (MSIT) (IITP-2014-0-00059, Development of Predictive Visual Intelligence Technology).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yoon, J., Kim, D. An accurate and real-time multi-view face detector using ORFs and doubly domain-partitioning classifier. J Real-Time Image Proc 16, 2425–2440 (2019). https://doi.org/10.1007/s11554-018-0751-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11554-018-0751-6