Abstract
In an uncontrolled environment, many of the face detection algorithms lack robustness due to their design. The present research on unconstrained face detection is focused on handcrafted and visual features in isolation. We propose a novel approach to use handcrafted as well as visual features together for improvement in face detection to achieve robustness. The algorithm uses a side-view face detector, which divides the problem space into two: side view face detection and frontal face detection. For frontal faces, Discrete Wavelet Transform (DWT) followed by the encoding of Eyes like landmarks (ELL) pixels is proposed in this work. A Human trait that helps to make decisions even better when they are taken together with the help of more than one decision makers is modeled in this work. To achieve this, a combination of handcrafted and visual features is used. Further to improve classification and to provide a better decision, a faster second stage classification scheme is introduced. The result shows an improvement when handcrafted and visual features are combined instead of using them separately.












Similar content being viewed by others
References
Agrawal AK, Singh YN (2017) An efficient approach for face recognition in uncontrolled environment. Multimed Tools Appl 76(3):3751–3760
Al-Tairi ZH, Rahmat RW, Saripan MI, Sulaiman PS (2014) Skin segmentation using YUV and RGB color spaces. JIPS 10(2):283–299
Bagchi P, Bhattacharjee D, Nasipuri M (2016) A robust analysis, detection and recognition of facial features in 2.5 d images. Multimed Tools Appl 75(18):11059–11096
Berg TL, Berg AC, Edwards J, Forsyth DA (2005) Who’s in the picture. In: Advances in neural information processing systems, pp 137–144
Candemir S, Borovikov E, Santosh K, Antani S, Thoma G (2015) Rsilc: rotation-and scale-invariant, line-based color-aware descriptor. Image Vis Comput 42:1–12
Chen D, Ren S, Wei Y, Cao X, Sun J (2014) Joint cascade face detection and alignment. In: European conference on computer vision. Springer, pp 109–122
Everingham M, Eslami SA, Van Gool L, Williams CK, Winn J, Zisserman A (2015) The pascal visual object classes challenge: a retrospective. Int J Comput Vis 111(1):98–136
Fan W, Bouguila N (2015) Face detection and facial expression recognition using simultaneous clustering and feature selection via an expectation propagation statistical learning framework. Multimed Tools Appl 74(12):4303–4327
Farfade SS, Saberian MJ, Li LJ (2015) Multi-view face detection using deep convolutional neural networks. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval. ACM, pp 643–650
Ghiasi G, Fowlkes CC (2014) Occlusion coherence: Localizing occluded faces with a hierarchical deformable part model. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2385–2392
Gini C (1971) Variability and mutability, contribution to the study of statistical distributions and relations. J Amer Stat Assoc 66:534–544
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Haar A (1910) Zur theorie der orthogonalen funktionensysteme. Math Ann 69 (3):331–371
Hao Z, Liu Y, Qin H, Yan J, Li X, Hu X (2017) Scale-aware face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 6186–6195
He Q, He B, Zhang Y, Fang H (2019) Multimedia based fast face recognition algorithm of speed up robust features. Multimedia Tools and Applications 78(17):24035–24045
Hong SH, Lee JW, Lama RK, Kwon GR (2016) Real-time face detection and phone-to-face distance measuring for speech recognition for multi-modal interface in mobile device. Multimed Tools Appl 75(12):6717–6735
Huang C, Ai H, Li Y, Lao S (2007) High-performance rotation invariant multiview face detection. IEEE Trans Pattern Anal Mach Intell 29(4):671–686
Hussain MF, Wang H, Santosh K (2018) Gray level face recognition using spatial features. In: International conference on recent trends in image processing and pattern recognition. Springer, pp 216–229
Jain V, Learned-Miller E (2010) FDDB: A benchmark for face detection in unconstrained settings. University of Massachusetts. Amherst Tech Rep UM-CS-2010-009 2(7):8
Jones M, Viola P (2003) Fast multi-view face detection. Mitsubishi Electric Res TR-20003-96 3(14):2
Koestinger M, Wohlhart P, Roth PM, Bischof H (2011) Annotated facial landmarks in the wild: a large-scale, real-world database for facial landmark localization. In: 2011 IEEE international conference on Computer vision workshops (ICCV workshops). IEEE, pp 2144–2151
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Li SZ, Zhu L, Zhang Z, Blake A, Zhang H, Shum H (2002) Statistical learning of multi-view face detection. In: European conference on computer vision. Springer, pp 67–81
Li H, Hua G, Lin Z, Brandt J, Yang J (2013) Probabilistic elastic part model for unsupervised face detector adaptation. In: 2013 IEEE international conference on Computer vision (ICCV). IEEE, pp 793–800
Li H, Lin Z, Shen X, Brandt J, Hua G (2015) A convolutional neural network cascade for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5325–5334
Li Y, Sun B, Wu T, Wang Y (2016) Face detection with end-to-end integration of a convnet and a 3D model. In: European conference on computer vision. Springer, pp 420–436
Liao S, Jain AK, Li SZ (2016) A fast and accurate unconstrained face detector. IEEE Trans Pattern Anal Mach Intell 38(2):211–223
Lienhart R, Maydt J (2002) An extended set of haar-like features for rapid object detection. In: International conference on image processing, proceedings, vol 1. IEEE, pp 1–4
Marciniak T, Chmielewska A, Weychan R, Parzych M, Dabrowski A (2015) Influence of low resolution of images on reliability of face detection and recognition. Multimed Tools Appl 74(12):4329–4349
Oyster CW, Haver N (1999) The human eye: structure and function, vol 766. Sinauer Associates, Sunderland
Qin H, Yan J, Li X, Hu X (2016) Joint training of cascaded CNN for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3456–3465
Ranjan R, Patel VM, Chellappa R (2015) A deep pyramid deformable part model for face detection. In: IEEE 7th international conference on Biometrics theory, applications and systems (BTAS), pp 1–8
Ranjan R, Patel VM, Chellappa R (2019) Hyperface: a deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans Pattern Anal Mach Intell 41(1):121–135
Sawat DD, Hegadi RS, Hegadi RS (2018) Eye like landmarks extraction and patching for face detection using deep neural network. In: International conference on recent trends in image processing and pattern recognition. Springer, pp 395–405
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 815–823
Viola P, Jones M, et al. (2001) Rapid object detection using a boosted cascade of simple features. CVPR (1) 1:511–518
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Wang H, Hussain MF, Mukherjee H, Obaidullah SM, Hegadi RS, Roy K, Santosh K (2018) An empirical study: Elm in face matching. In: International conference on recent trends in image processing and pattern recognition. Springer, pp 277–287
Yang B, Yan J, Lei Z, Li SZ (2014) Aggregate channel features for multi-view face detection. In: IEEE international joint conference on Biometrics (IJCB), pp 1–8
Yang S, Luo P, Loy CC, Tang X (2015) From facial parts responses to face detection: a deep learning approach. In: Proceedings of the IEEE International Conference on Computer Vision, pp 3676–3684
Yang S, Luo P, Loy CC, Tang X (2018) Faceness-net: Face detection through deep facial part responses. IEEE Trans Pattern Anal Mach Intell 40(8):1845–1859
Zhang K, Zhang Z, Li Z, Qiao Y (2016) Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process Lett 23(10):1499–1503
Zhang S, Zhu X, Lei Z, Shi H, Wang X, Li SZ (2017) S3FD: Single shot scale-invariant face detector. arXiv:1708.05237
Zuiderveld K (1994) Contrast limited adaptive histogram equalization. Graphics Gems IV:474–485. Academic Press Professional, Inc.
Acknowledgments
Authors thank to the Ministry of Electronics and Information Technology (MeitY), New Delhi for granting Visvesvaraya Ph.D. fellowship through file no. PhD-MLA∖4(34)∖2014-15 Dated: 10/04/2015.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sawat, D.D., Hegadi, R.S., Garg, L. et al. Pixel encoding for unconstrained face detection. Multimed Tools Appl 79, 35033–35054 (2020). https://doi.org/10.1007/s11042-020-08800-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-08800-1