Abstract
Aiming at the problem of poor face detection performance under the classroom scenario with different angle of views, occlusion, and uneven distribution of face scales, a novel classroom face detection method based on the improved multi-task cascaded convolutional neural network (MTCNN) algorithm is proposed in this paper. Firstly, a deep residual feature generation module is introduced to improve the detection accuracy of small-scale faces by utilizing the characteristics of low-level fine granularity and converting the original poor features into high-resolution deformation features. Then, all parts involving landmarks are removed to get the simplified MTCNN model, which is combined with deep residual feature generation module to improve the detection speed while the accuracy is ensured. Finally, an up-and-down cropping strategy is employed to solve the problem of large population and uneven face scale in the classroom scenario. Experimental results demonstrate that the proposed method can achieve superior accuracy and efficiency over some state-of-the-art approaches for face detection on the FDDB dataset, as well as in the real classroom scenario.
Similar content being viewed by others
References
Gza, B., Yx, C.: Efficient face detection and tracking in video sequences based on deep learning. Inf. Sci. 568, 265–285 (2021)
Li, C., Li, R., Sun, J.: CNN face live detection algorithm based on binocular camera. J. Phys. Conf. Ser. 1881(2): 022015 (7pp) (2021)
Li, H., Lin, Z., Shen, X., Brandt, J., Hua, G.: A convolutional neural network cascade for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5325–5334 (2015)
Ranjan, R., Patel, V.M., Chellappa, R.: A deep pyramid deformable part model for face detection. In: 2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS), pp. 1–8 (2015)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2009)
Yang, S., Luo, P., Loy, C.C., Tang, X.: From facial parts responses to face detection: a deep learning approach. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3676–3684 (2015)
Tan, T.H., Kuo, T.Y., Liu, H.: Intelligent lecturer tracking and capturing system based on face detection and wireless sensing technology. Sensors (Basel, Switzerland) 19(19) (2019)
Gupta, S.K., Ashwin, T.S., Guddeti, R.: Students’ affective content analysis in smart classroom environment using deep learning techniques. Multimed. Tools Appl. 78(18), 25321–25348 (2019)
Li, T.: Research on intelligent classroom attendance management based on feature recognition. J. Ambient Intell. Humaniz. Comput. 1–8 (2021)
Zhang, K., Zhang, Z., Li, Z., Qiao, Y.: Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process. Lett. 23(10), 1499–1503 (2016)
Ma, L.H., Fan, H.Y., Lu, Z.M., et al.: Acceleration of multi-task cascaded convolutional networks. IET Image Process. 14(11), 2435–2441 (2020)
Du, J.: High-precision portrait classification based on MTCNN and its application on similarity judgement. J. Phys. Conf. Ser. 1518(1), 012066 (2020)
Bodla, N., Singh, B., Chellappa, R., Davis, L. S.: Soft-NMS-improving object detection with one line of code. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5561–5569 (2017)
Ku, H., Dong, W.: Face recognition based on MTCNN and convolutional neural network. Front. Signal Process. 4(1), 37–42 (2020)
Mo, H., Liu, L., Zhu, W., Li, Q., Liu, H., Yin, S., Wei, S.: A multi-task hardwired accelerator for face detection and alignment. IEEE Trans. Circuits Syst. Video Technol. 30(11), 4284–4298 (2019)
Li, J., Liang, X., Wei, Y., Xu, T., Feng, J., Yan, S.: Perceptual generative adversarial networks for small object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1222–1230 (2017)
Luo, J., Liu, J., Lin, J., Wang, Z.: A lightweight face detector by integrating the convolutional neural network with the image pyramid. Pattern Recognit. Lett. 133, 180–187 (2020)
Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier neural networks. In: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, pp. 315–323 (2011)
Hassaballah, M., Murakami, K., Ido, S.: Face detection evaluation: a new approach based on the golden ratio Phi. Signal Image Video Process 7(2), 307–316 (2013)
Ho, Y., Wookey, S.: The real-world-weight cross-entropy loss function: modeling the costs of mislabeling. IEEE Access 8, 4806–4813 (2019)
Wang, L., Zhang, Y., Feng, J.: On the Euclidean distance of images. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1334–1339 (2005)
Yang, S., Luo, P., Loy, C.C., Tang, X.: A face detection benchmark. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5525–5533 (2016)
Jain, V., Learned-Miller, E.: Fddb: a benchmark for face detection in unconstrained settings. 2(5) (2010)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Gu, M., Liu, X. & Feng, J. Classroom face detection algorithm based on improved MTCNN. SIViP 16, 1355–1362 (2022). https://doi.org/10.1007/s11760-021-02087-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-021-02087-x