Skip to main content
Log in

Classroom face detection algorithm based on improved MTCNN

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Aiming at the problem of poor face detection performance under the classroom scenario with different angle of views, occlusion, and uneven distribution of face scales, a novel classroom face detection method based on the improved multi-task cascaded convolutional neural network (MTCNN) algorithm is proposed in this paper. Firstly, a deep residual feature generation module is introduced to improve the detection accuracy of small-scale faces by utilizing the characteristics of low-level fine granularity and converting the original poor features into high-resolution deformation features. Then, all parts involving landmarks are removed to get the simplified MTCNN model, which is combined with deep residual feature generation module to improve the detection speed while the accuracy is ensured. Finally, an up-and-down cropping strategy is employed to solve the problem of large population and uneven face scale in the classroom scenario. Experimental results demonstrate that the proposed method can achieve superior accuracy and efficiency over some state-of-the-art approaches for face detection on the FDDB dataset, as well as in the real classroom scenario.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Gza, B., Yx, C.: Efficient face detection and tracking in video sequences based on deep learning. Inf. Sci. 568, 265–285 (2021)

    Article  Google Scholar 

  2. Li, C., Li, R., Sun, J.: CNN face live detection algorithm based on binocular camera. J. Phys. Conf. Ser. 1881(2): 022015 (7pp) (2021)

  3. Li, H., Lin, Z., Shen, X., Brandt, J., Hua, G.: A convolutional neural network cascade for face detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5325–5334 (2015)

  4. Ranjan, R., Patel, V.M., Chellappa, R.: A deep pyramid deformable part model for face detection. In: 2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS), pp. 1–8 (2015)

  5. Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2009)

    Article  Google Scholar 

  6. Yang, S., Luo, P., Loy, C.C., Tang, X.: From facial parts responses to face detection: a deep learning approach. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3676–3684 (2015)

  7. Tan, T.H., Kuo, T.Y., Liu, H.: Intelligent lecturer tracking and capturing system based on face detection and wireless sensing technology. Sensors (Basel, Switzerland) 19(19) (2019)

  8. Gupta, S.K., Ashwin, T.S., Guddeti, R.: Students’ affective content analysis in smart classroom environment using deep learning techniques. Multimed. Tools Appl. 78(18), 25321–25348 (2019)

    Article  Google Scholar 

  9. Li, T.: Research on intelligent classroom attendance management based on feature recognition. J. Ambient Intell. Humaniz. Comput. 1–8 (2021)

  10. Zhang, K., Zhang, Z., Li, Z., Qiao, Y.: Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process. Lett. 23(10), 1499–1503 (2016)

    Article  Google Scholar 

  11. Ma, L.H., Fan, H.Y., Lu, Z.M., et al.: Acceleration of multi-task cascaded convolutional networks. IET Image Process. 14(11), 2435–2441 (2020)

    Article  Google Scholar 

  12. Du, J.: High-precision portrait classification based on MTCNN and its application on similarity judgement. J. Phys. Conf. Ser. 1518(1), 012066 (2020)

    Article  Google Scholar 

  13. Bodla, N., Singh, B., Chellappa, R., Davis, L. S.: Soft-NMS-improving object detection with one line of code. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5561–5569 (2017)

  14. Ku, H., Dong, W.: Face recognition based on MTCNN and convolutional neural network. Front. Signal Process. 4(1), 37–42 (2020)

    Article  Google Scholar 

  15. Mo, H., Liu, L., Zhu, W., Li, Q., Liu, H., Yin, S., Wei, S.: A multi-task hardwired accelerator for face detection and alignment. IEEE Trans. Circuits Syst. Video Technol. 30(11), 4284–4298 (2019)

    Article  Google Scholar 

  16. Li, J., Liang, X., Wei, Y., Xu, T., Feng, J., Yan, S.: Perceptual generative adversarial networks for small object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1222–1230 (2017)

  17. Luo, J., Liu, J., Lin, J., Wang, Z.: A lightweight face detector by integrating the convolutional neural network with the image pyramid. Pattern Recognit. Lett. 133, 180–187 (2020)

    Article  Google Scholar 

  18. Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier neural networks. In: Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, pp. 315–323 (2011)

  19. Hassaballah, M., Murakami, K., Ido, S.: Face detection evaluation: a new approach based on the golden ratio Phi. Signal Image Video Process 7(2), 307–316 (2013)

    Article  Google Scholar 

  20. Ho, Y., Wookey, S.: The real-world-weight cross-entropy loss function: modeling the costs of mislabeling. IEEE Access 8, 4806–4813 (2019)

    Article  Google Scholar 

  21. Wang, L., Zhang, Y., Feng, J.: On the Euclidean distance of images. IEEE Trans. Pattern Anal. Mach. Intell. 27(8), 1334–1339 (2005)

    Article  Google Scholar 

  22. Yang, S., Luo, P., Loy, C.C., Tang, X.: A face detection benchmark. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5525–5533 (2016)

  23. Jain, V., Learned-Miller, E.: Fddb: a benchmark for face detection in unconstrained settings. 2(5) (2010)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Meihua Gu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gu, M., Liu, X. & Feng, J. Classroom face detection algorithm based on improved MTCNN. SIViP 16, 1355–1362 (2022). https://doi.org/10.1007/s11760-021-02087-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-021-02087-x

Keywords

Navigation