Abstract
Inthis paper, we present a new approach for emotion recognition from facial images. The proposed method is based on the association of a pretrained convolutional neural network (CNN) model (VGG16, ResNet50) with a multilayer perceptron (MLP) classifier. The pretrained CNN model is used as a feature extractor. For this purpose, we adapt the original architecture by adding a global average pooling layer (GAP) without any fine tuning of the network parameters. In order to avoid overfitting for the MLP classifier, we introduce the early stopping criterion. It is proved that the aforementioned elements contribute in improving the performance of our approach in terms of generalization ability. The procedure for emotion recognition from facial images is applied on the CK+ (extended Cohen-Kanad), JAFFE (Japanese Female Facial Expression) and KDEF (Karolinska Directed Emotional Faces) databases. The k-fold cross validation procedure is used for accuracy estimation. The experimental results show the effectiveness of our facial emotion recognition (FER) approach compared to the existing methods yielding to recognition rates of 100%, 96.40% and 98.78% for the CK+, JAFFE and KDEF databases, respectively. On the other hand, further improvement of our recognition performance is obtained for images from the JAFFE database by performing a data augmentation during the training phase. This allows to achieve an accuracy of 100% for this database. Four other metrics, namely the F1-score, positive and negative likelihood ratios and Mathews correlation coefficient, confirm as well the classification results obtained from accuracy.
Similar content being viewed by others
References
Al Taee EJ, Jasim QM (2020) Blurred facial expression recognition system by using convolution neural network. Webology, 17(2)
Bettadapura V (2012) Face expression recognition and analysis: the state of the art. arXiv:1203.6722
Cai J, Chang O, Tang X, Xue C et al (2018) Wei: facial expression recognition method based on sparse batch normalization CNN. In: Proceedings of 37th Chinese control conference. Wuhan
Caleanu CD (2013) Fase expression recognition: a brief overview of the last decade. In: 2013 IEEE 8th International symposium applied computational intelligence and informatics (SACI), pp 157–161
Chicco D, Jurman G (2020) The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics, 21. https://doi.org/10.1186/s12864-019-6413-7
Christopher M (2006) Bishop: pattern recognition and machine learning. Springer
Daniel B (2018) Cross-Validation, https://doi.org/10.1016/B978-0-12-809633-8.20349-X
Dimitrievska V, Ackovska N (2020) Behavior models of emotion-featured robots: a survey. J Intell Robot Syst 100:1031–1053. https://doi.org/10.1007/s10846-020-01219-8
Ekman P, Friesen W (1978) Facial action coding system: a technique for the measurement of facial movement. Consulting Psychologists Press, Palo Alto
Eng SK, Ali H, Cheah AY, Chong YF (2019) Facial expression recognition in JAFFE and KDEF Datasets using histogram of oriented gradients and support vector machine. In: 5th International conference on man machine systems, IOP conf. series: materials science and engineering, vol 705, p 012031
Furlán F, Rubio E, Sossa H, Ponce V (2020) CNN based detectors on planetary environments: a performance evaluation. Frontiers Neurorob 14:85
He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. arXiv:1512.03385v1
Hinz T, Barros P, Wermter S (2016) The eects of regularization on learning facial expressions with convolutional neural networks. In: Proceedings of the 25th international conference on articial neural networks (ICANN), Part II, LNCS 9887. Barcelona, pp 80–87
Hung JC, Chang J-W (2021) Multi-level transfer learning for improving the performance of deep neural networks: theory and practice from the tasks of facial emotion recognition and named entity recognition. Appl Soft Comput 109:107491. ISSN 1568-4946, https://doi.org/10.1016/j.asoc.2021.107491
Iwendi C, Srivastava G, Khan SS et al (2020) Cyberbullying detection solutions based on deep learning architectures. Multimedia Systems. https://doi.org/10.1007/s00530-020-00701-5
Jain N, Kumar S, Kumar A, Shamsolmoali P, Zareapoor M (2018) Hybrid deep neural networks for face emotion recognition. Pattern Recogn Lett 115:101–106
Jain DK, Shamsolmoali P, Sehdev P (2019) Extended deep neural network for facial emotion recognition. Pattern Recogn Lett 120:69–74
Kar NB, Babu KS, Sangaiah AK, Bakshi S (2019) Face expression recognition system based on ripplet transform type II and least square SVM. Multimed Tools Applic 78:4789–4812
Kaur J, Sharma A (2017) Performance analysis of face detection by using Viola-Jones algorithm. Int J Comput Intell Res ISSN 0973-1873 13 (5):707–717
Kim J-H, Kim B-G, Roy PP, Jeong D-M (2019) Efficient facial expression recognition algorithm based on hierarchical deep neural network structure. IEEE Access 7:41273–41285. https://doi.org/10.1109/ACCESS.2019.2907327
Kingma DP, Ba J (2015) Adam: a method for stochastic optimization. In: International conference on learning representations (ICLR). arXiv:1412.6980v9
Kwong JCT, Garcia FCC, Abu PAR, Reyes RSJ (2018) Emotion recognition via facial expression: utilization of numerous feature descriptors in different machine learning algorithms. In: Proceedings of TENCON 2018 - 2018 IEEE Region 10 conference. Jeju, pp 28–31
Lauer F, Bentoumi M, Bloch G, Millérioux G, Aknin P (2004) Ho-Kashyap with early stopping vs soft margin SVM for linear classifiers - an application, lecture notes in computer science, vol 3173/2004. Springer, Heidelberg, pp 524–530
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–44. https://doi.org/10.1038/nature14539
Lekdioui K, Messoussi R, Ruichek Y, Chaabi Y, Touahni R (2017) Facial decomposition for expression recognition using texture/shape descriptors and svm classifer. Signal Processing: Image Communication. https://doi.org/10.1016/j.image.2017.08.001
Li S, Deng W (2018) Deep facial expression recognition: a survey. arXiv:1804.08348v2, 22
Li J, Jin K, Zhou D, Kubota N, Ju Z (2020) Attention mechanism-based CNN for facial expression recognition. Neurocomputing 411:340–350
Liliana DY (2019) Emotion recognition from facial expression using deep convolutional neural network. IOP Conf. Series: Journal of Physics: Conf., Series 1193. IOP Publishing, pp 012004. https://doi.org/10.1088/1742-6596/1193/1/012004
Lima E, Sun X, Dong J, Wang H, Yang Y, Liu L (2017) Learning and transferring convolutional neural network knowledge to ocean front recognition. IEEE Geosci Remote Sens Lett 14(3):354–358
Lin M, Chen Q, Yan S (2014) Network in network. arXiv:1312.4400v3
Liu Y, Li Y, Ma X, Song R (2017) Facial expression recognition with fusion features extracted from salient facial areas. Sensors 17:712. https://doi.org/10.3390/s17040712
Lopes AT, de Aguiar E, De Souza AF, Oliveira-Santos T et al (2017) Facial expression recognition with convolutional neural networks: coping with few data and the training sample order. Pattern Recogn 61:610–628. https://doi.org/10.1016/j.patcog.2016.07.026
Lucey P, Cohn JF, Kanade T, Saragih J, Ambadar Z, Matthews I (2010) The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression. In: 2010 IEEE Computer society conference on computer vision and pattern recognition - workshops. Juin, pp 94–101. https://doi.org/10.1109/CVPRW.2010.5543262
Lundqvist D, Flykt A, Ohman A (1998) The Karolinska directed emotional faces – KDEF, CD ROM from Department of Clinical Neuroscience, Psychology section, Karolinska Institutet ISBN 91-630-7164-9
Lyons M, Akamatsu S, Kamachi M, Gyoba J (1998) Coding facial expressions with gabor wavelets. In: Proceedings third IEEE international conference on automatic face and gesture recognition. IEEE, pp 200–205
Mavani V, Raman S, Miyapuram KP (2017) Facial expression recognition using visual saliency and deep learning. IEEE International conference on computer vision workshops (ICCVW)
Mohammadpour M, Khaliliardali H, Hashemi SMR, AlyanNezhadi MN et al (2017) facial emotion recognition using deep convolutional networks. In: 2017 IEEE 4th International conference on knowledge-based engineering and innovation (KBEI)
Nwankpa CE, Ijomah W, Gachagan A, Marshall S (2018) Activation functions: comparison of trends in practice and research for deep learning. arXiv:1811.03378v1
Oquab M, Bottou L, Laptev I, Sivic J (2014) Learning and transferring mid-level image representations using convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1717–1724
Pandey S (2014) Review and comparison of face detection algorithms. Int J Comput Sci Inf Technol 5(3):4111–4117
Punyani P, Gupta R, Kumar A (2020) Neural networks for facial age estimation: a survey on recent advances. Artif Intell Rev 53:3299–3347. https://doi.org/10.1007/s10462-019-09765-w
Ruder S (2017) An overview of gradient descent optimization algorithms. arXiv:1609.04747v2
Ruiz-Garcia A, Elshaw M, Altahhan A, Palade V (2018) A hybrid deep learning neural approach for emotion recognition from facial expressions for socially assistive robots. Neural Comput Applic 29:359–373
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg AC, FeiFei L (2015) ImageNet large scale visual recognition challenge. Int J Comput Vis (IJCV) 115 (3):211–252
Sabri M, Kurita T (2018) Improvement of feature localization for facial expressions by adding noise. Int J Affect Eng 17 (n∘ 1):27–37
Said Y, Barr M (2021) Human emotion recognition based on facial expressions via deep learning on high-resolution images. Multimedia Tools and Applications. https://doi.org/10.1007/s11042-021-10918-9
Sarvamangala DR, Kulkarni RV (2021) Convolutional neural networks in medical image understanding: a survey. Evolutionary Intelligence. https://doi.org/10.1007/s12065-020-00540-3
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556v6
Sokolova M, Japkowicz N, Szpakowicz S (2006) Beyond accuracy, F-Score and ROC: a family of discriminant measures for performance evaluation. In: Sattar A, Kang B (eds) AI 2006: advances in artificial intelligence lecture notes in computer science. https://doi.org/10.1007/11941439_114, vol 4304. Springer, Berlin
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. Computer Vision and Pattern Recognition (CVPR). arXiv:1409.4842
Tharwat A (2021) Classification assessment methods. Appl Comput Inform 17(1):168–192. https://doi.org/10.1016/j.aci.2018.08.003
Ting KM (2011) Confusion matrix. In: Sammut C, Webb GI (eds) Encyclopedia of machine learning. https://doi.org/10.1007/978-0-387-30164-8_157. Springer, Boston
Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. IEEE Comput Soc Conf Comput Vis Pattern Recogn(CVPR’01) 1:511–518
Yang B, Li Z, Cao E (2020) Facial Expression Recognition Based on Multi-dataset. Neur Netw Radioeng 29 (n∘):1
Yao X, Wang X, Wang S-H, Zhang Y-D (2020) A comprehensive survey on convolutional neural network in medical image analysis. Multimedia Tools and Applications. https://doi.org/10.1007/s11042-020-09634-7
Ye Y, Zhang X, Lin Y, Wang H (2019) Facial expression recognition via region-based convolutional fusion network. J Vis Commun Image, R 62:1–11. https://doi.org/10.1016/j.jvcir.2019.04.009
Yolcu G, Oztel I, Kazan S, Oz C, Palaniappan K, Lever TE, Bunyak F (2019) Facial expression recognition for monitoring neurological disorders based on convolutional neural network. Multimed Tools Applic 78:31581–31603
Zhang T, Yu B (2005) Boosting with early stopping: convergence and consistency. The Annals of Statistics, Institute of Mathematical Statistics 33:1538–1579
Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A (2015) Learningdeep features for discriminative localization. arXiv:1512.04150v1
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Bentoumi, M., Daoud, M., Benaouali, M. et al. Improvement of emotion recognition from facial images using deep learning and early stopping cross validation. Multimed Tools Appl 81, 29887–29917 (2022). https://doi.org/10.1007/s11042-022-12058-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12058-0