Abstract
In recent years, deep learning neural frameworks have been given significant attention in programming development, especially in machine learning, machine vision and artificial intelligence (AI). The ability to detect faces has inspired many researchers because a human face shows great dissimilarities in form and figure due to changes in position and expression in different situations. This research aims to produce a method of applying recurrent neural network (RNN) designs using long short-term memory (LSTM) to identify facial expressions. The proposed method involves an improved RNN that uses LSTM to increase the effectiveness of the feature extraction process using input sets which regenerate the input-data from the features. The accuracy and computing time of this technique were studied. With LSTM-RNNs, the results show that the design gives enhanced outcomes compared with other methods, including most image/video face detection methods. The efficiency evaluation of LSTM-RNNs in images and in video frame series shows that there are performance improvements of more than 5% compared with traditional neural networks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahmed, H.A., Rashid, T.A., Sidiq, A.T.: Face behavior recognition through support vector machines. Int. J. Adv. Comput. Sci. Appl. 7(1), 101–108 (2016)
Verburg, M., Menkovski, V.: Micro-expression detection in long videos using optical flow and recurrent neural networks. arXiv:1903.10765, vol. v1 (2019)
Sang, D.V., Van Dat, N., Thuan, D.P.: Facial expression recognition using deep convolutional neural networks. In: Proceedings of 2017 9th International Conference on Knowledge and Systems Engineering, KSE 2017, vol. 2017, pp. 130–135 (2017)
Minaee, S., Abdolrashidi, A.: Deep-Emotion: Facial Expression Recognition Using Attentional Convolutional Network (2019)
Ebrahimi Kahou, S., Michalski, V., Konda, K., Memisevic, R., Pal, C.: Recurrent neural networks for emotion recognition in video. In: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction - ICMI 2015, pp. 467–474 (2015)
Gu, J., Yang, X., De Mello, S., Kautz, J.: Dynamic facial analysis: from Bayesian filtering to recurrent neural network. In: Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, vol. 2017, pp. 1531–1540 (2017)
Jung, H., Lee, S., Yim, J., Park, S., Kim, J.: Joint fine-tuning in deep neural networks for facial expression recognition. In: Proceedings of the IEEE International Conference on Computer Vision, vol. 2015, pp. 2983–2991 (2015)
Jeyaraj, P.R., Samuel Nadar, E.R.: Computer-assisted medical image classification for early diagnosis of oral cancer employing deep learning algorithm. J. Cancer Res. Clin. Oncol. 145, 829–837 (2019)
Mohsen, H., El-Dahshan, E.S.A., El-Horbaty, E.S.M., Salem, A.B.M.: Classification using deep learning neural networks for brain tumors. Future Comput. Inform. J. 3(1), 68–71 (2018)
Aubreville, M., et al.: Automatic classification of cancerous tissue in laserendomicroscopy images of the oral cavity using deep learning. Sci. Rep. 7(1), 11979 (2017)
Antonio, V.A.A., Ono, N., Saito, A., Sato, T., Altaf-Ul-Amin, M., Kanaya, S.: Classification of lung adenocarcinoma transcriptome subtypes from pathological images using deep convolutional networks. Int. J. Comput. Assist. Radiol. Surg. 13(12), 1905–1913 (2018)
Graves, A., Schmidhuber, J.: Offline handwriting recognition with multidimensional recurrent neural networks. In: NIPS (2009)
Bell, S., Zitnick, C.L., Bala, K., et al.: Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks, arXiv preprint arXiv:1512.04143 (2015)
Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6645–6649 (2013)
Graves, A., Jaitly, N.: Towards end-to-end speech recognition with recurrent neural networks. In: Proceedings of International Conference on Machine Learning, pp. 1764–1772 (2014)
Mikolov, T., Karafiát, M., Burget, L., Cernocký, J., Khudanpur, S.: Recurrent neural network based language model. In: Proceedings of INTERSPEECH, vol. 2, pp. 1045–1048 (2010)
Sanin, A., Sanderson, C., Harandi, M.T., Lovell, B.C.: Spatiotemporal covariance descriptors for action and gesture recognition. In: IEEE Workshop on Applications of Computer Vision (2013)
Zhang, T., Zheng, W., Cui, Z., Zong, Y., Li, Y.: Spatial-temporal recurrent neural network for emotion recognition. IEEE Trans. Cybern. arXiv:1705.04515 (99), 1–9 (2018)
Yang et al.: FER using WMDNN based on double-channel facial images. IEEE Access 6, 4630–4640 (2016). [8]
Yao, A., Cai, D., Hu, P., Wang, S., Shan, L., Chen, Y.: HoloNet: towards robust emotion recognition in the wild (2016)
Khorrami, P., Paine, T.L., Brady, K., Dagli, C., Huang, T.S.: How deep neural networks can improve emotion recognition on video data. In: IEEE Conference on Image Processing (ICIP) (2016)
Jain, D.K., Kumar, R., Jain, N.: Decision-based spectral embedding approach for identifying facial behaviour on RGB-D images. In: Modi, N., Verma, P., Trivedi, B. (eds.) Proceedings of International Conference on Communication and Networks. AISC, vol. 508, pp. 677–687. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-2750-5_69
Jain, D.K., Zhang, Z., Huang, K.: Hybrid patch based diagonal pattern geometric appearance model for facial expression recognition. In: Zhang, Z., Huang, K. (eds.) IVS 2016. CCIS, vol. 664, pp. 107–113. Springer, Singapore (2016). https://doi.org/10.1007/978-981-10-3476-3_13
Chernykh, V., Sterling, G., Prihodko, P.: Emotion Recognition From Speech With Recurrent Neural Networks, arXiv:1701.08071v1 [cs.CL] (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Salih, W.M., Nadher, I., Tariq, A. (2020). Deep Learning for Face Expressions Detection: Enhanced Recurrent Neural Network with Long Short Term Memory. In: Khalaf, M., Al-Jumeily, D., Lisitsa, A. (eds) Applied Computing to Support Industry: Innovation and Technology. ACRIT 2019. Communications in Computer and Information Science, vol 1174. Springer, Cham. https://doi.org/10.1007/978-3-030-38752-5_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-38752-5_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-38751-8
Online ISBN: 978-3-030-38752-5
eBook Packages: Computer ScienceComputer Science (R0)