Expression Recognition Using a Flow-Based Latent-Space Representation

Aathreya, Saandeep; Canavan, Shaun

doi:10.1007/978-3-031-37745-7_11

Saandeep Aathreya⁹ &
Shaun Canavan⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13646))

Included in the following conference series:

International Conference on Pattern Recognition

173 Accesses

Abstract

Facial expression Recognition is a growing and important field that has applications in fields such as medicine, security, education, and entertainment. While there have been encouraging approaches that have shown accurate results on a wide variety of datasets, in many cases it is still a difficult problem to explain the results. To enable deployment of expression recognition applications in-the-wild, being able to explain why an particular expression is classified is an important task. Considering this, we propose to model flow-based latent representations of facial expressions, which allows us to further analyze the features and grants us more granular control over which features are produced for recognition. Our work is focused on posed facial expressions with a tractable density of the latent space. We investigate the behaviour of these tractable latent space features in the case of subject dependent and independent expression recognition. We employ a flow-based generative approach with minimal supervision introduced during training and observe that traditional metrics give encouraging results. When subject independent expressions are evaluated, a shift towards a stochastic nature, in the probability space, is observed. We evaluate our flow-based representation on the BU-EEG dataset showing our approach provides good separation of classes, resulting in more explainable results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://github.com/rosinality/glow-pytorch.

References

Al-modwahi, A.A.M., et al.: Facial expression recognition intelligent security system for real time surveillance. In: World Congress in Computer Science, Computer Engineering, and Applied Computing (2012)
Google Scholar
Atanov, A., et al.: Semi-conditional normalizing flows for semi-supervised learning (2020)
Google Scholar
Barrett, L.F., et al.: Emotional expressions reconsidered: challenges to inferring emotion from human facial movements. Psychol. Sci. Publ. Interest 20(1), 1–68 (2019)
Article Google Scholar
Berthouze, N., et al.: Emopain challenge 2020: Multimodal pain evaluation from facial and bodily expressions. arXiv preprint arXiv:2001.07739 (2020)
Bojanowski, P., Joulin, A., Lopez-Paz, D., Szlam, A.: Optimizing the latent space of generative networks (2019)
Google Scholar
Cowie, R.: Ethical issues in affective computing. In: The Oxford handbook of AC, pp. 334–348. Oxford University Press (2015)
Google Scholar
Deng, L.: The MNIST database of handwritten digit images for machine learning research. IEEE Signal Process. Mag. 29(6), 141–142 (2012)
Article Google Scholar
Dinh, L., Sohl-Dickstein, J., Bengio, S.: Density estimation using real NVP. CoRR abs/1605.08803 (2016). http://arxiv.org/abs/1605.08803
Ekman, P., Friesen, W.V.: Constants across cultures in the face and emotion. J. Pers. Soc. Psychol. 17(2), 124 (1971)
Article Google Scholar
Ertugrul, I.O., et al.: Cross-domain au detection: Domains, learning approaches, and measures. In: FG, pp. 1–8. IEEE (2019)
Google Scholar
Escalante, H.J., et al.: Design of an explainable machine learning challenge for video interviews. In: IJCNN (2017)
Google Scholar
Fabiano, D., Canavan, S.: Emotion recognition using fused physiological signals. In: ACII, pp. 42–48. IEEE (2019)
Google Scholar
Goebel, R., et al.: Explainable AI: the new 42? In: Holzinger, A., Kieseberg, P., Tjoa, A.M., Weippl, E. (eds.) CD-MAKE 2018. LNCS, vol. 11015, pp. 295–303. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-99740-7_21
Chapter Google Scholar
Goodfellow, I.J., et al.: Generative adversarial networks (2014)
Google Scholar
Habler, E., Shabtai, A.: Using LSTM encoder-decoder algorithm for detecting anomalous ADS-B messages. Comput. Secur. 78, 155–173 (2018)
Article Google Scholar
Hasani, B., Mahoor, M.H.: Facial expression recognition using enhanced deep 3d convolutional neural networks. In: CVPRW (2017)
Google Scholar
Hinduja, S., Canavan, S., Yin, L.: Recognizing perceived emotions from facial expressions. In: FG (2020)
Google Scholar
Hu, X., et al.: Ten challenges for EEG-based affective computing. Brain Science Advances 5(1), 1–20 (2019). https://doi.org/10.1177/2096595819896200
Izmailov, P., et al.: Semi-supervised learning with normalizing flows (2019)
Google Scholar
Jack, R.E., Garrod, O.G., Yu, H., Caldara, R., Schyns, P.G.: Facial expressions of emotion are not culturally universal. Proc. Natl. Acad. Sci. 109(19), 7241–7244 (2012)
Article Google Scholar
Kandeel, A.A., et al.: Explainable model selection of a CNN for driver’s facial emotion identification. In: ICPRW (2021)
Google Scholar
Khalfallah, J., Slama, J.B.H.: Facial expression recognition for intelligent tutoring systems in remote laboratories platform. Proc. Comput. Sci. 73, 274–281 (2015)
Article Google Scholar
Kingma, D.P., Dhariwal, P.: Glow: Generative flow with invertible 1 \(\times \) 1 convolutions (2018)
Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational bayes (2014)
Google Scholar
Kobyzev, I., Prince, S., Brubaker, M.: Normalizing flows: an introduction and review of current methods. IEEE Trans. Pattern Anal. Mach. Intell. 1 (2020). https://doi.org/10.1109/TPAMI.2020.2992934
Li, S., Deng, W.: A deeper look at facial expression dataset bias. IEEE Trans. Affect. Comput. (2020)
Google Scholar
Li, X., et al.: An EEG-based multi-modal emotion database with both posed and authentic facial actions for emotion analysis. In: FG (2020)
Google Scholar
Liu, M., Li, S., Shan, S., Wang, R., Chen, X.: Deeply learning deformable facial action parts model for dynamic expression analysis. In: Cremers, D., Reid, I., Saito, H., Yang, M.-H. (eds.) ACCV 2014. LNCS, vol. 9006, pp. 143–157. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16817-3_10
Chapter Google Scholar
Lucas, J., Tucker, G., Grosse, R., Norouzi, M.: Understanding posterior collapse in generative latent variable models (2019)
Google Scholar
Lucey, P., Cohn, J., Lucey, S., Matthews, I., Sridharan, S., Prkachin, K.M.: Automatically detecting pain using facial actions. In: ACIIW, pp. 1–8 (2009). https://doi.org/10.1109/ACII.2009.5349321
Van der Maaten, L., Hinton, G.: Visualizing data using T-SNE. J. Mach. Learn. Res. 9(11) (2008)
Google Scholar
McGarigal, K., Stafford, S., Cushman, S.: Discriminant Analysis, pp. 129–187 (2000)
Google Scholar
Melhart, D., Liapis, A., Yannakakis, G.N.: The affect game annotation (again) dataset. arXiv preprint arXiv:2104.02643 (2021)
Minaee, S., Abdolrashidi, A.: Deep-emotion: Facial expression recognition using attentional convolutional network. arXiv preprint arXiv:1902.01019 (2019)
Muhammad, G., Alsulaiman, M., Amin, S.U., Ghoneim, A., Alhamid, M.F.: A facial-expression monitoring system for improved healthcare in smart cities. IEEE Access 5, 10871–10881 (2017). https://doi.org/10.1109/ACCESS.2017.2712788
Article Google Scholar
Nguyen, A., et al.: Plug & play generative networks: conditional iterative generation of images in latent space. In: CVPR (2017)
Google Scholar
Nummenmaa, L., Hari, R., Hietanen, J.K., Glerean, E.: Maps of subjective feelings. Proc. Natl. Acad. Sci. 115(37), 9198–9203 (2018). https://doi.org/10.1073/pnas.1807390115, https://www.pnas.org/content/115/37/9198
Paszke, A., et al.: Pytorch: an imperative style, high-performance deep learning library. In: Wallach, H., Larochelle, H., Beygelzimer, A., d’Alché-Buc, F., Fox, E., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 32, pp. 8024–8035. Curran Associates, Inc. (2019)
Google Scholar
Perov, I., et al.: Deepfacelab: a simple, flexible and extensible face swapping framework (2020)
Google Scholar
Reynolds, D.: Gaussian Mixture Models, pp. 659–663. Springer, US (2009). https://doi.org/10.1007/978-0-387-73003-5_196
Rezende, D., Mohamed, S.: Variational inference with normalizing flows. In: ICML (2015)
Google Scholar
Rothkrantz, L., et al.: Facs-coding of facial expressions. Association for Computing Machinery (2009)
Google Scholar
Rudovic, O., et al.: Personalized federated deep learning for pain estimation from face images. arXiv preprint arXiv:2101.04800 (2021)
Shao, J., Qian, Y.: Three convolutional neural network models for facial expression recognition in the wild. Neurocomputing. 355, 82–92 (2019). https://doi.org/10.1016/j.neucom.2019.05.005, https://www.sciencedirect.com/science/article/pii/S0925231219306137
Song, Y., Morency, L.P., Davis, R.: Distribution-sensitive learning for imbalanced datasets. In: FGW (2013)
Google Scholar
Sricharan, K., et al.: Semi-supervised conditional gans. arXiv preprint arXiv:1708.05789 (2017)
Sun, B., Li, L., Zhou, G., He, J.: Facial expression recognition in the wild based on multimodal texture features. J. Electron. Imaging 25(6), 1–8 (2016)
Article Google Scholar
Takalkar, M.A., Xu, M.: Image based facial micro-expression recognition using deep learning on small datasets. In: 2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA), pp. 1–7. IEEE (2017)
Google Scholar
Vinciarelli, A., Pantic, M., Bourlard, H.: Social signal processing: survey of an emerging domain. Image Vision Comput. 27(12), 1743–1759 (2009)
Google Scholar
Weitz, K., et al.: Deep-learned faces of pain and emotions: elucidating the differences of facial expressions with the help of explainable AI methods. tm-Technisches Messen. 86(7–8), 404–412 (2019)
Google Scholar
Widen, S.C., et al.: Anger and disgust: discrete or overlapping categories. In: APS Annual Convention (2004)
Google Scholar
Xie, S., Hu, H., Chen, Y.: Facial expression recognition with two-branch disentangled generative adversarial network. IEEE Trans. Circuits Syst. Video Technol. (2020)
Google Scholar
Yang, H., et al.: Identity-adaptive facial expression recognition through expression regeneration using conditional generative adversarial networks. In: FG (2018)
Google Scholar

Download references

Acknowledgement

This material is based upon the work supported in part by the National Science Foundation under grant CNS-2039373. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Author information

Authors and Affiliations

University of South Florida, Tampa, FL, 33620, USA
Saandeep Aathreya & Shaun Canavan

Authors

Saandeep Aathreya
View author publications
You can also search for this author in PubMed Google Scholar
Shaun Canavan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shaun Canavan .

Editor information

Editors and Affiliations

York University, Toronto, ON, Canada
Jean-Jacques Rousseau
Ontario Tech University, Oshawa, ON, Canada
Bill Kapralos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Aathreya, S., Canavan, S. (2023). Expression Recognition Using a Flow-Based Latent-Space Representation. In: Rousseau, JJ., Kapralos, B. (eds) Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges. ICPR 2022. Lecture Notes in Computer Science, vol 13646. Springer, Cham. https://doi.org/10.1007/978-3-031-37745-7_11

Download citation

DOI: https://doi.org/10.1007/978-3-031-37745-7_11
Published: 29 July 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-37744-0
Online ISBN: 978-3-031-37745-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Expression Recognition Using a Flow-Based Latent-Space Representation