skip to main content
research-article

Meta-MMFNet: Meta-learning-based Multi-model Fusion Network for Micro-expression Recognition

Published:25 September 2023Publication History
Skip Abstract Section

Abstract

Despite its wide applications in criminal investigations and clinical communications with patients suffering from autism, automatic micro-expression recognition remains a challenging problem because of the lack of training data and imbalanced classes problems. In this study, we proposed a meta-learning-based multi-model fusion network (Meta-MMFNet) to solve the existing problems. The proposed method is based on the metric-based meta-learning pipeline, which is specifically designed for few-shot learning and is suitable for model-level fusion. The frame difference and optical flow features were fused, deep features were extracted from the fused feature, and finally in the meta-learning-based framework, weighted sum model fusion method was applied for micro-expression classification. Meta-MMFNet achieved better results than state-of-the-art methods on four datasets. The code is available at https://github.com/wenjgong/meta-fusion-based-method.

REFERENCES

  1. [1] Altae-Tran Han, Ramsundar Bharath, Pappu Aneesh S., and Pande Vijay S.. 2016. Low data drug discovery with one-shot learning. Retrieved from http://arxiv.org/abs/1611.03199.Google ScholarGoogle Scholar
  2. [2] Chen Xiaozhi, Ma Huimin, Wan Ji, Li Bo, and Xia Tian. 2017. Multi-view 3D object detection network for autonomous driving. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17). IEEE Computer Society, 65266534. Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Chen Yinbo, Wang Xiaolong, Liu Zhuang, Xu Huijuan, and Darrell Trevor. 2020. A new meta-baseline for few-shot learning. Retrieved from https://arxiv.org/abs/2003.04390.Google ScholarGoogle Scholar
  4. [4] Cootes Timothy F., Taylor Christopher J., Cooper David H., and Graham Jim. 1995. Active shape models-their training and application. Comput. Vision Image Understand. 61, 1 (1995), 3859.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Ekman P.. Lie Catching and Microexpressions. The Philosophy of Deception. 118133.Google ScholarGoogle Scholar
  6. [6] Finn Chelsea, Abbeel Pieter, and Levine Sergey. 2017. Model-Agnostic meta-learning for fast adaptation of deep networks. In Proceedings of the 34th International Conference on Machine Learning (ICML’17) (Proceedings of Machine Learning Research), Precup Doina and Teh Yee Whye (Eds.), Vol. 70. PMLR, 11261135. Retrieved from http://proceedings.mlr.press/v70/finn17a.html.Google ScholarGoogle Scholar
  7. [7] Finn Chelsea, Abbeel Pieter, and Levine Sergey. 2017. Model-Agnostic meta-learning for fast adaptation of deep networks. In Proceedings of the 34th International Conference on Machine Learning (ICML’17). JMLR.org, 11261135.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Gan Yee Siang, Liong Sze-Teng, Yau Wei-Chuen, Huang Yen-Chang, and Ken Tan Lit. 2019. OFF-ApexNet on micro-expression recognition system. Signal Process. Image Commun. 74 (2019), 129139. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Guan Jiazhi and Shen Dongyao. 2021. Transfer spatio-temporal knowledge from emotion-related tasks for facial expression spotting. In Proceedings of the 1st Workshop on Facial Micro-Expression: Advanced Techniques for Facial Expressions Generation and Spotting (FME’21), Cheng Wen-Huang, Li Jingting, and Yap Moi Hoon (Eds.). ACM, 1924. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Han Xian-Hua, Zheng Yinqiang, and Chen Yen-Wei. 2022. Hyperspectral image reconstruction using multi-scale fusion learning. ACM Trans. Multim. Comput. Commun. Appl. 18, 1 (2022), 16:1–16:21. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Happy S. L. and Routray Aurobinda. 2019. Fuzzy histogram of optical flow orientations for micro-expression recognition. IEEE Trans. Affect. Comput. 10, 3 (2019), 394406. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Huang Gao, Liu Zhuang, and Weinberger Kilian Q.. 2016. Densely connected convolutional networks. Retrieved from http://arxiv.org/abs/1608.06993.Google ScholarGoogle Scholar
  13. [13] Huang Xiaohua, Wang Sujing, Liu Xin, Zhao Guoying, Feng Xiaoyi, and Pietikäinen Matti. 2019. Discriminative spatiotemporal local binary pattern with revisited integral projection for spontaneous facial micro-expression recognition. IEEE Trans. Affect. Comput. 10, 1 (2019), 3247. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Huang Xiaohua, Zhao Guoying, Hong Xiaopeng, Zheng Wenming, and Pietikäinen Matti. 2016. Spontaneous facial micro-expression analysis using spatiotemporal completed local quantized patterns. Neurocomputing 175 (2016), 564578. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Khor Huai-Qian, See John, Phan Raphael Chung-Wei, and Lin Weiyao. 2018. Enriched long-term recurrent convolutional network for facial micro-expression recognition. In Proceedings of the 13th IEEE International Conference on Automatic Face and Gesture Recognition (FG’18). IEEE Computer Society, 667674. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. [16] Kim Dae Hoe, Baddar Wissam J., and Ro Yong Man. 2016. Micro-Expression recognition with expression-state constrained spatio-temporal feature representations. In Proceedings of the ACM Conference on Multimedia Conference (MM’16), Hanjalic Alan, Snoek Cees, Worring Marcel, Bulterman Dick C. A., Huet Benoit, Kelliher Aisling, Kompatsiaris Yiannis, and Li Jin (Eds.). ACM, 382386. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] Li Jing, Wang Yandan, See John, and Liu Wenbin. 2019. Micro-expression recognition based on 3D flow convolutional neural network. Pattern Anal. Appl. 22, 4 (2019), 13311339. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] Li Qiuyu, Zhan Shu, Xu Liangfeng, and Wu Congzhong. 2019. Facial micro-expression recognition based on the fusion of deep learning and enhanced optical flow. Multim. Tools Appl. 78, 20 (2019), 2930729322. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Li Xiaobai, Pfister Tomas, Huang Xiaohua, Zhao Guoying, and Pietikäinen Matti. 2013. A Spontaneous Micro-expression Database: Inducement, collection and baseline. In Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG’13). IEEE Computer Society, 16. Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Li Yante, Huang Xiaohua, and Zhao Guoying. 2021. Joint local and global information learning with single Apex frame detection for micro-expression recognition. IEEE Trans. Image Process. 30 (2021), 249263. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] Lin Chenhan, Long Fei, Huang JianMing, and Li Jun. 2018. Micro-expression recognition based on spatiotemporal gabor filters. In Proceedings of the 8th International Conference on Information Science and Technology (ICIST’18). IEEE, 487491.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Liong Sze-Teng, See John, Phan Raphael Chung-Wei, and Wong KokSheik. 2016. Less is more: Micro-expression recognition from video using Apex frame. Retrieved from http://arxiv.org/abs/1606.01721.Google ScholarGoogle Scholar
  23. [23] Liong Sze-Teng, See John, Wong KokSheik, and Phan Raphael C.-W.. 2018. Less is more: Micro-expression recognition from video using apex frame. Signal Process. Image Commun. 62 (2018), 8292. Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Liu Hanxiao, Simonyan Karen, and Yang Yiming. 2019. DARTS: Differentiable architecture search. In Proceedings of the 7th International Conference on Learning Representations (ICLR’19). Retrieved from https://openreview.net/forum?id=S1eYHoC5FX.Google ScholarGoogle Scholar
  25. [25] Liu Yuchi, Du Heming, Zheng Liang, and Gedeon Tom. 2019. A neural micro-expression recognizer. In Proceedings of the 14th IEEE International Conference on Automatic Face and Gesture Recognition (FG’19). IEEE, 14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Liu Yishu, Liu Yingbin, and Ding Liwang. 2018. Scene classification based on two-stage deep feature fusion. IEEE Geosci. Remote. Sens. Lett. 15, 2 (2018), 183186. Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Lu Hua, Kpalma Kidiyo, and Ronsin Joseph. 2018. Motion descriptors for micro-expression recognition. Signal Process. Image Commun. 67 (2018), 108117. Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Lu Zhaoyu, Luo Ziqi, Zheng Huicheng, Chen Jikai, and Li Weihong. 2014. A Delaunay-based temporal coding model for micro-expression recognition. In Proceedings of the Asian Conference on Computer Vision (ACCV’14) (Lecture Notes in Computer Science), Jawahar C. V. and Shan Shiguang (Eds.), Vol. 9009. Springer, 698711. Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Ma Haoyu, Gong Bingchen, and Yu Yizhou. 2022. Structure-aware meta-fusion for image super-resolution. ACM Trans. Multim. Comput. Commun. Appl. 18, 2 (2022), 60:1–60:25. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Metz Luke, Maheswaranathan Niru, Cheung Brian, and Sohl-Dickstein Jascha. 2019. Meta-learning update rules for unsupervised representation learning. In Proceedings of the 7th International Conference on Learning Representations (ICLR’19). Retrieved from https://openreview.net/forum?id=HkNDsiC9KQ.Google ScholarGoogle Scholar
  31. [31] Miao Yu, Dong Haiwei, Al’Jaam Jihad Mohamad, and El-Saddik Abdulmotaleb. 2019. A deep learning system for recognizing facial expression in real-time. ACM Trans. Multim. Comput. Commun. Appl. 15, 2 (2019), 33:1–33:20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Morris Michael W. and Keltner Dacher. 2000. How emotions work: The social functions of emotional expression in negotiations. Res. Organiz. Behav. 22 (2000), 150. Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Nguyen Nhi Thi Thu, Nguyen Duyen Thi Thu, and Pham Bao The. 2021. Micro-expression recognition based on the fusion between optical flow and dynamic image. In Proceedings of the 5th International Conference on Machine Learning and Soft Computing (ICMLSC’21). ACM, 115120. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Niu Mingyue, Li Ya, Tao Jianhua, and Wang Su-Jing. 2018. Micro-expression recognition based on local two-order gradient pattern. In Proceedings of the 1st Asian Conference on Affective Computing and Intelligent Interaction (ACII Asia’18). 16. Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Ojala Timo, Pietikäinen Matti, and Harwood David. 1994. Performance evaluation of texture measures with classification based on Kullback discrimination of distributions. In Proceedings of the 12th IAPR International Conference on Pattern Recognition, Conference A: Computer Vision and Image Processing (ICPR’94). IEEE, 582585. Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Pan Hang, Xie Lun, and Wang Zhiliang. 2021. Spatio-temporal convolutional attention network for spotting macro- and micro-expression intervals. In Proceedings of the 1st Workshop on Facial Micro-Expression: Advanced Techniques for Facial Expressions Generation and Spotting (FME’21), Cheng Wen-Huang, Li Jingting, and Yap Moi Hoon (Eds.). ACM, 2530. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. [37] Peng Min, Wang Chongyang, Bi Tao, Shi Yu, Zhou Xiangdong, and Chen Tong. 2019. A novel apex-time network for cross-dataset micro-expression recognition. In Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction (ACII’19). IEEE, 16. Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Peng Min, Wu Zhan, Zhang Zhihao, and Chen Tong. 2018. From macro to micro expression recognition: Deep learning on small datasets using transfer learning. In Proceedings of the 13th IEEE International Conference on Automatic Face and Gesture Recognition (FG’18). IEEE Computer Society, 657661. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. [39] Quang Nguyen Van, Chun Jinhee, and Tokuyama Takeshi. 2019. CapsuleNet for micro-expression recognition. In Proceedings of the 14th IEEE International Conference on Automatic Face and Gesture Recognition (FG’19). IEEE, 17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Real Esteban, Aggarwal Alok, Huang Yanping, and Le Quoc V.. 2019. Regularized evolution for image classifier architecture search. In Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI’19), the 31st Innovative Applications of Artificial Intelligence Conference (IAAI’19), and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’19). AAAI Press, 47804789. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Santoro Adam, Bartunov Sergey, Botvinick Matthew, Wierstra Daan, and Lillicrap Timothy P.. 2016. Meta-learning with memory-augmented neural networks. In Proceedings of the 33rd International Conference on Machine Learning (ICML’16) (JMLR Workshop and Conference Proceedings), Balcan Maria-Florina and Weinberger Kilian Q. (Eds.), Vol. 48. JMLR.org, 18421850. Retrieved from http://proceedings.mlr.press/v48/santoro16.html.Google ScholarGoogle Scholar
  42. [42] Snell Jake, Swersky Kevin, and Zemel Richard S.. 2017. Prototypical networks for few-shot learning. In Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems, Guyon Isabelle, Luxburg Ulrike von, Bengio Samy, Wallach Hanna M., Fergus Rob, Vishwanathan S. V. N., and Garnett Roman (Eds.). 40774087. Retrieved from https://proceedings.neurips.cc/paper/2017/hash/cb8da6767461f2812ae4290eac7cbc42-Abstract.html.Google ScholarGoogle Scholar
  43. [43] Song Baolin, Li Ke, Zong Yuan, Zhu Jie, Zheng Wenming, Shi Jingang, and Zhao Li. 2019. Recognizing spontaneous micro-expression using a three-stream convolutional neural network. IEEE Access 7 (2019), 184537184551. Google ScholarGoogle ScholarCross RefCross Ref
  44. [44] Sun Deqing, Roth Stefan, and Black Michael J.. 2014. A quantitative analysis of current practices in optical flow estimation and the principles behind them. Int. J. Comput. Vis. 106, 2 (2014), 115137. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. [45] Takalkar Madhumita A. and Xu Min. 2017. Image-based facial micro-expression recognition using deep learning on small datasets. In Proceedings of the International Conference on Digital Image Computing: Techniques and Applications (DICTA’17). IEEE, 17. Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Thrun Sebastian and Pratt Lorien Y.. 1998. Learning to learn: introduction and overview. In Learning to Learn, Thrun Sebastian and Pratt Lorien Y. (Eds.). Springer, 317. Google ScholarGoogle ScholarCross RefCross Ref
  47. [47] Wang Shangfei, Hao Longfei, and Ji Qiang. 2020. Posed and spontaneous expression distinction using latent regression Bayesian networks. ACM Trans. Multim. Comput. Commun. Appl. 16, 3 (2020), 80:1–80:18. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. [48] Wang Yandan, See John, Phan Raphael C.-W., and Oh Yee-Hui. 2014. LBP with six intersection points: Reducing redundant information in LBP-TOP for micro-expression recognition. In Proceedings of the 12th Asian Conference on Computer Vision (ACCV’14) (Lecture Notes in Computer Science), Cremers Daniel, Reid Ian D., Saito Hideo, and Yang Ming-Hsuan (Eds.), Vol. 9003. Springer, 525537. Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Wang Zhoutao, Xie Qian, Wei Mingqiang, Long Kun, and Wang Jun. 2022. Multi-feature fusion VoteNet for 3D object detection. ACM Trans. Multim. Comput. Commun. Appl. 18, 1 (2022), 6:1–6:17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. [50] Whitehill Jacob, Serpell Zewelanji, Lin Yi-Ching, Foster Aysha, and Movellan Javier R.. 2014. The faces of engagement: Automatic recognition of student engagementfrom facial expressions. IEEE Trans. Affect. Comput. 5, 1 (2014), 8698. Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Wu Hao-Yu, Rubinstein Michael, Shih Eugene, Guttag John V., Durand Frédo, and Freeman William T.. 2012. Eulerian video magnification for revealing subtle changes in the world. ACM Trans. Graph. 31, 4 (2012), 65:1–65:8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. [52] Xia Zhaoqiang, Feng Xiaoyi, Hong Xiaopeng, and Zhao Guoying. 2018. Spontaneous facial micro-expression recognition via deep convolutional network. In Proceedings of the 8th International Conference on Image Processing Theory, Tools and Applications (IPTA’18). IEEE, 16. Google ScholarGoogle ScholarCross RefCross Ref
  53. [53] Xia Zhaoqiang, Hong Xiaopeng, Gao Xingyu, Feng Xiaoyi, and Zhao Guoying. 2020. Spatiotemporal recurrent convolutional networks for recognizing spontaneous micro-expressions. IEEE Trans. Multim. 22, 3 (2020), 626640. Google ScholarGoogle ScholarCross RefCross Ref
  54. [54] Xu Feng, Zhang Junping, and Wang James Z.. 2017. Microexpression identification and categorization using a facial dynamics map. IEEE Trans. Affect. Comput. 8, 2 (2017), 254267. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. [55] Yan Wen-Jing, Wu Qi, Liu Yong-Jin, Wang Sujing, and Fu Xiaolan. 2013. CASME database: A dataset of spontaneous micro-expressions collected from neutralized faces. In Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG’13). IEEE Computer Society, 17. Google ScholarGoogle ScholarCross RefCross Ref
  56. [56] Yan W. J., Li X., Wang S. J., Zhao G., Liu Y. J., Chen Y. H., and Fu X.. 2014. CASME II: An improved spontaneous micro-expression database and the baseline evaluation. PLoS One 9, 1 (2014), e86041.Google ScholarGoogle ScholarCross RefCross Ref
  57. [57] Zach Christopher, Pock Thomas, and Bischof Horst. 2007. A duality-based approach for realtime TV-L\({}^{\mbox{1}}\) optical flow. In Proceedings of the 29th DAGM Symposium on Pattern Recognition (Lecture Notes in Computer Science), Hamprecht Fred A., Schnörr Christoph, and Jähne Bernd (Eds.), Vol. 4713. Springer, 214223. Google ScholarGoogle ScholarCross RefCross Ref
  58. [58] Zhai Deming, Shi Ruifeng, Jiang Junjun, and Liu Xianming. 2022. Rectified meta-learning from noisy labels for robust image-based plant disease classification. ACM Trans. Multim. Comput. Commun. Appl. 18, 1s (2022), 30:1–30:17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. [59] Zhao Guoying and Pietikäinen Matti. 2007. Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans. Pattern Anal. Mach. Intell. 29, 6 (2007), 915928. Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. [60] Zhou Ling, Mao Qirong, Huang Xiaohua, Zhang Feifei, and Zhang Zhihong. 2021. Feature refinement: An expression-specific feature learning and fusion method for micro-expression recognition. Retrieved from https://arxiv.org/abs/2101.04838.Google ScholarGoogle Scholar
  61. [61] Zong Yuan, Huang Xiaohua, Zheng Wenming, Cui Zhen, and Zhao Guoying. 2018. Learning from hierarchical spatiotemporal descriptors for micro-expression recognition. IEEE Trans. Multim. 20, 11 (2018), 31603172. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Meta-MMFNet: Meta-learning-based Multi-model Fusion Network for Micro-expression Recognition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 20, Issue 2
      February 2024
      548 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3613570
      • Editor:
      • Abdulmotaleb El Saddik
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 September 2023
      • Online AM: 2 June 2022
      • Accepted: 16 May 2022
      • Revised: 4 April 2022
      • Received: 14 October 2021
      Published in tomm Volume 20, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text