Conferences >2016 23rd International Confe...

Fusion of classifier predictions for audio-visual emotion recognition

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In this paper is presented a novel multimodal emotion recognition system which is based on the analysis of audio and visual cues. MFCC-based features are extracted from t...Show More

Metadata

Abstract:

In this paper is presented a novel multimodal emotion recognition system which is based on the analysis of audio and visual cues. MFCC-based features are extracted from the audio channel and facial landmark geometric relations are computed from visual data. Both sets of features are learnt separately using state-of-the-art classifiers. In addition, we summarise each emotion video into a reduced set of key-frames, which are learnt in order to visually discriminate emotions by means of a Convolutional Neural Network. Finally, confidence outputs of all classifiers from all modalities are used to define a new feature space to be learnt for final emotion prediction, in a late fusion/stacking fashion. The conducted experiments on eNTERFACE'05 database show significant performance improvements of our proposed system in comparison to state-of-the-art approaches.

Published in: 2016 23rd International Conference on Pattern Recognition (ICPR)

Date of Conference: 04-08 December 2016

Date Added to IEEE Xplore: 24 April 2017

ISBN Information:

DOI: 10.1109/ICPR.2016.7899608

Conference Location: Cancun, Mexico

Contents

References is not available for this document.

Fusion of classifier predictions for audio-visual emotion recognition

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Fusion of classifier predictions for audio-visual emotion recognition

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?