loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Georgia Paraskevopoulou 1 ; Evaggelos Spyrou 2 ; 3 and Stavros Perantonis 3

Affiliations: 1 Department of History & Philosophy of Science, National and Kapodistrian University of Athens, Athens, Greece ; 2 Department of Computer Science and Telecommunications, University of Thessaly, Lamia, Greece ; 3 Institute of Informatics and Telecommunications, National Center for Scientific Research - “Demokritos,” Athens, Greece

Keyword(s): Emotion Recognition, Convolutional Neural Network, Spectrograms, Data Augmentation.

Abstract: The recognition of the emotions of humans is crucial for various applications related to human-computer interaction or for understanding the users’ mood in several tasks. Typical machine learning approaches used towards this goal first extract a set of linguistic features from raw data, which are then used to train supervised learning models. Recently, Convolutional Neural Networks (CNNs), which unlike traditional approaches, learn to extract the appropriate features of their inputs, have also been applied as emotion recognition classifiers. In this work, we adopt a CNN architecture that uses spectrograms, extracted from audio signals as inputs and we propose data augmentation techniques to boost the classification performance. The proposed data augmentation approach includes noise addition, shifting of the audio signal, and changing its pitch or its speed. Experimental results indicate that the herein presented approach outperforms previous work which not use augmented data.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.144.17.45

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Paraskevopoulou, G.; Spyrou, E. and Perantonis, S. (2022). A Data Augmentation Approach for Improving the Performance of Speech Emotion Recognition. In Proceedings of the 19th International Conference on Signal Processing and Multimedia Applications - SIGMAP; ISBN 978-989-758-591-3; ISSN 2184-9471, SciTePress, pages 61-69. DOI: 10.5220/0011148000003289

@conference{sigmap22,
author={Georgia Paraskevopoulou. and Evaggelos Spyrou. and Stavros Perantonis.},
title={A Data Augmentation Approach for Improving the Performance of Speech Emotion Recognition},
booktitle={Proceedings of the 19th International Conference on Signal Processing and Multimedia Applications - SIGMAP},
year={2022},
pages={61-69},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011148000003289},
isbn={978-989-758-591-3},
issn={2184-9471},
}

TY - CONF

JO - Proceedings of the 19th International Conference on Signal Processing and Multimedia Applications - SIGMAP
TI - A Data Augmentation Approach for Improving the Performance of Speech Emotion Recognition
SN - 978-989-758-591-3
IS - 2184-9471
AU - Paraskevopoulou, G.
AU - Spyrou, E.
AU - Perantonis, S.
PY - 2022
SP - 61
EP - 69
DO - 10.5220/0011148000003289
PB - SciTePress