Abstract:
Emotion estimation by speech increase precision through the development of deep learning. However, most of the emotion estimation using deep learning involves supervised ...Show MoreMetadata
Abstract:
Emotion estimation by speech increase precision through the development of deep learning. However, most of the emotion estimation using deep learning involves supervised learning, and it is difficult to get a large data set used for learning. In addition, when the training data environment and the actual data environment are significantly different, it is considered as a problem that the accuracy of emotion estimation greatly deteriorates. Therefore, in this study, in order to solves these problems, we used a smooth emotion estimation model by using virtual adversarial training (VAT), which is a semi supervised learning method, that improves the robustness of the model. VAT attracts attention in machine learning as a method of smoothing a generation model by adding minute and intentional perturbation to training data in learning. We first set hyperparameters in VAT by verification with single corpus and then perform evaluation experiments with cross corpus to show the improvement of model robustness.
Published in: 2019 IEEE International Conference on Big Data, Cloud Computing, Data Science & Engineering (BCD)
Date of Conference: 29-31 May 2019
Date Added to IEEE Xplore: 31 October 2019
ISBN Information: