Loading [a11y]/accessibility-menu.js
An Ensemble Method for Multiple Speech Enhancement Using Deep Learning | IEEE Conference Publication | IEEE Xplore

An Ensemble Method for Multiple Speech Enhancement Using Deep Learning


Abstract:

This paper proposes an ensemble of multiple speech enhancement methods using convolutional neural networks (CNN). Speech enhancement is one of the most important tasks in...Show More

Abstract:

This paper proposes an ensemble of multiple speech enhancement methods using convolutional neural networks (CNN). Speech enhancement is one of the most important tasks in the field of audio signal processing, and various methods have been proposed so far. Each of these methods has its own strengths and weaknesses depending on the target speech signal, the type of noise, and the recording environment. In this study, we aim to construct a novel robust speech enhancement method that works well in various environments by combining the advantages of multiple speech enhancement methods. We formulate an ensemble based on weighted summation of time-frequency masks, and propose a method to estimate the optimal weight values based on the input acoustic signal using a convolutional neural network. The convolutional operation in CNN allows the integration to take into account the changes in time and frequency. In the simulation experiments, we evaluate the effectiveness of the proposed method by computing the mean square error between the ideal mask and the mask generated by each method.
Date of Conference: 17-20 January 2023
Date Added to IEEE Xplore: 15 February 2023
ISBN Information:

ISSN Information:

Conference Location: Atlanta, GA, USA

References

References is not available for this document.