Loading [a11y]/accessibility-menu.js
Sound Event Localization and Detection using Spatial Feature Fusion | IEEE Conference Publication | IEEE Xplore

Sound Event Localization and Detection using Spatial Feature Fusion


Abstract:

Sound event localization and detection (SELD) can identify the category and location of a sound event along with providing valuable information for many applications. Exi...Show More

Abstract:

Sound event localization and detection (SELD) can identify the category and location of a sound event along with providing valuable information for many applications. Existing methods primarily use convolutional recurrent neural networks as a network model and Log-Mel spectrograms to classify sound events. However, there are no dominant spatial features to identify the direction of sound events. The fusion of various spatial features can lead to a better performance in the SELD task. In this study, we propose an optimal feature fusion by systematically analyzing various combinations of spatial features. We used the TAU-NIGENS spatial sound events 2021 dataset to evaluate the SELD task performance of various combinations of spatial features. We found that the combination of interaural phase difference (IPD) and sinIPD had a better performance than the other features and combinations. Finally, we confirmed that the proposed features had a better performance than the state-of-the-art methods.
Date of Conference: 19-21 October 2022
Date Added to IEEE Xplore: 25 November 2022
ISBN Information:

ISSN Information:

Conference Location: Jeju Island, Korea, Republic of

Funding Agency:


References

References is not available for this document.