Conferences >2021 IEEE Spoken Language Tec...

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

In this paper, we present a conditional multitask learning method for end-to-end neural speaker diarization (EEND). The EEND system has shown promising performance compar...Show More

Metadata

Abstract:

In this paper, we present a conditional multitask learning method for end-to-end neural speaker diarization (EEND). The EEND system has shown promising performance compared with traditional clustering-based methods, especially in the case of overlapping speech. In this paper, to further improve the performance of the EEND system, we propose a novel multitask learning framework that solves speaker diarization and a desired subtask while explicitly considering the task dependency. We optimize speaker diarization conditioned on speech activity and overlap detection that are subtasks of speaker diarization, based on the probabilistic chain rule. Experimental results show that our proposed method can leverage a subtask to effectively model speaker diarization, and outperforms conventional EEND systems in terms of diarization error rate.

Published in: 2021 IEEE Spoken Language Technology Workshop (SLT)

Date of Conference: 19-22 January 2021

Date Added to IEEE Xplore: 25 March 2021

ISBN Information:

DOI: 10.1109/SLT48900.2021.9383555

Conference Location: Shenzhen, China

Contents

References is not available for this document.

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?