Conferences >ICASSP 2019 - 2019 IEEE Inter...

Acoustic Modeling for Distant Multi-talker Speech Recognition with Single- and Multi-channel Branches

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper presents a novel heterogeneous-input multi-channel acoustic model (AM) that has both single-channel and multi-channel input branches. In our proposed training ...Show More

Metadata

Abstract:

This paper presents a novel heterogeneous-input multi-channel acoustic model (AM) that has both single-channel and multi-channel input branches. In our proposed training pipeline, a single-channel AM is trained first, then a multi-channel AM is trained starting from the single-channel AM with a randomly initialized multi-channel input branch. Our model uniquely uses the power of a complemen-tal speech enhancement (SE) module while exploiting the power of jointly trained AM and SE architecture. Our method was the foundation for the Hitachi/JHU CHiME-5 system that achieved the second-best result in the CHiME-5 competition, and this paper details various investigation results that we were not able to present during the competition period. We also evaluated and reconfirmed our method's effectiveness with the AMI Meeting Corpus. Our AM achieved a 30.12% word error rate (WER) for the development set and a 32.33% WER for the evaluation set for the AMI Corpus, both of which are the best results ever reported to the best of our knowledge.

Published in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 12-17 May 2019

Date Added to IEEE Xplore: 17 April 2019

ISBN Information:

ISSN Information:

DOI: 10.1109/ICASSP.2019.8682273

Conference Location: Brighton, UK

Contents

References is not available for this document.

Acoustic Modeling for Distant Multi-talker Speech Recognition with Single- and Multi-channel Branches

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Acoustic Modeling for Distant Multi-talker Speech Recognition with Single- and Multi-channel Branches

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?