Journals & Magazines >IEEE/ACM Transactions on Audi... >Volume: 28

Knowledge Distillation-Based Representation Learning for Short-Utterance Spoken Language Identification

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

With successful applications of deep feature learning algorithms, spoken language identification (LID) on long utterances obtains satisfactory performance. However, the p...Show More

Metadata

Abstract:

With successful applications of deep feature learning algorithms, spoken language identification (LID) on long utterances obtains satisfactory performance. However, the performance on short utterances is drastically degraded even when the LID system is trained using short utterances. The main reason is due to the large variation of the representation on short utterances which results in high model confusion. To narrow the performance gap between long, and short utterances, we proposed a teacher-student representation learning framework based on a knowledge distillation method to improve LID performance on short utterances. In the proposed framework, in addition to training the student model on short utterances with their true labels, the internal representation from the output of a hidden layer of the student model is supervised with the representation corresponding to their longer utterances. By reducing the distance of internal representations between short, and long utterances, the student model can explore robust discriminative representations for short utterances, which is expected to reduce model confusion. We conducted experiments on our in-house LID dataset, and NIST LRE07 dataset, and showed the effectiveness of the proposed methods for short utterance LID tasks.

Published in: IEEE/ACM Transactions on Audio, Speech, and Language Processing ( Volume: 28)

Page(s): 2674 - 2683

Date of Publication: 14 September 2020

ISSN Information:

DOI: 10.1109/TASLP.2020.3023627

Contents

References is not available for this document.

Knowledge Distillation-Based Representation Learning for Short-Utterance Spoken Language Identification

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Knowledge Distillation-Based Representation Learning for Short-Utterance Spoken Language Identification

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?