Journals & Magazines >IEEE Signal Processing Letters >Volume: 32

Sampling-Based Pruned Knowledge Distillation for Training Lightweight RNN-T

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We present a novel training method for small-scale RNN-T models, widely used in real-world speech recognition applications. Despite efforts to scale down models for edge ...Show More

Metadata

Abstract:

We present a novel training method for small-scale RNN-T models, widely used in real-world speech recognition applications. Despite efforts to scale down models for edge devices, the demand for even smaller and more compact speech recognition models persists to accommodate a broader range of devices. In this letter, we propose Sampling-based Pruned Knowledge Distillation (SP-KD) for training lightweight RNN-T models. In contrast to the conventional knowledge distillation techniques, the proposed method enables student models to distill knowledge from the distribution of teacher models, which is estimated by considering not only the best paths but also less likely paths. Additionally, we leverage pruning the output lattice of RNN-T to comprehensively transfer knowledge from teacher models to student models. Experimental results demonstrate that our proposed method outperforms the baseline in training tiny RNN-T models.

Published in: IEEE Signal Processing Letters ( Volume: 32)

Page(s): 631 - 635

Date of Publication: 13 January 2025

ISSN Information:

DOI: 10.1109/LSP.2025.3528364

Contents

References is not available for this document.

Sampling-Based Pruned Knowledge Distillation for Training Lightweight RNN-T

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Sampling-Based Pruned Knowledge Distillation for Training Lightweight RNN-T

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?