Conferences >2015 IEEE International Confe...

Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

To develop speaker adaptation algorithms for deep neural network (DNN) that are suitable for large-scale online deployment, it is desirable that the adaptation model be r...View more

Metadata

Abstract:

To develop speaker adaptation algorithms for deep neural network (DNN) that are suitable for large-scale online deployment, it is desirable that the adaptation model be represented in a compact form and learned in an unsupervised fashion. In this paper, we propose a novel low-footprint adaptation technique for DNN that adapts the DNN model through node activation functions. The approach introduces slope and bias parameters in the sigmoid activation functions for each speaker, allowing the adaptation model to be stored in a small-sized storage space. We show that this adaptation technique can be formulated in a linear regression fashion, analogous to other speak adaptation algorithms that apply additional linear transformations to the DNN layers. We further investigate semi-supervised online adaptation by making use of the user click-through data as a supervision signal. The proposed method is evaluated on short message dictation and voice search tasks in supervised, unsupervised, and semi-supervised setups. Compared with the singular value decomposition (SVD) bottleneck adaptation, the proposed adaptation method achieves comparable accuracy improvements with much smaller footprint.

Published in: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 19-24 April 2015

Date Added to IEEE Xplore: 06 August 2015

Electronic ISBN:978-1-4673-6997-8

ISSN Information:

DOI: 10.1109/ICASSP.2015.7178784

Conference Location: South Brisbane, QLD, Australia

Contents

References is not available for this document.

Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?