Conferences >2015 IEEE International Confe...

Combination of two-dimensional cochleogram and spectrogram features for deep learning-based ASR

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper explores the use of auditory features based on cochleograms; two dimensional speech features derived from gammatone filters within the convolutional neural net...Show More

Metadata

Abstract:

This paper explores the use of auditory features based on cochleograms; two dimensional speech features derived from gammatone filters within the convolutional neural network (CNN) framework. Furthermore, we also propose various possibilities to combine cochleogram features with log-mel filter banks or spectrogram features. In particular, we combine within low and high levels of CNN framework which we refer to as low-level and high-level feature combination. As comparison, we also construct the similar configuration with deep neural network (DNN). Performance was evaluated in the framework of hybrid neural network - hidden Markov model (NN-HMM) system on TIMIT phoneme sequence recognition task. The results reveal that cochleogram-spectrogram feature combination provides significant advantages. The best accuracy was obtained by high-level combination of two dimensional cochleogram-spectrogram features using CNN, achieved up to 8.2% relative phoneme error rate (PER) reduction from CNN single features or 19.7% relative PER reduction from DNN single features.

Published in: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 19-24 April 2015

Date Added to IEEE Xplore: 06 August 2015

Electronic ISBN:978-1-4673-6997-8

ISSN Information:

DOI: 10.1109/ICASSP.2015.7178827

Conference Location: South Brisbane, QLD, Australia

Contents

References is not available for this document.

Combination of two-dimensional cochleogram and spectrogram features for deep learning-based ASR

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Combination of two-dimensional cochleogram and spectrogram features for deep learning-based ASR

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?