Journals & Magazines >IEEE Transactions on Multimedia >Volume: 11 Issue: 7

Lipreading With Local Spatiotemporal Descriptors

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Visual speech information plays an important role in lipreading under noisy conditions or for listeners with a hearing impairment. In this paper, we present local spatiot...Show More

Metadata

Abstract:

Visual speech information plays an important role in lipreading under noisy conditions or for listeners with a hearing impairment. In this paper, we present local spatiotemporal descriptors to represent and recognize spoken isolated phrases based solely on visual input. Spatiotemporal local binary patterns extracted from mouth regions are used for describing isolated phrase sequences. In our experiments with 817 sequences from ten phrases and 20 speakers, promising accuracies of 62% and 70% were obtained in speaker-independent and speaker-dependent recognition, respectively. In comparison with other methods on AVLetters database, the accuracy, 62.8%, of our method clearly outperforms the others. Analysis of the confusion matrix for 26 English letters shows the good clustering characteristics of visemes for the proposed descriptors. The advantages of our approach include local processing and robustness to monotonic gray-scale changes. Moreover, no error prone segmentation of moving lips is needed.

Published in: IEEE Transactions on Multimedia ( Volume: 11, Issue: 7, November 2009)

Page(s): 1254 - 1265

Date of Publication: 18 August 2009

ISSN Information:

DOI: 10.1109/TMM.2009.2030637

Contents

References is not available for this document.

Lipreading With Local Spatiotemporal Descriptors

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Lipreading With Local Spatiotemporal Descriptors

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?