Skip to main content

Fast Viseme Recognition for Talking Head Application

  • Conference paper
Image Analysis and Recognition (ICIAR 2005)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3656))

Included in the following conference series:

  • 964 Accesses

Abstract

Real time recognition of visual face appearances (visemes) which correspond to phonemes and their speech contexts is presented. We distinguish six major classes of visemes. Features are extracted in the form of normalized image texture. The normalization procedure uses barycentric coordinates in a mesh of triangles superimposed onto a reference facial image. The mesh itself is defined using a subset of FAP points conforming with MPEG-4 standard. The elaborated classifiers were designed by PCA subspace and LDA methods. It appears that the LDA classifier outperforms subspace technique. It is better than the best subspace PCA – in recognition rate by more than 13% times (97% versus 84%) and it is more than 10 times faster (0.5ms versus 7ms) and its time is neglected w.r.t. mouth image normalization time (0.5ms versus 5ms).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bober, M., Kucharski, K., Skarbek, W.: Face Recognition by Fisher and Scatter Linear Discriminant Analysis. In: Petkov, N., Westenberg, M.A. (eds.) CAIP 2003. LNCS, vol. 2756, pp. 638–645. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  2. Fukunaga, K.: Introduction to statistical pattern recognition, 2nd edn. Academic Press, Boston (1990)

    MATH  Google Scholar 

  3. Golub, G., Van Loan, C.: Matrix Computations. Johns Hopkins University Press, Baltimore (1996)

    MATH  Google Scholar 

  4. Jolliffe, I.T.: Principal Component Analysis. Springer, New York (2002)

    MATH  Google Scholar 

  5. Oja, E.: Subspace Methods of Pattern Recognition. Research Studies Press, England (1983)

    Google Scholar 

  6. Ripley, B.D.: Pattern Recognition and Neural Networks. Cambridge University Press, Cambridge (1996)

    MATH  Google Scholar 

  7. Swets, D.L., Weng, J.: Using Discriminant Eigenfeatures for Image Retrieval. IEEE Trans. on PAMI 18(8), 831–837 (1996)

    Google Scholar 

  8. The Hidden Markov Model Toolkit (HTK), http://htk.eng.cam.ac.uk

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Leszczynski, M., Skarbek, W., Badura, S. (2005). Fast Viseme Recognition for Talking Head Application. In: Kamel, M., Campilho, A. (eds) Image Analysis and Recognition. ICIAR 2005. Lecture Notes in Computer Science, vol 3656. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11559573_64

Download citation

  • DOI: https://doi.org/10.1007/11559573_64

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29069-8

  • Online ISBN: 978-3-540-31938-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics