Abstract
This paper describes a new approach of modeling visual speech, based on an artificial neural network (ANN). The network architecture makes possible a fusion of linguistic expert knowledge into the ANN. Goal is the development of a computer animation program as a training aid for learning lip-reading. The current PC version allows a synchronization of the animation program with a special stand-alone speech synthesis computer via a Centronics parallel interface.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
P. Menzerath and A. de Lacerda: Koartikulation, Steuerung und Lautabgrenzung, Berlin, 1933.
G. Alich: Zur Erkennbarkeit von Sprachgestalten beim Ablesen vom Munde (Dissertation), Bonn, 1961.
H.H. Bothe and F. Rieger: Lipreading — Analysis and Synthesis on Microcomputers, in: W. Zagler (Ed.), Computers for Handicapped Persons, Proceedings of the 3rd International Conference, Vienna, (1992), 59–64.
D. Storey and M. Roberts: Reading the Speech of Digital Lips: Motives an Methods for Audio-visual Speech Synthesis, Visible Language 22 (1989), 112–127.
M.M. Cohen and D.W. Massaro: Synthesis of Visible Speech, Behaviour Research Methods, Instruments & Computers, (1990), 260–263.
M. Saintourens, M.H. Tramus, H. Huitric, and M. Nahas: Creation of a Synthetic Face Speaking in Real Time with a Synthetic Voice, Proceedings of the Workshop of Speech Synthesis, Autrance, (1990), 381–393.
H.H. Bothe, G. Lindner and F. Rieger: The Development of a Computer Animation Program for the Teaching of Lipreading, In: E. Ballabio, I. Placencia-Porrero and R. Puig de la Bellacasa (Eds.), Technology and Informatics 9, Rehabilitation Technology: Strategies for the European Union, Amsterdam, (1993), 45–49.
H.H. Bothe, F. Rieger and R. Tackmann: Visual Coarticulation Effects in Syllable Environment, Proceedings of the EUROSPEECH, Berlin, (1993), 1741–1744.
H.H. Bothe, G. Lindner, R. Pramanik and F. Rieger: Dynamic Modeling of Visual Articulation Movements, Proceedings of the International Symposium on Nonlinear Theory and its Applications (NOLTA), Hawaii, (1993), 1363–1366.
J.C. Bezdek: Pattern Recognition with Objective Function Algorithms, London, 1981.
J.S. Roger Jang and C.T. Sun: Functional Equivalence Between Radial Basis Function Networks and Fuzzy Inference Systems, Trans. Neural Networks, Vol. 4, No. 1 (1993), 156–159.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1994 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bothe, H.H., Wieden, E.A. (1994). Artificial visual speech synchronized with a speech synthesis system. In: Zagler, W.L., Busby, G., Wagner, R.R. (eds) Computers for Handicapped Persons. ICCHP 1994. Lecture Notes in Computer Science, vol 860. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58476-5_102
Download citation
DOI: https://doi.org/10.1007/3-540-58476-5_102
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58476-6
Online ISBN: 978-3-540-48989-4
eBook Packages: Springer Book Archive