Abstract
In our approach video-based recognition of sign language requires the extraction of sign parameters. Each sign can be characterised by means of manual (handshape, hand orientation, location and movement) and non-manual (trunk, head, gaze, facial expression, mouth) parameters. This paper introduces a software module which is as a part of the developed automatic sign language recognition system able to extract relevant body regions from digitised video images. The recognition of body regions is crucial for determining location of signs. The proposed software module uses a rule-based system for analysing the body contour in order to compute the 2D position of the shoulders, the top of head and the vertical axis of the body. Based on these results the position of the eyes are calculated directly from the segmented face of the signer. The position of the remaining face- (nose, forehead, mouth, cheek, chin) and trunk regions (shoulder belt, chest, belly, hip) are determined by means of two estimators, where a-priori known geometric data of the face and fuzzy technique are used. Experiments indicate that our approach leads to good estimation of body regions, which we all compute in real time.
Preview
Unable to display preview. Download preview PDF.
References
Böken, S.: Videobasierte Bestimmung der Handposition im Oberkörperbereich. Technical Report, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1997.
Boyes Braem, P.: Einführung in die Gebärdensprache und ihre Erforschung. Signum Press, 1995.
Chellappa, R., C.L. Wilson and S. Sirohey: Human and Machine Recognition of Faces: A Survey. Proceedings of the IEEE, Vol. 83, No. 5, pp. 705–740, 1995.
Grobel, K., H. Hienz, S. Romainczyk, S. Böken and B. Vetter: Videobasierte Erkennung von Körperregionen zur Bestimmung der Ausführungsstelle einer Gebärde. 9. Aachener Kolloquium ”Signaltheorie” — Bild-und Sprachsignale, Aachen, March 18–20, pp. 313–316, 1997.
Grobel, K. and H. Hienz: Videobasierte Gebärdenerkennung. In: Kraiss, K.-F. (Ed.): Jahresbericht — Lehrstuhl für Technische Informatik 1995/96, pp. 21–28. Shaker-Verlag, 1997.
Prillwitz, S. et al.: HamNoSys — Hamburg Notation System for Sign Languages, An Introductory Guide, Version 2.0. Signum Press, 1989.
Romainczyk, S.: Bestimmung der Schulterposition von Personen aus Videobildern. Technical Report, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1996.
Sobottka, K. and I. Pitas: A Fully Automatic Approach to Facial Feature Detection and Tracking. Proceedings of First International Conference on Audio-and Video-Based Biometric Person Authentification, Crans-Montana, Switzerland, March 12–14, pp. 78–84, Sringer-Verlag, 1997.
Soede, M. (Ed.): SignPS a System for Sign Writing: Final Report. EEC TIDE Project No. 1202, 1997.
Tan, M.: Ansichtenbasierte Handformerkennung in Bildfolgen. Diploma Thesis, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1997.
Vetter, B.: Videobasierte Schätzung von Gesichtsbereichen in Echtzeit für die automatische Erkennung der deutschen Gebärdensprache. Technical Report, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1997.
Zimmermann, H. J. Fuzzy Set Theory and its Application. 2nd. Edition, Kluwer Acadamic Publisher, 1991.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag
About this paper
Cite this paper
Hienz, H., Grobel, K. (1998). Automatic estimation of body regions from video images. In: Wachsmuth, I., Fröhlich, M. (eds) Gesture and Sign Language in Human-Computer Interaction. GW 1997. Lecture Notes in Computer Science, vol 1371. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0052995
Download citation
DOI: https://doi.org/10.1007/BFb0052995
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64424-8
Online ISBN: 978-3-540-69782-4
eBook Packages: Springer Book Archive