Automatic estimation of body regions from video images

Hienz, Hermann; Grobel, Kirsti

doi:10.1007/BFb0052995

Hermann Hienz¹ &
Kirsti Grobel¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1371))

Included in the following conference series:

International Gesture Workshop

101 Accesses
5 Citations

Abstract

In our approach video-based recognition of sign language requires the extraction of sign parameters. Each sign can be characterised by means of manual (handshape, hand orientation, location and movement) and non-manual (trunk, head, gaze, facial expression, mouth) parameters. This paper introduces a software module which is as a part of the developed automatic sign language recognition system able to extract relevant body regions from digitised video images. The recognition of body regions is crucial for determining location of signs. The proposed software module uses a rule-based system for analysing the body contour in order to compute the 2D position of the shoulders, the top of head and the vertical axis of the body. Based on these results the position of the eyes are calculated directly from the segmented face of the signer. The position of the remaining face- (nose, forehead, mouth, cheek, chin) and trunk regions (shoulder belt, chest, belly, hip) are determined by means of two estimators, where a-priori known geometric data of the face and fuzzy technique are used. Experiments indicate that our approach leads to good estimation of body regions, which we all compute in real time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Böken, S.: Videobasierte Bestimmung der Handposition im Oberkörperbereich. Technical Report, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1997.
Google Scholar
Boyes Braem, P.: Einführung in die Gebärdensprache und ihre Erforschung. Signum Press, 1995.
Google Scholar
Chellappa, R., C.L. Wilson and S. Sirohey: Human and Machine Recognition of Faces: A Survey. Proceedings of the IEEE, Vol. 83, No. 5, pp. 705–740, 1995.
Article Google Scholar
Grobel, K., H. Hienz, S. Romainczyk, S. Böken and B. Vetter: Videobasierte Erkennung von Körperregionen zur Bestimmung der Ausführungsstelle einer Gebärde. 9. Aachener Kolloquium ”Signaltheorie” — Bild-und Sprachsignale, Aachen, March 18–20, pp. 313–316, 1997.
Google Scholar
Grobel, K. and H. Hienz: Videobasierte Gebärdenerkennung. In: Kraiss, K.-F. (Ed.): Jahresbericht — Lehrstuhl für Technische Informatik 1995/96, pp. 21–28. Shaker-Verlag, 1997.
Google Scholar
Prillwitz, S. et al.: HamNoSys — Hamburg Notation System for Sign Languages, An Introductory Guide, Version 2.0. Signum Press, 1989.
Google Scholar
Romainczyk, S.: Bestimmung der Schulterposition von Personen aus Videobildern. Technical Report, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1996.
Google Scholar
Sobottka, K. and I. Pitas: A Fully Automatic Approach to Facial Feature Detection and Tracking. Proceedings of First International Conference on Audio-and Video-Based Biometric Person Authentification, Crans-Montana, Switzerland, March 12–14, pp. 78–84, Sringer-Verlag, 1997.
Google Scholar
Soede, M. (Ed.): SignPS a System for Sign Writing: Final Report. EEC TIDE Project No. 1202, 1997.
Google Scholar
Tan, M.: Ansichtenbasierte Handformerkennung in Bildfolgen. Diploma Thesis, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1997.
Google Scholar
Vetter, B.: Videobasierte Schätzung von Gesichtsbereichen in Echtzeit für die automatische Erkennung der deutschen Gebärdensprache. Technical Report, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1997.
Google Scholar
Zimmermann, H. J. Fuzzy Set Theory and its Application. 2nd. Edition, Kluwer Acadamic Publisher, 1991.
Google Scholar

Download references

Author information

Authors and Affiliations

Lehrstuhl für Technische Informatik, Aachen University of Technology (RWTH), Ahornstrasse 55, D-52074, Aachen, Germany
Hermann Hienz & Kirsti Grobel

Authors

Hermann Hienz
View author publications
You can also search for this author in PubMed Google Scholar
Kirsti Grobel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Ipke Wachsmuth Martin Fröhlich

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hienz, H., Grobel, K. (1998). Automatic estimation of body regions from video images. In: Wachsmuth, I., Fröhlich, M. (eds) Gesture and Sign Language in Human-Computer Interaction. GW 1997. Lecture Notes in Computer Science, vol 1371. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0052995

Download citation

DOI: https://doi.org/10.1007/BFb0052995
Published: 19 May 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64424-8
Online ISBN: 978-3-540-69782-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics