Skip to main content

Automatic estimation of body regions from video images

  • Conference paper
  • First Online:
Gesture and Sign Language in Human-Computer Interaction (GW 1997)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1371))

Included in the following conference series:

Abstract

In our approach video-based recognition of sign language requires the extraction of sign parameters. Each sign can be characterised by means of manual (handshape, hand orientation, location and movement) and non-manual (trunk, head, gaze, facial expression, mouth) parameters. This paper introduces a software module which is as a part of the developed automatic sign language recognition system able to extract relevant body regions from digitised video images. The recognition of body regions is crucial for determining location of signs. The proposed software module uses a rule-based system for analysing the body contour in order to compute the 2D position of the shoulders, the top of head and the vertical axis of the body. Based on these results the position of the eyes are calculated directly from the segmented face of the signer. The position of the remaining face- (nose, forehead, mouth, cheek, chin) and trunk regions (shoulder belt, chest, belly, hip) are determined by means of two estimators, where a-priori known geometric data of the face and fuzzy technique are used. Experiments indicate that our approach leads to good estimation of body regions, which we all compute in real time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Böken, S.: Videobasierte Bestimmung der Handposition im Oberkörperbereich. Technical Report, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1997.

    Google Scholar 

  2. Boyes Braem, P.: Einführung in die Gebärdensprache und ihre Erforschung. Signum Press, 1995.

    Google Scholar 

  3. Chellappa, R., C.L. Wilson and S. Sirohey: Human and Machine Recognition of Faces: A Survey. Proceedings of the IEEE, Vol. 83, No. 5, pp. 705–740, 1995.

    Article  Google Scholar 

  4. Grobel, K., H. Hienz, S. Romainczyk, S. Böken and B. Vetter: Videobasierte Erkennung von Körperregionen zur Bestimmung der Ausführungsstelle einer Gebärde. 9. Aachener Kolloquium ”Signaltheorie” — Bild-und Sprachsignale, Aachen, March 18–20, pp. 313–316, 1997.

    Google Scholar 

  5. Grobel, K. and H. Hienz: Videobasierte Gebärdenerkennung. In: Kraiss, K.-F. (Ed.): Jahresbericht — Lehrstuhl für Technische Informatik 1995/96, pp. 21–28. Shaker-Verlag, 1997.

    Google Scholar 

  6. Prillwitz, S. et al.: HamNoSys — Hamburg Notation System for Sign Languages, An Introductory Guide, Version 2.0. Signum Press, 1989.

    Google Scholar 

  7. Romainczyk, S.: Bestimmung der Schulterposition von Personen aus Videobildern. Technical Report, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1996.

    Google Scholar 

  8. Sobottka, K. and I. Pitas: A Fully Automatic Approach to Facial Feature Detection and Tracking. Proceedings of First International Conference on Audio-and Video-Based Biometric Person Authentification, Crans-Montana, Switzerland, March 12–14, pp. 78–84, Sringer-Verlag, 1997.

    Google Scholar 

  9. Soede, M. (Ed.): SignPS a System for Sign Writing: Final Report. EEC TIDE Project No. 1202, 1997.

    Google Scholar 

  10. Tan, M.: Ansichtenbasierte Handformerkennung in Bildfolgen. Diploma Thesis, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1997.

    Google Scholar 

  11. Vetter, B.: Videobasierte Schätzung von Gesichtsbereichen in Echtzeit für die automatische Erkennung der deutschen Gebärdensprache. Technical Report, Aachen University of Technology (RWTH), Lehrstuhl für Technische Informatik, 1997.

    Google Scholar 

  12. Zimmermann, H. J. Fuzzy Set Theory and its Application. 2nd. Edition, Kluwer Acadamic Publisher, 1991.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Ipke Wachsmuth Martin Fröhlich

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag

About this paper

Cite this paper

Hienz, H., Grobel, K. (1998). Automatic estimation of body regions from video images. In: Wachsmuth, I., Fröhlich, M. (eds) Gesture and Sign Language in Human-Computer Interaction. GW 1997. Lecture Notes in Computer Science, vol 1371. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0052995

Download citation

  • DOI: https://doi.org/10.1007/BFb0052995

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-64424-8

  • Online ISBN: 978-3-540-69782-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics