Skip to main content
Log in

Vision-based arm gesture recognition for a long-range human–robot interaction

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

This paper proposes a vision-based human arm gesture recognition method for human–robot interaction, particularly at a long distance where speech information is not available. We define four meaningful arm gestures for a long-range interaction. The proposed method is capable of recognizing the defined gestures only with 320×240 pixel-sized low-resolution input images captured from a single camera at a long distance, approximately five meters from the camera. In addition, the system differentiates the target gestures from the users’ normal actions that occur in daily life without any constraints. For human detection at a long distance, the proposed approach combines results from mean-shift color tracking, short- and long-range face detection, and omega shape detection. The system then detects arm blocks using a background subtraction method with a background updating module and recognizes the target gestures based on information about the region, periodical motion, and shape of the arm blocks. From experiments using a large realistic database, a recognition rate of 97.235% is achieved, which is a sufficiently practical level for various pervasive and ubiquitous applications based on human gestures.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Richarz J, Scheidig A, Martin C, Muller S, Gross H (2007) A monocular pointing pose estimator for gestural instruction of a mobile robot. Int J Adv Robot Syst 4(1):139–150

    Google Scholar 

  2. Bremner P, Pipe A, Melhuish C, Fraser M, Subramanian S (2009) Conversational gestures in human–robot interaction. In: IEEE int conf on systems, man, and cybernetics, pp 1645–1649

    Google Scholar 

  3. Aggarwal J, Cai Q (1999) Human motion analysis: a review. Comput Vis Image Underst 73(3):428–440

    Article  Google Scholar 

  4. Davis JW (2001) Hierarchical motion history images for recognizing human motion. In: IEEE workshop on detection and recognition of events in video, pp 39–46

    Chapter  Google Scholar 

  5. Yin X, Zhu X (2006) Hand posture recognition in gesture-based human–robot interaction. In: IEEE int conf on industrial electronics and applications, pp 1–6

    Google Scholar 

  6. Yang H, Park A, Lee S (2007) Gesture spotting and recognition for human–robot interaction. IEEE Trans Robot 23(2):256–270

    Article  Google Scholar 

  7. Hwang B, Kim S, Lee S (2006) A full-body gesture database for automatic gesture recognition. In: Int conf on automatic face and gesture recognition, pp 243–248

    Chapter  Google Scholar 

  8. Li H, Greenspan M (2005) Multi-scale gesture recognition from time-varying contours. In: IEEE int conf on computer vision, pp 236–243

    Google Scholar 

  9. Bien Z, Do J, Kim J, Stefanov D, Park K (2003) User-friendly interaction/interface control of intelligent home for movement-disabled people. In: Int conf on human–computer interaction

    Google Scholar 

  10. Kim S, Lee J, Lee R, Hwang E, Chung M (2008) User-friendly personal photo browsing for mobile devices. ETRI J 30(3):432–440

    Article  Google Scholar 

  11. Medioni G, Choi J, Kuo C, Choudhury A, Zhang L, Fidaleo D (2007) Non-cooperative persons identification at a distance with 3d face modeling. In: IEEE int conf on biometrics, pp 1–6

    Google Scholar 

  12. Baek K, Jang H, Han Y, Hahn H (2005) Efficient small face detection in surveillance images using major color component and LDA scheme. Lect Notes Comput Sci 3802:285–290

    Article  Google Scholar 

  13. Hayashi S, Hasegawa O (2006) A detection techniques for degraded face images. In: IEEE int conf on computer vision and pattern recognition, pp 1506–1512

    Google Scholar 

  14. Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24(5):603–619

    Article  Google Scholar 

  15. Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. In: IEEE conf. on computer vision and pattern recognition, pp I-511–I-518

    Google Scholar 

  16. Jun B, Kim D (2007) Robust real-time face detection using face certainty map. In: Proceeding of the 2nd international conference on biometrics (ICB 2007), vol 4642, pp 29–38

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to DoHyung Kim.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, D., Lee, J., Yoon, HS. et al. Vision-based arm gesture recognition for a long-range human–robot interaction. J Supercomput 65, 336–352 (2013). https://doi.org/10.1007/s11227-010-0541-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-010-0541-9

Keywords

Navigation