Skip to main content
Log in

Face localization and tracking in the neural abstraction pyramid

  • Original Article
  • Published:
Neural Computing & Applications Aims and scope Submit manuscript

Abstract

One of the major tasks in some human–computer interface applications, such as face recognition and video telephony, is to localize a human face in an image. In this paper, we propose to use hierarchical neural networks with local recurrent connectivity to solve this task not only in unambiguous situations, but also in the presence of complex backgrounds, difficult lighting, and noise. The networks are trained using a database of gray-scale still images and manually determined eye coordinates. They are able to produce reliable and accurate eye coordinates for unknown images by iteratively refining initial solutions. Because the networks process entire images, there is no need for any time-consuming scanning across positions and scales. Furthermore, the fast network updates allow for real-time face tracking. In this case, the networks are trained using still images that move in random directions. The trained networks are able to accurately track the eye positions in the test image sequences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6a, b
Fig. 7a, b
Fig. 8
Fig. 9a–c
Fig. 10a, b
Fig. 11a, b

Similar content being viewed by others

References

  1. Amari S-I (1977) Dynamics of pattern formation in lateral-inhibition type neural fields. Biol Cybern 27:77–87

    MATH  Google Scholar 

  2. Behnke S (2001) Learning iterative image reconstruction in the neural abstraction pyramid. Int J Comput Intell Appl, special issue on neural networks at IJCAI-2001 1(4):427–438

    Article  Google Scholar 

  3. Behnke S (2003) Hierarchical neural networks for image interpretation. In: Lecture notes in computer science, LNCS, vol 2766. Springer, Berlin Heidelberg New York

  4. Behnke S, Rojas R (1998) Neural abstraction pyramid: a hierarchical image understanding architecture. In: Proceedings of the international joint conference on neural networks (IJCNN’98), Anchorage, Alaska, May 1998, vol 2, pp 820–825

  5. Bengio Y, Simard P, Frasconi P (1994) Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Networ 5(2):157–166

    Article  Google Scholar 

  6. Fukushima K, Miyake S, Ito T (1983) Neocognitron: a neural network model for a mechanism of visual pattern recognition. IEEE Trans Syst Man Cybern 13:826–834

    Google Scholar 

  7. Hjelmas E, Kee Low B (2001) Face detection: a survey. Comput Vis Image Und 83:236–274

    Article  MATH  Google Scholar 

  8. Jesorsky O, Kirchberg KJ, Frischholz RW (2001) Robust face detection using the Hausdorff distance. In: Proceedings of the 3rd international conference on audio- and video-based biometric person authentication (AVBPA 2001), Halmstad, Sweden, June 2001. Lecture notes in computer science, vol 2091. Springer, Berlin Heidelberg New York, pp 90–95

  9. LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324

    Article  Google Scholar 

  10. Maio D, Maltoni D (2000) Real-time face localization on gray-scale static images. Pattern Recogn 33:1525–1539

    Article  Google Scholar 

  11. Messer K, Matas J, Kittler J, Luettin J, Maitre G (1999) XM2VTSDB: the extended M2VTS database. In: Proceedings of the 2nd international conference on audio- and video-based biometric person authentication (AVBPA’99), Washington, DC, March 1999, pp 72–77

  12. Riedmiller M, Braun H (1993) A direct adaptive method for faster backpropagation learning: the RPROP algorithm. In: Proceedings of the IEEE international conference on neural networks (ICNN’93), San Francisco, California, March 1993, pp 586–591

  13. Rowley HA, Baluja S, Kanade T (1998) Neural network-based face detection. IEEE Trans Pattern Anal Machine Intell 20:23–38

    Article  Google Scholar 

  14. Seung HS (1998) Learning continuous attractors in recurrent networks. In: Jordan MI, Kearns MJ, Solla SA (eds) Advances in neural information processing systems (NIPS), vol 10. MIT Press, Cambridge, Massachusetts, pp 654–660

  15. Williams RJ, Peng J (1990) An efficient gradient-based algorithm for on-line training of recurrent network trajectories. Neural Comput 2(4):491–501

    Google Scholar 

  16. Yang M-H, Kriegman D, Ahuja N (2002) Detecting faces in images: a survey. IEEE Trans Pattern Anal Machine Intell 24(1):34–58

    Article  MATH  Google Scholar 

  17. Viola P, Jones MJ (2004) Robust Real-Time Face Detection. Int J Computer Vision 57(2):137–154

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sven Behnke.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Behnke, S. Face localization and tracking in the neural abstraction pyramid. Neural Comput & Applic 14, 97–103 (2005). https://doi.org/10.1007/s00521-004-0444-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-004-0444-x

Keywords

Navigation