Abstract
One of the major tasks in some human–computer interface applications, such as face recognition and video telephony, is to localize a human face in an image. In this paper, we propose to use hierarchical neural networks with local recurrent connectivity to solve this task not only in unambiguous situations, but also in the presence of complex backgrounds, difficult lighting, and noise. The networks are trained using a database of gray-scale still images and manually determined eye coordinates. They are able to produce reliable and accurate eye coordinates for unknown images by iteratively refining initial solutions. Because the networks process entire images, there is no need for any time-consuming scanning across positions and scales. Furthermore, the fast network updates allow for real-time face tracking. In this case, the networks are trained using still images that move in random directions. The trained networks are able to accurately track the eye positions in the test image sequences.
Similar content being viewed by others
References
Amari S-I (1977) Dynamics of pattern formation in lateral-inhibition type neural fields. Biol Cybern 27:77–87
Behnke S (2001) Learning iterative image reconstruction in the neural abstraction pyramid. Int J Comput Intell Appl, special issue on neural networks at IJCAI-2001 1(4):427–438
Behnke S (2003) Hierarchical neural networks for image interpretation. In: Lecture notes in computer science, LNCS, vol 2766. Springer, Berlin Heidelberg New York
Behnke S, Rojas R (1998) Neural abstraction pyramid: a hierarchical image understanding architecture. In: Proceedings of the international joint conference on neural networks (IJCNN’98), Anchorage, Alaska, May 1998, vol 2, pp 820–825
Bengio Y, Simard P, Frasconi P (1994) Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Networ 5(2):157–166
Fukushima K, Miyake S, Ito T (1983) Neocognitron: a neural network model for a mechanism of visual pattern recognition. IEEE Trans Syst Man Cybern 13:826–834
Hjelmas E, Kee Low B (2001) Face detection: a survey. Comput Vis Image Und 83:236–274
Jesorsky O, Kirchberg KJ, Frischholz RW (2001) Robust face detection using the Hausdorff distance. In: Proceedings of the 3rd international conference on audio- and video-based biometric person authentication (AVBPA 2001), Halmstad, Sweden, June 2001. Lecture notes in computer science, vol 2091. Springer, Berlin Heidelberg New York, pp 90–95
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Maio D, Maltoni D (2000) Real-time face localization on gray-scale static images. Pattern Recogn 33:1525–1539
Messer K, Matas J, Kittler J, Luettin J, Maitre G (1999) XM2VTSDB: the extended M2VTS database. In: Proceedings of the 2nd international conference on audio- and video-based biometric person authentication (AVBPA’99), Washington, DC, March 1999, pp 72–77
Riedmiller M, Braun H (1993) A direct adaptive method for faster backpropagation learning: the RPROP algorithm. In: Proceedings of the IEEE international conference on neural networks (ICNN’93), San Francisco, California, March 1993, pp 586–591
Rowley HA, Baluja S, Kanade T (1998) Neural network-based face detection. IEEE Trans Pattern Anal Machine Intell 20:23–38
Seung HS (1998) Learning continuous attractors in recurrent networks. In: Jordan MI, Kearns MJ, Solla SA (eds) Advances in neural information processing systems (NIPS), vol 10. MIT Press, Cambridge, Massachusetts, pp 654–660
Williams RJ, Peng J (1990) An efficient gradient-based algorithm for on-line training of recurrent network trajectories. Neural Comput 2(4):491–501
Yang M-H, Kriegman D, Ahuja N (2002) Detecting faces in images: a survey. IEEE Trans Pattern Anal Machine Intell 24(1):34–58
Viola P, Jones MJ (2004) Robust Real-Time Face Detection. Int J Computer Vision 57(2):137–154
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Behnke, S. Face localization and tracking in the neural abstraction pyramid. Neural Comput & Applic 14, 97–103 (2005). https://doi.org/10.1007/s00521-004-0444-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-004-0444-x