Abstract
Visual face tracking is an important building block for all intelligent living and working spaces, as it is able to locate persons without any human intervention or the need for the users to carry sensors on themselves. In this paper we present a novel face tracking system built on a particle filtering framework that facilitates the use of non-linear visual measurements on the facial area. We concentrate on three different such non-linear visual measurement cues, namely object detection, foreground segmentation and colour matching. We derive robust measurement likelihoods under a unified representation scheme and fuse them into our face tracking algorithm. This algorithm is complemented with optimum selection of the particle filter’s object model and a target handling scheme. The resulting face tracking system is extensively evaluated and compared to baseline ones.











Similar content being viewed by others
Notes
To obtain the sequence and the annotations contact the authors.
See an example on http://www.youtube.com/watch?v=iI7mWvf0g1M.
References
Arulampalam, S., Maskell, S., Gordon, N., & Clapp, T. (2002). A tutorial on particle filters for on-line non-linear/non-gaussian bayesian tracking. IEEE Transactions on Signal Processing, 50(2), 174–188.
Babenko, B., Yang, M. H., & Belongie, S. (2009). Visual tracking with online multiple instance learning. In IEEE conference on computer vision and pattern recognition (CVPR, 2009). Miami Beach, FL, USA.
Bernardin, K., Stiefelhagen, R., Pnevmatikakis, A., Lanz, O., Brutti, A., Casas, J. R., et al. (2009). Person tracking. In A. Waibel & R. Stiefelhagen (Eds.), Computers in the human interaction loop, human–computer interaction (pp. 11–22). New York: Springer.
Blackman, S., & Popoli, R. (1999). Design and analysis of modern tracking systems. Artech House radar library. Northwood: Artech House.
Bradski, G., Kaehler, A., & Pisarevsky, V. (2005). Learning-based computer vision with intel’s open source computer vision library.
Buehren, M. (2011). Functions for the rectangular assignment problem. http://www.mathworks.com/matlabcentral/fileexchange/6543-functionsfor-the-rectangular-assignment-problem.
CHIL (computer in the human interaction loop) EU FP6 integrated project. http://chil.server.de/ (2004).
Ding, X., Xu, H., Cui, P., Sun, L., & Yang, S. (2009). A cascade svm approach for head-shoulder detection using histograms of oriented gradients. In IEEE international symposium on circuits and systems (ISCAS 2009). Taipei, Taiwan, pp. 1791–1794.
HERMES (cognitive care and guidance for active ageing) EU FP7 specific targeted research project. http://www.fp7-hermes.eu (2007).
Jaffré, G., & Crouzil, A. (2003). Non-rigid object localization from color model using mean shift. In IEEE International conference on image processing (ICIP 2003), Barcelona, Spain, pp. 317–320.
Jones, M. J., & Rehg, J. M. (2002). Statistical color models with application to skin detection. International Journal of Computer Vision, 46(1), 81–96.
Kitagawa, G. (1996). Monte carlo filter and smoother for non-gaussian nonlinear state space models. Journal of Computational and Graphical Statistics, 5(1), 1–25.
Patrikakis, C., Pnevmatikakis, A., Chippendale, P., Nunes, M., Santos Cruz, R., Poslad, S., et al. (2010). Direct your personal coverage of large athletic events. In IEEE multimedia. doi:10.1109/MMUL.2010.69.
Perez, P., Vermaak, J., & Blake, A. (2004). Data fusion for visual tracking with particles. Proceedings of IEEE, 92(3), 495–513.
Pnevmatikakis, A., & Polymenakos, L. (2006). Robust estimation of background for fixed cameras. In 15th International conference on computing (CIC ’06). Mexico City, Mexico, pp. 37–42.
Stauffer, C., & Grimson, W. E. L. (2000). Learning patterns of activity using real-time tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8), 747–757.
Viola, P. A., & Jones, M. J. (2001). Rapid object detection using a boosted cascade of simple features. In IEEE computer society conference on computer vision and pattern recognition (CVPR 2001). Kauai, HI, USA, pp. 511–518.
Waibel, A., Steusloff, H., & Stiefelhagen, R. (2004). CHIL: Computers in the human interaction loop. In 5th International workshop on image analysis for multimedia interactive services. Lisboa, Portugal, pp. 175–178.
Xu, L., Landabaso, J., & Pardas, M. (2005). Shadow removal with blob-based morphological reconstruction for error correction. In IEEE international conference on acoustics, speech, and signal processing (ICASSP 2005). Philadelphia, PA, USA.
Author information
Authors and Affiliations
Corresponding author
Additional information
Part of this work has been carried out in the scope of the EC co-funded projects SMART (FP7-287583) and eWALL (FP7-610658).
Rights and permissions
About this article
Cite this article
Katsarakis, N., Pnevmatikakis, A., Tan, ZH. et al. Combination of Multiple Measurement Cues for Visual Face Tracking. Wireless Pers Commun 78, 1789–1810 (2014). https://doi.org/10.1007/s11277-014-1900-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-014-1900-2