Skip to main content
Log in

Online annotation of faces in personal videos by sequential learning

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper addresses semi-automatic annotation of faces in personal videos. Different from previous offline annotation systems, this paper studies online annotation of faces. During an annotation session, few annotations are requested from the user only for some part of the video online. These annotations are used to train a system that will perform annotation automatically for the rest of the video. The automatic annotation results are presented to the user during the same session and the user is allowed to correct any automatic annotation mistakes. Thus, only fast and accurate face recognition methods are considered. Instead of batch learning, which has been used in the existing annotation systems, this paper proposes sequential learning methods to be used as face classifiers. Different classification methods are tested with various feature extraction methods using the same database so that a fair comparison is made among them. The results are evaluated in terms of recognition accuracies and execution time requirements.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Arandjelovic O, Zisserman A (2005) Automatic face recognition for film character retrieval in feature-length films. IEEE Comput Soc Conf Comput Vis Pattern Recognit(CVPR’05) 1:860–867

    Google Scholar 

  2. Bach F, Lanckriet G, Jordan M (2004) Multiple kernel learning, conic duality and the SMO algorithm, In International Conference on Machine Learning, NY, USA, pp. 6

  3. Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720

    Article  Google Scholar 

  4. Bouguet J (2000) Pyramidal implementation of the Lucas-Kanade feature tracker: description of the algorithm. Technical report, OpenCV Document, Intel Microprocessor Research Labs

  5. Chang CC, Lin CJ (2001) LIBSVM, www.csie.ntu.edu.tu/~cjlin/libsvm/library

  6. Dalal N, Triggs B (2005) Histogram of Oriented Gradients for human detection. IEEE Comput Soc Conf Comp Vis Pattern Recognit 1:886–893

    Google Scholar 

  7. Domeniconi C, Gunopulos D (2001) Incremental Support Vector Machine construction. Proceedings IEEE International Conference on Data Mining, pp. 589–592

  8. Ekenel HK, Stiefelhagen R (2005) Local appearance based face recognition using Discrete Cosine Transform. Proceedings of the 13th European Signal Processing Conference (EUSIPCO 2005)

  9. Everingham M, Sivic J, Zisserman A (2006) “Hello! My name is… Buffy”—Automatic Naming of Characters in TV Video. British Machine Vision Conference

  10. Everingham M, Sivic J, Zisserman A (2009) Taking the bite out of automated naming of characters in TV video. Image Vision Comput 27(5):545–559

    Article  Google Scholar 

  11. Fischer M (2008) Automatic identification of persons in TV series. Universität Karlsruhe (TH) M.S. Thesis

  12. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  13. Ramanan D, Baker S, Kakade S (2007) Leveraging archival video for building face datasets. IEEE 11th International Conference on ICCV 2007, pp. 1–8

  14. Rowley HA, Baluja S, Kanade T (1998) Neural network based face detection. IEEE Trans PAMI 20(1):23–38

    Article  Google Scholar 

  15. Satoh S (2000) Comparative evaluation of face sequence matching for content-based video access. Fourth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 163–168

  16. Shaoning P, Ozawa S, Kasabov N (2005) Incremental linear discriminant analysis for classification of data streams. IEEE Trans Syst Man Cybern 35(5):905–914

    Article  Google Scholar 

  17. Shi J, Tomasi C (1994) Good features to track. Proc IEEE Comput Soc Conf Comput. Vision and Pattern Recogn pp. 593–600

  18. Sivic J, Everingham M, Zisserman A (2005) Person Spotting: video shot retrieval for face sets. Proceedings of the 4th Conference on Image and Video Retrieval, pp. 226–236

  19. Sivic J, Everingham M, Zisserman A (2009) “Who are you?”—Learning person specific classifiers from video. IEEE Conference on Computer Vision and Pattern Recognition, pp. 1145–1152

  20. Syed NA, Liu H, Sung KK (1999) Incremental Learning with Support Vector Machines. International Joint Conference on Artificial Intelligence (IJCAI)

  21. Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 511–518

  22. Wolf L, Hassner T, Taigman Y (2008) Descriptor based methods in the wild. Real Life Images Workshop at the European Conference on Computer Vision (ECCV)

  23. Yilmazturk MC, Ulusoy I, Cicekli N (2010) An analysis of different methods for online and semi-automatic annotation of faces in videos. ISCIS 2010, London, UK

  24. Zhu J, Hoi SCH, Lyu MR (2008) Face annotation using transductive kernel Fisher discriminant. IEEE Trans Multimed 86–96

Download references

Acknowledgments

This work is partially supported by The Scientific and Technical Council of Turkey Grant TUBITAK EEEAG-107E234.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to I. Ulusoy.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yilmazturk, M.C., Ulusoy, I. & Cicekli, N.K. Online annotation of faces in personal videos by sequential learning. Multimed Tools Appl 63, 591–613 (2013). https://doi.org/10.1007/s11042-011-0884-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-011-0884-0

Keywords

Navigation