Abstract
This paper addresses semi-automatic annotation of faces in personal videos. Different from previous offline annotation systems, this paper studies online annotation of faces. During an annotation session, few annotations are requested from the user only for some part of the video online. These annotations are used to train a system that will perform annotation automatically for the rest of the video. The automatic annotation results are presented to the user during the same session and the user is allowed to correct any automatic annotation mistakes. Thus, only fast and accurate face recognition methods are considered. Instead of batch learning, which has been used in the existing annotation systems, this paper proposes sequential learning methods to be used as face classifiers. Different classification methods are tested with various feature extraction methods using the same database so that a fair comparison is made among them. The results are evaluated in terms of recognition accuracies and execution time requirements.
Similar content being viewed by others
References
Arandjelovic O, Zisserman A (2005) Automatic face recognition for film character retrieval in feature-length films. IEEE Comput Soc Conf Comput Vis Pattern Recognit(CVPR’05) 1:860–867
Bach F, Lanckriet G, Jordan M (2004) Multiple kernel learning, conic duality and the SMO algorithm, In International Conference on Machine Learning, NY, USA, pp. 6
Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720
Bouguet J (2000) Pyramidal implementation of the Lucas-Kanade feature tracker: description of the algorithm. Technical report, OpenCV Document, Intel Microprocessor Research Labs
Chang CC, Lin CJ (2001) LIBSVM, www.csie.ntu.edu.tu/~cjlin/libsvm/library
Dalal N, Triggs B (2005) Histogram of Oriented Gradients for human detection. IEEE Comput Soc Conf Comp Vis Pattern Recognit 1:886–893
Domeniconi C, Gunopulos D (2001) Incremental Support Vector Machine construction. Proceedings IEEE International Conference on Data Mining, pp. 589–592
Ekenel HK, Stiefelhagen R (2005) Local appearance based face recognition using Discrete Cosine Transform. Proceedings of the 13th European Signal Processing Conference (EUSIPCO 2005)
Everingham M, Sivic J, Zisserman A (2006) “Hello! My name is… Buffy”—Automatic Naming of Characters in TV Video. British Machine Vision Conference
Everingham M, Sivic J, Zisserman A (2009) Taking the bite out of automated naming of characters in TV video. Image Vision Comput 27(5):545–559
Fischer M (2008) Automatic identification of persons in TV series. Universität Karlsruhe (TH) M.S. Thesis
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Ramanan D, Baker S, Kakade S (2007) Leveraging archival video for building face datasets. IEEE 11th International Conference on ICCV 2007, pp. 1–8
Rowley HA, Baluja S, Kanade T (1998) Neural network based face detection. IEEE Trans PAMI 20(1):23–38
Satoh S (2000) Comparative evaluation of face sequence matching for content-based video access. Fourth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 163–168
Shaoning P, Ozawa S, Kasabov N (2005) Incremental linear discriminant analysis for classification of data streams. IEEE Trans Syst Man Cybern 35(5):905–914
Shi J, Tomasi C (1994) Good features to track. Proc IEEE Comput Soc Conf Comput. Vision and Pattern Recogn pp. 593–600
Sivic J, Everingham M, Zisserman A (2005) Person Spotting: video shot retrieval for face sets. Proceedings of the 4th Conference on Image and Video Retrieval, pp. 226–236
Sivic J, Everingham M, Zisserman A (2009) “Who are you?”—Learning person specific classifiers from video. IEEE Conference on Computer Vision and Pattern Recognition, pp. 1145–1152
Syed NA, Liu H, Sung KK (1999) Incremental Learning with Support Vector Machines. International Joint Conference on Artificial Intelligence (IJCAI)
Viola P, Jones M (2001) Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 511–518
Wolf L, Hassner T, Taigman Y (2008) Descriptor based methods in the wild. Real Life Images Workshop at the European Conference on Computer Vision (ECCV)
Yilmazturk MC, Ulusoy I, Cicekli N (2010) An analysis of different methods for online and semi-automatic annotation of faces in videos. ISCIS 2010, London, UK
Zhu J, Hoi SCH, Lyu MR (2008) Face annotation using transductive kernel Fisher discriminant. IEEE Trans Multimed 86–96
Acknowledgments
This work is partially supported by The Scientific and Technical Council of Turkey Grant TUBITAK EEEAG-107E234.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yilmazturk, M.C., Ulusoy, I. & Cicekli, N.K. Online annotation of faces in personal videos by sequential learning. Multimed Tools Appl 63, 591–613 (2013). https://doi.org/10.1007/s11042-011-0884-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-011-0884-0