Abstract
Content-based image retrieval (CBIR) systems traditionally find images within a database that are similar to query image using low level features, such as colour histograms. However, this requires a user to provide an image to the system. It is easier for a user to query the CBIR system using search terms which requires the image content to be described by semantic labels. However, finding a relationship between the image features and semantic labels is a challenging problem to solve. This paper aims to discover semantic labels for facial features for use in a face image retrieval system. Face image retrieval traditionally uses global face-image information to determine similarity between images. However little has been done in the field of face image retrieval to use local face-features and semantic labelling. Our work aims to develop a clustering method for the discovery of semantic labels of face-features. We also present a machine learning based face-feature localization mechanism which we show has promise in providing accurate localization.
Similar content being viewed by others
Notes
The ffs and d notation is not included in the diagram for clarity.
Known as Stratified K-fold cross validation.
Detailed in Sect. 2.2.1.
References
Ai H, Liang L, Xiao X, Xu G (2001) Face indexing and retrieval in personal digital album. In: Proceedings of 2nd IEEE Pacific rim conference on multimedia, vol 2195. Springer, London, pp 48–54
Asthana A, Goecke R, Quadrianto N, Gedeon T (2009) Learning based automatic face annotation for arbitrary poses and expressions from frontal images only. In: Proceedings of IEEE conference on computer vis and pattern recognit. pp 1635–1642
Belhumeur PN, Hespanha J, Kriegman DJ (1997) Eigenfaces vs. Fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720
Cai D, He X, Han J, Zhang H-J (2006) Orthogonal laplacianfaces for face recognition. IEEE Trans Image Process 15(11):3608–3614
Carneiro G, Chan AB, Moreno PJ, Vasconcelos N (2007) Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans Pattern Anal Mach Intell 29(3):394–410
Chang CC, Lin CJ (2001) LIBSVM: a library for support vector machines. http://www.csie.ntu.edu.tw/~cjlin/libsvm
Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60
Gao Y, Qi Y (2005) Robust visual similarity retrieval in single model face databases. Pattern Recognit 38(7):1009–1020
Hanif SM, Prevost L, Belaroussi R, Milgram M (2008) Real-time facial feature localization by combining space displacement neural networks. Pattern Recognit Lett 29(8):1094–1104
He X, Cai D, Han J (2008) Learning a maximum margin subspace for image retrieval. IEEE Trans Knowl Data Eng 20(2):189–201
Heisele B, Serre T, Poggio T (2007) A component-based framework for face detection and identification. Int J Comput Vis 74(2):167–181
Heisele B, Serre T, Pontil M, Poggio T (2001) Component-based face detection. In: Proceedings of IEEE conference on computer visual and pattern recognition, vol 1. IEEE Computer Society, Los Alamitos, CA, USA, pp 657–662
Hsu CW, Chang CC, Lin C-J (2005) A practical guide to support vector classification
Hsu RL, Abdel Mottaleb M, Jain AK (2002) Face detection in color images. IEEE Trans Pattern Anal Mach Intell 24(5):696–706
Hsu RL, Jain AK (2002) Semantic face matching. In: Proceedings of IEEE international conference on multimedia and expo, vol 2, pp 145–148
Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of international conference on neural networks, vol 2. IEEE, pp 985–990
Ito H, Koshimizu H (2006) Face image retrieval and annotation based on two latent semantic spaces in FIARS. In:Proceedings of 8th IEEE international multimedia. IEEE Computer Society, Washington, DC, USA, pp 831–836
Lai PJ, Wang JH (2003) Facial image database for law enforcement application: an implementation. In: Proceedings of 37th IEEE international conference on security technology. Taipei, Taiwan, pp 285–289
Levenberg K (1944) A method for the solution of certain problems in least squares. Quart Appl Math 2:164–168
Li CM, Li YS, Zhuang QD, Xiao ZZ (2004) The face localization and regional features extraction. In: Proceedings of international conference on machine learn and cybernetics, vol 6. pp 3835–3840
Lin SH, Kung SY, Lin LJ (1997) Face recognition/detection by probabilistic decision-based neural network. IEEE Trans Neural Netw 8(1):114–132
Lu Y, Guo H, Feldkamp L (1998) Robust neural learning from unbalanced data samples. In: Proceedings of IEEE international joint conference on neural networks, vol 3. Anchorage, Alaska, USA, pp 1816–1821
MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. In: Cam LL, Neyman J (eds.) Proceedings of fifth Berkeley symposium on math statist and prob, vol 1. University of California, pp 281–297
Nguyen D, Widrow B (1990) Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights. In: Proceedings of international joint conference on neural networks, vol 3, pp 21–26
Rasiwasia N, Moreno PL, Vasconcelos N (2007) Bridging the gap: query by semantic example. IEEE Trans Multimed 9(5):923–938
Sahbi H (2008) A particular gaussian mixture model for clustering and its application to image retrieval. Soft Comput 12(7):667–676
Sheikholeslami G, Chang W, Zhang A (2002) SemQuery: semantic clustering and querying on heterogeneous features for visual data. IEEE Trans Knowl Data Eng 14(5):988–1002
Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Mach Intell 22(12):1349–1380
Sridharan K, Nayak S, Chikkerur S, Govindaraju V (2005) A probabilistic approach to semantic face retrieval system. In: Kanade T, Jain A, Ratha NK (eds) Audio- and video-based biometric person authentication, vol 3546 of Lecture Notes in Computer Science. Springer, Berlin, pp 977–986
Tan SC, Rao MVC, Lim CP (2008) A hybrid neural network classifier combining ordered fuzzy artmap and the dynamic decay adjustment algorithm. Soft Comput 12(8):765–775
Tesic J, Smith JR (2006) Semantic labeling of multimedia content clusters. In: Proceedings of IEEE international conference on multimedia and expo. pp 1493–1496
Turk M, Pentland A (1991) Eigenfaces for recognition. Cogn Neurosci 3(1):71–86
Wang DH (2006) ELM-based multiple classifier systems. In: Proceedings of international conference on control, sutom, robot and vis. pp 1–5
Wang DH, Kim Y-S, Park SC, Lee CS, Han YK (2007) Learning based neural similarity metrics for multimedia data mining. Soft Comput 11(4):335–340
Wang DH, Ma XH (2005) A hybrid image retrieval system with user’s relevance feedback using neurocomputing. Informatica 29:271–279
Wu B, Ai H, Huang C (2004a) Facial Image Retrieval based on Demographic Classification. In: Proceedings of 17th international conference on pattern recognition, vol 3. pp 914–917
Wu TF, Lin CJ, Weng RC (2004b) Probability estimates for multi-class classification by pairwise coupling. Machine Learn Res 5:975–1005
Xu R, Wunsch DI (2005) Survey of clustering algorithms. IEEE Trans Neural Netw 16(3):645–678
Yang M-H, Kriegman DJ, Ahuja N (2002) Detecting Faces in Images: A Survey. IEEE Trans Pattern Anal Mach Intell 24(1):34–58
Zhou H, Yuan Y, Sadka AH (2008) Application of semantic features in face recognition. Pattern Recogn 41(10):3251–3256
Zuo F, de With PH (2008) Facial feature extraction by a cascade of model-based algorithms. Signal Process Image Commun 23(3):194–211
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Conilione, P.C., Wang, D. Automatic localization and annotation of facial features using machine learning techniques. Soft Comput 15, 1231–1245 (2011). https://doi.org/10.1007/s00500-010-0586-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-010-0586-y