A novel training weighted ensemble (TWE) with application to face recognition

doi:10.1016/j.asoc.2011.01.032

Applied Soft Computing

Volume 11, Issue 4, June 2011, Pages 3608-3617

https://doi.org/10.1016/j.asoc.2011.01.032 Get rights and content

Abstract

Individual classifiers that are fully trained are unstable especially when the database conditions are changed. Moreover, designing a unique classifier with the suitable parameters to achieve acceptable performance is a non-trivial task. Combined classifiers, which consist of a set of individually trained classifiers, are introduced to avoid the previous problems. There are two key issues in the combination of classifiers. The first issue is how to obtain the set of base classifiers to combine. The second issue is how to fuse the decisions of those classifiers. In this paper, weak Learning Vector Quantization (LVQ) neural networks have been used as base classifiers. Also, a new combination technique which is based on training-weighted voting is introduced. Other factors that greatly affect the performance of a combined classifier are related to the type of the individual classifiers, the training parameters, database size and nature, etc. These factors have been considered in the design of the proposed combined classifier. TWE has been experimentally tested on five standard face databases: Yale, ORL, Grimace, Faces94 and Faces95 and has demonstrated excellent performance. Analysis of the ensemble stability has shown promising results.

Introduction

Many researchers have solved pattern recognition problems by introducing solutions that are based on a single classifier. In their studies, that unique classifier has been trained very well, and then used to recognize the unseen (test) instances. For each test instance, the decision that is taken by this classifier is considered to be the final decision of the introduced solution. The design of a unique classifier that results in excellent percentage of correct classification is very complex especially when each instance has a huge number of attributes, which is the case in face recognition problems. In [16], the problems that face the design of that unique classifier are discussed in detail.

On the other hand, a lot of studies gave other solutions to pattern recognition problems which are based on a combination of multiple classifiers. In their researches, the final decision of the whole solution is taken after combining the decisions of all the individual classifiers. These studies agreed that the decision taken by a combination of multiple classifiers is better than the decision of only one classifier regardless of the strength of this unique classifier. In [16], the advantages of combining multiple classifiers or what is called a multiple classifier system are given. Face recognition, as one of the pattern recognition problems, is considered to be one of the most important fields especially after the 11th of September 2001 events. The need to automatically recognize the people from their faces becomes much imperative than before. In this paper, we introduce a new approach based on combining multiple classifiers instead of depending on one classifier to achieve better classification results. In Section 2, a literature review is given to explain the used techniques in combining the individual classifiers. Section 3 describes the Learning Vector Quantization (LVQ) neural network as a base classifier and explains the proposed approach. In Section 4, the different image databases used in this research like Yale, ORL and Essex are described. In Section 5, in addition to discussing the implementation of the proposed approach, comparisons against other different approaches are presented. Finally, a conclusion about the proposed approach and its ability to reach the objective of this paper is discussed.

Section snippets

Background and literature review

In pattern recognition literature, it has been shown that combining classifiers gives better results than individual classifiers. Some of these studies are in the areas of word recognition [16], facial-gender recognition [45] and face recognition [60]. In [30], mainly four groups that characterize the combination of the individual classifiers (or the Multiple Classifier System) have been identified: the representation of the input, the architecture of the individual classifiers, the

The proposed approach

In the proposed approach we make a combination of individual neural networks. The type of all individual neural networks is unified, but they are different in the main parameters of that type.

Face databases

To fairly evaluate the proposed combined classifier, a variety of research databases were used, which are, Yale database [65], ORL database [3] and Essex database [57]. Essex database, which includes images of persons of different racial origins, has four databases: Grimace [56], Faces94 [53], Faces95 [54], Faces96 [55]. Eventually, we used five databases: Yale, ORL, Grimace, Faces94 and Faces95 databases. Table 1 summarizes the main attributes of the original images of these databases:

Before

Case study: applying the proposed approach to Yale database

In the first step, we trained many individual classifiers. Each classifier is different than the others in the number of epochs, the learning rate, and/or the number of hidden neurons. With Yale database, we found that the suitable mixture of the values of the main parameters that gave us correct recognition accuracy of the training between 80% and 90% (or around these boundaries) are the following values: (1) number of epochs is between 300 and 400, (2) learning rate is between 0.02 and 0.06,

Conclusions

This paper has introduced a novel training weighted classifier ensemble (TWE), to solve the instability problem of individual pattern classifiers. The proposed classifier consists of multiple individual classifiers which are different in both architecture and training parameters. The achieved classification accuracies of the combined classifier outperform those of the best individual classifiers. For fair evaluation of the proposed combined classifier, the most widely used five face databases

Acknowledgement

The authors would like to acknowledge the support of their respective universities.

References (72)

O. Deniz et al.
Face recognition using independent component analysis and support vector machines
Pattern Recognition Letters
(2003)
E.B. Kong et al.
Error-correcting output coding corrects bias and variance
J. Maver et al.
Recognizing 2-tone images in grey-level parametric eigenspaces
Pattern Recognition Letters
(2002)
X.-N. Song et al.
A complete fuzzy discriminant analysis approach for face recognition
Applied Soft Computing
(2010)
L. Qiao et al.
Sparsity preserving projections with applications to face recognition
Pattern Recognition
(2010)
H. Alam et al.
A pair-wise decision fusion framework: recognition of human faces
K.M. Ali et al.
Error reduction through learning multiple descriptions
Machine Learning
(1996)
AT&T Laboratories Cambridge, The ORL Face Image Database, 2002, URL:...
G. Auda et al.
Voting schemes for cooperative neural network classifiers
R. Battiti et al.
Democracy in neural nets: voting schemes for classification
Neural Networks
(1995)

L. Breiman

Bagging predictors

Machine Learning

(1996)

D. Chen et al.

A simple implementation of the stochastic discrimination for pattern recognition

T.G. Dietterich

Machine learning research: four current directions

The AI Magazine

(1998)

R.P.W. Duin et al.

Experiments with classifier combining rules

A.H. El-Baz, Face Recognition using a Combined Neural Network Based Classifier, Ph.D. Dissertation, Mansoura...

L.V. Fausett

Fundamentals of Neural Networks: Architectures, Algorithms, and Applications

(1994)

Y. Freund et al.

A decision-theoretic generalization of on-line learning and an application to boosting

C. Geng et al.

SIFT features for face recognition

L.K. Hansen et al.

Neural network ensembles

IEEE Transactions on Pattern Analysis and Machine Intelligence

(1990)

T.K. Ho et al.

Combination of decisions by multiple classifiers

T.K. Ho

Data complexity analysis for classifier combination

U. Hoffmann et al.

A boosting approach to P300 detection with application to brain–computer interfaces

L. Hong et al.

Integrating faces and fingerprint for personal identification

IEEE Transactions on Pattern Analysis and Machine Intelligence

(1998)

G. Hua

Face recognition by discriminative orthogonal rank-one tensor decomposition

R. Huang et al.

A hybrid face recognition method using Markov random fields

International Conference on Pattern Recognition

(2004)

A.K. Jain et al.

Statistical pattern recognition: a review

IEEE Transactions on Pattern Analysis and Machine Intelligence

(2000)

C. Ji et al.

Combinations of weak classifiers

IEEE Transactions on Neural Networks

(1997)

X. Jiang et al.

Eigenfeature regularization and extraction in face recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence

(2008)

K.I. Kim et al.

Face recognition using support vector machines with local correlation kernels

International Journal of Pattern Recognition and Artificial Intelligence

(2002)

J. Kittler et al.

On combining classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence

(1998)

E.M. Kleinberg

Stochastic discrimination

Annals of Mathematics and Artificial Intelligence

(1990)

T. Kohonen

Self-organizing formation of topologically correct feature maps

Biological Cybernetics

(1982)

P. Latinne et al.

Combining different methods and numbers of weak decision trees

Pattern Analysis and Applications

(2002)

S. Lawrence et al.

Face recognition: a convolutional neural-network approach

IEEE Transactions on Neural Networks

(1997)

M.Y.F. Leong, J.M. Thomson, Implementation of a Real-Time Automated DSP Based Face Recognition System: Final Report, A...

Z. Li et al.

Face recognition using improved pairwise coupling support vector machines

Cited by (11)

Robust face descriptor in unconstrained environments
2024, Expert Systems with Applications
This research paper presents the novel face recognition (FR) approach by introducing the novel descriptor so-called directional binary pattern (DBP). The directional features are captured from 3 different directions of the 6x6 image patch by using robust thresholding criteria. For each direction, the first order derivatives are generated in 8 neighborhood blocks of the 6x6 pixel window, where each neighborhood block size is 2x2. By applying thresholding criteria on these first order derivatives generate 4 DBP transformed images from one direction therefore from all three directions 12 DBP transformed images are produced. From all DBP transformed images sub-regional histograms are extracted which are combined finally to produce the entire DBP histogram feature. Furthermore magnitude DBP (MDBP) sub-regional features are incorporated with DBP features to enhance the discriminativity. Then 12 DBP histograms and 6 MDBP histograms are merged to form discriminative histogram feature called as robust face descriptor (RFD). PCA and FLDA are used for feature compression, which pick the relevant features for classification. Ultimately classification duty is performed by SVMs and NN. Whole method is evaluated on ORL, GT, Yale, YB, EYB, SOF, JAFFE and Faces94 datasets. The discovered method proves its excellence by defeating the results of various methods.
Softly combining an ensemble of classifiers learned from a single convolutional neural network for scene categorization
2018, Applied Soft Computing Journal
Citation Excerpt :
Obtained scene features from an example image are illustrated in Fig. 6. It has been demonstrated numerous times that ensemble methods can usually yield superior results to single classifiers [65–67]. To make the ensemble methods effective, a diverse set of classifiers are necessary.
In this paper we propose to train an ensemble of classifiers from a single convolutional neural network (CNN) and softly combine these classifiers for scene categorization. Specifically, we explore the hierarchical structure of a CNN to extract multiple types of features from images, and train a multi-class classifier corresponding to each type of features. To combine these classifiers effectively, a soft combination strategy is introduced. Considering the fact that different images may need to be discriminated by using different types of features, we train a set of auxiliary binary-class classifiers to estimate the quality of categorizing an image by using the corresponding multi-class classifiers, so that a dynamic weight can be assigned to each of the multi-class classifiers for combination. On the other hand, because features extracted from different layers of a CNN differ largely in their levels of abstraction, classifiers trained based on these features have quite different capabilities for scene categorization. To address this issue, in the soft combination strategy we adopt the genetic algorithm to learn another set of static weights for the multi-class classifiers for combination. The static weights are to adapt the multi-class classifiers to given datasets. Finally, to categorize an image, the multi-class classifiers are combined by using both dynamic and static weights. We conduct experiments on two challenging benchmark datasets, MIT-indoor scene 67 and SUN397. Experiment results show that the proposed method is effective for scene categorization and can give superior results to state-of-the-art approaches.
Generalizing intersection kernel support vector machines for color texture based recognition
2016, Journal of Visual Communication and Image Representation
Citation Excerpt :
Image texture provides the information about the spatial arrangement of color or intensities in an image or selected region of the image, which has the advantage of rotational invariance and stronger robustness to noise [4,5,7]. From the view of information fusion [8], a stronger feature can be constructed adopting certain fusion strategy from several heterogeneous features by use of their complementarity and redundancy. Previous studies [9,10] have shown that among various fusion schemes for bottom-layer image features, the one fusing color and texture has the attraction of strong discriminative power.
This paper presents a novel recognition approach in which the component-adaptive color co-occurrence matrices (CACCMs) are designed to characterize color and texture cues in the images, while histogram intersection kernel support vector machines (HIKSVMs) are generalized to the version compatible to color co-occurrence matrix (CCM), called CCM intersection kernel support vector machines (CIKSVMs). An ensemble learning framework is proposed for synchronously training the optimal marginal CIKSVMs and corresponding CACCMs’ extractors. This learning architecture is applicable to an arbitrary color space employed for image coding, while we pay utmost attention to a perceptual uniform color space for the prominent potential in image proprieties’ display. For the formulation of recognition algorithm, the set of multi-channel CACCMs (CAMCMs) of per sample is utilized to get a balance between discriminative power and computational efficiency, while multiple marginal CIKSVMs are combined by weighted majority voting. The effectiveness of our approach is validated by promising results obtained from four experimental datasets.
Combining features of negative correlation learning with mixture of experts in proposed ensemble methods
2012, Applied Soft Computing Journal
Citation Excerpt :
There are also hopes that if a classifier fails, the overall system can recover the error [3]. Combining methods [4,5] is an approach to improve the performance in prediction [6,7] and classification [8,9] particularly for complex problems such as those involving limited number of patterns, high-dimensional feature sets, and highly overlapped classes [10,11]. In this research, we focus on the combining methods in which NNs are used as the base classifier in the combining system.
Both theoretical and experimental studies have shown that combining accurate neural networks (NNs) in the ensemble with negative error correlation greatly improves their generalization abilities. Negative correlation learning (NCL) and mixture of experts (ME), two popular combining methods, each employ different special error functions for the simultaneous training of NNs to produce negatively correlated NNs. In this paper, we review the properties of the NCL and ME methods, discussing their advantages and disadvantages. Characterization of both methods showed that they have different but complementary features, so if a hybrid system can be designed to include features of both NCL and ME, it may be better than each of its basis approaches. In this study, two approaches are proposed to combine the features of both methods in order to solve the weaknesses of one method with the strength of the other method, i.e., gated-NCL (G-NCL) and mixture of negatively correlated experts (MNCE). In the first approach, G-NCL, a dynamic combiner of ME is used to combine the outputs of base experts in the NCL method. The suggested combiner method provides an efficient tool to evaluate and combine the NCL experts by the weights estimated dynamically from the inputs based on the different competences of each expert regarding different parts of the problem. In the second approach, MNCE, the capability of a control parameter for NCL is incorporated in the error function of ME, which enables the training algorithm of ME to efficiently adjust the measure of negative correlation between the experts. This control parameter can be regarded as a regularization term added to the error function of ME to establish better balance in bias–variance–covariance trade-offs and thus improves the generalization ability. The two proposed hybrid ensemble methods, G-NCL and MNCE, are compared with their constituent methods, ME and NCL, in solving several benchmark problems. The experimental results show that our proposed methods preserve the advantages and alleviate the disadvantages of their basis approaches, offering significantly improved performance over the original methods.
Reward-Penalty Weighted Ensemble for Emotion State Classification from Multi-Modal Data Streams
2022, International Journal of Neural Systems
Indoor scene recognition based on weighted voting schemes
2019, 2019 European Conference on Mobile Robots, ECMR 2019 - Proceedings

View all citing articles on Scopus

View full text

A novel training weighted ensemble (TWE) with application to face recognition

Abstract

Introduction

Section snippets

Background and literature review

The proposed approach

Face databases

Case study: applying the proposed approach to Yale database

Conclusions

Acknowledgement

Pattern Recognition Letters

Pattern Recognition Letters

Applied Soft Computing

Pattern Recognition

A pair-wise decision fusion framework: recognition of human faces

Error reduction through learning multiple descriptions

Machine Learning

Voting schemes for cooperative neural network classifiers

Democracy in neural nets: voting schemes for classification

Neural Networks

Bagging predictors

Machine Learning

A simple implementation of the stochastic discrimination for pattern recognition

Machine learning research: four current directions

The AI Magazine

Experiments with classifier combining rules

Fundamentals of Neural Networks: Architectures, Algorithms, and Applications

A decision-theoretic generalization of on-line learning and an application to boosting

SIFT features for face recognition

Neural network ensembles

IEEE Transactions on Pattern Analysis and Machine Intelligence

Combination of decisions by multiple classifiers

Data complexity analysis for classifier combination

A boosting approach to P300 detection with application to brain–computer interfaces

Integrating faces and fingerprint for personal identification

IEEE Transactions on Pattern Analysis and Machine Intelligence

Face recognition by discriminative orthogonal rank-one tensor decomposition

A hybrid face recognition method using Markov random fields

International Conference on Pattern Recognition

Statistical pattern recognition: a review

IEEE Transactions on Pattern Analysis and Machine Intelligence

Combinations of weak classifiers

IEEE Transactions on Neural Networks

Eigenfeature regularization and extraction in face recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence

Face recognition using support vector machines with local correlation kernels

International Journal of Pattern Recognition and Artificial Intelligence

On combining classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence

Stochastic discrimination

Annals of Mathematics and Artificial Intelligence

Self-organizing formation of topologically correct feature maps

Biological Cybernetics

Combining different methods and numbers of weak decision trees

Pattern Analysis and Applications

Face recognition: a convolutional neural-network approach

IEEE Transactions on Neural Networks

Face recognition using improved pairwise coupling support vector machines