Elsevier

Pattern Recognition

Volume 43, Issue 4, April 2010, Pages 1431-1440
Pattern Recognition

Feature extraction using discrete cosine transform and discrimination power analysis with a face recognition technology

https://doi.org/10.1016/j.patcog.2009.11.001Get rights and content

Abstract

Discrete cosine transform (DCT) is a powerful transform to extract proper features for face recognition. After applying DCT to the entire face images, some of the coefficients are selected to construct feature vectors. Most of the conventional approaches select coefficients in a zigzag manner or by zonal masking. In some cases, the low-frequency coefficients are discarded in order to compensate illumination variations. Since the discrimination power of all the coefficients is not the same and some of them are discriminant than others, so we can achieve a higher true recognition rate by using discriminant coefficients (DCs) as feature vectors. Discrimination power analysis (DPA) is a statistical analysis based on the DCT coefficients properties and discrimination concept. It searches for the coefficients which have more power to discriminate different classes better than others. The proposed approach, against the conventional approaches, is data-dependent and is able to find DCs on each database. The simulations results of the various coefficient selection (CS) approaches on ORL and Yale databases confirm the success of the proposed approach. The DPA-based approaches achieve the performance of PCA/LDA or better with less complexity. The proposed method can be implemented for any feature selection problem as well as DCT coefficients. Also, a new modification of PCA and LDA is proposed namely, DPA–PCA and DPA–LDA. In these modifications DCs which are selected by DPA are used as the input of these transforms. Simulation results of DPA–PCA and DPA–LDA on the ORL and Yale database verify the improvement of the results by using these new modifications.

Introduction

Researches in the field of face recognition have become an attractive subject in the recent years. Activities in this field come from its applications in different areas such as security and surveillance, commercial and law enforcement. Ability for implementation in real time has intensified the attention to this field. Although research in the field of face recognition is active over 30 years and considerable successes in face recognition systems have been achieved, still there are some unsolved problems in it. Illumination variation, rotation and facial expression are the basic existing challenges in this area.

A wide variety of approaches have been developed by researchers for face recognition problem (see [1], [2] for a survey). From one point of view, these various approaches are categorized into two general groups, namely feature-based and holistic approach. In feature-based approaches, shapes and geometrical relationships of the individual facial features including eyes, mouth and nose are analyzed. But in holistic approaches, the face images are analyzed as two-dimensional holistic patterns. Feature-based approaches are more robust against rotation, scale and illumination variations, but their success depends on the facial feature detection. Due to difficulties in facial feature detection, holistic approaches are considered much more frequently than feature-based approaches [1].

Extracting proper features is crucial for satisfactory design of any pattern classifier system. In this way, two types of discrete transforms, statistical and deterministic, have been widely used for feature extraction and data redundancy reduction. The basis vectors of the statistical transforms depend on the statistical specification of the database and different basis vectors are possible for different databases. Deterministic transforms have invariant basis vectors which are independent of the database. Although statistical transforms have a great ability to remove correlation between data, they have high computational complexity. Also computation of the basis vectors for each given database is needed.

Among statistical approaches, principal component analysis (PCA) and linear discriminant analysis (LDA) are two powerful statistical tools for feature extraction and data representation. Kirby and Sirovich [3] were the first to employ Karhunen–Loeve transform (KLT) to represent facial images. Then Turk and Pentland [4]developed a PCA based approach namely “Eigenface”. PCA extracts features that are most efficient for representation, which may not be very useful for classification. Etemad and Chelappa [5], Belhumeur et al. [6] and Zhao et al. [7] then proposed the LDA “Fisherface” method to extract features that are most efficient for classification. Because of some limitations of the PCA and LDA, a variety of modifications have been proposed by authors. For more details see [8], [9], [10], [11], [12], [13], [14], [15], [16].

Proportional advantages of deterministic transforms make them an interesting class of feature extraction approaches. Discrete Fourier transform (DFT) [17], wavelet transform [18], [19], [20] and discrete cosine transform (DCT) [21], [22], [23], [24], [25], [26], [27], [28] are the important approaches of this class. Some special properties of DCT make it a powerful transform in image processing applications, including face recognition. DCT is very close to the KLT and has a strong ability for data decorrelation [26]. There are fast algorithms for DCT realization, which is not the case for KLT.

Combination of statistical and deterministic transforms constructs a third type of feature extraction approaches with both advantages. In this type, DCT reduces the dimension of data to avoid singularity and decreases the computational cost of PCA and LDA. Chen et al. [29] showed that using the PCA or the LDA in the DCT domain yields the same results as the one obtained from the spatial domain. Various combinations of the DCT, PCA and LDA have been surveyed by Pnevmatikakis and Polymenakos [30]. Ramasubramanian and Venkatesh [22] used a combination of the DCT, PCA and the characteristics of the human visual system for encoding and recognition of faces.

After applying the DCT to an image, some coefficients are selected and others are discarded in data dimension reduction. The selection of the DCT coefficients is an important part of the feature extraction process. In most of the approaches which utilize the DCT, not enough attention was given to coefficients selection (CS). The coefficients are usually selected with conventional methods such as zigzag or zonal masking. These conventional approaches are not necessarily efficient in all the applications and for all the databases. Discrimination power analysis (DPA) is a novel approach which selects features (DCT coefficients) with respect to their discrimination power. DPA utilizes statistical analysing of a database, associates each DCT coefficient discrimination to a number, and generates a CS mask. Our proposed feature extraction approach is database dependent and is able to find the best discriminant coefficients for each database.

The rest of the paper is organized as follows: Section 2 is devoted to the feature extraction approach. The whole procedure of our approach is depicted in Section 3. Experimental results and discussion are presented in Section 4, and finally Section 5 includes conclusion.

Section snippets

Feature extraction

In this section, various DCT feature extraction approaches are considered and a new efficient approach is proposed. DCT feature extraction consists of two stages. In the first stage, the DCT is applied to the entire image to obtain the DCT coefficients, and then some of the coefficients are selected to construct feature vectors in the second stage. Dimension of the DCT coefficient matrix is the same as the input image. In fact the DCT, by itself, does not decrease data dimension; so it

The whole procedure

In order to evaluate our proposed method, a simplified face recognition system has been used. Fig. 4 shows the whole procedure of our face recognition approach. This block diagram is proposed only for evaluating our proposed method and is not a real application system. It is clear that a real application system needs some other extra blocks such as preprocessing, etc. Also a better classifier rather than Euclidean distance (ED) is required in a real application. The block diagram is illustrated

Experimental results and discussion

In order to evaluate the proposed approaches, our experiments are performed on two benchmark face databases: the ORL and the Yale database. Besides, for each database, various CS approaches have been compared. All approaches utilize the ED classifier. The experiments are programmed in the MATLAB language (Version 7). In all of the simulations, the database is randomly divided to train and test sets. Five and six images of each individual, respectively, in the ORL and the Yale databases, have

Conclusion

After surveying the various CS approaches in the DCT domain, a new category of CS approaches was proposed. DPA is a data-dependent approach which utilizes the statistical analysis in order to find the most discriminant coefficients. A pm limits the search area and reaps two benefits: first is the better performance especially with a small number of training images, and second is the computational cost reduction. Also a new modification to original PCA and LDA in the DCT domain was proposed,

About the Author—SAEED DABBAGHCHIAN received his B.S. degree from Urmia University, Urmia, Iran, in 2003, and his M.S. from the University of Tabriz, Tabriz, Iran, in 2005, both in electrical engineering. His current research interests include feature extraction, pattern recognition and speech recognition.

References (35)

  • M. Turk et al.

    Eigenfaces for recognition

    International Journal of Cognitive Neuroscience

    (1991)
  • K. Etemad et al.

    Discriminant analysis for recognition of human face images

    Journal of the Optical Society of America A

    (1997)
  • P.N. Belhumeur et al.

    Eigenfaces vs. Fisherfaces: recognition using class specific linear projection

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (1997)
  • W. Zhao et al.

    Discriminant analysis of principal components for face recognition

    ICFGR

    (1998)
  • J. Lu et al.

    Face recognition using LDA-based algorithms

    IEEE Transactions on Neural Networks

    (2003)
  • Q. Liu et al.

    Improving kernel fisher discriminant analysis for face recognition

    IEEE Transactions on Circuits and Systems for Video Technology

    (2004)
  • M.S. Bartlet et al.

    Face recognition by independent component analysis

    IEEE Transactions on Neural Networks

    (2002)
  • Cited by (131)

    • Zoom based image super-resolution using DCT with LBP as characteristic model

      2022, Journal of King Saud University - Computer and Information Sciences
      Citation Excerpt :

      The succeeding section contains the more detailed explanation of the proposed approach. In the field of image processing, the Discrete Cosine Transform (DCT) is the famous technique of analysis (Rhee and Kang, 1999; Parmar and Scholar, 2014; Dabbaghchian et al., 2010). The drawback of two-dimension wavelet is that it is unable to capture directional information of image.

    View all citing articles on Scopus

    About the Author—SAEED DABBAGHCHIAN received his B.S. degree from Urmia University, Urmia, Iran, in 2003, and his M.S. from the University of Tabriz, Tabriz, Iran, in 2005, both in electrical engineering. His current research interests include feature extraction, pattern recognition and speech recognition.

    About the Author—MASOUMEH P. GHAEMMAGHAMI received his B.S. degree from Urmia University, Urmia, Iran, in 2005, and his M.S. from the Science and Research Branch of Islamic Azad University, Tehran, Iran, in 2009, both in electrical engineering. Her research interests have been focused on the pattern recognition and robust speech recognition.

    About the Author—ALI AGHAGOLZADEH received his B.S. degree from Tabriz University, Tabriz, Iran, in 1985, and his M.S. from Illinois Institute of Technology, USA, in 1988, and Ph.D. from Purdue University, USA, in 1991, all in electrical engineering. He is a Professor of Faculty of Electrical and Computer Engineering in Tabriz University, Tabriz, Iran. His research interests have been focused on image processing and computer vision, image and video coding and transmission.

    View full text