Feature extraction using discrete cosine transform and discrimination power analysis with a face recognition technology

doi:10.1016/j.patcog.2009.11.001

Pattern Recognition

Volume 43, Issue 4, April 2010, Pages 1431-1440

https://doi.org/10.1016/j.patcog.2009.11.001 Get rights and content

Abstract

Discrete cosine transform (DCT) is a powerful transform to extract proper features for face recognition. After applying DCT to the entire face images, some of the coefficients are selected to construct feature vectors. Most of the conventional approaches select coefficients in a zigzag manner or by zonal masking. In some cases, the low-frequency coefficients are discarded in order to compensate illumination variations. Since the discrimination power of all the coefficients is not the same and some of them are discriminant than others, so we can achieve a higher true recognition rate by using discriminant coefficients (DCs) as feature vectors. Discrimination power analysis (DPA) is a statistical analysis based on the DCT coefficients properties and discrimination concept. It searches for the coefficients which have more power to discriminate different classes better than others. The proposed approach, against the conventional approaches, is data-dependent and is able to find DCs on each database. The simulations results of the various coefficient selection (CS) approaches on ORL and Yale databases confirm the success of the proposed approach. The DPA-based approaches achieve the performance of PCA/LDA or better with less complexity. The proposed method can be implemented for any feature selection problem as well as DCT coefficients. Also, a new modification of PCA and LDA is proposed namely, DPA–PCA and DPA–LDA. In these modifications DCs which are selected by DPA are used as the input of these transforms. Simulation results of DPA–PCA and DPA–LDA on the ORL and Yale database verify the improvement of the results by using these new modifications.

Introduction

Researches in the field of face recognition have become an attractive subject in the recent years. Activities in this field come from its applications in different areas such as security and surveillance, commercial and law enforcement. Ability for implementation in real time has intensified the attention to this field. Although research in the field of face recognition is active over 30 years and considerable successes in face recognition systems have been achieved, still there are some unsolved problems in it. Illumination variation, rotation and facial expression are the basic existing challenges in this area.

A wide variety of approaches have been developed by researchers for face recognition problem (see [1], [2] for a survey). From one point of view, these various approaches are categorized into two general groups, namely feature-based and holistic approach. In feature-based approaches, shapes and geometrical relationships of the individual facial features including eyes, mouth and nose are analyzed. But in holistic approaches, the face images are analyzed as two-dimensional holistic patterns. Feature-based approaches are more robust against rotation, scale and illumination variations, but their success depends on the facial feature detection. Due to difficulties in facial feature detection, holistic approaches are considered much more frequently than feature-based approaches [1].

Extracting proper features is crucial for satisfactory design of any pattern classifier system. In this way, two types of discrete transforms, statistical and deterministic, have been widely used for feature extraction and data redundancy reduction. The basis vectors of the statistical transforms depend on the statistical specification of the database and different basis vectors are possible for different databases. Deterministic transforms have invariant basis vectors which are independent of the database. Although statistical transforms have a great ability to remove correlation between data, they have high computational complexity. Also computation of the basis vectors for each given database is needed.

Among statistical approaches, principal component analysis (PCA) and linear discriminant analysis (LDA) are two powerful statistical tools for feature extraction and data representation. Kirby and Sirovich [3] were the first to employ Karhunen–Loeve transform (KLT) to represent facial images. Then Turk and Pentland [4]developed a PCA based approach namely “Eigenface”. PCA extracts features that are most efficient for representation, which may not be very useful for classification. Etemad and Chelappa [5], Belhumeur et al. [6] and Zhao et al. [7] then proposed the LDA “Fisherface” method to extract features that are most efficient for classification. Because of some limitations of the PCA and LDA, a variety of modifications have been proposed by authors. For more details see [8], [9], [10], [11], [12], [13], [14], [15], [16].

Proportional advantages of deterministic transforms make them an interesting class of feature extraction approaches. Discrete Fourier transform (DFT) [17], wavelet transform [18], [19], [20] and discrete cosine transform (DCT) [21], [22], [23], [24], [25], [26], [27], [28] are the important approaches of this class. Some special properties of DCT make it a powerful transform in image processing applications, including face recognition. DCT is very close to the KLT and has a strong ability for data decorrelation [26]. There are fast algorithms for DCT realization, which is not the case for KLT.

Combination of statistical and deterministic transforms constructs a third type of feature extraction approaches with both advantages. In this type, DCT reduces the dimension of data to avoid singularity and decreases the computational cost of PCA and LDA. Chen et al. [29] showed that using the PCA or the LDA in the DCT domain yields the same results as the one obtained from the spatial domain. Various combinations of the DCT, PCA and LDA have been surveyed by Pnevmatikakis and Polymenakos [30]. Ramasubramanian and Venkatesh [22] used a combination of the DCT, PCA and the characteristics of the human visual system for encoding and recognition of faces.

After applying the DCT to an image, some coefficients are selected and others are discarded in data dimension reduction. The selection of the DCT coefficients is an important part of the feature extraction process. In most of the approaches which utilize the DCT, not enough attention was given to coefficients selection (CS). The coefficients are usually selected with conventional methods such as zigzag or zonal masking. These conventional approaches are not necessarily efficient in all the applications and for all the databases. Discrimination power analysis (DPA) is a novel approach which selects features (DCT coefficients) with respect to their discrimination power. DPA utilizes statistical analysing of a database, associates each DCT coefficient discrimination to a number, and generates a CS mask. Our proposed feature extraction approach is database dependent and is able to find the best discriminant coefficients for each database.

The rest of the paper is organized as follows: Section 2 is devoted to the feature extraction approach. The whole procedure of our approach is depicted in Section 3. Experimental results and discussion are presented in Section 4, and finally Section 5 includes conclusion.

Section snippets

Feature extraction

In this section, various DCT feature extraction approaches are considered and a new efficient approach is proposed. DCT feature extraction consists of two stages. In the first stage, the DCT is applied to the entire image to obtain the DCT coefficients, and then some of the coefficients are selected to construct feature vectors in the second stage. Dimension of the DCT coefficient matrix is the same as the input image. In fact the DCT, by itself, does not decrease data dimension; so it

The whole procedure

In order to evaluate our proposed method, a simplified face recognition system has been used. Fig. 4 shows the whole procedure of our face recognition approach. This block diagram is proposed only for evaluating our proposed method and is not a real application system. It is clear that a real application system needs some other extra blocks such as preprocessing, etc. Also a better classifier rather than Euclidean distance (ED) is required in a real application. The block diagram is illustrated

Experimental results and discussion

In order to evaluate the proposed approaches, our experiments are performed on two benchmark face databases: the ORL and the Yale database. Besides, for each database, various CS approaches have been compared. All approaches utilize the ED classifier. The experiments are programmed in the MATLAB language (Version 7). In all of the simulations, the database is randomly divided to train and test sets. Five and six images of each individual, respectively, in the ORL and the Yale databases, have

Conclusion

After surveying the various CS approaches in the DCT domain, a new category of CS approaches was proposed. DPA is a data-dependent approach which utilizes the statistical analysis in order to find the most discriminant coefficients. A pm limits the search area and reaps two benefits: first is the better performance especially with a small number of training images, and second is the computational cost reduction. Also a new modification to original PCA and LDA in the DCT domain was proposed,

About the Author—SAEED DABBAGHCHIAN received his B.S. degree from Urmia University, Urmia, Iran, in 2003, and his M.S. from the University of Tabriz, Tabriz, Iran, in 2005, both in electrical engineering. His current research interests include feature extraction, pattern recognition and speech recognition.

References (35)

M. Li et al.
2D-LDA: a novel statistical linear discriminant analysis for image matrix
Pattern Recognition Letters
(2005)
J.H. Lai et al.
Face recognition using holistic fourier invariant features
Journal of Pattern Recognition
(2001)
C. Garcia et al.
Wavelet packet analysis for face recognition
Journal of Image and Vision Computing
(2000)
B. Li et al.
When eigenfaces are combined with wavelets
Journal of Knowledge based Systems
(2002)
D. Ramasubramanian et al.
Encoding and recognition of faces based on the human visual model and DCT
Journal of Pattern Recognition
(2001)
W. Chen et al.
PCA and LDA in DCT domain
Pattern Recognition Letters
(2005)
C. Sanderson et al.
Features for robust face-based identity verification
Journal of Signal Processing
(2003)
W. Zhao et al.
Face recognition: a literature survey
ACM Computing Surveys
(2003)
R. Chellappa et al.
Human and machine recognition of faces: a survey
Proceeding of IEEE
(1995)
M. Kirby et al.
Application of the Karhunen–Loeve procedure for the characterization of human faces
IEEE Transactions on Pattern Analysis and Machine Intelligence
(1990)

M. Turk et al.

Eigenfaces for recognition

International Journal of Cognitive Neuroscience

(1991)

K. Etemad et al.

Discriminant analysis for recognition of human face images

Journal of the Optical Society of America A

(1997)

P.N. Belhumeur et al.

Eigenfaces vs. Fisherfaces: recognition using class specific linear projection

IEEE Transactions on Pattern Analysis and Machine Intelligence

(1997)

W. Zhao et al.

Discriminant analysis of principal components for face recognition

ICFGR

(1998)

J. Lu et al.

Face recognition using LDA-based algorithms

IEEE Transactions on Neural Networks

(2003)

Q. Liu et al.

Improving kernel fisher discriminant analysis for face recognition

IEEE Transactions on Circuits and Systems for Video Technology

(2004)

M.S. Bartlet et al.

Face recognition by independent component analysis

IEEE Transactions on Neural Networks

(2002)

Cited by (131)

RADIC:A tool for diagnosing COVID-19 from chest CT and X-ray scans using deep learning and quad-radiomics
2023, Chemometrics and Intelligent Laboratory Systems
Deep learning (DL) algorithms have demonstrated a high ability to perform speedy and accurate COVID-19 diagnosis utilizing computed tomography (CT) and X-Ray scans. The spatial information in these images was used to train DL models in the majority of relevant studies. However, training these models with images generated by radiomics approaches could enhance diagnostic accuracy. Furthermore, combining information from several radiomics approaches with time-frequency representations of the COVID-19 patterns can increase performance even further. This study introduces "RADIC", an automated tool that uses three DL models that are trained using radiomics-generated images to detect COVID-19. First, four radiomics approaches are used to analyze the original CT and X-ray images. Next, each of the three DL models is trained on a different set of radiomics, X-ray, and CT images. Then, for each DL model, deep features are obtained, and their dimensions are decreased using the Fast Walsh Hadamard Transform, yielding a time-frequency representation of the COVID-19 patterns. The tool then uses the discrete cosine transform to combine these deep features. Four classification models are then used to achieve classification. In order to validate the performance of RADIC, two benchmark datasets (CT and X-Ray) for COVID-19 are employed. The final accuracy attained using RADIC is 99.4% and 99% for the first and second datasets respectively. To prove the competing ability of RADIC, its performance is compared with related studies in the literature. The results reflect that RADIC achieve superior performance compared to other studies. The results of the proposed tool prove that a DL model can be trained more effectively with images generated by radiomics techniques than the original X-Ray and CT images. Besides, the incorporation of deep features extracted from DL models trained with multiple radiomics approaches will improve diagnostic accuracy.
Zoom based image super-resolution using DCT with LBP as characteristic model
2022, Journal of King Saud University - Computer and Information Sciences
Citation Excerpt :
The succeeding section contains the more detailed explanation of the proposed approach. In the field of image processing, the Discrete Cosine Transform (DCT) is the famous technique of analysis (Rhee and Kang, 1999; Parmar and Scholar, 2014; Dabbaghchian et al., 2010). The drawback of two-dimension wavelet is that it is unable to capture directional information of image.
The prime intention of super-resolution (SR) technique is to restore the high-resolution images from one or more low-resolution (LR) images. These images are captured from the same scene with different acquisition systems with different resolution. Because these acquisition systems, images are suffered for an ill-posed problem with low visualization and picture information. Therefore, in this paper, the zoom-based super-resolution approach is proposed for super-resolution of low resolute images which are acquired from different camera zoom-lens. In this approach, three LR images of the same static scene which are acquired using three distinct zoom factors are used. Learning-based SR technique is used to enhance the spatial resolution of these LR images. The training dataset comprises three sets of captured images which are LR images, an enhanced version of LR images-HR1 and enhanced version of HR1 images-HR2. High-frequency details of the super-resolute image are learned in form of the discrete cosine transform (DCT) coefficients of HR training images. Finally, the super-resolved versions of LR observations, captured at different zoom-factors, are combined. The experimental results show that this proposed approach can be applied to various types of natural images in grayscale as well as color. The experimental results also show that this proposed approach performs better than existing approaches.
Singular spectrum analysis based sleeping stage classification via electrooculogram
2024, Multimedia Tools and Applications
Computer-aided Diagnosis of Various Diseases Using Ultrasonography Images
2024, Current Medical Imaging
A Survey on the Machine Learning Techniques for Automated Diagnosis from Ultrasound Images
2024, Current Medical Imaging
Proposed homomorphic DWT for cancelable palmprint recognition technique
2024, Multimedia Tools and Applications

View all citing articles on Scopus

About the Author—MASOUMEH P. GHAEMMAGHAMI received his B.S. degree from Urmia University, Urmia, Iran, in 2005, and his M.S. from the Science and Research Branch of Islamic Azad University, Tehran, Iran, in 2009, both in electrical engineering. Her research interests have been focused on the pattern recognition and robust speech recognition.

About the Author—ALI AGHAGOLZADEH received his B.S. degree from Tabriz University, Tabriz, Iran, in 1985, and his M.S. from Illinois Institute of Technology, USA, in 1988, and Ph.D. from Purdue University, USA, in 1991, all in electrical engineering. He is a Professor of Faculty of Electrical and Computer Engineering in Tabriz University, Tabriz, Iran. His research interests have been focused on image processing and computer vision, image and video coding and transmission.

View full text

Feature extraction using discrete cosine transform and discrimination power analysis with a face recognition technology

Abstract

Introduction

Section snippets

Feature extraction

The whole procedure

Experimental results and discussion

Conclusion

Pattern Recognition Letters

Journal of Pattern Recognition

Journal of Image and Vision Computing

Journal of Knowledge based Systems

Journal of Pattern Recognition

Pattern Recognition Letters

Journal of Signal Processing

Face recognition: a literature survey

ACM Computing Surveys

Human and machine recognition of faces: a survey

Proceeding of IEEE

Application of the Karhunen–Loeve procedure for the characterization of human faces

IEEE Transactions on Pattern Analysis and Machine Intelligence

Eigenfaces for recognition

International Journal of Cognitive Neuroscience

Discriminant analysis for recognition of human face images

Journal of the Optical Society of America A

Eigenfaces vs. Fisherfaces: recognition using class specific linear projection

IEEE Transactions on Pattern Analysis and Machine Intelligence

Discriminant analysis of principal components for face recognition

ICFGR

Face recognition using LDA-based algorithms

IEEE Transactions on Neural Networks

Improving kernel fisher discriminant analysis for face recognition

IEEE Transactions on Circuits and Systems for Video Technology

Face recognition by independent component analysis

IEEE Transactions on Neural Networks