Hand shape recognition based on coherent distance shape contexts

doi:10.1016/j.patcog.2012.02.018

Pattern Recognition

Volume 45, Issue 9, September 2012, Pages 3348-3359

https://doi.org/10.1016/j.patcog.2012.02.018 Get rights and content

Abstract

In this paper, we propose a novel hand shape recognition method named as Coherent Distance Shape Contexts (CDSC), which is based on two classical shape representations, i.e., Shape Contexts (SC) and Inner-distance Shape Contexts (IDSC). CDSC has good ability to capture discriminative features from hand shape and can well deal with the inexact correspondence problem of hand landmark points. Particularly, it can extract features mainly from the contour of fingers. Thus, it is very robust to different hand poses or elastic deformations of finger valleys. In order to verify the effectiveness of CDSC, we create a new hand image database containing 4000 grayscale left hand images of 200 subjects, on which CDSC has achieved the accurate identification rate of 99.60% for identification and the Equal Error Rate of 0.9% for verification, which are comparable with the state-of-the-art hand shape recognition methods.

Highlights

► We experimentally prove that the parts of contour around finger valleys are not reliable. ► The proposed method takes into account the negative effect of changes of finger valleys. ► This method could deal with the inexact correspondence problem of landmark points. ► A speed-up procedure and a fusion procedure are proposed for low cost and robustness.

Introduction

In information and network society, there are many occasions in which the personal authentication is required, e.g., access control to a building, information security in a computer or smart-phone, visitor management, electronic payment, etc. There is no doubt that biometrics is one of the most important and effective solutions for this task. Generally, biometrics is a field of technology that uses automated methods for identifying or verifying a person based on a physiological or behavioral trait [1]. In real applications, the traits that are commonly measured in different systems are face, fingerprints, iris, hand shape, palmprint, hand vein, ear, gait, voice, etc.

Among all kinds of biometrics techniques, hand shape biometrics has drawn wide attention from researchers, and has been investigated for a long time [2]. As often noted in the literatures, hand shape biometrics is attractive due to its following advantages [2], [3]. (1) It is user-friendly since hand shape can be captured in a relatively convenient, non-intrusive manner by using inexpensive sensors. (2) Other hand features such as palmprint, palm vein and knuckleprint can be easily integrated to an existing hand shape based biometric system to form a more robust multimodal biometrics system. (3) It is more acceptable to the public mainly because it lacks criminal connotation. (4) It is less prone to disturbances and more robust to environmental conditions and to individual anomalies.

There have been a number of previous studies for hand shape biometrics. In the early study of this technique, most commercial systems [4] and some of the research systems [5], [6], [7], [8] required a platform with pegs to guide the placement of the user's hand. This manner, however, is not user-friendly and is unreliable since pegs often cause heavy deformation of the hand silhouette. Recently, some more user-friendly systems use non-contact image acquisition setups to capture hand images [9], [10], [11], [12]. Obviously, such setups would introduce more variations of hand pose, which will result in poor recognition performance. In order to make a trade-off between the user-friendliness and reliability, most systems capture hand images by using peg-free and contact manner [3], [13], [14], [15], [16], [17], [18], [19].

Generally speaking, the approaches of peg-free and contact hand shape recognition could be divided into three categories, i.e., the contour-based, the region-based, and the geometry-based, respectively. The contour-based approaches usually extract features from the whole shape contour, which consists of discrete points. G. Amayeh et al. [13] proposed using high-order Zernike moments, which are powerful shape descriptors, to capture hand shape features, and applied an efficient algorithm to deal with several practical issues of the moments including computational cost and lack of accuracy. H. Dutagaci et al. [17] proposed a normalization algorithm to adjust the fingers to standard positions, and extracted shape features by applying Linear Discriminant Analysis (LDA) on Discrete Fourier Transform (DFT) coefficients of the contour. The region-based methods usually extract features from the binary images, where the shape is represented as a connected region. E. Yoruk et al. [3], [14] proposed a global hand shape biometrics approach, which is pivoted on the normalization of the deformable hand shape and extract independent component analysis (ICA) features to compare the hand region images. They notice that the thumb can potentially arrive in a curved posture even after the whole hand exerting on the platen, which would cause the normalization of the hand shape unreliable in a real application environment. So it is believed that omitting the thumb is a good choice to ensure a robust system [2]. J. M. Ramirez-Cortes et al. [19] proposed an hand shape identification approach using the morphological pattern spectrum. In this method, the structuring element of the morphological operations is given by a disk with an incremental radius of one pixel per iteration, and a spectrum-based feature vector is obtained from the hand shape image for recognition. The geometry-based methods extract physical or abstract shape geometry from the shape. In geometry-based methods, a lot of landmark points (e.g., finger tips or valleys) would be determined before the feature extraction. G. Fouquier [15] presented an authentication system based on simple finger geometric features. In this method, for each finger, each boundary point is projected on the finger major axis. The lengths of the projected segments are computed and a histogram of these lengths is considered as features for recognition. M. Adan et al. [16] proposed a Natural Reference System (NRS) of the hand's natural layout and used it to extract the polar representation of the hand's contour for comparison. Under the NRS some abstract geometric features are extracted and used for recognition, such as the lengths between landmark points of NRS.

Though the existing hand shape recognition approaches have achieved promising recognition performances, they still have some shortcomings. For example, most contour based approaches extracted features from whole contour of a hand for recognition. Parts of contour around finger valleys, however, may have obvious changes caused by different hand poses, which is not reliable for robust recognition. Thus, it is difficult to obtain satisfying recognition results for those contour-based approaches in which the negative effect of changes of finger valleys has not been fully taken into account. In order to remove this negative effect, a normalization procedure has been proposed to adjust the fingers to some standard positions [14]. This procedure, however, would cost extra processing time and is not available when the fingers are at extreme positions, such as the hands are fully extended. For those geometry-based methods, the geometry features are usually extracted from some specific positions of the hands. Obviously, this manner could not capture enough discriminative information [17]. More importantly, it is difficult to robustly extract those landmark points when the fingers are at different positions. Exactly speaking, some landmark points of a hand could be easily determined, but they may not exactly correspond to landmark points of another hand due to the position variations of fingers. This inexact correspondence problem would result in unreliable features, therefore, cause recognition errors.

In the past decade, many shape matching algorithms have been proposed [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], [30], [31], and they are applied to different shape-based applications. In this paper, we propose a novel contour-based hand shape recognition method, which is based on two classical shape matching methods, namely Shape Contexts (SC) [20] and Inner-distance Shape Contexts (IDSC) [32]. It should be noted that both SC and IDSC have not been exploited for the application of hand shape recognition so far. Both of them uniformly extract points along the contour, and utilize the global and local geometry simultaneously. The features extracted by SC and IDSC not only contain the geometry features in common geometry-based methods, but also take into account the distributions of those features, which intrinsically capture more discriminative information of the shapes than traditional geometry-base methods. Meanwhile, the matching procedure of SC and IDSC is based on the method of dynamic programming (DP), which could find the exact correspondences of contour points between two hands and relieve the inexact correspondence problem in traditional geometry-based methods. However, original SC and IDSC have some limits. For example, using Euclidean distances between contour points, SC is quite sensitive to different hand poses. Though the inner-distance introduced in IDSC largely solves the hand pose problem, it is still affected by elastic deformations of finger valleys. Therefore, they are not suitable for the hand shape biometrics. In order to effectively overcome these limits, we fully consider the prior knowledge, and then propose a new method by modifying many details of SC and IDSC. For example, SC uses the Euclidean distance and IDSC uses the inner-distance to construct features, both of which suffer from the unreliable parts of the contour. To address this problem, we introduce a new distance measure, namely coherent distance, which could capture the robust distance features of a shape and is more suitable for hand shape recognition.

The main contributions of our method are as follows: First, we prove that the parts of contour around finger valleys are not reliable for robust hand shape recognition since they have obvious changes caused by different hand poses. Second, we propose a novel hand shape recognition method that takes into account the negative effect of changes of finger valleys. Meanwhile, this method could also well deal with the inexact correspondence problem and capture enough discriminative information of the hand shapes, therefore, can achieve good recognition performance. Third, considering the practical issues including computational cost and robustness, a speed-up procedure and a fusion procedure are proposed.

The rest of this paper is organized as follows: Section 2 describes the image collection device and the preprocessing algorithm. Section 3 makes a brief review of the methods of SC and IDSC. Section 4 presents the details of the proposed method. Section 5 introduces the new hand image database and reports the experimental results, and Section 6 concludes the whole paper.

Section snippets

Hand images collection and preprocessing

In order to verify the effectiveness of the proposed method, a new hand shape image database has been created. To do so, a simple device is designed for hand image collection as shown in Fig. 1. This device mainly consists of a peg-free platform and a digital CCD camera, whose type is Cannon Powershot SX110 IS. This camera is controlled by a laptop computer via USB. The software of Cannon Remote Capture is used to capture images in daylight. During image acquisition, the hand is required to lay

Shape Contexts (SC) for 2-D Shapes

Due to its simplicity and discriminability, SC has become a quite popular method for shape recognition in recent years [20]. It describes the relative spatial distribution (distance and orientation) of the feature points by the information of other points. Given n sample points x₁, x₂, …, x_n on a shape, the shape context at point x_i is defined as a 2-D histogram h_i of the distance and angle joint distribution of the remaining n-1 points, where the bins uniformly divide the log-polar space.

Coherent distance shape contexts

As Fig. 5 illustrated, the method of SC extracts features based on Euclidean distances, which is quite sensitive to different hand poses, therefore, is not suitable for hand shape biometrics. Though the method of IDSC largely solves the articulation problem in shape retrieval, it is still not robust enough for hand shape recognition. The main reason is that the inner-distance between two points belonging to different fingers is often affected by elastic deformations of finger valleys. Fig. 7

Hand shape database

To achieve convincing experimental results, the size of the database is critical. Though there are many reported peg-free and contact hand shape databases, we believe that only those containing more than 100 subjects should be considered. To the best of our knowledge, there are eight reported hand shape databases satisfying this criterion, which are summarized in Table 1. Among them, only the BioSecure database is publicly available at the price of one thousand euros, while the rest of them are

Conclusions

In this paper, we proposed a novel hand shape recognition method named as Coherent Distance Shape Contexts (CDSC), which is based on two classical shape representations, i.e., Shape Contexts (SC) and Inner-distance Shape Contexts (IDSC). It is well known that SC and IDSC have good abilities to capture very discriminative features for a shape and to well deal with the inexact correspondence problem in shape matching. The method of CDSC is derived from SC (or IDSC), thus it also has these

Acknowledgments

This work is supported by the grants of the National Science Foundation of China, no. 61175022, 61100161, 61005010, 60705007, 60975005 and 60905023; and the grants of the Knowledge Innovation Program of the Chinese Academy of Sciences (Y023A11292 and Y023A61121).

Rong-Xiang Hu received the B.Sc. degree in computer science from Hefei University of Technology, Hefei, China, in 2006. From September 2006, he is a Master-Doctoral Program student in department of automation, University of Science and Technology of China, Hefei, China. His research interests include pattern recognition, machine learning and image processing.

References (38)

N. Duta
A survey of biometric technology based on hand shape
Pattern Recognition
(2009)
E. Yoruk et al.
Hand biometrics
Image and Vision Computing
(2006)
A. Kumar
Personal authentication using hand images
Pattern Recognition Letters
(2006)
D.L. Woodard
Finger surface as a biometric identifier
Computer Vision and Image Understanding
(2005)
M. Adan
Biometric verification identification based on hands natural layout
Image and Vision Computing
(2008)
C.L. Su
Hand image recognition by the techniques of hand shape scaling and image weight scaling
Expert Systems with Applications
(2008)
A.R. Backes et al.
A complex network-based approach for boundary shape analysis
Pattern Recognition
(2009)
W. Xiong et al.
Model-guided deformable hand shape recognition without positioning aids
Pattern Recognition
(2005)
A.K. Jain
An introduction to biometric recognition
IEEE Transactions on Circuits And Systems for Video Technology
(2004)
D.P. Sidlauskas
HAND: give me five
IEEE Spectrum
(1994)

M. Golfarelli

On the error-reject trade-off in biometric verification systems

IEEE Transactions on Pattern Analysis and Machine Intelligence

(1997)

A.K. Jain, Deformable matching of hand shapes for verification, in: International Conference on Image Processing, ICIP,...

R. Sanchez-Reillo

Biometric identification through hand geometry measurements

IEEE Transactions on Pattern Analysis and Machine Intelligence

(2000)

A. Morales, Contact-free hand biometric system for real environments, in: European Signal Processing Conference...

G. Zheng et al.

Application of projective invariants in hand geometry biometrics

IEEE Transactions on Information Forensics And Security

(2007)

V. Kanhangad et al.

Human hand identification with 3D hand pose variations

(2010)

G. Amayeh

Peg-free hand shape verification using high order zernike moments

(2006)

E. Yoruk

Shape-based hand recognition

IEEE Transactions on Image Processing

(2006)

G. Fouquier

The biosecure geometry-based system for hand modality

in Acoustics, Speech and Signal Processing,ICASSP Honolulu, HI

(2007)

Cited by (33)

Cross ratio arrays: A descriptor invariant to severe projective deformation and robust to occlusion for planar shape recognition
2021, Computers and Graphics (Pergamon)
Citation Excerpt :
It has been shown to work well for objects defined by contours, for handwritten digits and for 3D objects with planar views, and it was used with some degree of success to automatically detect trademark infringement. Improvements on the shape context method have been proposed to make it more efficient and effective to rigid transformations [26,27], robust to more shape deformations [28], capable of handling separate transformations on articulated parts (IDSC) [29], among others. A different path was taken by [30], that developed a shape descriptor that uses an approximation to the shape contours through a set of conic sections.
Recognizing planar objects such as characters, symbols and logos is considered a hard problem in computer vision due to the possibility of these shapes suffer from different kinds of disturbances: projective or other nonlinear deformations, occlusions and spurious insertions. Different planar descriptors have been proposed by taking advantage of geometric features that are invariant to certain image transformations, usually linear ones such as scaling, rotation and affinity. In this work, a new descriptor of planar shapes robust to projective deformations is proposed: Cross Ratio Arrays (CRA). The descriptor is based on tracing rays across the images and collecting their intersections with the borders of the shapes to assemble arrays of computed cross ratios, one of the most fundamental projective invariants. Higher level arrays are built out of these sets of arrays from both query and template shapes, which ultimately allows us to identify correspondences in these shapes to estimate how projectively deformed one shape is from the other. Experiments with synthetic shapes as well as real world scene shapes suffering from severe projective deformations were conducted, with CRA outperforming state-of-the-art descriptors. In addition, tests were performed with different levels of occlusion and weak nonlinear deformations to evidence CRA’s robustness to such cases.
User authorization based on hand geometry without special equipment
2018, Pattern Recognition
Citation Excerpt :
Aside from the results achieved by each protocol, the table includes information on the methods of data acquisition and additional requirements imposed by each method. Please note that our method does not require more prerequisites than any other method (i.e., no need for pegs nor dedicated device) and the only case where fewer images used (i.e. Hu et al. [13]) is significantly outperformed by HGCIA, i.e. comparable False Rejection Rate, yet our method provides significantly lower False Acceptance Rate. The main difference between our approach and Kumar et al. [4] are superior FAR and FRR parameters when we compare to the method using solely hand geometry, the second method provided by the authors results in similar FAR and FRR, yet multiple biometrics result in more people excluded from using the method (e.g. due to accidents and disabilities).
Bioidentification is one of the most popular methods of user identification, one of the reasons is the fact that the ‘access tokens’ are part of user’s body and cannot be simply lost or forgotten. Recently, the popularity of biometric methods increases even more due to the improved accuracy of measuring devices and lower error rates offered by the algorithms. Despite the technological progress, prices of the top tier equipment remain on a constant, high level. In this paper, we propose a hand geometry user identification algorithm that utilizes data acquired by a standard office scanner, and that has reasonable execution times of both data collection and the identification process. In order to achieve an algorithm that is as accurate as current state of the art algorithms (or even outperforms them) and utilizes only devices that are commonly available in most offices we had to include several non-standard hand geometric features, e.g. the crookedness of the fingers.
Our algorithm was tested on 60 volunteer adults, with age ranging from early 20s to late 50s. The most important results are False Acceptance Rate (FAR) equal to 0.0% and False Rejection Rate (FRR) at the level of 1.19%, with Equal Error Rate (EER) 0.59% during authorization (referred to as verification mode since algorithm verifies the claimed identity), when using a template derived from three images. In identification mode we got results FAR=0.0%, FRR=1.19% and ERR=0.59%. We achieved Identification Rate (IR) equal to 100.0% when taking only subjects from the database in identification mode. The subjects were not limited regarding the position of their hand on the scanner, nor were hand injuries and jewelry a disqualifying factor. Moreover, we describe the performance and time needed in a real-life experiments, showing that the algorithm may be used by people without technical background at low cost and is adequate for systems that require medium to high level of security.
Fast hand posture classification using depth features extracted from random line segments
2017, Pattern Recognition
Citation Excerpt :
Priyal and Bora [1] used Krawtchouk moment features to represent binary silhouettes of hands which are rotationally normalized based on gesture geometry, and compared Krawtchouk moment features with another two features that are also based on moments. Hu et al. [2] proposed a novel hand shape recognition method based on Coherent Distance Shape Contexts that takes into account the negative effect of changes of finger valleys. Ren et al. [3] described hand shape with a curve that records the relative distance between each hand contour vertex and a centre point.
In this paper we propose a set of fast-computable depth features for static hand posture classification from a single depth image. The proposed features, which are extracted from pixels on randomly positioned line segments, are specially designed as low-level cues to be used in random forest classifier which combines the cues to discover high-level unseen informative structure in an infinite dimensional feature space. The proposed features, while being simple, can effectively capture both hand geometry shape and depth texture information. The accuracy and speed performance of the recognition algorithm based on the proposed features is evaluated with American Sign Language (ASL) finger spelling dataset and with two new hand posture datasets. The proposed algorithm has a recognition accuracy rate that is comparable to the state-of-the-art methods, while being much faster in both training and testing phases. Our implementation of the proposed algorithm runs at about 600fps using only one thread of an i7 CPU. A pre-trained demo program is available to public.
Hierarchical projective invariant contexts for shape recognition
2016, Pattern Recognition
Citation Excerpt :
In [20], Wang et al. propose the height function (HF) descriptor, in which one sample point is characterized by the distances of all the other sample points to its tangent line. Some works combine these methods or use triangle-area formed by the sample points in order to improve the performance [27–29]. These descriptors are able to discriminate subtle shape differences, but they represent shapes by pair-wise measures between points along the contour, resulting in a long feature vector for each point.
Shape descriptors play an important role in various computer vision tasks. Many existing descriptors are typically derived from pair-wise measures, such as distances and angles, which may vary with severe geometrical deformations including affine and projective transformations. In this paper, we propose a new shape descriptor from a newly developed projective invariant, the characteristic number (CN). This new descriptor is invariant to projective (or perspective) transformations by computing CN values on a series of 5 sample points along the shape contour with the intervals varying from coarse to fine. This hierarchical strategy yields a compacter descriptor so that the time complexity for both descriptor construction and shape matching are less or comparable to many existing methods. We also use the derived points out of the contour and the ratio of two invariant values, in order to improve the stability at finer scales and robustness to noise. We demonstrate the performance of the descriptor by comparing with the state-of-the-art on the MCD and other public shape sets with severe perspective transformations and other type variations including noise, missing parts and articulated deformations.
FINGER BIOMETRIC RECOGNITION WITH FEATURE SELECTION
2023, arXiv
Current Challenges of Hand-Based Biometric Systems
2023, Cyber Security and Business Intelligence: Innovations and Machine Learning for Cyber Risk Management

View all citing articles on Scopus

Wei Jia received the B.Sc. degree in informatics from Central China Normal University, Wuhan, China, in 1998, the M.Sc. degree in computer science from Hefei University of Technology, Hefei, China, in 2004, and the Ph.D. degree in pattern recognition and intelligence system from University of Science and Technology of China, Hefei, China, in 2008. He is currently an associate professor in Hefei Institutes of Physical Science, Chinese Academy of Science. His research interests include biometrics, pattern recognition, and image processing.

David Zhang graduated in Computer Science from Peking University. He received his M.Sc. in Computer Science in 1982 and his Ph.D. in 1985 from the Harbin Institute of Technology. In 1994 he received his second Ph.D. in Electrical and Computer Engineering from the University of Waterloo, Ontario, Canada. Currently, he is a Chair Professor at the Hong Kong Polytechnic University where he is the Founding Director of the Biometrics Technology Center supported by the Hong Kong SAR Government in 1998. He is the associate Editor of more than ten international journals including IEEE Transactions and Pattern Recognition; Technical Committee Chair of IEEE CIS and the author of more than 10 books and 200 journal papers. Professor Zhang is a Croucher Senior Research Fellow, Distinguished Speaker of the IEEE Computer Society, and a Fellow of both IEEE and IAPR.

Jie Gui received the B.E degree in Computer Science in Hohai University in 2004, the M.Sc. degree in computer science from Chinese Academy of Science, China, in 2007, and the Ph.D. degree in pattern recognition and intelligence system from University of Science and Technology of China, Hefei, China, in 2010. He is currently an assistant professor in Hefei Institutes of Physical Science, Chinese Academy of Science. His research interests are machine learning, pattern recognition, and image processing.

Liang-Tu Song received the B.E. and M.E. degrees in computer science from Anhui Agriculture University, Hefei, China, in 1987 and 1990, respectively, and the Ph.D. degree in pattern recognition and intelligence system from University of Science and Technology of China, Hefei, China, in 2007. He has been a professor in the Institute of Intelligent Machines, Chinese Academy of Sciences, since 2008. His research interests include data acquisition, pattern recognition and intelligent systems.

View full text

Hand shape recognition based on coherent distance shape contexts

Abstract

Highlights

Introduction

Section snippets

Hand images collection and preprocessing

Shape Contexts (SC) for 2-D Shapes

Coherent distance shape contexts

Hand shape database

Conclusions

Acknowledgments

Pattern Recognition

Image and Vision Computing

Pattern Recognition Letters

Computer Vision and Image Understanding

Image and Vision Computing

Expert Systems with Applications

Pattern Recognition

Pattern Recognition

An introduction to biometric recognition

IEEE Transactions on Circuits And Systems for Video Technology

HAND: give me five

IEEE Spectrum

On the error-reject trade-off in biometric verification systems

IEEE Transactions on Pattern Analysis and Machine Intelligence

Biometric identification through hand geometry measurements

IEEE Transactions on Pattern Analysis and Machine Intelligence

Application of projective invariants in hand geometry biometrics

IEEE Transactions on Information Forensics And Security

Human hand identification with 3D hand pose variations

Peg-free hand shape verification using high order zernike moments

Shape-based hand recognition

IEEE Transactions on Image Processing

The biosecure geometry-based system for hand modality

in Acoustics, Speech and Signal Processing,ICASSP Honolulu, HI