Elsevier

Pattern Recognition

Volume 45, Issue 9, September 2012, Pages 3348-3359
Pattern Recognition

Hand shape recognition based on coherent distance shape contexts

https://doi.org/10.1016/j.patcog.2012.02.018Get rights and content

Abstract

In this paper, we propose a novel hand shape recognition method named as Coherent Distance Shape Contexts (CDSC), which is based on two classical shape representations, i.e., Shape Contexts (SC) and Inner-distance Shape Contexts (IDSC). CDSC has good ability to capture discriminative features from hand shape and can well deal with the inexact correspondence problem of hand landmark points. Particularly, it can extract features mainly from the contour of fingers. Thus, it is very robust to different hand poses or elastic deformations of finger valleys. In order to verify the effectiveness of CDSC, we create a new hand image database containing 4000 grayscale left hand images of 200 subjects, on which CDSC has achieved the accurate identification rate of 99.60% for identification and the Equal Error Rate of 0.9% for verification, which are comparable with the state-of-the-art hand shape recognition methods.

Highlights

► We experimentally prove that the parts of contour around finger valleys are not reliable. ► The proposed method takes into account the negative effect of changes of finger valleys. ► This method could deal with the inexact correspondence problem of landmark points. ► A speed-up procedure and a fusion procedure are proposed for low cost and robustness.

Introduction

In information and network society, there are many occasions in which the personal authentication is required, e.g., access control to a building, information security in a computer or smart-phone, visitor management, electronic payment, etc. There is no doubt that biometrics is one of the most important and effective solutions for this task. Generally, biometrics is a field of technology that uses automated methods for identifying or verifying a person based on a physiological or behavioral trait [1]. In real applications, the traits that are commonly measured in different systems are face, fingerprints, iris, hand shape, palmprint, hand vein, ear, gait, voice, etc.

Among all kinds of biometrics techniques, hand shape biometrics has drawn wide attention from researchers, and has been investigated for a long time [2]. As often noted in the literatures, hand shape biometrics is attractive due to its following advantages [2], [3]. (1) It is user-friendly since hand shape can be captured in a relatively convenient, non-intrusive manner by using inexpensive sensors. (2) Other hand features such as palmprint, palm vein and knuckleprint can be easily integrated to an existing hand shape based biometric system to form a more robust multimodal biometrics system. (3) It is more acceptable to the public mainly because it lacks criminal connotation. (4) It is less prone to disturbances and more robust to environmental conditions and to individual anomalies.

There have been a number of previous studies for hand shape biometrics. In the early study of this technique, most commercial systems [4] and some of the research systems [5], [6], [7], [8] required a platform with pegs to guide the placement of the user's hand. This manner, however, is not user-friendly and is unreliable since pegs often cause heavy deformation of the hand silhouette. Recently, some more user-friendly systems use non-contact image acquisition setups to capture hand images [9], [10], [11], [12]. Obviously, such setups would introduce more variations of hand pose, which will result in poor recognition performance. In order to make a trade-off between the user-friendliness and reliability, most systems capture hand images by using peg-free and contact manner [3], [13], [14], [15], [16], [17], [18], [19].

Generally speaking, the approaches of peg-free and contact hand shape recognition could be divided into three categories, i.e., the contour-based, the region-based, and the geometry-based, respectively. The contour-based approaches usually extract features from the whole shape contour, which consists of discrete points. G. Amayeh et al. [13] proposed using high-order Zernike moments, which are powerful shape descriptors, to capture hand shape features, and applied an efficient algorithm to deal with several practical issues of the moments including computational cost and lack of accuracy. H. Dutagaci et al. [17] proposed a normalization algorithm to adjust the fingers to standard positions, and extracted shape features by applying Linear Discriminant Analysis (LDA) on Discrete Fourier Transform (DFT) coefficients of the contour. The region-based methods usually extract features from the binary images, where the shape is represented as a connected region. E. Yoruk et al. [3], [14] proposed a global hand shape biometrics approach, which is pivoted on the normalization of the deformable hand shape and extract independent component analysis (ICA) features to compare the hand region images. They notice that the thumb can potentially arrive in a curved posture even after the whole hand exerting on the platen, which would cause the normalization of the hand shape unreliable in a real application environment. So it is believed that omitting the thumb is a good choice to ensure a robust system [2]. J. M. Ramirez-Cortes et al. [19] proposed an hand shape identification approach using the morphological pattern spectrum. In this method, the structuring element of the morphological operations is given by a disk with an incremental radius of one pixel per iteration, and a spectrum-based feature vector is obtained from the hand shape image for recognition. The geometry-based methods extract physical or abstract shape geometry from the shape. In geometry-based methods, a lot of landmark points (e.g., finger tips or valleys) would be determined before the feature extraction. G. Fouquier [15] presented an authentication system based on simple finger geometric features. In this method, for each finger, each boundary point is projected on the finger major axis. The lengths of the projected segments are computed and a histogram of these lengths is considered as features for recognition. M. Adan et al. [16] proposed a Natural Reference System (NRS) of the hand's natural layout and used it to extract the polar representation of the hand's contour for comparison. Under the NRS some abstract geometric features are extracted and used for recognition, such as the lengths between landmark points of NRS.

Though the existing hand shape recognition approaches have achieved promising recognition performances, they still have some shortcomings. For example, most contour based approaches extracted features from whole contour of a hand for recognition. Parts of contour around finger valleys, however, may have obvious changes caused by different hand poses, which is not reliable for robust recognition. Thus, it is difficult to obtain satisfying recognition results for those contour-based approaches in which the negative effect of changes of finger valleys has not been fully taken into account. In order to remove this negative effect, a normalization procedure has been proposed to adjust the fingers to some standard positions [14]. This procedure, however, would cost extra processing time and is not available when the fingers are at extreme positions, such as the hands are fully extended. For those geometry-based methods, the geometry features are usually extracted from some specific positions of the hands. Obviously, this manner could not capture enough discriminative information [17]. More importantly, it is difficult to robustly extract those landmark points when the fingers are at different positions. Exactly speaking, some landmark points of a hand could be easily determined, but they may not exactly correspond to landmark points of another hand due to the position variations of fingers. This inexact correspondence problem would result in unreliable features, therefore, cause recognition errors.

In the past decade, many shape matching algorithms have been proposed [20], [21], [22], [23], [24], [25], [26], [27], [28], [29], [30], [31], and they are applied to different shape-based applications. In this paper, we propose a novel contour-based hand shape recognition method, which is based on two classical shape matching methods, namely Shape Contexts (SC) [20] and Inner-distance Shape Contexts (IDSC) [32]. It should be noted that both SC and IDSC have not been exploited for the application of hand shape recognition so far. Both of them uniformly extract points along the contour, and utilize the global and local geometry simultaneously. The features extracted by SC and IDSC not only contain the geometry features in common geometry-based methods, but also take into account the distributions of those features, which intrinsically capture more discriminative information of the shapes than traditional geometry-base methods. Meanwhile, the matching procedure of SC and IDSC is based on the method of dynamic programming (DP), which could find the exact correspondences of contour points between two hands and relieve the inexact correspondence problem in traditional geometry-based methods. However, original SC and IDSC have some limits. For example, using Euclidean distances between contour points, SC is quite sensitive to different hand poses. Though the inner-distance introduced in IDSC largely solves the hand pose problem, it is still affected by elastic deformations of finger valleys. Therefore, they are not suitable for the hand shape biometrics. In order to effectively overcome these limits, we fully consider the prior knowledge, and then propose a new method by modifying many details of SC and IDSC. For example, SC uses the Euclidean distance and IDSC uses the inner-distance to construct features, both of which suffer from the unreliable parts of the contour. To address this problem, we introduce a new distance measure, namely coherent distance, which could capture the robust distance features of a shape and is more suitable for hand shape recognition.

The main contributions of our method are as follows: First, we prove that the parts of contour around finger valleys are not reliable for robust hand shape recognition since they have obvious changes caused by different hand poses. Second, we propose a novel hand shape recognition method that takes into account the negative effect of changes of finger valleys. Meanwhile, this method could also well deal with the inexact correspondence problem and capture enough discriminative information of the hand shapes, therefore, can achieve good recognition performance. Third, considering the practical issues including computational cost and robustness, a speed-up procedure and a fusion procedure are proposed.

The rest of this paper is organized as follows: Section 2 describes the image collection device and the preprocessing algorithm. Section 3 makes a brief review of the methods of SC and IDSC. Section 4 presents the details of the proposed method. Section 5 introduces the new hand image database and reports the experimental results, and Section 6 concludes the whole paper.

Section snippets

Hand images collection and preprocessing

In order to verify the effectiveness of the proposed method, a new hand shape image database has been created. To do so, a simple device is designed for hand image collection as shown in Fig. 1. This device mainly consists of a peg-free platform and a digital CCD camera, whose type is Cannon Powershot SX110 IS. This camera is controlled by a laptop computer via USB. The software of Cannon Remote Capture is used to capture images in daylight. During image acquisition, the hand is required to lay

Shape Contexts (SC) for 2-D Shapes

Due to its simplicity and discriminability, SC has become a quite popular method for shape recognition in recent years [20]. It describes the relative spatial distribution (distance and orientation) of the feature points by the information of other points. Given n sample points x1, x2, …, xn on a shape, the shape context at point xi is defined as a 2-D histogram hi of the distance and angle joint distribution of the remaining n-1 points, where the bins uniformly divide the log-polar space.

Coherent distance shape contexts

As Fig. 5 illustrated, the method of SC extracts features based on Euclidean distances, which is quite sensitive to different hand poses, therefore, is not suitable for hand shape biometrics. Though the method of IDSC largely solves the articulation problem in shape retrieval, it is still not robust enough for hand shape recognition. The main reason is that the inner-distance between two points belonging to different fingers is often affected by elastic deformations of finger valleys. Fig. 7

Hand shape database

To achieve convincing experimental results, the size of the database is critical. Though there are many reported peg-free and contact hand shape databases, we believe that only those containing more than 100 subjects should be considered. To the best of our knowledge, there are eight reported hand shape databases satisfying this criterion, which are summarized in Table 1. Among them, only the BioSecure database is publicly available at the price of one thousand euros, while the rest of them are

Conclusions

In this paper, we proposed a novel hand shape recognition method named as Coherent Distance Shape Contexts (CDSC), which is based on two classical shape representations, i.e., Shape Contexts (SC) and Inner-distance Shape Contexts (IDSC). It is well known that SC and IDSC have good abilities to capture very discriminative features for a shape and to well deal with the inexact correspondence problem in shape matching. The method of CDSC is derived from SC (or IDSC), thus it also has these

Acknowledgments

This work is supported by the grants of the National Science Foundation of China, no. 61175022, 61100161, 61005010, 60705007, 60975005 and 60905023; and the grants of the Knowledge Innovation Program of the Chinese Academy of Sciences (Y023A11292 and Y023A61121).

Rong-Xiang Hu received the B.Sc. degree in computer science from Hefei University of Technology, Hefei, China, in 2006. From September 2006, he is a Master-Doctoral Program student in department of automation, University of Science and Technology of China, Hefei, China. His research interests include pattern recognition, machine learning and image processing.

References (38)

  • M. Golfarelli

    On the error-reject trade-off in biometric verification systems

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (1997)
  • A.K. Jain, Deformable matching of hand shapes for verification, in: International Conference on Image Processing, ICIP,...
  • R. Sanchez-Reillo

    Biometric identification through hand geometry measurements

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (2000)
  • A. Morales, Contact-free hand biometric system for real environments, in: European Signal Processing Conference...
  • G. Zheng et al.

    Application of projective invariants in hand geometry biometrics

    IEEE Transactions on Information Forensics And Security

    (2007)
  • V. Kanhangad et al.

    Human hand identification with 3D hand pose variations

    (2010)
  • G. Amayeh

    Peg-free hand shape verification using high order zernike moments

    (2006)
  • E. Yoruk

    Shape-based hand recognition

    IEEE Transactions on Image Processing

    (2006)
  • G. Fouquier

    The biosecure geometry-based system for hand modality

    in Acoustics, Speech and Signal Processing,ICASSP Honolulu, HI

    (2007)
  • Cited by (33)

    • Cross ratio arrays: A descriptor invariant to severe projective deformation and robust to occlusion for planar shape recognition

      2021, Computers and Graphics (Pergamon)
      Citation Excerpt :

      It has been shown to work well for objects defined by contours, for handwritten digits and for 3D objects with planar views, and it was used with some degree of success to automatically detect trademark infringement. Improvements on the shape context method have been proposed to make it more efficient and effective to rigid transformations [26,27], robust to more shape deformations [28], capable of handling separate transformations on articulated parts (IDSC) [29], among others. A different path was taken by [30], that developed a shape descriptor that uses an approximation to the shape contours through a set of conic sections.

    • User authorization based on hand geometry without special equipment

      2018, Pattern Recognition
      Citation Excerpt :

      Aside from the results achieved by each protocol, the table includes information on the methods of data acquisition and additional requirements imposed by each method. Please note that our method does not require more prerequisites than any other method (i.e., no need for pegs nor dedicated device) and the only case where fewer images used (i.e. Hu et al. [13]) is significantly outperformed by HGCIA, i.e. comparable False Rejection Rate, yet our method provides significantly lower False Acceptance Rate. The main difference between our approach and Kumar et al. [4] are superior FAR and FRR parameters when we compare to the method using solely hand geometry, the second method provided by the authors results in similar FAR and FRR, yet multiple biometrics result in more people excluded from using the method (e.g. due to accidents and disabilities).

    • Fast hand posture classification using depth features extracted from random line segments

      2017, Pattern Recognition
      Citation Excerpt :

      Priyal and Bora [1] used Krawtchouk moment features to represent binary silhouettes of hands which are rotationally normalized based on gesture geometry, and compared Krawtchouk moment features with another two features that are also based on moments. Hu et al. [2] proposed a novel hand shape recognition method based on Coherent Distance Shape Contexts that takes into account the negative effect of changes of finger valleys. Ren et al. [3] described hand shape with a curve that records the relative distance between each hand contour vertex and a centre point.

    • Hierarchical projective invariant contexts for shape recognition

      2016, Pattern Recognition
      Citation Excerpt :

      In [20], Wang et al. propose the height function (HF) descriptor, in which one sample point is characterized by the distances of all the other sample points to its tangent line. Some works combine these methods or use triangle-area formed by the sample points in order to improve the performance [27–29]. These descriptors are able to discriminate subtle shape differences, but they represent shapes by pair-wise measures between points along the contour, resulting in a long feature vector for each point.

    • Current Challenges of Hand-Based Biometric Systems

      2023, Cyber Security and Business Intelligence: Innovations and Machine Learning for Cyber Risk Management
    View all citing articles on Scopus

    Rong-Xiang Hu received the B.Sc. degree in computer science from Hefei University of Technology, Hefei, China, in 2006. From September 2006, he is a Master-Doctoral Program student in department of automation, University of Science and Technology of China, Hefei, China. His research interests include pattern recognition, machine learning and image processing.

    Wei Jia received the B.Sc. degree in informatics from Central China Normal University, Wuhan, China, in 1998, the M.Sc. degree in computer science from Hefei University of Technology, Hefei, China, in 2004, and the Ph.D. degree in pattern recognition and intelligence system from University of Science and Technology of China, Hefei, China, in 2008. He is currently an associate professor in Hefei Institutes of Physical Science, Chinese Academy of Science. His research interests include biometrics, pattern recognition, and image processing.

    David Zhang graduated in Computer Science from Peking University. He received his M.Sc. in Computer Science in 1982 and his Ph.D. in 1985 from the Harbin Institute of Technology. In 1994 he received his second Ph.D. in Electrical and Computer Engineering from the University of Waterloo, Ontario, Canada. Currently, he is a Chair Professor at the Hong Kong Polytechnic University where he is the Founding Director of the Biometrics Technology Center supported by the Hong Kong SAR Government in 1998. He is the associate Editor of more than ten international journals including IEEE Transactions and Pattern Recognition; Technical Committee Chair of IEEE CIS and the author of more than 10 books and 200 journal papers. Professor Zhang is a Croucher Senior Research Fellow, Distinguished Speaker of the IEEE Computer Society, and a Fellow of both IEEE and IAPR.

    Jie Gui received the B.E degree in Computer Science in Hohai University in 2004, the M.Sc. degree in computer science from Chinese Academy of Science, China, in 2007, and the Ph.D. degree in pattern recognition and intelligence system from University of Science and Technology of China, Hefei, China, in 2010. He is currently an assistant professor in Hefei Institutes of Physical Science, Chinese Academy of Science. His research interests are machine learning, pattern recognition, and image processing.

    Liang-Tu Song received the B.E. and M.E. degrees in computer science from Anhui Agriculture University, Hefei, China, in 1987 and 1990, respectively, and the Ph.D. degree in pattern recognition and intelligence system from University of Science and Technology of China, Hefei, China, in 2007. He has been a professor in the Institute of Intelligent Machines, Chinese Academy of Sciences, since 2008. His research interests include data acquisition, pattern recognition and intelligent systems.

    View full text