OrieNet: A Regression System for Latent Fingerprint Orientation Field Extraction

Qu, Zhenshen; Liu, Junyu; Liu, Yang; Guan, Qiuyu; Yang, Chunyu; Zhang, Yuxin

doi:10.1007/978-3-030-01424-7_43

Zhenshen Qu¹⁸,
Junyu Liu¹⁸,
Yang Liu¹⁸,
Qiuyu Guan¹⁸,
Chunyu Yang¹⁹ &
…
Yuxin Zhang²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11141))

Included in the following conference series:

International Conference on Artificial Neural Networks

8637 Accesses
5 Citations

Abstract

Orientation field is an important characteristic of fingerprints. Many biometrics processing steps rely on its accurate estimation. Previous works on this task failed because of blurry fingerprint patterns and severe background noises. In this paper, a new algorithm system specific for fingerprint orientation estimation is proposed, combining domain knowledge of handcraft methods and the generalization ability of DNN. System’s preprocessing part roughly extracts effective information of input image with specially designed traditional method combination, then a Deep Regression Neural Network (DRNN) is adopted to predict the orientations fields, showing much faster convergence speed during training process than classification networks with the same backbone structure. Novel structure for DNN design is proposed to solve problem of discontinuity around 0° and increase prediction accuracy. Experimental results on test database proves that proposed algorithm system defeats state-of-the-art fingerprint orientation estimation algorithms.

Download conference paper PDF

A Novel System for Fingerprint Orientation Estimation

ConvNet Regression for Fingerprint Orientations

Fingerprint Orientation Image Estimation in the Frequency Domain

Keywords

1 Introduction

As a biometric identification technology, automatic fingerprint recognition is widely used in judicial, government, commercial and financial fields because of its advantages such as easy access, strong operability and high reliability. Automatic fingerprint identification system (AFIS) [1] generally includes: fingerprint acquisition, image enhancement, feature extraction, matching and other parts. Since the 1990s, algorithms of each part of AFIS have been continuously improved [2,3,4].Due to the importance of fingerprint orientation, a large number of scholars have conducted research in this field to improve the accuracy of fingerprint recognition.

One of the commonly used methods is a gradient-based algorithm, which performs a difference operation on the latent image. Therefore, it is very sensitive to image quality. Hong et al. [5] improved this method. They proposed to filter the directional field with a low-pass filter while correcting the isolated wrong direction. Another is the model-based approach. This method mainly uses the global constraints to model the orientation field mathematically. Sherlock et al. [6] proposed a zero-pole model that models the fingerprint orientation field based on the location of singular points. However, this method fails when there is no singularity in the fingerprint.

A few dictionary-based approaches have been proposed to improve latent orientation field estimation. Feng et al. [7] proposed a novel fingerprint orientation field extraction algorithm based on prior knowledge of fingerprint structure. The dictionary is constructed using a set of ground truth orientation fields, and the compatibility constraint among neighboring orientation patches. The dictionary-based approach has better generalization ability than the model-based approach, but its performance relies on large and diverse dictionaries, and results in higher computational cost.

Recent years, deep learning has made remarkable achievements in the field of pattern recognition. Convolutional neural networks (CNN) are widely used in image classification, object recognition, object detection and other fields [8,9,10]. Cao et al. [11] proposed a learning-based approach to classify the orientation field of a latent patch as one of a set of representative orientation patterns using a ConvNet. However, the pattern set’s quality is directly affected by the quality of database. In 2017, Yao et al. [12] proposed an end-to-end deep convolutional network combining domain knowledge and the representation ability of deep learning. In terms of orientation field, a classification network based on DeepLab v2 [13] is adopted. This pipeline achieves better results with expert network-marked labels, but it still meets with difficult in convergence and is easy to drop into local optimal solution.

Inspired by abundant achievements on semantic segmentation [14] in recent years, we propose an effective orientation extraction framework for latent fingerprint. Considering the poor quality of latent images, we first design preprocessing method combining local total variation (LTV) decomposition, band-pass filter and Gabor filter on latent fingerprints so that input condition of the network is improved. Processed images are passed to the proposed Convolutional Neural Network for high accuracy orientation field prediction. Experimental results on test database proves that proposed algorithm system defeats state-of-the-art fingerprint orientation estimation algorithms.

The contributions of this paper are summarized as follows:

1.
A new algorithm system specific for fingerprint orientation estimation consisting of preprocessing and deep neural network part. Domain knowledge and the generalization ability of network are combined in this system.
2.
Effective preprocess to enhance the potential ridge structure of poor quality fingerprints by specially designed algorithm combination.
3.
A novel deep regression neural network(DRNN) is proposed, with higher accuracy, faster training speed and less difficulty during convergence.
4.
A new structure sources from traditional boosting algorithm is introduced into proposed DRNN, solving label discontinuity problem and significantly improve network performance.

2 Proposed Method

2.1 Methods Overview

The basic idea is to build an algorithm system specific for fingerprint orientation estimation. Recent years, many works [12, 19, 20] show the necessity and tendency of combining domain knowledge of traditional image algorithms with deep learning. Along this way, we propose an algorithm consists of preprocessing part and full convolutional network part. Firstly, preprocessing part is introduced, which roughly extracts effective information of input images with designed traditional method combination, including cartoon-texture decomposition and Gabor filtration. Secondly, we discuss how to construct a deep neural network predicting the partial orientations, and make full use of preprocessed fingerprints (Fig. 1).

2.2 Latent Fingerprint Preprocessing

Firstly, the LTV model, a nonlinear filter pair which retains both the essential features of Meyer’s models and the simplicity and rapidity of the linear model, is used to decompose images. Then, a Log-Gabor filter [15] is utilized to enhance the potential ridge structure in marked ROI. Each latent image is divided into non-overlapping blocks of 64 × 64 pixels. In order to avoid the edge effect of the filter, only 16 × 16 pixel in the center of the block is taken after filtering. In the frequency domain, two-dimensional Log-Gabor Transfer function is defined as two parts:

$$ {\text{G}}\left( w \right) = exp\left( { - \left[ {ln\left( {w/w_{0} } \right)} \right]^{2} /2\left[ {ln\left( {k/w_{0} } \right)} \right]^{2} } \right) $$

(1)

$$ {\text{G}}\left( \theta \right) = exp\left( { - \left( {\theta - \theta_{0} } \right)^{2} /2\sigma_{\theta }^{2} } \right) $$

(2)

The final Gabor filter can be obtained as follow:

$$ {\text{G}}\left( {w,\theta } \right) = G\left( w \right) \cdot G\left( \theta \right) $$

(3)

Since the center frequency of Log-Gabor filter needs to be determined in advance, an automatic optimization method is used to find the appropriate frequency iteratively. Then, a set of 12 directional filters is generated which is used to obtain the responses in 12 directions, where two orientations with the highest responses are selected. Finally, the enhanced blocks are combined to generate the whole enhanced latent.

2.3 Deep Regression Neural Network

DRNN.

Fingerprint orientation estimation can be regarded as a pixel level segmentation question after down-sampling. Instead of widely used classification networks for image segmentation [14], a deep regression neural network (DRNN) has been designed in this work. Outputs of the network are directly the predicted angles, allowing continuous value of estimation. Meanwhile, we find that with small sample and relatively large category quantity, it’s hard for classification networks to convergence in practice. This is probably because in a segmentation network, the last layer divides every pixel into different classes. Structure of this layer can be regarded as an aggregation of classification outputs. The aggregation is much sparser than that of a single classification network. Suppose the aggregation’s size is 20 * 20 * 90 (which is the condition in our network during training), then there will be 20 * 20 = 400 1 s in the aggregation, and 20 * 20 * 89 = 35600 zeros. Positive samples are far less than negative ones, which is demonstrated in Fig. 2. But a regression network has dense outputs. It can perfectly avoid this problem. Right one in Fig. 2 shows the loss decrease rate by training iteration of DRNN and classification network. Two networks are the same in backbone structures, and the only difference is the final layer.

Boosting Structure.

Anew structure sources from traditional boosting algorithm is introduced into proposed DRNN’s output part. Boosting is a general machine learning algorithm for improving the accuracy of any given learning algorithm [16]. Boosting algorithm requires different kinds of weak learning machines, and then fuse the output of all learning machines together with a certain strategy. As a result, boosting algorithm solves the problem of discontinuity around 0° and produce a much more accurate output.

Our expected outputs are angles range from 0 to 180°. Angles near 0 and those approaching 180° are continuous in physical meaning but have a huge gap in scale, which causes mutations in labels, as displayed in Fig. 3. This is the problem of discontinuity around 0°, and labels around 0° in physical meaning are called bad zones in rest of this paper. Convolutional layers have the property of smoothing neighborhood outputs, after which bad zone outputs will deviance. As shown in left one in Fig. 3, labels nearer to bad zone result in larger deviation in outputs. Output nearly changes 90° when label is close to 0°. For this reason, if the regression result is directly taken as final output, the proposed DRNN will be a weak learning machine in this situation. In this work, boosting algorithm is introduced to upgrade this weak learning machine.

Instead of only one layer of outputs (angles), the network has been adjusted to 3 the same pass ways, but each pass way has a different 0° definition. Figure 4 shows degree definitions of three pass ways, in which definitions for pass way 2 and 3 can be transformed from pass way 1 by (4) and (5). After this process, bad zone of 3 pass ways will not overlap.

$$ x2^{ '} = \left\{ {\begin{array}{*{20}c} {x2 + 120,x2 < 60} \\ {x2 - 60,x2 \ge 60} \\ \end{array} } \right. $$

(4)

$$ x3^{ '} = \left\{ {\begin{array}{*{20}c} {x3 + 60,x3 < 120} \\ {x3 - 120,x3 \ge 120} \\ \end{array} } \right. $$

(5)

Outputs of three pass ways are first reversed to normal definition, and output 1, 2, 3 are single results of three pass ways at the same position respectively. Then output strategy of this network is: if difference of output 1 and 2 is less than 10°, output is the average of first 2 output channels, or output will be the last one. Kindly sacrificing the simplicity of network, bad zones’ impact have been eliminated, causing large improvement in output accuracy. Detailed data is displayed in experiments section.

Network Architecture.

In practice, images of fingerprints are different in size and aspect ratio, so a full convolutional network has been proposed for this task. The first part of the network are 3 Conv-ReLu blocks. Instead of pooling, a Conv layer with stride 2 is used in each block to compress the variables, totally 8 times down-sampling. This is because pooling layers can create an invariance to small shifts and distortions [17], which is advantage in object detection tasks, but this task is sensitive to partial rotation. According to the results in [18], kernel size of the first part has been adjusted to 7 * 7, 5 * 5 and 3 * 3 respectively (Fig. 5).

Second part of the network used ASPP [13] layers of the same size in 3 parallel passing ways. In each passing way 2 atrous convolutional layers have been deployed with different sample rates. Both layers’ feature maps are fused together. The final layer is the direct overlap of three pass ways’ output. Implementing boosting algorithm, predicted orientation field is produced.

Label, Loss Function and Training.

As second part of network has three pass ways, labels are also transformed to match the designed regression results. Instead of traditional quadratic error between label and regression results, the loss function is defined as:

$$ loss = \frac{1}{NM}\sum ((1 - \left( {20 \cdot labels - 1)^{2} } \right) \cdot 100 \left( {new\_labels - reg\_result)^{2} } \right) $$

(6)

Where N is size of output orientation field, M is batch size, $ reg\_result $ represents the regression result of network’s second part, labels is original labels, and new_labels means transformed labels. According to scale of loss, scale of labels can be adjusted by multiplying a constant. To some extent, DRNN’s convergence speed can be controlled in this way. After experiments, rather than [0,180), we found smaller labels mapped into range [0, 0.01) help the network to convergence much easier. To improve the accuracy of results, a weight $ ((1 - \left( {20 \cdot labels - 1)^{2} } \right) $ is added to the loss function, thus bad zones get ignorable weights. We don’t care what bad zones predict and only consider the accuracy of effective areas. To speed up training process and improve network performance, input images are all normalized and masked at first.

After reversing the regression result to one channel using the method in boosting structure, accuracy is defined like:

$$ accuracy = 1 - \frac{{\sum \left| {labels - output} \right|}}{N} $$

(7)

In training process, to increase samples’ number, we segmented the training images into overlapping 160 × 160 blocks. Latent fingerprints are straightly used as inputs. Labels’ quality were worse than library fingerprints, but fingerprints’ patterns were the same with required inputs. In testing process, we used test images directly as input because the input size of our system are not constrained, and impact of edge effect can be eliminated.

3 Experiments

3.1 Database

Database used in this paper is collected by Beijing Hisign Technology Co., Ltd, winner of FVC-Ongoing 2017. Fingerprints are divided into 2 groups: library fingerprints and latent fingerprints, every latent image has its matched library fingerprint image, totally 2164 pairs. 500 pairs are made into testing samples and the rest are used for training. Each latent fingerprint is 512 × 512 pixels in size and 500 ppi in this paper, and library fingerprints are 640 × 640 pixels and 500 ppi. Latent images’ orientations are to be detected and used to enhance input latent fingerprints. Lacking of ground truth orientation information, labels are produced by fingerprint recognition SDK of Beijing Hisign Technology Co., Ltd.. Library fingerprints’ labels are more accurate, while latent images’ output labels will include more mistakes.

3.2 Identification Performance

To test the quality of our output orientations, an objective comparison with other methods is made. Gabor-based algorithm extracts orientation field on Gabor phase. Template -based algorithm extracts orientation fields by first clustering label block templates, then classifying fingerprint blocks into templates with a learned deep learning network. FingerNet is re-trained and tested using the same data set with ours. FingerNet extract orientation field with a learnt fully convolutional network based on DeepLab v2. As our labels were collected using Hisign SDK, SDK’s performance is also considered. After getting the output orientation fields of each methods, the same reinforcement method has been used to fuse the orientation information and latent fingerprint images. Finally, Hisign SDK was used to get the matching accuracy of each method. Results are shown in Table 1.

Table 1. Matching results of each method on testing dataset

Full size table

The Cumulative Match Characteristic (CMC) curves of above seven methods on 500 latent images are shown in Fig. 6. Following the control variable principle, FingerNet(yellow) is re-trained and tested using the same data set with ours. For more convenient comparison, the results of some methods are placed separately in another figure shown below. Thus, we can see the trend of the curve of fingerprint recognition rate clearly.

The results show our method made an accuracy of over 85% in top 1 matching test, which is undoubtedly better than Gabor or masking method. The result is also 1.6 percent higher than the result of FingerNet’s outputs. Boosting algorithm and preprocess make clear contribution to the improvement of output quality. Comparing with SDK’s result, our method get some increase in accuracy, which means the network has the ability of generalization and corrects some mistakes made by SDK.

Figure 7 shows threshold-recall curves of proposed method and FingerNet. Recall is defined as proportion of test images with average angular precision higher than threshold. It shows that proposed method gets results closer to labels than FingerNet. Figure 8 compares the orientation fields from top 2 algorithms on latent fingerprints visually while the original latent image is also given. We observe that the proposed algorithm outperforms the other algorithms on latent fingerprints.

4 Conclusion and Future Work

We propose a whole system to produce more accuracy orientation fields of latent fingerprints, including preprocess and orientation estimation. This system has combined domain knowledge got from preprocess and contextual information generated by deep learning method to outperform other orientation estimation algorithms. For better and faster training of the network, not classification but regression network was designed to get the output orientation field. To eliminate error in bad zones, boosting algorithm and new structure is adopted in network design.

Future work will include (1) integration of the whole system, (2) optimization of the network and preprocess, (3) extending this system to reinforcement and matching.

References

Maltoni, D., Maio, D., Jain, A.K., Prabhakar, S.: Handbook of Fingerprint Recognition. Springer, London (2003). https://doi.org/10.1007/b97303
Book MATH Google Scholar
Jain, A.K., Feng, J., Nandakumar, K.: Fingerprint matching. Computer 43(2), 36–44 (2010)
Article Google Scholar
Conti, V., et al.: Fast fingerprints classification only using the directional image. In: Apolloni, B., Howlett, R.J., Jain, L. (eds.) KES 2007. LNCS (LNAI), vol. 4692, pp. 34–41. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-74819-9_5
Chapter Google Scholar
Jiang, X., Liu, M., Kot, A.C.: Fingerprint retrieval for identification. IEEE Trans. Inf. Foren. Secur. 1(4), 532–542 (2006)
Article Google Scholar
Hong, L., Wan, Y., Jain, A.: Fingerprint image enhancement: algorithm and performance evaluation. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 777–789 (1970)
Article Google Scholar
Sherlock, B.G., Monro, D.M.: A model for interpreting fingerprint topology. Pattern Recogn. 26(7), 1047–1055 (1993)
Article Google Scholar
Feng, J., Zhou, J., Jain, A.K.: Orientation field estimation for latent fingerprint enhancement. IEEE Trans. Pattern Anal. Mach. Intell. 35(4), 925–940 (2013)
Article Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: International Conference on Neural Information Processing Systems, pp. 1097–1105. Curran Associates Inc (2012)
Google Scholar
Redmon, J., et al.: You only look once: unified, real-time object detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788. IEEE Computer Society (2016)
Google Scholar
Liu, W., et al.: SSD: single shot MultiBox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Cao, K., Jain, A.K.: Latent orientation field estimation via convolutional neural network. In: International Conference on Biometrics, pp. 349–356. IEEE (2015)
Google Scholar
Tang, Y., et al.: FingerNet: an unified deep network for fingerprint minutiae extraction. In: IEEE International Joint Conference on Biometrics, pp. 108–116. IEEE (2017)
Google Scholar
Chen, L.C., Papandreou, G., Kokkinos, I., et al.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. PP(99), 1 (2017)
Google Scholar
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(4), 640 (2014)
Article Google Scholar
Buades, A., Le, T.M., Morel, J.M., et al.: Fast cartoon + texture image filters. IEEE Trans. Image Process. 19(8), 1978 (2010)
Article MathSciNet Google Scholar
Schapire, R.E.: The boosting approach to machine learning: an overview. In: Denison, D.D., Hansen, M.H., Holmes, C.C., Mallick, B., Yu, B. (eds.) Nonlinear Estimation and Classification. Lecture Notes in Statistics, vol. 171. Springer, New York (2003). https://doi.org/10.1007/978-0-387-21579-2_9
Chapter Google Scholar
Lecun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436 (2015)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: Computer Science (2014)
Google Scholar
Schuch, P., Schulz, S.-D., Busch, C.: ConvNet regression for fingerprint orientations. In: Sharma, P., Bianchi, F.M. (eds.) SCIA 2017. LNCS, vol. 10269, pp. 325–336. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59126-1_27
Chapter Google Scholar
Liu, S., Pan, J., Yang, M.-H.: Learning recursive filters for low-level vision via a hybrid neural network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 560–576. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_34
Chapter Google Scholar

Download references

Acknowledgement

We would like to thank Beijing Hisign Technology Co., Ltd. and Cross-strait Tsinghua Research Institute for providing essential resource and support to us.

Author information

Authors and Affiliations

Department of Control Science and Engineering, HIT, Harbin, China
Zhenshen Qu, Junyu Liu, Yang Liu & Qiuyu Guan
Beijing Hisign Technology Co., Ltd., Beijing, China
Chunyu Yang
Cross-Strait Tsinghua Research Institute, Beijing, China
Yuxin Zhang

Authors

Zhenshen Qu
View author publications
You can also search for this author in PubMed Google Scholar
Junyu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Qiuyu Guan
View author publications
You can also search for this author in PubMed Google Scholar
Chunyu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yuxin Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yang Liu .

Editor information

Editors and Affiliations

Czech Academy of Sciences, Prague 8, Czech Republic
Věra Kůrková
Open University of Cyprus, Latsia, Cyprus
Yannis Manolopoulos
CITEC Bielefeld University, Bielefeld, Germany
Barbara Hammer
Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
University of Piraeus, Piraeus, Greece
Ilias Maglogiannis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qu, Z., Liu, J., Liu, Y., Guan, Q., Yang, C., Zhang, Y. (2018). OrieNet: A Regression System for Latent Fingerprint Orientation Field Extraction. In: Kůrková, V., Manolopoulos, Y., Hammer, B., Iliadis, L., Maglogiannis, I. (eds) Artificial Neural Networks and Machine Learning – ICANN 2018. ICANN 2018. Lecture Notes in Computer Science(), vol 11141. Springer, Cham. https://doi.org/10.1007/978-3-030-01424-7_43

Download citation

DOI: https://doi.org/10.1007/978-3-030-01424-7_43
Published: 27 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01423-0
Online ISBN: 978-3-030-01424-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics