SpPCANet: a simple deep learning-based feature extraction approach for 3D face recognition

Dutta, Koushik; Bhattacharjee, Debotosh; Nasipuri, Mita

doi:10.1007/s11042-020-09554-6

SpPCANet: a simple deep learning-based feature extraction approach for 3D face recognition

Published: 20 August 2020

Volume 79, pages 31329–31352, (2020)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

535 Accesses
12 Citations
Explore all metrics

Abstract

A Sparse Principal Component Analysis Network (SpPCANet) based feature extraction is proposed here for 3D face recognition. The network consists of three basic components: (1) Multistage sparse principal component analysis filters, (2) Binary hashing, and (3) Block-wise histogram computation. Here, the sparse principal component analysis is used to learn multistage filter banks at the convolution stage, which is followed by binary hashing for indexing and block-wise histogram for pooling. Finally, a linear support vector machine (SVM) is used for classifying the features extracted by SpPCANet. The proposed network SpPCANet is a lightweight deep learning network. Three well-known 3D face databases, namely, Frav3D, Bosphorus3D, and Casia3D, are used for validating the proposed system. This proposed network has been extensively studied by varying different parameters, such as the number of filters at the convolution layer and the size of filters at the convolution layer and size of non-overlapping blocks at the pooling layer. Handling all types of variation of faces available in Frav3D, Bosphorus3D, and Casia3D databases, the system has acquired 96.93%, 98.54%, and 88.80% recognition rates, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hybrid descriptors and Weighted PCA-EFMNet for Face Verification in the Wild

Article 09 July 2019

2DPCANet: a deep leaning network for face recognition

Article 04 July 2017

Spatial pyramid face feature representation and weighted dissimilarity matching for improved face recognition

Article 28 August 2017

References

Alain G, Bengio Y (2014) What regularized auto-encoders learn from the data-generating distribution. J Mach Learn Res 15:3743–3773
MathSciNet MATH Google Scholar
Alyuz N, Gokberk B, Dibeklioglu H, Savran A, Salah AA, Akarun L, Sankur B (2008) 3D Face Recognition Benchmarks on the Bosphorus3D Database with Focus on Facial Expressions. In: Schouten B, Juul NC, Drygajlo A, Tistarelli M (eds) Biometrics and Identity Management. BioID 2008. Lecture notes in computer science, vol 5372. Springer, Berlin Heidelberg, pp 57–66
Google Scholar
Bagchi P, Bhattacharjee D, Nasipuri M (2015). 3D face recognition using surface Normals. In: Proc. IEEE region 10 conference, TENCON - 2015. https://doi.org/10.1109/TENCON.2015.7372819
Besl PJ, McKay ND (1992) A method for registration of 3-D shapes. IEEE trans. Pattern anal. Mach. Intell. (T-PAMI) 14(2):239–256
Article Google Scholar
BOSPHORUS3D3D: http://bosphorus.ee.boun.edu.tr/default.aspx
Bruna J, Mallat S (2013) Invariant scattering convolution networks. IEEE trans. Pattern anal. Mach. Intell. (T-PAMI) 35(8):1872–1886
Article Google Scholar
CASIA3D: http://www.idealtest.org/dbDetailForUser.do?id=8
Chan TH, Jia K, Gao S, Lu J, Zeng Z, Ma Y (2015) PCANet: a simple deep learning baseline for image classification? IEEE Trans Image Process 24(12):5017–5032
Article MathSciNet Google Scholar
Chandrakala M, Ravi S (2018) Effective 3D face recognition technique based on Gabor and LTP features. International Journal of Engineering and Advanced Technology (IJEAT) 8(2S):284–290
Google Scholar
Chhatrala R, Jadhav D (2017) Gait recognition based on curvelet transform and PCANet. Pattern Recog. Image Anal 27(3):525–531. https://doi.org/10.1134/S1054661817030075
Article Google Scholar
Chouchane A, Belahcene M (2015) 3D and 2D face recognition using integral projection curves based depth and intensity images. Int J Intell Syst Technol Appl 14(1):50–69
Google Scholar
Chouchane A, Belahcene M, Ouamane A, Bourennane S (2014) 3D face recognition based on histograms of local descriptors. In: Proc. 4th international conference on image processing theory, tools and applications (IPTA), Paris, France. https://doi.org/10.1109/IPTA.2014.7001925
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
MATH Google Scholar
Drira H, Amor BB, Srivastava A, Daoudi M, Slama R (2013) 3D face recognition under expressions, occlusions and pose variations. IEEE trans. Pattern anal. Mach. Intell. (T-PAMI) 35(9):2270–2283
Article Google Scholar
Dutta K, Bhattacharjee D, Nasipuri M (2016). Expression and occlusion invariant 3D face recognition based on region classifier. In: Proc. 1st international conference on information technology, information systems and electrical engineering (ICITISEE), pp. 99-104. https://doi.org/10.1109/ICITISEE.2016.7803055
Dutta K, Bhattacharjee D, Nasipuri M (2019) 3D face recognition based on volumetric representation of range image. In: Chaki R, Cortesi a, Saeed K, Chaki N (eds) advance computing and Systems for Security. Advance in Intelligent Systems and Computing 883:175–189. https://doi.org/10.1007/978-981-13-3702-4_11
Article Google Scholar
FRAV3D: http://www.frav.es/databases
Ganguly S, Bhattacharjee D, and Nasipuri M (2014). 2.5D face images: acquisition, processing and application. In Proc. ICC 2014 -computer networks and security, pp. 36-44
Ganguly S, Bhattacharjee D, Nasipuri M (2014) 3D face recognition from range images based on curvature analysis. ICTACT Journal on image and video processing 4(3):748–753. https://doi.org/10.21917/ijivp.2014.0108
Article Google Scholar
Gilani SZ, Mian A (2016). Towards large-scale 3D face recognition. International Conference on Digital Image Computing: Techniques and Applications (DICTA)
Huang D, Du Y, He Q, Song W, Liu K (2016) Scene classification in high resolution remotely sensed images based on PCANet. Web Technologies and Applications, APWeb, Springer, Cham 9865:179–190. https://doi.org/10.1007/978-3-319-45835-9_16
Article Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proc. 25th international conference on neural information processing systems. Lake Tahoe, Nevada, pp 1097–1105
Google Scholar
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. In: Proc of the IEEE 86(11):2278–2324
Google Scholar
Lee H, Grosse R, Rananth R, and Ng A Y (2009). Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: Proc. 26th Annu. ICML, pp. 609–616. https://doi.org/10.1145/1553374.1553453
Li B, Dong Y, Zhao D, Wen Z, and Yang L (2016). A PCANet based method for vehicle make recognition. In: Proc. 19th international conference on intelligent transportation systems (ITSC), IEEE. Pp. 2404-2409. https://doi.org/10.1109/ITSC.2016.7795943
Li C, Tan Y, Wang D, Ma P (2017) Research on 3D face recognition method in cloud environment based on semi supervised clustering algorithm. Multimed Tools Appl 6:17055–17073
Article Google Scholar
Ng CJ, and Teoh ABJ (2015). DCTNet: a simple learning-free approach for face recognition. In: Proc. APSIPA, pp. 761–768. https://doi.org/10.1109/APSIPA.2015.7415375
Ouamane A, Belahcene M, Bourennane S (2013). Multimodal 3D and 2D face authentication approach using extended LBP and statistic local features proposed. In: Proc. European workshop on visual information processing (EUVIP), pp. 130-135
Parvathy SB, Naveen S, Moni RS (2014). A novel approach for multimodal face recognition system based on modular PCA. In: proc. 1st international conference on computational systems and communications (ICCSC), pp. 127-132. https://doi.org/10.1109/COMPSC.2014.7032634
Ratyal N, Taj IA, Sajid M, Mahmood A, Razzaq S, Dar SH, Ali N, Usman M, Baig MJA, Mussadiq U (2019). Deeply learned pose invariant image analysis with applications in 3D face recognition. Mathematical problems in engineering. https://doi.org/10.1155/2019/3547416
Schroff F, Kalenichenko D, and Philbin J (2015). FaceNet: a unified embedding for face recognition and clustering. In: Proc IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2015.7298682
Sghaier S, Farhat W, Souani C (2018) Novel technique for 3D face recognition using anthropometric methodology. International Journal of Ambient Computing and Intelligence 9(1):60–77
Article Google Scholar
Soltanpour S, Wu QMJ (2017). High-order local Normal derivative pattern (LNDP) for 3D face recognition. International conference on image processing (ICIP). Pp. 2811-2815
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, and Rabinovich A (2015). Going deeper with convolutions. In: Proc IEEE Conference on Computer Vision and Pattern Recognition (CVPR) https://doi.org/10.1109/CVPR.2015.7298594
Tian L, Fan C, Ming Y (2015). Stacked PCA network (SPCANet): an effective deep learning for face recognition. In: Proc. IEEE International Conference on Digital Signal Processing, pp. 1039–1043. https://doi.org/10.1109/ICDSP.2015.7252036
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B Methodol 58(1):267–288 https://www.jstor.org/stable/2346178
MathSciNet MATH Google Scholar
Tong M, Chen Y, Zhao M, Bu H, Xi S (2019) A deep discriminative and robust nonnegative matrix factorization network method with soft label constraint. Neural Comput & Applic 31:7447–7475. https://doi.org/10.1007/s00521-018-3554-6
Article Google Scholar
Tong M, Li M, Bai H, Lei M, Zhao M (2020) DKD–DAD: a novel framework with discriminative kinematic descriptor and deep attention-pooled descriptor for action recognition. Neural Comput & Applic 32:5285–5302. https://doi.org/10.1007/s00521-019-04030-1
Article Google Scholar
Tong M, Zhao M, Chen Y, Wang H (2019) D3-LND: a two-stream framework with discriminant deep descriptor, linear CMDT and nonlinear KCMDT descriptors for action recognition. Neurocomputing 325:90–100. https://doi.org/10.1016/j.neucom.2018.09.086
Article Google Scholar
Wang X, Ruan Q, Jin Y, and An G (2014). Three-dimensional face recognition under expression variation. EURASIP Journal on Image and Video Processing https://doi.org/10.1186/1687-5281-2014-51, 2014
Werghi N, Berretti S, Bimbo A D, Pala P (2013). Local descriptors matching for 3D face recognition. In: Proc. IEEE international conference on image processing (ICIP), Australia, pp. 3710-3714. https://doi.org/10.1109/ICIP.2013.6738765
Zeiler MD, Fergus R (2014) Visualizing and Understanding Convolutional Networks. In: Fleet D, Pajdla T, Schiele B, Tuytelaars T (eds) Computer Vision – ECCV 2014. Lecture notes in computer science. Springer, Cham. https://doi.org/10.1007/978-3-319-10590-1_53
Chapter Google Scholar
Zhang L, Ding Z, Li H, Shen Y, Lu J (2014) 3D face recognition based on multiple Keypoint descriptors and sparse representation. PLoS One 9(6):e100120. https://doi.org/10.1371/journal.pone.0100120
Article Google Scholar
Zhang Y, Geng T, Wu X, Zhou J, and Gao D (2018). ICANet: a simple cascade linear convolution network for face recognition. EURASIP Journal on Image and Video Processing https://doi.org/10.1186/s13640-018-0288-4, 2018
Zheng D, Du J, Fan W, Wang J, Zhai C (2016) Deep learning with PCANet for human age estimation. In: Huang DS, Jo KH (eds) international conference on intelligent computing (ICIC), lecture notes in computer science, springer. Cham. 9772:300–310. https://doi.org/10.1007/978-3-319-42294-7_26
Article Google Scholar
Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc Ser B (Stat Methodol) 67(2):301–320 https://www.jstor.org/stable/3647580
Article MathSciNet Google Scholar
Zou H, Hastie T, Tibshirani R (2006) Sparse principal component analysis. J Comput Graph Stat 15(2):265–286. https://doi.org/10.1198/106186006X113430
Article MathSciNet Google Scholar

Download references

Acknowledgments

The first author is grateful to the Ministry of Electronics and Information Technology (MeitY), Govt. of India, for the grant of the Visvesvaraya doctorate fellowship award. The authors are also thankful to CMATER laboratory of the Department of Computer Science and Engineering, Jadavpur University, Kolkata, India, for providing the necessary infrastructure for this work.

Author information

Authors and Affiliations

Computer Science and Engineering, Jadavpur University, 188, Raja S. C. Maulik Road, Kolkata, 700032, India
Koushik Dutta, Debotosh Bhattacharjee & Mita Nasipuri

Authors

Koushik Dutta
View author publications
You can also search for this author in PubMed Google Scholar
Debotosh Bhattacharjee
View author publications
You can also search for this author in PubMed Google Scholar
Mita Nasipuri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Koushik Dutta.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dutta, K., Bhattacharjee, D. & Nasipuri, M. SpPCANet: a simple deep learning-based feature extraction approach for 3D face recognition. Multimed Tools Appl 79, 31329–31352 (2020). https://doi.org/10.1007/s11042-020-09554-6

Download citation

Received: 10 August 2019
Revised: 30 June 2020
Accepted: 06 August 2020
Published: 20 August 2020
Issue Date: November 2020
DOI: https://doi.org/10.1007/s11042-020-09554-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SpPCANet: a simple deep learning-based feature extraction approach for 3D face recognition

Access this article

Similar content being viewed by others

Hybrid descriptors and Weighted PCA-EFMNet for Face Verification in the Wild

2DPCANet: a deep leaning network for face recognition

Spatial pyramid face feature representation and weighted dissimilarity matching for improved face recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

SpPCANet: a simple deep learning-based feature extraction approach for 3D face recognition

Access this article

Similar content being viewed by others

Hybrid descriptors and Weighted PCA-EFMNet for Face Verification in the Wild

2DPCANet: a deep leaning network for face recognition

Spatial pyramid face feature representation and weighted dissimilarity matching for improved face recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation