Abstract
Hyperspectral imaging technology, combining traditional imaging and spectroscopy technologies to simultaneously acquire spatial and spectral information, is deemed to be an intuitive medium for robust face recognition. However, the intrinsic structure of hyperspectral images is more complicated than ordinary gray-scale or RGB images, how to fully explore the discriminant and correlation features with only a limited number of hyperspectral samples for deep learning training has not been well studied. In response to these problems, this paper proposes an end-to-end multiscale fusion lightweight convolution neural network (CNN) framework for hyperspectral face recognition, termed as the features fusion with channel attention network (FFANet). Firstly, to capture richer subtle details, we introduce Second-Order Efficient Channel Attention (SECA) as the variant of Efficient Channel Attention (ECA) into the framework. The difference from ECA is that SECA can extract the second-order information of each channel to improve the network’s feature extraction ability and is more suitable for the complexity of hyperspectral data. Secondly, we further fuse multiscale information to yield a comprehensive and discriminative representation learning. Finally, the joint of Self-Supervision and Knowledge Distillation (SSKD) is exploited to train an efficient deep model, which can learn more dark knowledge from the trained teacher network. The experimental results on three benchmark hyperspectral face databases of PolyU, CMU, and UWA show that the proposed approach has achieved competitive accuracy and efficiency on the basis of significantly reducing the storage space and computation overheads. These characteristics also show its wide applicability on edge/mobile devices.
Similar content being viewed by others
References
Liang H, Gao J, Qiang N (2021) A novel framework based on wavelet transform and principal component for face recognition under varying illumination. Appl Intell 51:1762–1783
Li Y, Xie T, Wang P, Wang J, Liu S, Zhou X, Zhang X (2018) Joint spectral-spatial hyperspectral image classification based on hierarchical subspace switch ensemble learning algorithm. Appl Intell 48:4128–4148
Osia N, Bourlai T (2014) A spectral independent approach for physiological and geometric based face recognition in the visible, middle-wave and long-wave infrared bands. Image Vision Comput 32:847–859
Uzair M, Mahmood A, Mian A (2015) Hyperspectral face recognition with spatio-spectral information fusion and PLS regression. IEEE Trans Image Process 24:1127–1137
Wei D, Zhang L, Hu N, Liu L, Ma N, Zhao Y (2017) Hyperspectral face recognition with spatial-spectral fusion information and gabor feature. Trans Beijing Inst Technol 37:1077–1083
Uzair M, Mahmood A, Mian A (2013) Hyperspectral face recognition using 3d-DCT and partial least squares. In: Proceedings of the British machine vision conference, pp 1–7
Shen L, Zheng S (2012) Hyperspectral face recognition using 3D Gabor wavelets. In: Proceedings of the international conference on pattern recognition, pp 1574–1577
Liang J, Zhou J, Gao Y (2015) 3D local derivative pattern for hyperspectral face recognition. In: IEEE international conference and workshops on automatic face and gesture recognition, pp 1–6
Aman G, Hasan D (2018) 3D Discrete wavelet transform-based feature extraction for hyperspectral face recognition. IET Biomet 7:49–55
Pan Z, Healey G, Tromberg B (2009) Comparison of Spectral-Only and Spectral/Spatial face recognition for personal identity verification EURASIP. Journal Advances In Signal Processing
Xie Z, Li Y, Niu J, Shi L (2019) Hyperspectral face recognition using block based convolution neural network and AdaBoost band selection. Int Conf Syst Inform 1270–1274
Di W, Zhang L, Zhang D, Pan Q (2010) Studies on hyperspectral face recognition in visible spectrum with feature band selection. IEEE Trans Syst Man Cybern Part A Syst Humans 40:1354–1361
Li Y, Wu Y, Zhang N (2018) An image-level classification framework for Hyperspectral image with CNNs. In: Proceedings of 14th IEEE international conference on signal processing (ICSP), pp 586–590
Xie Z, Niu J, Li Y (2020) Hyperspectral face recognition based on SLRC for single sample problem. Proc SPIE 11428:114280Q
Appice A, Guccione P, Acciaro E, Malerba D (2020) Detecting salient regions in a bi-temporal hyperspectral scene by iterating clustering and classification. Appl Intell 50:3179–3200
Taherkhani F, Dawson J, Nasrabadi N Prasad S, Chanussot J (eds) (2020) Deep sparse band selection for hyperspectral face recognition. Springer, Cham
Hsieh T, Kiang F (2020) Comparison of CNN algorithms on hyperspectral image classification in agricultural lands. Sensor 20:1734
Wu X, He R, Sun Z (2018) A light CNN for deep face representation with noisy labels. In: IEEE transactions on information forensics and security, vol 13, pp 2884–2896
Wang Q, Wu B, Zhu P, Li P (2020) ECA-Net: efficient channel attention for deep convolutional neural networks. In: IEEE conference on computer vision and pattern recognition, pp 11534–11542
Larochelle H, Hinton GE (2010) Learning to combine foveal glimpses with a third-order boltzmann machine. In: Proceedings of the 23rd international conference on neural information processing systems, pp 143–1251
Hu J, Shen L, Albanie S (2017) Squeeze-and-excitation Networks. In: IEEE conference on computer vision and pattern recognition, pp 7132–7141
Gardner MW, Dorling SR (1998) Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmospheric Environ 32:2627–2636
Woo S, Park J, Lee J, Kweon IS (2018) CBAM: Convolutional Block attention module. In: European conference on computer vision, pp 3–19
Hu J, Shen L, Albanie S, Sun G (2018) Gather-excite: Exploiting feature context in convolutional neural networks. In: Proceedings of international conference on neural information processing system, pp 1–11
Chollet F (2017) Xception: Deep learning with depthwise separable convolutions. In: IEEE conference on computer vision and pattern recognition, pp 1251–1258
Li X, Lei L, Sun Y, Li M, Kuang G (2020) Multimodal bilinear fusion network with Second-Order Attention-Based channel selection for land cover classification. IEEE J Select Topics Appl Earth Observ Remote Sens 13:1011–1026
Roy AG, Navab N, Wachinge C (2019) Recalibrating fully convolutional networks with spatial and channel squeeze and excitation blocks. IEEE Trans Med Imaging 38:540–549
Hinton G, Vinyals O, Dean J (2015) Distilling the knowledge in a neural network NIPS. Deep Learning Workshop
Zagoruyko S, Komodakis N (2017) Paying more attention to attention: Improving the performance of convolutional neural networks via attention transfer. International Conference on Learning Representations
Yim J, Joo D, Bae J, Kim J (2017) A gift from knowledge distillation: Fast optimization, network minimization and transfer learning. In: IEEE conference on computer vision and pattern recognition, pp 4133–4141
Kim J, Park S, Kwak N (2018) Paraphrasing complex network: Network compression via factor transfer. Neural Inform Process Syst 2760–2769
Liu Y, Cao J, Li B, Yuan C, Hu W, Li Y, Duan Y (2019) Knowledge distillation via instance relationship graph. IEEE Conf Comput Vision Pattern Recognit 7096–7104
Xu G, Liu Z, Li X, Loy C (2020) Knowledge distillation meets Self-Supervision. European Conf Comput Vision 588–604
Gao Z, Xie J, Wang Q, Li P (2019) Global second-order pooling convolutional networks. IEEE Conf Comput Vision Pattern Recognit 3024–3033
Lin M, Chen Q, Yan S (2014) Network in network. International Conference on Learning Representation
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. International Conference on Learning Representation
Mu G, Huang D, Hu G, Sun J, Wang Y (2019) Led3d: A Lightweight and Efficient Deep Approach to Recognizing Low-quality 3D Faces. IEEE Conf Comput Vision Pattern Recognit 5773–5782
PolyU-HSFD (2010) http://www4.comp.polyu.edu.hk/biometrics
Denes L, Metes P, Liu Y (2002) Hyperspectral face database, Robotics Inst Pittsburgh, PA, Tech. Rep CMU-RI-TR-02-25
Yi D, Lei Z, Liao S, Li SZ (2014) Learning face representation from scratch, arXiv:1411.7923
Wang K, Gao X, Zhao Y, Li X, Dou D, Xu C-Z (2020) Pay attention to features, transfer learn faster CNNs. In: International conference on learning representations
Robila S (2008) Toward hyperspectral face recognition. Image Processing: Algorithms and Systems VI 6812
Krizhevsky A, Sutskever I, Hinton G (2012) ImageNet classification with deep convolutional neural networks. In: Conference and workshop on neural information processing systems, p 25
mollahosseini A, chan D, mahoor MH (2016) Going deeper in facial expression recognition using deep neural networks. In: IEEE winter conference on applications of computer vision, pp 1–10
He K, Zhang X, Ren S (2016) Deep residual learning for image recognition. In: IEEE conference on computer vision and pattern recognition, pp 770–778
Howard A, Sandler M, Chu G, Chen L, et al. (2019) Searching for MobileNet v3. IEEE Int Conf Comput Vision 2019:1314–1324
Ma N, Zhang X, Zheng H, Sun J (2018) ShuffleNet v2: Practical Guidelines for efficient CNN architecture design. European Conf Comput Vision 2018:116–131
Chen S, Liu Y, Gao X, Han Z (2018) Mobilefacenets: efficient CNNs for accurate real-time face verification on Mobile Devices. Chinese Conf Biomet Recognit 2018:428–438
Zhao M, Jia Z, Cai Y, Chen X, Gong D (2021) Advanced variations of two-dimensional principal component analysis for face recognition. Neurocomputing, 446, In Press
Wu F, Jing X, Dong X, Hu R, Yue D, Wang L, Wang R, Guo C (2020) Intraspectrum discrimination and interspectrum correlation analysis deep network for multispectral face recognition. IEEE Trans Cybern 45(2):242–252
Acknowledgements
This research is supported by the National Nature Science Foundation of China (No.61861020), Science & Technology Project of Education Bureau of Jiangxi Province (No. GJJ190578)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Niu, JY., Xie, ZH., Li, Y. et al. Scale fusion light CNN for hyperspectral face recognition with knowledge distillation and attention mechanism. Appl Intell 52, 6181–6195 (2022). https://doi.org/10.1007/s10489-021-02721-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-02721-8