A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information

Jang, Young-Min; Lee, Minho; Ozawa, Seiichi

doi:10.1007/s12530-011-9033-2

A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information

Original Paper
Published: 03 June 2011

Volume 2, pages 261–272, (2011)
Cite this article

Evolving Systems Aims and scope Submit manuscript

Young-Min Jang¹,
Minho Lee¹ &
Seiichi Ozawa²

221 Accesses
5 Citations
Explore all metrics

Abstract

We propose a new approach to a real-time personal authentication system based on incrementally updated visual (face) and audio (voice) features of persons. The proposed system consists of real-time face detection, incremental audiovisual feature extraction, and incremental neural classifier model with long-term memory. The face detection part, a biologically motivated face-color preferable selective attention model first localizes face candidate regions in natural scenes, and then the Adaboost-based face detection identifies human faces from the localized face-candidate regions. The mel-frequency cepstral coefficient is used for vocal feature extraction of speakers. Moreover, incremental principal component analysis (IPCA) is used to reduce the dimensions of audiovisual features and to update them incrementally. The features extracted by IPCA is fed to the resource allocating network with long-term memory which learns facial and vocal features incrementally and recognizes faces in real time. Experimental results show that the proposed system can enhance the test performance incrementally without serious forgetting. In addition, a multi-modal (facial and vocal) feature effectively increases the robustness of the personal authentication system in noisy environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Real-Time Face Detection and Face Recognition: Study of Approaches

A Performance Analysis of Face and Speech Recognition in the Video and Audio Stream Using Machine Learning Classification Techniques

Face Recognition for Mobile Self-authentication with Online Model Update

References

Angelov P, Filev D (2004a) Flexible models with evolving structure. Int J Intell Syst 19(4):327–340
Article MATH Google Scholar
Angelov P, Filev D (2004b) An approach to online identification of Takagi-Sugeno fuzzy models. IEEE Trans Syst Man Cybern B Cybern 34(1):484–498
Article Google Scholar
Ban SW, Lee M, Yang HS (2004) A face detection using biologically motivated bottom-up saliency map model and top-down perception model. Neurocomputing 56:475–480
Article Google Scholar
Campbell JP (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437–1462
Article Google Scholar
Carpenter GA, Grossberg S (1988) The ART of adaptive pattern recognition by a self-organizing neural network. Computer 21(3):77–88
Article Google Scholar
Choi SB, Jung BS, Ban SW, Niitsuma H, Lee M (2006) Biologically motivated vergence control system using human-like selective attention model. Neurocomputing 69:537–558
Article Google Scholar
Davis SB, Mermelstein P (1980) Comparison of parametric representations for mono-syllabic word recognition in continuously spoken sentences. IEEE Trans Acoust Speech Signal Process 28(4):357–366
Article Google Scholar
Goldstein EB (1996) Sensation and perception, 4th edn. An International Thomson Publishing Company, Mexico
Google Scholar
Hall P, Martin R (1998) Incremental eigenanalysis for classification. In: Proceedings of British machine vision conference, vol 1, pp 286–295
Harris S (2008) All-in-one CISSP exam guide, 4th edn. McGraw-Hill, pp 184–185
Hasan R, Jamil M, Rabbani G, Rahman S (2004) Speaker identification using mel frequency cepstral coefficients. In: Proceedings of 3rd international conference on electrical and computer engineering (ICECE), pp 565–568
Haykin S (1999) Neural networks—a comprehensive foundation, 2nd edn. Prentice Hall, Englewood Cliffs
Iglesias JA, Angelov P, Ledezma A, Sanchis A (2010) Evolving classification of agent’s behaviors: a general approach. Evol Syst 1(3):161–172
Article Google Scholar
Jeong S, Ban SW, Lee M (2008) Stereo saliency map considering affective factors and selective motion analysis in a dynamic environment. Neural Netw 21(10):1420–1430
Article Google Scholar
Kasabov NK, Song Q (2002) DENFIS: Dynamic evolving neuralfuzzy inference system and its application for time-series prediction. IEEE Trans Fuzzy Syst 10(2):144–154
Article Google Scholar
Kim B, Ban SW, Lee M (2008) Improving AdaBoost based face detection using face-color preferable selective attention. In: Intelligent data engineering and automated learning—IDEAL 2008, LNCS 5326. Springer, Berlin, pp 88–95
Matyás V, Ríha Z (2002) Biometric authentication—security and usability. In: Proceedings of IFIP TC6/TC11 sixth joint working conference on communications and multimedia security, pp 227–239
Otsu N (1979) A threshold selection method from gray-level histogram. IEEE Trans System Man Cybern 9(1):62–66
Article MathSciNet Google Scholar
Ozawa S, Toh SL, Abe S, Pang S, Kasabov N (2005) Incremental learning of feature space and classifier for face recognition. Neural Netw 18(5–6):575–584
Article Google Scholar
Ozawa S, Pang S, Kasabov N (2008) Incremental learning of chunk data for on-line pattern classification systems. IEEE Trans Neural Netw 19(6):1061–1074
Article Google Scholar
Pacheco J, Rubio J, Guillen J (2009) Detection and following of a face in movement using a neural network. In: Advances in neural networks research—ISNN 2009, AISC 56. Springer, Berlin, pp 481–490
Park HM (2003) Adaptive filtering methods for acoustic noise reduction and noisy speech recognition. Doctor’s thesis, Department of Electrical Engineering and Computer Science, Division of Electrical Engineering, Korea Advanced Institute of Science and Technology
Platt J (1991) A resource-allocating network for function interpolation. Neural Comput 3(2):213–225
Article MathSciNet Google Scholar
Rubio JJ (2009) SOFMLS: online self-organizing fuzzy modified least square network. IEEE Trans Fuzzy Syst 17(6):1296–1309
Article MathSciNet Google Scholar
Rubio JJ, Pacheco J (2009) An stable online clustering fuzzy neural network for nonlinear systems identification. Neural Comput Appl 18(6):633–641
Article Google Scholar
Rubio JJ, Vazquez DM, Pacheco J (2010) Backpropagation to train an evolving radial basis function neural network. Evol Syst 1(3):173–180
Article Google Scholar
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vision 57(2):137–154
Article Google Scholar
Xu M, Duan L-Y, Cai J, Chia L-T, Xu C, Tian Q (2004) HMM-based audio keyword generation. In: Advances in multimedia information processing, LNCS, vol 3333. Springer, Berlin, pp 566–574

Download references

Acknowledgments

This research was supported by the Converging Research Center Program funded by the Ministry of Education, Science and Technology (2010K001130) (50%) and also the National Research Foundation of Korea (NRF) Grant (NRF-2010-616-D00096) (50%).

Author information

Authors and Affiliations

School of Electrical Engineering, Kyungpook National University, 1370 Sankyuk-Dong, Puk-Gu, Taegu, 702-701, Korea
Young-Min Jang & Minho Lee
Graduate School of Engineering, Kobe University, Rokko-dai, Nada, Kobe, 657-8501, Japan
Seiichi Ozawa

Authors

Young-Min Jang
View author publications
You can also search for this author in PubMed Google Scholar
Minho Lee
View author publications
You can also search for this author in PubMed Google Scholar
Seiichi Ozawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Minho Lee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jang, YM., Lee, M. & Ozawa, S. A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information. Evolving Systems 2, 261–272 (2011). https://doi.org/10.1007/s12530-011-9033-2

Download citation

Received: 16 January 2011
Accepted: 20 April 2011
Published: 03 June 2011
Issue Date: December 2011
DOI: https://doi.org/10.1007/s12530-011-9033-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information

Abstract

Access this article

Similar content being viewed by others

Real-Time Face Detection and Face Recognition: Study of Approaches

A Performance Analysis of Face and Speech Recognition in the Video and Audio Stream Using Machine Learning Classification Techniques

Face Recognition for Mobile Self-authentication with Online Model Update

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information

Abstract

Access this article

Similar content being viewed by others

Real-Time Face Detection and Face Recognition: Study of Approaches

A Performance Analysis of Face and Speech Recognition in the Video and Audio Stream Using Machine Learning Classification Techniques

Face Recognition for Mobile Self-authentication with Online Model Update

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation