Skip to main content
Log in

A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information

  • Original Paper
  • Published:
Evolving Systems Aims and scope Submit manuscript

Abstract

We propose a new approach to a real-time personal authentication system based on incrementally updated visual (face) and audio (voice) features of persons. The proposed system consists of real-time face detection, incremental audiovisual feature extraction, and incremental neural classifier model with long-term memory. The face detection part, a biologically motivated face-color preferable selective attention model first localizes face candidate regions in natural scenes, and then the Adaboost-based face detection identifies human faces from the localized face-candidate regions. The mel-frequency cepstral coefficient is used for vocal feature extraction of speakers. Moreover, incremental principal component analysis (IPCA) is used to reduce the dimensions of audiovisual features and to update them incrementally. The features extracted by IPCA is fed to the resource allocating network with long-term memory which learns facial and vocal features incrementally and recognizes faces in real time. Experimental results show that the proposed system can enhance the test performance incrementally without serious forgetting. In addition, a multi-modal (facial and vocal) feature effectively increases the robustness of the personal authentication system in noisy environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  • Angelov P, Filev D (2004a) Flexible models with evolving structure. Int J Intell Syst 19(4):327–340

    Article  MATH  Google Scholar 

  • Angelov P, Filev D (2004b) An approach to online identification of Takagi-Sugeno fuzzy models. IEEE Trans Syst Man Cybern B Cybern 34(1):484–498

    Article  Google Scholar 

  • Ban SW, Lee M, Yang HS (2004) A face detection using biologically motivated bottom-up saliency map model and top-down perception model. Neurocomputing 56:475–480

    Article  Google Scholar 

  • Campbell JP (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437–1462

    Article  Google Scholar 

  • Carpenter GA, Grossberg S (1988) The ART of adaptive pattern recognition by a self-organizing neural network. Computer 21(3):77–88

    Article  Google Scholar 

  • Choi SB, Jung BS, Ban SW, Niitsuma H, Lee M (2006) Biologically motivated vergence control system using human-like selective attention model. Neurocomputing 69:537–558

    Article  Google Scholar 

  • Davis SB, Mermelstein P (1980) Comparison of parametric representations for mono-syllabic word recognition in continuously spoken sentences. IEEE Trans Acoust Speech Signal Process 28(4):357–366

    Article  Google Scholar 

  • Goldstein EB (1996) Sensation and perception, 4th edn. An International Thomson Publishing Company, Mexico

    Google Scholar 

  • Hall P, Martin R (1998) Incremental eigenanalysis for classification. In: Proceedings of British machine vision conference, vol 1, pp 286–295

  • Harris S (2008) All-in-one CISSP exam guide, 4th edn. McGraw-Hill, pp 184–185

  • Hasan R, Jamil M, Rabbani G, Rahman S (2004) Speaker identification using mel frequency cepstral coefficients. In: Proceedings of 3rd international conference on electrical and computer engineering (ICECE), pp 565–568

  • Haykin S (1999) Neural networks—a comprehensive foundation, 2nd edn. Prentice Hall, Englewood Cliffs

  • Iglesias JA, Angelov P, Ledezma A, Sanchis A (2010) Evolving classification of agent’s behaviors: a general approach. Evol Syst 1(3):161–172

    Article  Google Scholar 

  • Jeong S, Ban SW, Lee M (2008) Stereo saliency map considering affective factors and selective motion analysis in a dynamic environment. Neural Netw 21(10):1420–1430

    Article  Google Scholar 

  • Kasabov NK, Song Q (2002) DENFIS: Dynamic evolving neuralfuzzy inference system and its application for time-series prediction. IEEE Trans Fuzzy Syst 10(2):144–154

    Article  Google Scholar 

  • Kim B, Ban SW, Lee M (2008) Improving AdaBoost based face detection using face-color preferable selective attention. In: Intelligent data engineering and automated learning—IDEAL 2008, LNCS 5326. Springer, Berlin, pp 88–95

  • Matyás V, Ríha Z (2002) Biometric authentication—security and usability. In: Proceedings of IFIP TC6/TC11 sixth joint working conference on communications and multimedia security, pp 227–239

  • Otsu N (1979) A threshold selection method from gray-level histogram. IEEE Trans System Man Cybern 9(1):62–66

    Article  MathSciNet  Google Scholar 

  • Ozawa S, Toh SL, Abe S, Pang S, Kasabov N (2005) Incremental learning of feature space and classifier for face recognition. Neural Netw 18(5–6):575–584

    Article  Google Scholar 

  • Ozawa S, Pang S, Kasabov N (2008) Incremental learning of chunk data for on-line pattern classification systems. IEEE Trans Neural Netw 19(6):1061–1074

    Article  Google Scholar 

  • Pacheco J, Rubio J, Guillen J (2009) Detection and following of a face in movement using a neural network. In: Advances in neural networks research—ISNN 2009, AISC 56. Springer, Berlin, pp 481–490

  • Park HM (2003) Adaptive filtering methods for acoustic noise reduction and noisy speech recognition. Doctor’s thesis, Department of Electrical Engineering and Computer Science, Division of Electrical Engineering, Korea Advanced Institute of Science and Technology

  • Platt J (1991) A resource-allocating network for function interpolation. Neural Comput 3(2):213–225

    Article  MathSciNet  Google Scholar 

  • Rubio JJ (2009) SOFMLS: online self-organizing fuzzy modified least square network. IEEE Trans Fuzzy Syst 17(6):1296–1309

    Article  MathSciNet  Google Scholar 

  • Rubio JJ, Pacheco J (2009) An stable online clustering fuzzy neural network for nonlinear systems identification. Neural Comput Appl 18(6):633–641

    Article  Google Scholar 

  • Rubio JJ, Vazquez DM, Pacheco J (2010) Backpropagation to train an evolving radial basis function neural network. Evol Syst 1(3):173–180

    Article  Google Scholar 

  • Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vision 57(2):137–154

    Article  Google Scholar 

  • Xu M, Duan L-Y, Cai J, Chia L-T, Xu C, Tian Q (2004) HMM-based audio keyword generation. In: Advances in multimedia information processing, LNCS, vol 3333. Springer, Berlin, pp 566–574

Download references

Acknowledgments

This research was supported by the Converging Research Center Program funded by the Ministry of Education, Science and Technology (2010K001130) (50%) and also the National Research Foundation of Korea (NRF) Grant (NRF-2010-616-D00096) (50%).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Minho Lee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jang, YM., Lee, M. & Ozawa, S. A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information. Evolving Systems 2, 261–272 (2011). https://doi.org/10.1007/s12530-011-9033-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12530-011-9033-2

Keywords

Navigation