Abstract
We propose a new approach to a real-time personal authentication system based on incrementally updated visual (face) and audio (voice) features of persons. The proposed system consists of real-time face detection, incremental audiovisual feature extraction, and incremental neural classifier model with long-term memory. The face detection part, a biologically motivated face-color preferable selective attention model first localizes face candidate regions in natural scenes, and then the Adaboost-based face detection identifies human faces from the localized face-candidate regions. The mel-frequency cepstral coefficient is used for vocal feature extraction of speakers. Moreover, incremental principal component analysis (IPCA) is used to reduce the dimensions of audiovisual features and to update them incrementally. The features extracted by IPCA is fed to the resource allocating network with long-term memory which learns facial and vocal features incrementally and recognizes faces in real time. Experimental results show that the proposed system can enhance the test performance incrementally without serious forgetting. In addition, a multi-modal (facial and vocal) feature effectively increases the robustness of the personal authentication system in noisy environments.
Similar content being viewed by others
References
Angelov P, Filev D (2004a) Flexible models with evolving structure. Int J Intell Syst 19(4):327–340
Angelov P, Filev D (2004b) An approach to online identification of Takagi-Sugeno fuzzy models. IEEE Trans Syst Man Cybern B Cybern 34(1):484–498
Ban SW, Lee M, Yang HS (2004) A face detection using biologically motivated bottom-up saliency map model and top-down perception model. Neurocomputing 56:475–480
Campbell JP (1997) Speaker recognition: a tutorial. Proc IEEE 85(9):1437–1462
Carpenter GA, Grossberg S (1988) The ART of adaptive pattern recognition by a self-organizing neural network. Computer 21(3):77–88
Choi SB, Jung BS, Ban SW, Niitsuma H, Lee M (2006) Biologically motivated vergence control system using human-like selective attention model. Neurocomputing 69:537–558
Davis SB, Mermelstein P (1980) Comparison of parametric representations for mono-syllabic word recognition in continuously spoken sentences. IEEE Trans Acoust Speech Signal Process 28(4):357–366
Goldstein EB (1996) Sensation and perception, 4th edn. An International Thomson Publishing Company, Mexico
Hall P, Martin R (1998) Incremental eigenanalysis for classification. In: Proceedings of British machine vision conference, vol 1, pp 286–295
Harris S (2008) All-in-one CISSP exam guide, 4th edn. McGraw-Hill, pp 184–185
Hasan R, Jamil M, Rabbani G, Rahman S (2004) Speaker identification using mel frequency cepstral coefficients. In: Proceedings of 3rd international conference on electrical and computer engineering (ICECE), pp 565–568
Haykin S (1999) Neural networks—a comprehensive foundation, 2nd edn. Prentice Hall, Englewood Cliffs
Iglesias JA, Angelov P, Ledezma A, Sanchis A (2010) Evolving classification of agent’s behaviors: a general approach. Evol Syst 1(3):161–172
Jeong S, Ban SW, Lee M (2008) Stereo saliency map considering affective factors and selective motion analysis in a dynamic environment. Neural Netw 21(10):1420–1430
Kasabov NK, Song Q (2002) DENFIS: Dynamic evolving neuralfuzzy inference system and its application for time-series prediction. IEEE Trans Fuzzy Syst 10(2):144–154
Kim B, Ban SW, Lee M (2008) Improving AdaBoost based face detection using face-color preferable selective attention. In: Intelligent data engineering and automated learning—IDEAL 2008, LNCS 5326. Springer, Berlin, pp 88–95
Matyás V, Ríha Z (2002) Biometric authentication—security and usability. In: Proceedings of IFIP TC6/TC11 sixth joint working conference on communications and multimedia security, pp 227–239
Otsu N (1979) A threshold selection method from gray-level histogram. IEEE Trans System Man Cybern 9(1):62–66
Ozawa S, Toh SL, Abe S, Pang S, Kasabov N (2005) Incremental learning of feature space and classifier for face recognition. Neural Netw 18(5–6):575–584
Ozawa S, Pang S, Kasabov N (2008) Incremental learning of chunk data for on-line pattern classification systems. IEEE Trans Neural Netw 19(6):1061–1074
Pacheco J, Rubio J, Guillen J (2009) Detection and following of a face in movement using a neural network. In: Advances in neural networks research—ISNN 2009, AISC 56. Springer, Berlin, pp 481–490
Park HM (2003) Adaptive filtering methods for acoustic noise reduction and noisy speech recognition. Doctor’s thesis, Department of Electrical Engineering and Computer Science, Division of Electrical Engineering, Korea Advanced Institute of Science and Technology
Platt J (1991) A resource-allocating network for function interpolation. Neural Comput 3(2):213–225
Rubio JJ (2009) SOFMLS: online self-organizing fuzzy modified least square network. IEEE Trans Fuzzy Syst 17(6):1296–1309
Rubio JJ, Pacheco J (2009) An stable online clustering fuzzy neural network for nonlinear systems identification. Neural Comput Appl 18(6):633–641
Rubio JJ, Vazquez DM, Pacheco J (2010) Backpropagation to train an evolving radial basis function neural network. Evol Syst 1(3):173–180
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vision 57(2):137–154
Xu M, Duan L-Y, Cai J, Chia L-T, Xu C, Tian Q (2004) HMM-based audio keyword generation. In: Advances in multimedia information processing, LNCS, vol 3333. Springer, Berlin, pp 566–574
Acknowledgments
This research was supported by the Converging Research Center Program funded by the Ministry of Education, Science and Technology (2010K001130) (50%) and also the National Research Foundation of Korea (NRF) Grant (NRF-2010-616-D00096) (50%).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jang, YM., Lee, M. & Ozawa, S. A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information. Evolving Systems 2, 261–272 (2011). https://doi.org/10.1007/s12530-011-9033-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12530-011-9033-2