A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information

We propose a new approach to a real-time personal authentication system based on incrementally updated visual (face) and audio (voice) features of persons. The proposed system consists of real-time face detection, incremental audiovisual feature extraction, and incremental neural classifier model with long-term memory. The face detection part, a biologically motivated face-color preferable selective attention model first localizes face candidate regions in natural scenes, and then the Adaboost-based face detection identifies human faces from the localized face-candidate regions. The mel-frequency cepstral coefficient is used for vocal feature extraction of speakers. Moreover, incremental principal component analysis (IPCA) is used to reduce the dimensions of audiovisual features and to update them incrementally. The features extracted by IPCA is fed to the resource allocating network with long-term memory which learns facial and vocal features incrementally and recognizes faces in real time. Experimental results show that the proposed system can enhance the test performance incrementally without serious forgetting. In addition, a multi-modal (facial and vocal) feature effectively increases the robustness of the personal authentication system in noisy environments.

[1]  Shigeo Abe,et al.  Incremental learning of feature space and classifier for face recognition , 2005, Neural Networks.

[2]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[3]  José de Jesús Rubio,et al.  An stable online clustering fuzzy neural network for nonlinear system identification , 2009, Neural Computing and Applications.

[4]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[5]  John C. Platt A Resource-Allocating Network for Function Interpolation , 1991, Neural Computation.

[6]  Plamen P. Angelov,et al.  Evolving classification of agents’ behaviors: a general approach , 2010, Evol. Syst..

[7]  Qi Tian,et al.  HMM-Based Audio Keyword Generation , 2004, PCM.

[8]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[9]  Minho Lee,et al.  Improving AdaBoost Based Face Detection Using Face-Color Preferable Selective Attention , 2008, IDEAL.

[10]  Václav Matyás,et al.  Biometric authentication - security and usability , 2002, Communications and Multimedia Security.

[11]  José de Jesús Rubio,et al.  SOFMLS: Online Self-Organizing Fuzzy Modified Least-Squares Network , 2009, IEEE Transactions on Fuzzy Systems.

[12]  José de Jesús Rubio,et al.  Backpropagation to train an evolving radial basis function neural network , 2010, Evol. Syst..

[13]  D.P. Filev,et al.  An approach to online identification of Takagi-Sugeno fuzzy models , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  Hyun Seung Yang,et al.  A face detection using biologically motivated bottom-up saliency map model and top-down perception model , 2004, Neurocomputing.

[15]  Dimitar Filev,et al.  Flexible models with evolving structure , 2004 .

[16]  J. Moran,et al.  Sensation and perception , 1980 .

[17]  Minho Lee,et al.  Biologically motivated vergence control system using human-like selective attention model , 2006, Neurocomputing.

[18]  Ralph R. Martin,et al.  Incremental Eigenanalysis for Classification , 1998, BMVC.

[19]  Shon Harris,et al.  All-in-one CISSP exam guide , 2005 .

[20]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[21]  Saifur Rahman,et al.  SPEAKER IDENTIFICATION USING MEL FREQUENCY CEPSTRAL COEFFICIENTS , 2004 .

[22]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[23]  Stephen Grossberg,et al.  The ART of adaptive pattern recognition by a self-organizing neural network , 1988, Computer.

[24]  Nikola K. Kasabov,et al.  DENFIS: dynamic evolving neural-fuzzy inference system and its application for time-series prediction , 2002, IEEE Trans. Fuzzy Syst..

[25]  José de Jesús Rubio,et al.  Detection and Following of a Face in Movement Using a Neural Network , 2009, ISNN.

[26]  Shaoning Pang,et al.  Incremental Learning of Chunk Data for Online Pattern Classification Systems , 2008, IEEE Transactions on Neural Networks.

[27]  Minho Lee,et al.  Stereo saliency map considering affective factors and selective motion analysis in a dynamic environment , 2008, Neural Networks.