Person authentication using face, teeth and voice modalities for mobile device security

In this paper, we propose an enhanced multimodal personal authentication system for mobile device security. The proposed approach fuses information obtained from face, teeth and voice modalities to improve performance. To integrate three modalities, we employ various fusion techniques such as the weighted-summation rule, K-NN, Fisher and Gaussian classifiers, and we then evaluate the authentication performance of the proposed system. The performance is evaluated on a database consisting of 1000 biometric traits that correspond to the face, teeth and voice modalities of 50 persons, i.e., 20 biometric traits per individual, in which these biometric traits are simultaneously collected by a smart-phone device. The experiment results integrating the three modalities showed the error rates of 1.64%, 4.70%, 3.06% and 1.98% for the weighted-summation rule, K-NN, Fisher and Gaussian classifier, respectively, and that the weight-summation rule outperformed the other classification approaches. In contrast, the error rates regarding a single modality were 5.09%, 7.75% and 8.98% for face, teeth, and voice modalities, respectively. From these results, we confirmed that the proposed method achieved a significant performance improvement over the methods using a single modality, and the results showed that the proposed method was very effective through various fusion experiments.

[1]  K. Chamnongthai,et al.  Improved PCA-Based Personal Identification Method Using Invariance Moment , 2005, 2005 3rd International Conference on Intelligent Sensing and Information Processing.

[2]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[3]  Kwang-Seok Hong,et al.  Multimodal biometric authentication using teeth image and voice in mobile environment , 2008, IEEE Transactions on Consumer Electronics.

[4]  Arun Ross,et al.  Score normalization in multimodal biometric systems , 2005, Pattern Recognit..

[5]  Bor-Chen Kuo,et al.  Gaussian mixture classifier with regularized covariance estimator for hyperspectral data classification , 2003, IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium. Proceedings (IEEE Cat. No.03CH37477).

[6]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[7]  Arun Ross,et al.  An introduction to biometrics , 2008, ICPR 2008.

[8]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[9]  Arun Ross,et al.  Information fusion in biometrics , 2003, Pattern Recognit. Lett..

[10]  Tae-Kyung Cho,et al.  Teeth Image Recognition for Biometrics , 2006, IEICE Trans. Inf. Syst..

[11]  Kwang-Seok Hong,et al.  Teeth recognition based on multiple attempts in mobile device , 2010, J. Netw. Comput. Appl..

[12]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[13]  Arun Ross,et al.  An introduction to biometric recognition , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Aggelos K. Katsaggelos,et al.  Audio-Visual Biometrics , 2006, Proceedings of the IEEE.

[15]  Oscar E. Agazzi,et al.  Keyword Spotting in Poorly Printed Documents using Pseudo 2-D Hidden Markov Models , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Sridha Sridharan,et al.  Adaptive Fusion of Speech and Lip Information for Robust Speaker Identification , 2001, Digit. Signal Process..

[17]  Monson H. Hayes,et al.  An embedded HMM-based approach for face detection and recognition , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[18]  Douglas D. O'Shaughnessy,et al.  Speech communication : human and machine , 1987 .

[19]  A. Murat Tekalp,et al.  Multimodal speaker identification using an adaptive classifier cascade based on modality reliability , 2005, IEEE Transactions on Multimedia.