论文信息 - Multimodal Authentication Using Asynchronous HMMs

Multimodal Authentication Using Asynchronous HMMs

It has often been shown that using multiple modalities to authenticate the identity of a person is more robust than using only one. Various combination techniques exist and are often performed at the level of the output scores of each modality system. In this paper, we present a novel HMM architecture able to model the joint probability distribution of pairs of asynchronous sequences (such as speech and video streams) describing the same event. We show how this model can be used for audio-visual person authentication. Results on the M2VTS database show robust performances of the system under various audio noise conditions, when compared to other state-of-the-art techniques.

Samy Bengio | Samy Bengio

[1] A. Nakamura,et al. Nature (London , 1975 .

[2] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[3] Samy Bengio,et al. An EM Algorithm for Asynchronous Input/Output Hidden Markov Models , 1996 .

[4] Luc Vandendorpe,et al. The M2VTS Multimodal Face Database (Release 1.00) , 1997, AVBPA.

[5] Sean R. Eddy,et al. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[6] Juergen Luettin,et al. Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..

[7] Gérard Chollet,et al. Multi-modal identity verification using expert fusion , 2000, Inf. Fusion.

[8] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[9] Arun Ross,et al. Information fusion in biometrics , 2003, Pattern Recognit. Lett..