Secured vocal access to telephone servers

A number of applications of man-machine interaction over the telephone requires a combination of speech recognition and speaker verification. This paper describes work carried out at IDIAP in the framework of national and European projects. A generic interactive voice server (IVS) is described by means of a graphical formalism. It includes speech recognition based on speaker independent flexible vocabulary technology and speaker verification performed by a number of techniques executed in parallel, and combined for optimal decision making.

[1]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[2]  Yevgeny Ludovik,et al.  Intelligent answering machine-secretary , 1995, EUROSPEECH.

[3]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[4]  Frédéric Bimbot,et al.  Variable-length sequence matching for phonetic transcription using joint multigrams , 1995, EUROSPEECH.

[5]  Gérard Chollet,et al.  Swiss French PolyPhone and PolyVar: telephone speech databases to model inter- and intra-speaker variability , 1996 .

[6]  Yasuhisa Niimi,et al.  Modeling dialogue control strategies to relieve speech recognition errors , 1995, EUROSPEECH.

[7]  Gérard Chollet,et al.  Combining methods to improve speaker verification decision , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[8]  Aaron E. Rosenberg,et al.  Connected word talker verification using whole word hidden Markov models , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[9]  Gérard Chollet,et al.  Swiss PolyPhone and PolyVar: Building Databases for Speech Recognition and Speaker Verification , 1996 .

[10]  B. L. Zeigler,et al.  Dialog design for a speech-interactive automation system , 1994, Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications.

[11]  Gérard Chollet,et al.  Assessment of speaker verification systems , 1995 .

[12]  Ivan Magrin-Chagnolleau,et al.  Second-order statistical measures for text-independent speaker identification , 1995, Speech Commun..