FDVQ based keyword spotter which incorporates a semi-supervised learning for primary processing

In this paper, we present a novel hybrid keyword spotting system that combines supervised and semi-supervised competitive learning algorithms. The rst stage is a S-SOM (Semi-supervised SelfOrganizing Map) module which is speci cally designed for discrimination between keywords (KWs) and non-keywords (NKWs). The second stage is an FDVQ (Fuzzy Dynamic Vector Quantization) module which consists of discriminating between KWs detected by the rst stage processing. The experiment on Switchboard database has show an improvement of about 6% on the accuracy of the system comparing to our best keyword-spotter one.

[1]  Franck Poirier,et al.  On a Fuzzy DVQ Algorithm for Speech Recognition , 1995 .

[2]  Korris Fu-Lai Chung,et al.  Fuzzy competitive learning , 1994, Neural Networks.

[3]  Herbert Gish,et al.  Phonetic training and language modeling for word spotting , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Shigeru Katagiri,et al.  Prototype-based MCE/GPD training for word spotting and connected word recognition , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[6]  Richard Rose,et al.  Discriminant wordspotting techniques for rejecting non-vocabulary utterances in unconstrained speech , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Franck Poirier,et al.  DVQ: DYNAMIC VECTOR QUANTIZATION - AN INCREMENTAL LVQ , 1991 .

[8]  Sophie Midenet,et al.  Learning Associations by Self-Organization: The LASSO model , 1994, Neurocomputing.

[9]  Alexander H. Waibel,et al.  Improving the MS-TDNN for word spotting , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Franck Poirier,et al.  Keyword spotting using supervised/unsupervised competitive learning , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[11]  Luis A. Hernández Gómez,et al.  Grammar learning and word spotting using recurrent neural networks , 1993, EUROSPEECH.

[12]  Franck Poirier,et al.  Improved DVQ algorithm for speech recognition: a new adaptive learning rule with neurons annihilation , 1993, EUROSPEECH.

[13]  Richard Rose,et al.  A hidden Markov model based keyword recognition system , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[14]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.