A keyword selection strategy for dialogue move recognition and multi-class topic identification

The concept of usefulness for keyword selection in topic identification problems is reformulated and extended to the multi-class domain. The derivation is shown to be a generalisation of that for the two class problem. The technique is applied to both multinomial and Poisson based estimates of word probability, and shown to outperform or compare favourably to various information theoretic techniques classifying dialogue moves in the map task corpus, and reports in the LOB corpus.

[1]  Hermann Ney,et al.  On the Estimation of 'Small' Probabilities by Leaving-One-Out , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Herbert Gish,et al.  Approaches to topic identification on the switchboard corpus , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Roger K. Moore,et al.  A theory of word frequencies and its application to dialogue move recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[4]  Michael J. Carey,et al.  Discriminative phonemes for speaker identification , 1994, ICSLP.

[5]  Roger K. Moore,et al.  The application of dynamic programming techniques to non-word based topic spotting , 1995, EUROSPEECH.

[6]  NeyHermann,et al.  On the Estimation of 'Small' Probabilities by Leaving-One-Out , 1995 .

[7]  A. Gorin On automated language acquisition , 1989 .

[8]  R. Gallager Information Theory and Reliable Communication , 1968 .

[9]  Anne H. Anderson,et al.  The Hcrc Map Task Corpus , 1991 .