A Methodology for Evaluating Spoken Language Dialogue Systems and Their Components

As spoken language dialogue systems (SLDSs) proliferate in the market place, the issue of SLDS evaluation has come to attract wide interest from research and industry alike. Yet it is only recently that spoken dialogue engineering researchers have come to face SLDSs evaluation in its full complexity. This paper presents results of the European DISC project concerning technical evaluation and usability evaluation of SLDSs and their components. The paper presents a methodology for complete and correct evaluation of SLDSs and components together with a generic evaluation template for describing the evaluation criteria needed.

[1]  Niels Ole Bernsen,et al.  Designing Interactive Speech Systems , 1998, Springer London.

[2]  N. M. Fraser,et al.  Call routing by name recognition: field trial results for the Operetta/sup TM/ system , 1996, Proceedings of IVTTA '96. Workshop on Interactive Voice Technology for Telecommunications Applications.

[3]  Lou Boves,et al.  Overview of the ARISE project , 1999, EUROSPEECH.

[4]  Scott McGlashan,et al.  Units of dialogue management: an example , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[5]  Laila Dybkjær,et al.  The disc approach to spoken language systems development and evaluation , 1998 .

[6]  Niels Ole Bernsen,et al.  Designing interactive speech systems - from first ideas to user testing , 1998 .