Considering the subjectivity to rationalise evaluation approaches: The example of Spoken Dialogue Systems

We present in this paper a reading grid aiming at helping the evaluator take into account the subjectivity factor when designing evaluation protocols. Actually, however most contributions to Spoken Dialogue Systems evaluation tend to objectify their approach with rationalising purpose, we believe that subjectivity needs to be considered for valuable evaluations. The first section shows how closely evaluation processes are dependant on their contexts and on the evaluators' perspectives. We then present an anthropocentric framework that establishes the evaluator as a mediator between the consideration of contextual elements and a rationalising corpus of evaluation procedures. We finally anticipate the benefits our framework brings at both individual and community levels.

[1]  Philippe Bretier,et al.  Ad-hoc Evaluations Along the Lifecycle of Industrial Spoken Dialogue Systems: Heading to Harmonisation? , 2010, LREC.

[2]  Klaus-Peter Engelbrecht,et al.  A taxonomy of quality of service and Quality of Experience of multimodal human-machine interaction , 2009, 2009 International Workshop on Quality of Multimedia Experience.

[3]  Romain Laroche,et al.  Hybridisation of expertise and reinforcement learning in dialogue systems , 2009, INTERSPEECH.

[4]  Philippe Lorino,et al.  Communautés d’enquête et création de connaissances dans l’organisation: le modèle de processus en gestion , 2007, Ann. des Télécommunications.

[5]  Tim Paek,et al.  Toward Evaluation that Leads to Best Practices: Reconciling Dialog Evaluation in Research and Industry , 2007, Proceedings of the Workshop on Bridging the Gap Academic and Industrial Research in Dialog Technologies - NAACL-HLT '07.

[6]  Roberto Pieraccini,et al.  Where do we go from here? Research and Commercial Spoken Dialog Systems , 2005, SIGDIAL.

[7]  Sebastian Möller,et al.  Quality of Telephone-Based Spoken Dialogue Systems , 2005 .

[8]  Sebastian Mller,et al.  Quality of Telephone-Based Spoken Dialogue Systems , 2004 .

[9]  G. Fischer Communities of Interest: Learning through the Interaction of Multiple Knowledge Systems , 2001 .

[10]  Jens Rasmussen,et al.  Skills, rules, and knowledge; signals, signs, and symbols, and other distinctions in human performance models , 1983, IEEE Transactions on Systems, Man, and Cybernetics.