Deeper Spoken Language Understanding for Man-Machine Dialogue on Broader Application Domains: A Logical Alternative to Concept Spotting

LOGUS is a French-speaking spoken language understanding (SLU) system which carries out a deeper analysis than those achieved by standard concept spotters. It is designed for multi-domain conversational systems or for systems that are working on complex application domains. Based on a logical approach, the system adapts the ideas of incremental robust parsing to the issue of SLU. The paper provides a detailed description of the system as well as results from two evaluation campaigns that concerned all of current French-speaking SLU systems. The observed error rates suggest that our logical approach can stand comparison with concept spotters on restricted application domains, but also that its behaviour is promising for larger domains. The question of the generality of the approach is precisely addressed by our current investigations on a new task: SLU for an emotional robot companion for young hospital patents.

[1]  M. McShane,et al.  Semantics-based resolution of fragments and underspecified structures , 2005 .

[2]  Robert C. Berwick,et al.  Principle-Based Parsing , 1987 .

[3]  Daniel Vanderveken,et al.  Chapter 2. Universal Grammar and Speech Act Theory , 2001 .

[4]  Klaus Zechner Automatic Construction of Frame Representations for Spontaneous Speech in Unrestricted Domains , 1998, COLING-ACL.

[5]  Steven Greenberg,et al.  LINGUISTIC DISSECTION OF SWITCHBOARD-CORPUS AUTOMATIC SPEECH RECOGNITION SYSTEMS , 2000 .

[6]  Jean-Yves Antoine,et al.  Predictive and objective evaluation of speech understanding: the "challenge" evaluation campaign of the I3 speech workgroup of the French CNRS , 2002, LREC.

[7]  James F. Allen,et al.  Generic Parsing for Multi-Domain Semantic Interpretation , 2005, IWPT.

[8]  J. Austin How to do things with words , 1962 .

[9]  Csr Young,et al.  How to Do Things With Words , 2009 .

[10]  David McKelvie,et al.  The Syntax of Disfluency in Spontaneous Spoken Language , 1998 .

[11]  Julia Hirschberg,et al.  Error handling in spoken dialogue systems , 2005, Speech Commun..

[12]  Victor Zue,et al.  JUPlTER: a telephone-based conversational interface for weather information , 2000, IEEE Trans. Speech Audio Process..

[13]  Lori Lamel,et al.  Challenges in real-life emotion annotation and machine learning based detection , 2005, Neural Networks.

[14]  Daniel Vanderveken,et al.  UNIVERSAL GRAMMAR AND SPEECH ACT THEORY , 2001 .

[15]  Gökhan Tür,et al.  Introduction to the Special Issue on Spoken Language Understanding in Conversational Systems , 2005, Speech Commun..

[16]  Steven Abney,et al.  Parsing By Chunks , 1991 .

[17]  Frédéric Béchet,et al.  The French MEDIA/EVALDA Project: the Evaluation of the Understanding Capability of Spoken Language Dialogue Systems , 2004, LREC.

[18]  Roberto Basili,et al.  Parsing engineering and empirical robustness , 2002, Natural Language Engineering.

[19]  James F. Allen,et al.  Improving Robustness by Modeling Spontaneous Speech Events , 2001 .

[20]  Jean-Pierre Chanod,et al.  Robustness beyond shallowness: incremental deep parsing , 2002, Natural Language Engineering.

[21]  James F. Allen,et al.  Human-Machine Collaborative Planning , 2002 .

[22]  Alex Waibel,et al.  Stochastically-Based Semantic Analysis , 1999 .

[23]  Mark-Jan Nederhof,et al.  Robust grammatical analysis for spoken dialogue systems , 1999, Natural Language Engineering.