Dialog Annotation for Stochastic Generation

Individuals who successfully make their livelihood by talking with others, for example travel agents, can be presumed to have optimized their language for the task at hand in terms of conciseness and intelligibility. It makes sense to exploit this effort for the purpose of building better generation components for a spoken dialog system. The Stochastic Generation technique, introduced by Oh and Rudnicky (2002), is one such approach. In this approach, utterances in a corpus of domain expert utterances are classified as to speech act and individual concepts tagged. Statistical n-gram models are built for each speech-act class then used generatively to create novel utterances. These have been shown to be comparable in quality to human productions. The class and tag scheme is concrete and closely tied to the domain at hand; we believe this produces a distinct advantage in speed of implementation and quality of results. The current paper describes the classification and tagging procedures used for Stochastic Generation, and discusses the advantages and limitations of the techniques.

[1]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[2]  Alexander I. Rudnicky,et al.  Creating natural dialogs in the carnegie mellon communicator system , 1999, EUROSPEECH.

[3]  Marilyn A. Walker,et al.  SPoT: A Trainable Sentence Planner , 2001, NAACL.

[4]  Stephanie Seneff,et al.  GENESIS-II: a versatile system for language generation in conversational system applications , 2000, INTERSPEECH.

[5]  Ronald Rosenfeld,et al.  Statistical language modeling using the CMU-cambridge toolkit , 1997, EUROSPEECH.

[6]  Alexander I. Rudnicky,et al.  Stochastic natural language generation for spoken dialog systems , 2002, Comput. Speech Lang..

[7]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[8]  Allen L. Gorin,et al.  User Interface Issues for Natural Spoken Dialog Systems , 1998 .

[9]  Alexander I. Rudnicky,et al.  The carnegie mellon communicator corpus , 2002, INTERSPEECH.

[10]  Helmut Horacek,et al.  A Flexible Shallow Approach to Text Generation , 1998, INLG.

[11]  Victor Zue,et al.  JUPlTER: a telephone-based conversational interface for weather information , 2000, IEEE Trans. Speech Audio Process..

[12]  Scott Axelrod Natural Language Generation in the IBM Flight Information System , 2000 .

[13]  Alexander I. Rudnicky,et al.  Task-based dialog management using an agenda , 2000 .

[14]  Kevin Knight,et al.  The Practical Value of N-Grams Is in Generation , 1998, INLG.

[15]  Yoram Singer,et al.  Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[16]  Adwait Ratnaparkhi,et al.  Trainable Methods for Surface Natural Language Generation , 2000, ANLP.

[17]  Michael Elhadad,et al.  An Overview of SURGE: a Reusable Comprehensive Syntactic Realization Component , 1996, INLG.

[18]  Jacqueline C. Kowtko,et al.  Data Collection and Analysis in the Air Travel Planning Domain , 1989, HLT.

[19]  Amanda Stent,et al.  Content planning and generation in continuous-speech spoken dialog systems∗ , 2000 .

[20]  Ehud Reiter,et al.  Knowledge Acquisition for Natural Language Generation , 2000, INLG.

[21]  Maxine Eskénazi,et al.  Data collection and processing in the carnegie mellon communicator , 1999, EUROSPEECH.