Spoken language understanding using weakly supervised learning

In this paper, we present a weakly supervised learning approach for spoken language understanding in domain-specific dialogue systems. We model the task of spoken language understanding as a two-stage classification problem. Firstly, the topic classifier is used to identify the topic of an input utterance. Secondly, with the restriction of the recognized target topic, the slot classifiers are trained to extract the corresponding slot-value pairs. It is mainly data-driven and requires only minimally annotated corpus for training whilst retaining the understanding robustness and deepness for spoken language. More importantly, it allows that weakly supervised strategies are employed for training the two kinds of classifiers, which could significantly reduce the number of labeled sentences. We investigated active learning and naive self-training for the two kinds of classifiers. Also, we propose a practical method for bootstrapping topic-dependent slot classifiers from a small amount of labeled sentences. Experiments have been conducted in the context of the Chinese public transportation information inquiry domain and the English DARPA Communicator domain. The experimental results show the effectiveness of our proposed SLU framework and demonstrate the possibility to reduce human labeling efforts significantly.

[1]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[2]  David Yarowsky,et al.  DECISION LISTS FOR LEXICAL AMBIGUITY RESOLUTION: Application to Accent Restoration in Spanish and French , 1994, ACL.

[3]  Andrew McCallum,et al.  Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.

[4]  Sadaoki Furui,et al.  Combination of finite state automata and neural network for spoken language understanding , 2003, INTERSPEECH.

[5]  Brendan J. Frey,et al.  Combination of statistical and rule-based approaches for spoken language understanding , 2002, INTERSPEECH.

[6]  Helen M. Meng,et al.  Semiautomatic Acquisition of Semantic Structures for Understanding Domain-Specific Natural Language Queries , 2002, IEEE Trans. Knowl. Data Eng..

[7]  Marilyn A. Walker,et al.  PARADISE: A Framework for Evaluating Spoken Dialogue Agents , 1997, ACL.

[8]  Andreas Stolcke,et al.  Inducing Probabilistic Grammars by Bayesian Model Merging , 1994, ICGI.

[9]  Alex Acero,et al.  Combining Statistical and Knowledge-Based Spoken Language Understanding in Conditional Models , 2006, ACL.

[10]  Ye-Yi Wang,et al.  Grammar learning for spoken language understanding , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[11]  Eugene Charniak Natural language learning , 1995, CSUR.

[12]  Douglas E. Appelt,et al.  GEMINI: A Natural Language System for Spoken-Language Understanding , 1993, ACL.

[13]  Sheri Hunnicutt,et al.  An experimental dialog system: WAXHOLM , 1993 .

[14]  Stephanie Seneff,et al.  TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.

[15]  Alexander H. Waibel,et al.  Modeling with Structures in Statistical Machine translation , 1998, ACL.

[16]  James R. Curran,et al.  Bootstrapping POS-taggers using unlabelled data , 2003, CoNLL.

[17]  Richard M. Schwartz,et al.  Hidden Understanding Models of Natural Language , 1994, ACL.

[18]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[19]  Lawton Hikwa Information and Knowledge Management and Access to Information , 2006 .

[20]  Chin-Hui Lee,et al.  Metrics for measuring domain independence of semantic classes , 2001, INTERSPEECH.

[21]  Bob Carpenter,et al.  Natural language call routing: a robust, self-organizing approach , 1998, ICSLP.

[22]  Victor Zue,et al.  JUPlTER: a telephone-based conversational interface for weather information , 2000, IEEE Trans. Speech Audio Process..

[23]  Rayid Ghani,et al.  Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[24]  R. Rivest Learning Decision Lists , 1987, Machine Learning.

[25]  Greg Schohn,et al.  Less is More: Active Learning with Support Vector Machines , 2000, ICML.

[26]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[27]  Anoop Sarkar,et al.  Applying Co-Training Methods to Statistical Parsing , 2001, NAACL.

[28]  Sheri Hunnicutt,et al.  An experimental dialogue system: waxholm , 1993, EUROSPEECH.

[29]  Daphne Koller,et al.  Support Vector Machine Active Learning with Application sto Text Classification , 2000, ICML.

[30]  Craig A. Knoblock,et al.  Active + Semi-supervised Learning = Robust Multi-View Learning , 2002, ICML.

[31]  Steve J. Young,et al.  Semantic processing using the Hidden Vector State model , 2005, Comput. Speech Lang..

[32]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[33]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[34]  Andrew R. Golding,et al.  A Bayesian Hybrid Method for Context-sensitive Spelling Correction , 1996, VLC@ACL.

[35]  Ye-Yi Wang A robust parser for spoken language understanding , 1999, EUROSPEECH.

[36]  Wolfgang Minker,et al.  A stochastic case frame approach for natural language understanding , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[37]  Roberto Pieraccini,et al.  A Learning Approach to Natural Language Understanding , 1994, ArXiv.

[38]  Gökhan Tür,et al.  The AT&T spoken language understanding system , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[39]  Taylor L. Booth,et al.  Grammatical Inference: Introduction and Survey-Part I , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Steven P. Abney,et al.  Bootstrapping , 2002, ACL.

[41]  Min Tang,et al.  Active Learning for Statistical Natural Language Parsing , 2002, ACL.

[42]  Jean-Luc Gauvain,et al.  Field trials of a telephone service for rail travel information , 1996, Proceedings of IVTTA '96. Workshop on Interactive Voice Technology for Telecommunications Applications.

[43]  Srinivas Bangalore,et al.  Combining prior knowledge and boosting for call classification in spoken language dialogue , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[44]  Mark E. Epstein,et al.  Fertility Models for Statistical Natural Language Understanding , 1997, ACL.

[45]  Yoram Singer,et al.  Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[46]  Feng Gao,et al.  A spoken language understanding approach using successive learners , 2006, INTERSPEECH.

[47]  Hermann Ney,et al.  Natural language understanding using statistical machine translation , 2001, INTERSPEECH.

[48]  Liang Gu,et al.  Portability challenges in developing interactive dialogue systems , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[49]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[50]  Jean-Luc Gauvain,et al.  The LIMSI RailTel System: Field trial of a telephone service for rail travel information , 1997, Speech Commun..

[51]  Wayne H. Ward,et al.  Recent Improvements in the CMU Spoken Language Understanding System , 1994, HLT.

[52]  Steve Young,et al.  A data-driven spoken language understanding system , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[53]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[54]  Renato De Mori,et al.  The Application of Semantic Classification Trees to Natural Language Understanding , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[55]  Ru-Zhan Lu,et al.  Embedded machine learning systems for robust spoken language parsing , 2005, 2005 International Conference on Natural Language Processing and Knowledge Engineering.

[56]  Gökhan Tür,et al.  Combining active and semi-supervised learning for spoken language understanding , 2005, Speech Commun..

[57]  Feng Gao,et al.  A Weakly Supervised Learning Approach for Spoken Language Understanding , 2006, EMNLP.

[58]  Alex Acero,et al.  Spoken Language Understanding "” An Introduction to the Statistical Framework , 2005 .

[59]  Chung Hee Hwang,et al.  The TRAINS project: a case study in building a conversational planning agent , 1994, J. Exp. Theor. Artif. Intell..

[60]  P. J. Price,et al.  Evaluation of Spoken Language Systems: the ATIS Domain , 1990, HLT.

[61]  Rada Mihalcea,et al.  Co-training and Self-training for Word Sense Disambiguation , 2004, CoNLL.