Towards shallow grammar induction for an adaptive assistive vocal interface : a concept tagging approach

This paper describes research within the ALADIN project, which aims to develop an adaptive, assistive vocal interface for people with a physical impairment. One of the components in this interface is a self-learning grammar module, which maps a user’s utterance to its intended meaning. This paper describes a case study of the learnability of this task on the basis of a corpus of commands for the card game patience. The collection, transcription and annotation of this corpus is outlined in this paper, followed by results of preliminary experiments using a shallow concept-tagging approach. Encouraging results are observed during learning curve experiments, that gauge the minimal amount of training data needed to trigger accurate concept tagging of previously unseen utterances.

[1]  Christopher D. Manning,et al.  The unsupervised learning of natural language structure , 2005 .

[2]  C. de Marcken On the Unsupervised Induction of Phrase-Structure Grammars , 1999 .

[3]  Alex Acero,et al.  SGStudio: rapid semantic grammar development for spoken language understanding , 2005, INTERSPEECH.

[4]  Kallirroi Georgila,et al.  Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08) , 2008 .

[5]  Guy De Pauw Agent-Based Unsupervised Grammar Induction , 2005, EUMAS.

[6]  Walter Daelemans,et al.  Memory-based shallow parser for python , 2010 .

[7]  Jan Noyes,et al.  Speech recognition technology for individuals with disabilities , 1992 .

[8]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[9]  Walter Daelemans,et al.  MBT : Memory Based Tagger, version 1.0, Reference Guide , 2002 .

[10]  Bradford Starkie Programming Spoken Dialogs Using Grammatical Inference , 2001, Australian Joint Conference on Artificial Intelligence.

[11]  Walter Daelemans,et al.  Memory-Based Language Processing , 2009, Studies in natural language processing.

[12]  Alex Acero,et al.  Rapid development of spoken language understanding grammars , 2006, Speech Commun..

[13]  Hermann Ney,et al.  A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding , 2008, LREC.

[14]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.