The Imagination of Crowds: Conversational AAC Language Modeling using Crowdsourcing and Large Data Sources

Augmented and alternative communication (AAC) devices enable users with certain communication disabilities to participate in everyday conversations. Such devices often rely on statistical language models to improve text entry by offering word predictions. These predictions can be improved if the language model is trained on data that closely reflects the style of the users' intended communications. Unfortunately, there is no large dataset consisting of genuine AAC messages. In this paper we demonstrate how we can crowd-source the creation of a large set of fictional AAC messages. We show that these messages model conversational AAC better than the currently used datasets based on telephone conversations or newswire text. We leverage our crowdsourced messages to intelligently select sentences from much larger sets of Twitter, blog and Usenet data. Compared to a model trained only on telephone transcripts, our best performing model reduced perplexity on three test sets of AAC-like communications by 60--82% relative. This translated to a potential keystroke savings in a predictive keyboard interface of 5--11%.

[1]  Andreas Stolcke,et al.  Web resources for language modeling in conversational speech recognition , 2007, TSLP.

[2]  Nestor Garay-Vitoria,et al.  Text prediction systems: a survey , 2006, Universal Access in the Information Society.

[3]  Sheri Hunnicutt Using syntactic and semantic information in a word prediction aid , 1989, EUROSPEECH.

[4]  D R Beukelman,et al.  Frequency of word occurrence in communication samples produced by adult communication aid users. , 1984, The Journal of speech and hearing disorders.

[5]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[6]  Alan Ritter,et al.  Unsupervised Modeling of Twitter Conversations , 2010, NAACL.

[7]  Constantine Stephanidis,et al.  Universal access in the information society , 1999, HCI.

[8]  Kathleen F. McCoy,et al.  Topic modeling in fringe word prediction for AAC , 2006, IUI '06.

[9]  Gregory W. Lesher DOMAIN-SPECIFIC WORD PREDICTION FOR AUGMENTATIVE COMMUNICATION , 2001 .

[10]  Hermann Ney,et al.  Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[11]  Andreas Stolcke,et al.  Entropy-based Pruning of Backoff Language Models , 2000, ArXiv.

[12]  Jianfeng Gao,et al.  Toward a unified approach to statistical language modeling for Chinese , 2002, TALIP.

[13]  Treebank Penn,et al.  Linguistic Data Consortium , 1999 .

[14]  Tonio Wandmacher,et al.  Methods to Integrate a Language Model with Semantic Information for a Word Prediction Component , 2007, EMNLP.

[15]  Akshay Java,et al.  The ICWSM 2009 Spinn3r Dataset , 2009 .

[16]  Keith Trnka Adaptive Language Modeling for Word Prediction , 2008, ACL.

[17]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Jianhua Li,et al.  Semantic knowledge in word completion , 2005, Assets '05.

[19]  Kathleen F. McCoy,et al.  User Interaction with Word Prediction: The Effects of Prediction Quality , 2009, TACC.

[20]  Keh-Jiann Chen,et al.  Chinese language model adaptation based on document classification and multiple domain-specific language models , 1997, EUROSPEECH.

[21]  David J. Ward,et al.  Artificial intelligence: Fast hands-free writing by gaze direction , 2002, Nature.

[22]  F ChenStanley,et al.  An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[23]  Horabail S. Venkatagiri Efficient keyboard layouts for sequential access in augmentative and alternative communication , 1999 .

[24]  David J. Ward,et al.  Fast Hands-free Writing by Gaze Direction , 2002, ArXiv.

[25]  Nestor Garay-Vitoria,et al.  Intelligent word-prediction to enhance text input rate (a syntactic analysis-based word-prediction aid for people with severe motor and speech disability) , 1997, IUI '97.

[26]  Thorsten Brants,et al.  Study on interaction between entropy pruning and kneser-ney smoothing , 2010, INTERSPEECH.

[27]  William D. Lewis,et al.  Intelligent Selection of Language Model Training Data , 2010, ACL.

[28]  Tamara Broderick,et al.  Fast and Flexible Selection with a Single Switch , 2009, PloS one.

[29]  Tonio Wandmacher,et al.  Sibylle, An Assistive Communication System Adapting to the Context and Its User , 2008, TACC.