Predicting Sentences using N-Gram Language Models

We explore the benefit that users in several application areas can experience from a "tab-complete" editing assistance function. We develop an evaluation metric and adapt N-gram language models to the problem of predicting the subsequent words, given an initial text fragment. Using an instance-based method as baseline, we empirically study the predictability of call-center emails, personal emails, weather reports, and cooking recipes.

[1]  Robert L. Mercer,et al.  An Estimate of an Upper Bound for the Entropy of English , 1992, CL.

[2]  Ian H. Witten,et al.  The Reactive Keyboard , 1992 .

[3]  Hendrik Blockeel,et al.  The Learning Shell: Automated Macro Construction , 2001, User Modeling.

[4]  Nico Jacobs,et al.  The Learning Shell , 2000, AAAI 2000.

[5]  Russell Greiner,et al.  Predicting UNIX Command Lines: Adjusting to User Patterns , 2000, AAAI/IAAI.

[6]  George F. Foster,et al.  Adaptive Language and Translation Models for Interactive Machine Translation , 2004, EMNLP.

[7]  Philippe Langlais,et al.  Trans Type: Development-Evaluation Cycles to Boost Translator's Productivity , 2002, Machine Translation.

[8]  Claude E. Shannon,et al.  Prediction and Entropy of Printed English , 1951 .

[9]  Tina Magnuson,et al.  Measuring the effectiveness of word prediction: The advantage of long-term use , 2002 .

[10]  Philippe Langlais,et al.  Translators at work with TRANSTYPE: Resource and Evaluation , 2002, LREC.

[11]  George F. Foster,et al.  User-Friendly Text Prediction For Translators , 2002, EMNLP.

[12]  Yiming Yang,et al.  The Enron Corpus: A New Dataset for Email Classi(cid:12)cation Research , 2004 .

[13]  Tobias Scheffer,et al.  Sentence Completion , 1921, SIGIR '04.

[14]  Brian D. Davison,et al.  Predicting Sequences of User Actions , 1998 .

[15]  Matjaz Debevc,et al.  An adaptive short list for documents on the World Wide Web , 1997, IUI '97.

[16]  George F. Foster,et al.  Unit Completion for a Computer-aided Translation Typing System , 2004, Machine Translation.

[17]  Hendrik Blockeel,et al.  Sequence prediction with mixed order Markov chains , 2002 .

[18]  Hiroshi Motoda,et al.  Machine Learning Techniques to Make Computers Easier to Use , 1997, IJCAI.