论文信息 - Detecting Missing Hyphens in Learner Text

Detecting Missing Hyphens in Learner Text

We present a method for automatically detecting missing hyphens in English text. Our method goes beyond a purely dictionary-based approach and also takes context into account. We evaluate our model on artificially generated data as well as naturally occurring learner text. Our best-performing model achieves high precision and reasonable recall, making it suitable for inclusion in a system that gives feedback to language learners.

Nitin Madnani | Martin Chodorow | Aoife Cahill | Susanne Wolff

[1] Michiel Bacchiani,et al. Restoring punctuation and capitalization in transcribed speech , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2] Dan Roth,et al. University of Illinois System in HOO Text Correction Shared Task , 2011, ENLG.

[3] Helen Yannakoudakis,et al. A New Dataset and Method for Automatically Grading ESOL Texts , 2011, ACL.

[4] Nitin Madnani,et al. Robust Systems for Preposition Error Correction Using Wikipedia Revisions , 2013, NAACL.

[5] Martin Chodorow,et al. Correcting Comma Errors in Learner Essays, and Restoring Commas in Newswire Text , 2012, NAACL.

[6] Adam Kilgarriff,et al. Helping Our Own: The HOO 2011 Pilot Shared Task , 2011, ENLG.