Synthetic Data for English Lexical Normalization: How Close Can We Get to Manually Annotated Data?
暂无分享,去创建一个
[1] Chin-Hui Lee,et al. Tweet Normalization with Syllables , 2015, ACL.
[2] Yi Yang,et al. A Log-Linear Model for Unsupervised Text Normalization , 2013, EMNLP.
[3] Rob van der Goot. MoNoise: A Multi-lingual and Easy-to-use Lexical Normalization Tool , 2019, ACL.
[4] A. Cüneyd Tantug,et al. Normalizing Non-canonical Turkish Texts Using Machine Translation Approaches , 2019, ACL.
[5] Timothy Baldwin,et al. Automatically Constructing a Normalisation Dictionary for Microblogs , 2012, EMNLP.
[6] Timothy Baldwin,et al. Lexical Normalisation of Short Text Messages: Makn Sens a #twitter , 2011, ACL.
[7] Yang Liu,et al. Improving Text Normalization via Unsupervised Model and Discriminative Reranking , 2014, ACL.
[8] Candice Proudfoot,et al. An analysis of the relationship between writing skills and Short Messaging Service language : a self–regulatory perspective , 2011 .
[9] Carlos G'omez-Rodr'iguez,et al. Towards robust word embeddings for noisy texts , 2019, Applied Sciences.
[10] Gertjan van Noord,et al. A Taxonomy for In-depth Evaluation of Normalization for User Generated Content , 2018, LREC.
[11] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[12] Jacob Eisenstein,et al. What to do about bad language on the internet , 2013, NAACL.
[13] L. Venkata Subramaniam,et al. Unsupervised cleansing of noisy text , 2010, COLING.
[14] Jennifer Foster,et al. GenERRate: Generating Errors for Use in Grammatical Error Detection , 2009, BEA@NAACL.
[15] Walter Daelemans,et al. Multimodular Text Normalization of Dutch User-Generated Content , 2016, ACM Trans. Intell. Syst. Technol..
[16] van der Goot,et al. Normalization and parsing algorithms for uncertain input , 2019 .
[17] Ming Zhou,et al. Recognizing Named Entities in Tweets , 2011, ACL.
[18] Chris Dyer,et al. Part-of-Speech Tagging for Twitter : Word Clusters and Other Advances , 2012 .
[19] Gertjan van Noord,et al. MoNoise: Modeling Noise Using a Modular Normalization System , 2017, ArXiv.
[20] Eduard H. Hovy,et al. Unsupervised Mining of Lexical Variants from Noisy Text , 2011, ULNLP@EMNLP.
[21] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.
[22] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.
[23] Sebastian Riedel,et al. Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection , 2018, EMNLP.