Char2Subword: Extending the Subword Embedding Space from Pre-trained Models Using Robust Character Compositionality
暂无分享,去创建一个
[1] Aditi Raghunathan,et al. Robust Encodings: A Framework for Combating Adversarial Typos , 2020, ACL.
[2] Mohammad Norouzi,et al. Dynamic Programming Encoding for Subword Segmentation in Neural Machine Translation , 2020, ACL.
[3] Thamar Solorio,et al. LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation , 2020, LREC.
[4] Doug Downey,et al. Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks , 2020, ACL.
[5] Greg Durrett,et al. Byte Pair Encoding is Suboptimal for Language Model Pretraining , 2020, FINDINGS.
[6] T. Solorio,et al. From English to Code-Switching: Transfer Learning with Strong Morphological Clues , 2019, ACL.
[7] Lav R. Varshney,et al. CTRL: A Conditional Transformer Language Model for Controllable Generation , 2019, ArXiv.
[8] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.
[9] Omer Levy,et al. What Does BERT Look at? An Analysis of BERT’s Attention , 2019, BlackboxNLP@ACL.
[10] Roland Vollgraf,et al. Pooled Contextualized Embeddings for Named Entity Recognition , 2019, NAACL.
[11] Graham Neubig,et al. Multilingual Neural Machine Translation With Soft Decoupled Encoding , 2019, ICLR.
[12] Ankur Bapna,et al. Revisiting Character-Based Neural Machine Translation with Capacity and Compression , 2018, EMNLP.
[13] Roland Vollgraf,et al. Contextual String Embeddings for Sequence Labeling , 2018, COLING.
[14] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[15] Quoc V. Le,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.
[16] Parminder Bhatia,et al. Morphological Priors for Probabilistic Neural Word Embeddings , 2016, EMNLP.
[17] Kevin Gimpel,et al. Bridging Nonlinearities and Stochastic Regularizers with Gaussian Error Linear Units , 2016, ArXiv.
[18] Yann LeCun,et al. Very Deep Convolutional Networks for Text Classification , 2016, EACL.
[19] Christopher D. Manning,et al. Achieving Open Vocabulary Neural Machine Translation with Hybrid Word-Character Models , 2016, ACL.
[20] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[21] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.
[22] Wang Ling,et al. Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation , 2015, EMNLP.
[23] Timothy Baldwin,et al. Shared Tasks of the 2015 Workshop on Noisy User-generated Text: Twitter Lexical Normalization and Named Entity Recognition , 2015, NUT@IJCNLP.
[24] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[25] Omer Levy,et al. word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method , 2014, ArXiv.
[26] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[27] Christopher D. Manning,et al. Better Word Representations with Recursive Neural Networks for Morphology , 2013, CoNLL.
[28] Jacob Eisenstein,et al. What to do about bad language on the internet , 2013, NAACL.
[29] Jacob Eisenstein,et al. Phonological Factors in Social Media Writing , 2013 .
[30] Graham Neubig,et al. Substring-based machine translation , 2013, Machine Translation.
[31] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[32] Alexander M. Fraser,et al. Target-side Word Segmentation Strategies for Neural Machine Translation , 2017, WMT.