A taxonomy and review of generalization research in NLP

[1]  Yoav Goldberg,et al.  Measuring Causal Effects of Data Statistics on Language Model's 'Factual' Predictions , 2022, ArXiv.

[2]  Christophe Servan,et al.  On the cross-lingual transferability of multilingual prototypical models across NLU tasks , 2022, METANLP.

[3]  Shannon L. Spruit,et al.  No Language Left Behind: Scaling Human-Centered Machine Translation , 2022, ArXiv.

[4]  Ronan Le Bras,et al.  Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models , 2022, ArXiv.

[5]  Yulia Tsvetkov,et al.  ORCA: Interpreting Prompted Language Models via Locating Supporting Data Evidence in the Ocean of Pretraining Data , 2022, ArXiv.

[6]  Yusuke Oda,et al.  Are Prompt-based Models Clueless? , 2022, ACL.

[7]  M. Dascalu,et al.  Domain Adaptation in Multilingual and Multi-Domain Monolingual Settings for Complex Word Identification , 2022, ACL.

[8]  Tiago Pimentel,et al.  Naturalistic Causal Probing for Morpho-Syntax , 2022, TACL.

[9]  Xi Victoria Lin,et al.  OPT: Open Pre-trained Transformer Language Models , 2022, ArXiv.

[10]  Anders Søgaard,et al.  Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks , 2022, DADC.

[11]  Arman Cohan,et al.  Improving the Generalizability of Depression Detection by Leveraging Clinical Questionnaires , 2022, ACL.

[12]  Jack G. M. FitzGerald,et al.  MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages , 2022, ACL.

[13]  T. Poibeau,et al.  Does BERT really agree ? Fine-grained Analysis of Lexical Dependence on a Syntactic Task , 2022, FINDINGS.

[14]  Rakesh R Menon,et al.  CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations , 2022, ACL.

[15]  Tianchuan Du,et al.  Towards Generalizeable Semantic Product Search by Text Similarity Pre-training on Search Click Logs , 2022, ECNLP.

[16]  Abed Alhakim Freihat,et al.  Using Linguistic Typology to Enrich Multilingual Lexicons: the Case of Lexical Gaps in Kinship , 2022, LREC.

[17]  G. Neumann,et al.  Few-Shot Cross-lingual Transfer for Coarse-grained De-identification of Code-Mixed Clinical Texts , 2022, BIONLP.

[18]  E. Mosca,et al.  “That Is a Suspicious Reaction!”: Interpreting Logits Variation to Detect NLP Adversarial Attacks , 2022, ACL.

[19]  Andrew M. Dai,et al.  PaLM: Scaling Language Modeling with Pathways , 2022, J. Mach. Learn. Res..

[20]  Lu Wang,et al.  Efficient Argument Structure Extraction with Transfer Learning and Active Learning , 2022, FINDINGS.

[21]  Lisa Anne Hendricks,et al.  Training Compute-Optimal Large Language Models , 2022, ArXiv.

[22]  Dilek Z. Hakkani-Tür,et al.  What is wrong with you?: Leveraging User Sentiment for Automatic Dialog Evaluation , 2022, FINDINGS.

[23]  Dipankar Das,et al.  Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining? , 2022, ACL.

[24]  Alessandro Sordoni,et al.  Better Language Model with Hypernym Class Prediction , 2022, ACL.

[25]  Seong Jae Hwang,et al.  The Change that Matters in Discourse Parsing: Estimating the Impact of Domain Shift on Parser Error , 2022, FINDINGS.

[26]  Tal Linzen,et al.  Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence Models , 2022, FINDINGS.

[27]  Reut Tsarfaty,et al.  Morphological Reinflection with Multiple Arguments: An Extended Annotation schema and a Georgian Case Study , 2022, ACL.

[28]  M. Shoeybi,et al.  Multi-Stage Prompting for Knowledgeable Dialogue Generation , 2022, FINDINGS.

[29]  Orhan Firat,et al.  Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation , 2022, ACL.

[30]  P. Blunsom,et al.  Revisiting the Compositional Generalization Abilities of Neural Sequence Models , 2022, ACL.

[31]  Shafiq R. Joty,et al.  Continual Few-shot Relation Learning via Embedding Space Regularization and Data Augmentation , 2022, ACL.

[32]  Tao Shen,et al.  ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer for Event-Centric Generation and Classification , 2022, ACL.

[33]  Peter A. Cholak,et al.  Overcoming a Theoretical Limitation of Self-Attention , 2022, ACL.

[34]  Yixin Cao,et al.  Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction , 2022, ACL.

[35]  Matt Gardner,et al.  Impact of Pretraining Term Frequencies on Few-Shot Reasoning , 2022, ArXiv.

[36]  Alexander M. Rush,et al.  PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts , 2022, ACL.

[37]  Reza Yazdani Aminabadi,et al.  Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model , 2022, ArXiv.

[38]  Zhilin Yang,et al.  ZeroPrompt: Scaling Prompt-Based Pretraining to 1, 000 Tasks Improves Zero-Shot Generalization , 2022, EMNLP.

[39]  Dragomir R. Radev,et al.  UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models , 2022, EMNLP.

[40]  Yuri Burda,et al.  Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets , 2022, ArXiv.

[41]  Zoey Liu,et al.  Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation , 2022, TACL.

[42]  Xi Victoria Lin,et al.  Few-shot Learning with Multilingual Generative Language Models , 2021, EMNLP.

[43]  Xi Victoria Lin,et al.  Efficient Large Scale Language Modeling with Mixtures of Experts , 2021, EMNLP.

[44]  Po-Sen Huang,et al.  Scaling Language Models: Methods, Analysis & Insights from Training Gopher , 2021, ArXiv.

[45]  Jane A. Yu,et al.  Quantifying Adaptability in Pre-trained Language Models with 500 Tasks , 2021, NAACL.

[46]  Sanket Vaibhav Mehta,et al.  ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning , 2021, ArXiv.

[47]  Edward Grefenstette,et al.  A Survey of Zero-shot Generalisation in Deep Reinforcement Learning , 2021, J. Artif. Intell. Res..

[48]  Dawn Song,et al.  Grounded Graph Decoding Improves Compositional Generalization in Question Answering , 2021, EMNLP.

[49]  Jacob Andreas,et al.  How Do Neural Sequence Models Generalize? Local and Global Cues for Out-of-Distribution Prediction , 2021, EMNLP.

[50]  Daniel Khashabi,et al.  Hey AI, Can You Solve Complex Tasks by Talking to Agents? , 2021, FINDINGS.

[51]  Sanket Vaibhav Mehta,et al.  Improving Compositional Generalization with Self-Training for Data-to-Text Generation , 2021, ACL.

[52]  H. Mobahi,et al.  Sharpness-Aware Minimization Improves Language Model Generalization , 2021, ACL.

[53]  Bin Ma,et al.  A Unified Speaker Adaptation Approach for ASR , 2021, EMNLP.

[54]  Phu Mon Htut,et al.  BBQ: A hand-built bias benchmark for question answering , 2021, FINDINGS.

[55]  Alexander M. Rush,et al.  Multitask Prompted Training Enables Zero-Shot Task Generalization , 2021, ICLR.

[56]  Greg Durrett,et al.  ASPECTNEWS: Aspect-Oriented Summarization of News Documents , 2021, ACL.

[57]  Alexander M. Fraser,et al.  Why don’t people use character-level machine translation? , 2021, FINDINGS.

[58]  Ashwin Srinivasan,et al.  Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations , 2021, FINDINGS.

[59]  Dzmitry Bahdanau,et al.  LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsing , 2021, ACL.

[60]  Stanislas Dehaene,et al.  Causal Transformers Perform Below Chance on Recursive Nested Constructions, Unlike Humans , 2021, ArXiv.

[61]  Dzmitry Bahdanau,et al.  Compositional Generalization in Dependency Parsing , 2021, ACL.

[62]  Chenhao Tan,et al.  Investigating the Effect of Natural Language Explanations on Out-of-Distribution Generalization in Few-shot NLI , 2021, INSIGHTS.

[63]  Mirella Lapata,et al.  Disentangled Sequence to Sequence Learning for Compositional Generalization , 2021, ACL.

[64]  Zhengyuan Liu,et al.  DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing , 2021, CODI.

[65]  Zhengyuan Liu,et al.  Improving Multi-Party Dialogue Discourse Parsing via Domain Integration , 2021, CODI.

[66]  Mark O. Riedl,et al.  Situated Dialogue Learning through Procedural Environment Generation , 2021, ACL.

[67]  David Restrepo Amariles,et al.  JuriBERT: A Masked-Language Model Adaptation for French Legal Text , 2021, NLLP.

[68]  Aleksandr Drozd,et al.  Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics , 2021, INSIGHTS.

[69]  D. Katz,et al.  LexGLUE: A Benchmark Dataset for Legal Language Understanding in English , 2021, ACL.

[70]  Mohit Bansal,et al.  Inducing Transformer’s Compositional Generalization Ability via Auxiliary Sequence Prediction Tasks , 2021, EMNLP.

[71]  Dan Friedman,et al.  Single-dataset Experts for Multi-dataset Question Answering , 2021, EMNLP.

[72]  Kai-Wei Chang,et al.  Relation-Guided Pre-Training for Open-Domain Question Answering , 2021, EMNLP.

[73]  Kevin Gimpel,et al.  On Generalization in Coreference Resolution , 2021, CRAC.

[74]  Kazuma Hashimoto,et al.  RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering , 2021, Annual Meeting of the Association for Computational Linguistics.

[75]  Marco Luca Sbodio,et al.  Neural Unification for Logic Reasoning over Natural Language , 2021, EMNLP.

[76]  Sarkar Snigdha Sarathi Das,et al.  CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning , 2021, ACL.

[77]  Ellie Pavlick,et al.  Frequency Effects on Syntactic Rule Learning in Transformers , 2021, EMNLP.

[78]  I. Augenstein,et al.  How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs? , 2021, EMNLP.

[79]  Xianpei Han,et al.  Honey or Poison? Solving the Trigger Curse in Few-shot Event Detection via Causal Intervention , 2021, EMNLP.

[80]  Albert Y.S. Lam,et al.  Effectiveness of Pre-training for Few-shot Intent Classification , 2021, EMNLP.

[81]  Michael J.Q. Zhang,et al.  SituatedQA: Incorporating Extra-Linguistic Contexts into QA , 2021, EMNLP.

[82]  Songfang Huang,et al.  Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning , 2021, EMNLP.

[83]  Matthew Purver,et al.  Exploring Underexplored Limitations of Cross-Domain Text-to-SQL Generalization , 2021, EMNLP.

[84]  Nikhil Ramesh,et al.  Entity-Based Knowledge Conflicts in Question Answering , 2021, EMNLP.

[85]  Mari Ostendorf,et al.  DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization , 2021, EMNLP.

[86]  Zhou Yu,et al.  Zero-Shot Dialogue State Tracking via Cross-Task Transfer , 2021, EMNLP.

[87]  Qi Zhang,et al.  Low-Resource Dialogue Summarization with Domain-Agnostic Multi-Source Pretraining , 2021, EMNLP.

[88]  Eric Nyberg,et al.  Exploring Strategies for Generalizable Commonsense Reasoning with Pre-trained Models , 2021, EMNLP.

[89]  Miguel Ballesteros,et al.  How much pretraining data do language models need to learn syntax? , 2021, EMNLP.

[90]  Jonathan Herzig,et al.  Finding needles in a haystack: Sampling Structurally-diverse Training Sets from Synthetic Data for Compositional Generalization , 2021, EMNLP.

[91]  M Saiful Bari,et al.  Nearest Neighbour Few-Shot Learning for Cross-lingual Classification , 2021, EMNLP.

[92]  Quoc V. Le,et al.  Finetuned Language Models Are Zero-Shot Learners , 2021, ICLR.

[93]  S. Riedel,et al.  Challenges in Generalization in Open Domain Question Answering , 2021, NAACL-HLT.

[94]  Xu Sun,et al.  Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification , 2021, EMNLP.

[95]  Einat Minkov,et al.  Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech , 2021, EMNLP.

[96]  Jin Yong Yoo,et al.  Towards Improving Adversarial Training of NLP Models , 2021, EMNLP.

[97]  Peng Cui,et al.  Towards Out-Of-Distribution Generalization: A Survey , 2021, ArXiv.

[98]  Xiaoxi Mao,et al.  LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation , 2021, TACL.

[99]  J. Schmidhuber,et al.  The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers , 2021, EMNLP.

[100]  Elia Bruni,et al.  The Paradox of the Compositionality of Natural Language: A Neural Machine Translation Case Study , 2021, ACL.

[101]  Reut Tsarfaty,et al.  (Un)solving Morphological Inflection: Lemma Overlap Artificially Inflates Models’ Performance , 2021, ACL.

[102]  J. Ainslie,et al.  Making Transformers Solve Compositional Tasks , 2021, ACL.

[103]  Luke Zettlemoyer,et al.  Noisy Channel Language Model Prompting for Few-Shot Text Classification , 2021, ACL.

[104]  Olivier Bonami,et al.  Not quite there yet: Combining analogical patterns and encoder-decoder networks for cognitively plausible inflection , 2021, Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[105]  Panagiotis Kouris,et al.  Abstractive Text Summarization: Enhancing Sequence-to-Sequence Models Using Word Sense Disambiguation and Semantic Content Generalization , 2021, CL.

[106]  Ramón Fernández Astudillo,et al.  Structural Guidance for Transformer Language Models , 2021, ACL.

[107]  Emmanuele Chersoni,et al.  Did the Cat Drink the Coffee? Challenging Transformers with Generalized Event Knowledge , 2021, STARSEM.

[108]  Y. Gal,et al.  Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks , 2021, NeurIPS Datasets and Benchmarks.

[109]  Kyle Lo,et al.  FLEX: Unifying Evaluation for Few-Shot NLP , 2021, NeurIPS.

[110]  Mingyue Han,et al.  Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models , 2021, ACL.

[111]  E. Kharitonov,et al.  Can Transformers Jump Around Right in Natural Language? Assessing Performance Transfer from SCAN , 2021, BLACKBOXNLP.

[112]  He He,et al.  An Investigation of the (In)effectiveness of Counterfactually Augmented Data , 2021, ACL.

[113]  Hongxia Jin,et al.  Enhancing the generalization for Intent Classification and Out-of-Domain Detection in SLU , 2021, ACL.

[114]  Rifat Shahriyar,et al.  XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages , 2021, FINDINGS.

[115]  Matthew Richardson,et al.  KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers , 2021, ACL.

[116]  Pradeep Ravikumar,et al.  Improving Compositional Generalization in Classification Tasks via Structure Annotations , 2021, ACL.

[117]  Vivek Srikumar,et al.  X-Fact: A New Benchmark Dataset for Multilingual Fact Checking , 2021, ACL.

[118]  Nurul Lubis,et al.  Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems , 2021, SIGDIAL.

[119]  Marco Baroni On the proper role of linguistically-oriented deep net analysis in linguistic theorizing , 2021, ArXiv.

[120]  Dilek Z. Hakkani-Tür,et al.  Generative Conversational Networks , 2021, SIGDIAL.

[121]  Dietrich Klakow,et al.  Modeling Profanity and Hate Speech in Social Media with Semantic Subspaces , 2021, WOAH.

[122]  Sebastian Ruder,et al.  Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks , 2021, ACL.

[123]  Marco Damonte,et al.  One Semantic Parser to Parse Them All: Sequence to Sequence Multi-Task Learning on Semantic Parsing Datasets , 2021, STARSEM.

[124]  Megha Srivastava,et al.  Question Generation for Adaptive Education , 2021, ACL.

[125]  Kenny Smith,et al.  Meta-Learning to Compositionally Generalize , 2021, ACL.

[126]  Jacob Andreas,et al.  Lexicon Learning for Few Shot Sequence Modeling , 2021, ACL.

[127]  Pascale Fung,et al.  X2Parser: Cross-Lingual and Cross-Domain Framework for Task-Oriented Compositional Semantic Parsing , 2021, REPL4NLP.

[128]  Ethan Gotlieb Wilcox,et al.  A Targeted Assessment of Incremental Processing in Neural Language Models and Humans , 2021, ACL.

[129]  Kai-Wei Chang,et al.  Syntax-augmented Multilingual BERT for Cross-lingual Transfer , 2021, ACL.

[130]  Milad Shokouhi,et al.  A Dataset and Baselines for Multilingual Reply Suggestion , 2021, ACL.

[131]  Prateek Yadav,et al.  multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning , 2021, NAACL.

[132]  Marten van Schijndel,et al.  Uncovering Constraint-Based Behavior in Neural Models via Targeted Fine-Tuning , 2021, ACL.

[133]  Chinmay Choudhary,et al.  Improving the Performance of UDify with Linguistic Typology Knowledge , 2021, SIGTYP.

[134]  Sarah Ita Levitan,et al.  Detecting Multilingual COVID-19 Misinformation on Social Media via Contextualized Embeddings , 2021, NLP4IF.

[135]  Marco Brambilla,et al.  Content-based Stance Classification of Tweets about the 2020 Italian Constitutional Referendum , 2021, SOCIALNLP.

[136]  Ekaterina Vylomova,et al.  Anlirika: An LSTM–CNN Flow Twister for Spoken Language Identification , 2021, SIGTYP.

[137]  Francis M. Tyers,et al.  Do RNN States Encode Abstract Phonological Alternations? , 2021, NAACL.

[138]  Ahmed Khoumsi,et al.  Domain Adaptation for Arabic Cross-Domain and Cross-Dialect Sentiment Analysis from Contextualized Word Embedding , 2021, NAACL.

[139]  Jacob Andreas,et al.  Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention , 2021, NAACL.

[140]  Yongjing Yin,et al.  On Compositional Generalization of Neural Machine Translation , 2021, ACL.

[141]  Constantin Orasan,et al.  An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers , 2021, ACL.

[142]  Diyi Yang,et al.  HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalizability , 2021, ACL.

[143]  Ce Zhang,et al.  Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models , 2021, ACL.

[144]  Jakub Szymanik,et al.  Language Models Use Monotonicity to Assess NPI Licensing , 2021, FINDINGS.

[145]  Xiaodong Liu,et al.  Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization , 2021, ACL.

[146]  Douwe Kiela,et al.  True Few-Shot Learning with Language Models , 2021, NeurIPS.

[147]  Minlie Huang,et al.  OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics , 2021, ACL.

[148]  Mingxuan Wang,et al.  Learning Language Specific Sub-network for Multilingual Machine Translation , 2021, ACL.

[149]  Haitao Zheng,et al.  Few-NERD: A Few-shot Named Entity Recognition Dataset , 2021, ACL.

[150]  Gholamreza Haffari,et al.  Neural-Symbolic Commonsense Reasoner with Relation Predictors , 2021, ACL.

[151]  Kathleen McKeown,et al.  Adversarial Learning for Zero-Shot Stance Detection on Social Media , 2021, NAACL.

[152]  I. Kobayashi,et al.  OCHADAI-KYOTO at SemEval-2021 Task 1: Enhancing Model Generalization and Robustness for Lexical Complexity Prediction , 2021, SEMEVAL.

[153]  Zili Zhou,et al.  Encoding Explanatory Knowledge for Zero-shot Science Question Answering , 2021, IWCS.

[154]  Gerasimos Lampouras,et al.  Generalising Multilingual Concept-to-Text NLG with Language Agnostic Delexicalisation , 2021, ACL.

[155]  Juho Lee,et al.  Learning to Perturb Word Embeddings for Out-of-distribution QA , 2021, ACL.

[156]  Kentaro Inui,et al.  Learning to Learn to be Right for the Right Reasons , 2021, NAACL.

[157]  Xiang Zhou,et al.  Hidden Biases in Unreliable News Detection Datasets , 2021, EACL.

[158]  Xiang Ren,et al.  X-METRA-ADA: Cross-lingual Meta-Transfer learning Adaptation to Natural Language Understanding and Question Answering , 2021, NAACL.

[159]  S. Riedel,et al.  Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity , 2021, ACL.

[160]  Ngoc Thang Vu,et al.  AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages , 2021, ACL.

[161]  Hannaneh Hajishirzi,et al.  Cross-Task Generalization via Natural Language Crowdsourcing Instructions , 2021, ACL.

[162]  Bill Yuchen Lin,et al.  Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning , 2021, EMNLP.

[163]  S. Riedel,et al.  Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation , 2021, EMNLP.

[164]  Xiang Ren,et al.  CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP , 2021, EMNLP.

[165]  Nanyun Peng,et al.  Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training , 2021, EMNLP.

[166]  Oyvind Tafjord,et al.  Explaining Answers with Entailment Trees , 2021, EMNLP.

[167]  Marek Rei,et al.  Memorisation versus Generalisation in Pre-trained Language Models , 2021, ACL.

[168]  Dan Roth,et al.  Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema , 2021, EMNLP.

[169]  Diyi Yang,et al.  Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs , 2021, NAACL.

[170]  Jinlan Fu,et al.  XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation , 2021, EMNLP.

[171]  Jason Weston,et al.  Retrieval Augmentation Reduces Hallucination in Conversation , 2021, EMNLP.

[172]  Shrey Desai,et al.  Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic Parsing , 2021, EMNLP.

[173]  Douwe Kiela,et al.  Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little , 2021, EMNLP.

[174]  Snigdha Chaturvedi,et al.  Is Everything in Order? A Simple Way to Order Sentences , 2021, EMNLP.

[175]  Mans Hulden,et al.  Can a Transformer Pass the Wug Test? Tuning Copying Bias in Neural Morphological Inflection Models , 2021, ACL.

[176]  Xuezhi Wang,et al.  Continual Learning for Text Classification with Information Disentanglement Based Regularization , 2021, NAACL.

[177]  Wenpeng Yin,et al.  Learning to Synthesize Data for Semantic Parsing , 2021, NAACL.

[178]  T. Zhao,et al.  Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach , 2021, EMNLP.

[179]  Dan Klein,et al.  Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections , 2021, EMNLP.

[180]  Kai Yu,et al.  ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser , 2021, NAACL.

[181]  Tharindu Ranasinghe,et al.  TransWiC at SemEval-2021 Task 2: Transformer-based Multilingual and Cross-lingual Word-in-Context Disambiguation , 2021, SEMEVAL.

[182]  Zhiyi Ma,et al.  Dynabench: Rethinking Benchmarking in NLP , 2021, NAACL.

[183]  Erenay Dayanik,et al.  Disentangling Document Topic and Author Gender in Multiple Languages: Lessons for Adversarial Debiasing , 2021, WASSA.

[184]  Graham Neubig,et al.  MasakhaNER: Named Entity Recognition for African Languages , 2021, Transactions of the Association for Computational Linguistics.

[185]  Pascale Fung,et al.  AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization , 2021, NAACL.

[186]  Timothy Baldwin,et al.  Evaluating Document Coherence Modeling , 2021, Transactions of the Association for Computational Linguistics.

[187]  David Ifeoluwa Adelani,et al.  The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation , 2021, MTSUMMIT.

[188]  Sanjeev Khudanpur,et al.  Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora , 2021, WMT.

[189]  Franck Dernoncourt,et al.  Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models , 2021, NAACL.

[190]  Emily M. Bender,et al.  On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜 , 2021, FAccT.

[191]  Diana Inkpen,et al.  Conditional Adversarial Networks for Multi-Domain Text Classification , 2021, ADAPTNLP.

[192]  Phil Blunsom,et al.  Mind the Gap: Assessing Temporal Generalization in Neural Language Models , 2021, NeurIPS.

[193]  Karin Verspoor,et al.  Memorization vs. Generalization : Quantifying Data Leakage in NLP Performance Evaluation , 2021, EACL.

[194]  Lucas Weber,et al.  Language Modelling as a Multi-Task Problem , 2021, EACL.

[195]  Hitomi Yanaka,et al.  Exploring Transitivity in Neural NLI Models through Veridicality , 2021, EACL.

[196]  Sonal Gupta,et al.  Muppet: Massive Multi-task Representations with Pre-Finetuning , 2021, EMNLP.

[197]  Roi Reichart,et al.  Model Compression for Domain Adaptation through Causal Effect Estimation , 2021, Transactions of the Association for Computational Linguistics.

[198]  Xiang Ren,et al.  Learning to Generate Task-Specific Adapters from Task Description , 2021, ACL.

[199]  Valentin Hofmann,et al.  Superbizarre Is Not Superb: Derivational Morphology Improves BERT’s Interpretation of Complex Words , 2021, ACL.

[200]  Jackie Chi Kit Cheung,et al.  Optimizing Deeper Transformers on Small Datasets , 2020, ACL.

[201]  Jianfeng Gao,et al.  RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems , 2020, ACL.

[202]  Mohit Bansal,et al.  I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling , 2020, ACL.

[203]  Magdalena Biesialska,et al.  Continual Lifelong Learning in Natural Language Processing: A Survey , 2020, COLING.

[204]  Pang Wei Koh,et al.  WILDS: A Benchmark of in-the-Wild Distribution Shifts , 2020, ICML.

[205]  Yoav Goldberg,et al.  Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals , 2020, Transactions of the Association for Computational Linguistics.

[206]  Nathan Schneider,et al.  Supertagging the Long Tail with Tree-Structured Decoding of Complex Categories , 2020, Transactions of the Association for Computational Linguistics.

[207]  Jiafeng Guo,et al.  Event Coreference Resolution with their Paraphrases and Argument-aware Embeddings , 2020, COLING.

[208]  Valeria de Paiva,et al.  Hy-NLI: a Hybrid system for Natural Language Inference , 2020, COLING.

[209]  Eduard Hovy,et al.  On the Systematicity of Probing Contextualized Word Representations: The Case of Hypernymy in BERT , 2020, STARSEM.

[210]  Miikka Silfverberg,et al.  Noise Isn’t Always Negative: Countering Exposure Bias in Sequence-to-Sequence Inflection Models , 2020, COLING.

[211]  Robert Frank,et al.  Sequence-to-Sequence Networks Learn the Meaning of Reflexive Anaphora , 2020, CRAC.

[212]  Kentaro Inui,et al.  Efficient Estimation of Influence of a Training Instance , 2020, SUSTAINLP.

[213]  Matt Gardner,et al.  Learning from Task Descriptions , 2020, EMNLP.

[214]  Ramesh Nallapati,et al.  Unsupervised Domain Adaptation for Cross-lingual Text Labeling , 2020, FINDINGS.

[215]  Daniel Gillick,et al.  Entity Linking in 100 Languages , 2020, EMNLP.

[216]  Alexander Rush,et al.  Sequence-level Mixed Sample Data Augmentation , 2020, EMNLP.

[217]  Coleman Haley,et al.  This is a BERT. Now there are several of them. Can they generalize to novel words? , 2020, BLACKBOXNLP.

[218]  Roger Levy,et al.  Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization , 2020, BLACKBOXNLP.

[219]  Mark Dredze,et al.  Do Models of Mental Health Based on Social Media Data Generalize? , 2020, FINDINGS.

[220]  Richard Tobin,et al.  Not a cute stroke: Analysis of Rule- and Neural Network-based Information Extraction Systems for Brain Radiology Reports , 2020, LOUHI.

[221]  Yaohui Jin,et al.  Modeling Content Importance for Summarization with Pre-trained Language Models , 2020, EMNLP.

[222]  Yejin Choi,et al.  Social Chemistry 101: Learning to Reason about Social and Moral Norms , 2020, EMNLP.

[223]  Caiming Xiong,et al.  The Thieves on Sesame Street Are Polyglots — Extracting Multilingual Models from Monolingual APIs , 2020, EMNLP.

[224]  Saptarashmi Bandyopadhyay,et al.  Natural Language Response Generation from SQL with Generalization and Back-translation , 2020, INTEXSEMPAR.

[225]  Swaroop Mishra,et al.  Do We Need to Create Big Datasets to Learn a Task? , 2020, SUSTAINLP.

[226]  Quan Wang,et al.  Event Extraction as Multi-turn Question Answering , 2020, FINDINGS.

[227]  Kaiyu Huang,et al.  A Joint Multiple Criteria Model in Transfer Learning for Cross-domain Chinese Word Segmentation , 2020, EMNLP.

[228]  Han Wang,et al.  Enhancing Generalization in Natural Language Inference by Syntax , 2020, FINDINGS.

[229]  Ayan Sengupta,et al.  DATAMAFIA at WNUT-2020 Task 2: A Study of Pre-trained Language Models along with Regularization Techniques for Downstream Tasks , 2020, WNUT.

[230]  Yonatan Belinkov,et al.  Findings of the WMT 2020 Shared Task on Machine Translation Robustness , 2020, WMT.

[231]  Timothy Baldwin,et al.  Target Word Masking for Location Metonymy Resolution , 2020, COLING.

[232]  Jia Deng,et al.  Strongly Incremental Constituency Parsing with Graph Neural Networks , 2020, NeurIPS.

[233]  Dan Roth,et al.  Temporal Reasoning on Implicit Events from Distant Supervision , 2020, NAACL.

[234]  Greg Durrett,et al.  Effective Distant Supervision for Temporal Relation Extraction , 2020, ADAPTNLP.

[235]  Ming-Wei Chang,et al.  Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both? , 2020, ACL.

[236]  Xiaodong Liu,et al.  Posterior Differential Regularization with f-divergence for Improving Model Robustness , 2020, NAACL.

[237]  Mirella Lapata,et al.  Meta-Learning for Domain Generalization in Semantic Parsing , 2020, NAACL.

[238]  Jimmy J. Lin,et al.  Scientific Claim Verification with VerT5erini , 2020, LOUHI.

[239]  Jungo Kasai,et al.  XOR QA: Cross-lingual Open-Retrieval Question Answering , 2020, NAACL.

[240]  Mirella Lapata,et al.  Compositional Generalization via Semantic Tagging , 2020, EMNLP.

[241]  Sungjin Lee,et al.  Self-Supervised Contrastive Learning for Efficient User Satisfaction Prediction in Conversational Agents , 2020, NAACL.

[242]  Siva Reddy,et al.  Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle , 2020, NAACL.

[243]  Holger Schwenk,et al.  Beyond English-Centric Multilingual Machine Translation , 2020, J. Mach. Learn. Res..

[244]  Xiaocheng Feng,et al.  Incorporating Commonsense Knowledge into Abstractive Dialogue Summarization via Heterogeneous Graph Networks , 2020, CCL.

[245]  Shrey Desai,et al.  Compressive Summarization with Plausibility and Salience Modeling , 2020, EMNLP.

[246]  Helen Yannakoudakis,et al.  Grammatical Error Correction in Low Error Density Domains: A New Benchmark and Analyses , 2020, EMNLP.

[247]  Benjamin Newman,et al.  The EOS Decision and Length Extrapolation , 2020, BLACKBOXNLP.

[248]  Svetlana Kiritchenko,et al.  On Cross-Dataset Generalization in Automatic Detection of Online Abuse , 2020, ALW.

[249]  Alessandro Raganato,et al.  XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization , 2020, EMNLP.

[250]  Roger Levy,et al.  Structural Supervision Improves Few-Shot Learning and Syntactic Generalization in Neural Language Models , 2020, EMNLP.

[251]  Tal Linzen,et al.  COGS: A Compositional Generalization Challenge Based on Semantic Interpretation , 2020, EMNLP.

[252]  Jonathan Berant,et al.  Improving Compositional Generalization in Semantic Parsing , 2020, FINDINGS.

[253]  Xuanjing Huang,et al.  An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems , 2020, FINDINGS.

[254]  Siddharth Dalmia,et al.  On Long-Tailed Phenomena in Neural Machine Translation , 2020, FINDINGS.

[255]  Samuel R. Bowman,et al.  Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data , 2020, INSIGHTS.

[256]  Kathleen McKeown,et al.  Zero-Shot Stance Detection: A Dataset and Model Using Generalized Topic Representations , 2020, EMNLP.

[257]  Claire Cardie,et al.  WikiLingua: A New Benchmark Dataset for Multilingual Abstractive Summarization , 2020, FINDINGS.

[258]  Asish Ghoshal,et al.  Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing , 2020, EMNLP.

[259]  Iryna Gurevych,et al.  Improving QA Generalization by Concurrent Modeling of Multiple Biases , 2020, FINDINGS.

[260]  Yoav Goldberg,et al.  Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data , 2020, EMNLP.

[261]  Dragomir R. Radev,et al.  Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start , 2020, EMNLP.

[262]  Wenhu Chen,et al.  KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation , 2020, EMNLP.

[263]  Ralph Weischedel,et al.  Learning to Generalize for Sequential Decision Making , 2020, FINDINGS.

[264]  Anette Frank,et al.  X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset , 2020, EMNLP.

[265]  Siva Reddy,et al.  Measuring Systematic Generalization in Neural Proof Generation with Transformers , 2020, NeurIPS.

[266]  M. Choudhury,et al.  TaxiNLI: Taking a Ride up the NLU Hill , 2020, CONLL.

[267]  Tiancheng Zhao,et al.  SPARTA: Efficient Open-Domain Question Answering via Sparse Transformer Matching Retrieval , 2020, NAACL.

[268]  Sameer Singh,et al.  Paired Examples as Indirect Supervision in Latent Decision Models , 2020, EMNLP.

[269]  Yejin Choi,et al.  Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics , 2020, EMNLP.

[270]  Philip S. Yu,et al.  Composed Variational Natural Language Generation for Few-shot Intents , 2020, FINDINGS.

[271]  Kyunghyun Cho,et al.  SSMBA: Self-Supervised Manifold Based Data Augmentation for Improving Out-of-Domain Robustness , 2020, EMNLP.

[272]  Kumud Chauhan,et al.  NEU at WNUT-2020 Task 2: Data Augmentation To Tell BERT That Death Is Not Necessarily Informative , 2020, WNUT.

[273]  Christopher DuBois,et al.  On the Transferability of Minimal Prediction Preserving Inputs in Question Answering , 2020, NAACL.

[274]  Trapit Bansal,et al.  Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks , 2020, EMNLP.

[275]  Gregor Betz,et al.  Critical Thinking for Language Models , 2020, IWCS.

[276]  Jonathan Berant,et al.  Span-based Semantic Parsing for Compositional Generalization , 2020, ACL.

[277]  Haoran Li,et al.  MTOP: A Comprehensive Multilingual Task-Oriented Semantic Parsing Benchmark , 2020, EACL.

[278]  Sebastian Riedel,et al.  Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets , 2020, EACL.

[279]  Joachim Daiber,et al.  MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering , 2020, Transactions of the Association for Computational Linguistics.

[280]  Lifu Tu,et al.  An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models , 2020, Transactions of the Association for Computational Linguistics.

[281]  Tao Yu,et al.  DART: Open-Domain Structured Data Record to Text Generation , 2020, NAACL.

[282]  Franck Dernoncourt,et al.  Exploiting the Syntax-Model Consistency for Neural Relation Extraction , 2020, ACL.

[283]  Cornelia Caragea,et al.  Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold Mixup , 2020, ACL.

[284]  Shruti Rijhwani,et al.  Temporally-Informed Analysis of Named Entity Recognition , 2020, ACL.

[285]  Ming-Wei Chang,et al.  Exploring Unexplored Generalization Challenges for Cross-Database Semantic Parsing , 2020, ACL.

[286]  Deniz Yuret,et al.  Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference , 2020, REPL4NLP.

[287]  Yoshua Bengio,et al.  Compositional Generalization by Factorizing Alignment and Translation , 2020, ACL.

[288]  Ryan Cotterell,et al.  SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection , 2020, SIGMORPHON.

[289]  M. Marelli,et al.  Mechanisms for handling nested dependencies in neural-network language models and humans , 2020, Cognition.

[290]  Percy Liang,et al.  Selective Question Answering under Domain Shift , 2020, ACL.

[291]  Alvaro Soto,et al.  Translating Natural Language Instructions for Behavioral Robot Navigation with a Multi-Head Attention Mechanism , 2020, WINLP.

[292]  Rajaswa Patil,et al.  LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits Based Humor Grading , 2020, SEMEVAL.

[293]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[294]  Uri Shalit,et al.  CausaLM: Causal Model Explanation Through Counterfactual Language Models , 2020, CL.

[295]  Aakanksha Naik,et al.  Towards Open Domain Event Trigger Identification using Adversarial Domain Adaptation , 2020, ACL.

[296]  Mihir Kale,et al.  Text-to-Text Pre-Training for Data-to-Text Tasks , 2020, INLG.

[297]  Adam Lopez,et al.  Inflecting When There’s No Majority: Limitations of Encoder-Decoder Neural Networks as Cognitive Models for German Plurals , 2020, ACL.

[298]  Peter Szolovits,et al.  Entity-Enriched Neural Models for Clinical Question Answering , 2020, BIONLP.

[299]  Dilek Z. Hakkani-Tür,et al.  Schema-Guided Natural Language Generation , 2020, INLG.

[300]  Koustuv Sinha,et al.  Probing Linguistic Systematicity , 2020, ACL.

[301]  Sameer Singh,et al.  Beyond Accuracy: Behavioral Testing of NLP Models with CheckList , 2020, ACL.

[302]  Roger P. Levy,et al.  A Systematic Assessment of Syntactic Generalization in Neural Language Models , 2020, ACL.

[303]  Tal Linzen,et al.  How Can We Accelerate Progress Towards Human-like Linguistic Generalization? , 2020, ACL.

[304]  Bill Yuchen Lin,et al.  RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms , 2020, EMNLP.

[305]  Hannaneh Hajishirzi,et al.  UnifiedQA: Crossing Format Boundaries With a Single QA System , 2020, FINDINGS.

[306]  Emily Denton,et al.  Social Biases in NLP Models as Barriers for Persons with Disabilities , 2020, ACL.

[307]  Jianfeng Gao,et al.  RMM: A Recursive Mental Model for Dialog Navigation , 2020, FINDINGS.

[308]  Xiang Ren,et al.  Teaching Machine Comprehension with Compositional Explanations , 2020, FINDINGS.

[309]  Anders Sogaard,et al.  We Need To Talk About Random Splits , 2020, EACL.

[310]  Piotr Rybak,et al.  KLEJ: Comprehensive Benchmark for Polish Language Understanding , 2020, ACL.

[311]  Xiang Yue,et al.  Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset , 2020, ACL.

[312]  Bernard J. Jansen,et al.  A Multi-Platform Arabic News Comment Dataset for Offensive Language Detection , 2020, LREC.

[313]  A. Korhonen,et al.  XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning , 2020, EMNLP.

[314]  Arzucan Özgür,et al.  Analyzing ELMo and DistilBERT on Socio-political News Classification , 2020, AESPEN.

[315]  Afroz Ahamad,et al.  AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition , 2020, LREC.

[316]  Michael J. Paul,et al.  Why Overfitting Isn’t Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries , 2020, ACL.

[317]  Chitta Baral,et al.  Self-Supervised Knowledge Triplet Learning for Zero-shot Question Answering , 2020, EMNLP.

[318]  Greg Durrett,et al.  Robust Question Answering Through Sub-part Alignment , 2020, NAACL.

[319]  Nathan Schneider,et al.  Lexical Semantic Recognition , 2020, MWE.

[320]  Dan Jurafsky,et al.  Learning Music Helps You Read: Using Transfer to Study Linguistic Structure in Language Models , 2020, EMNLP.

[321]  Sylvain Lamprier,et al.  MLSUM: The Multilingual Summarization Corpus , 2020, EMNLP.

[322]  Michael A. Lepori,et al.  Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMs , 2020, ACL.

[323]  Rudolf Rosa,et al.  Universal Dependencies according to BERT: both more specific and more general , 2020, FINDINGS.

[324]  Christopher Potts,et al.  Neural Natural Language Inference Models Partially Embed Theories of Lexical Entailment and Negation , 2020, BLACKBOXNLP.

[325]  Veselin Stoyanov,et al.  General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference , 2020, FINDINGS.

[326]  Rico Sennrich,et al.  Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation , 2020, ACL.

[327]  R. Thomas McCoy,et al.  Syntactic Data Augmentation Increases Robustness to Inference Heuristics , 2020, ACL.

[328]  Yu Hong,et al.  DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications , 2020, ACL.

[329]  Doug Downey,et al.  Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks , 2020, ACL.

[330]  Sampo Pyysalo,et al.  Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection , 2020, LREC.

[331]  Xiao Huang,et al.  TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition , 2020, ACL.

[332]  Tim Rocktäschel,et al.  There is Strength in Numbers: Avoiding the Hypothesis-Only Bias in Natural Language Inference via Ensemble Adversarial Training , 2020, ArXiv.

[333]  Mohit Bansal,et al.  Adversarial Augmentation Policy Search for Domain and Cross-Lingual Generalization in Reading Comprehension , 2020, FINDINGS.

[334]  Dilek Z. Hakkani-Tür,et al.  From Machine Reading Comprehension to Dialogue State Tracking: Bridging the Gap , 2020, NLP4CONVAI.

[335]  Dawn Song,et al.  Pretrained Transformers Improve Out-of-Distribution Robustness , 2020, ACL.

[336]  Tatsuya Kawahara,et al.  Designing Precise and Robust Dialogue Response Evaluators , 2020, ACL.

[337]  Zhiyuan Liu,et al.  More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction , 2020, AACL.

[338]  Noah A. Smith,et al.  Evaluating Models’ Local Decision Boundaries via Contrast Sets , 2020, FINDINGS.

[339]  Xiujun Li,et al.  Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space , 2020, EMNLP.

[340]  Xiaodong Fan,et al.  XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation , 2020, EMNLP.

[341]  Kentaro Inui,et al.  Do Neural Models Learn Systematicity of Monotonicity Inference in Natural Language? , 2020, ACL.

[342]  Weijia Xu,et al.  End-to-End Slot Alignment and Recognition for Cross-Lingual NLU , 2020, EMNLP.

[343]  Orhan Firat,et al.  XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization , 2020, ICML.

[344]  Elena Kochkina,et al.  Cost-Sensitive BERT for Generalisable Sentence Classification on Imbalanced Data , 2020, EMNLP.

[345]  Armando Solar-Lezama,et al.  Learning Compositional Rules via Neural Program Synthesis , 2020, NeurIPS.

[346]  Eunsol Choi,et al.  TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages , 2020, Transactions of the Association for Computational Linguistics.

[347]  Jianfeng Gao,et al.  Few-shot Natural Language Generation for Task-Oriented Dialog , 2020, FINDINGS.

[348]  Pasquale Minervini,et al.  Undersensitivity in Neural Reading Comprehension , 2020, FINDINGS.

[349]  Bill Yuchen Lin,et al.  CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning , 2020, FINDINGS.

[350]  Sebastian Riedel,et al.  Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension , 2020, Transactions of the Association for Computational Linguistics.

[351]  Ryan Cotterell,et al.  Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages , 2020, Transactions of the Association for Computational Linguistics.

[352]  Timo Schick,et al.  Exploiting Cloze-Questions for Few-Shot Text Classification and Natural Language Inference , 2020, EACL.

[353]  R. Thomas McCoy,et al.  Does Syntax Need to Grow on Trees? Sources of Hierarchical Inductive Bias in Sequence-to-Sequence Networks , 2020, TACL.

[354]  Yoav Goldberg,et al.  oLMpics-On What Language Model Pre-training Captures , 2019, Transactions of the Association for Computational Linguistics.

[355]  Xiao Wang,et al.  Measuring Compositional Generalization: A Comprehensive Method on Realistic Data , 2019, ICLR.

[356]  Benjamin Van Durme,et al.  Reading the Manual: Event Extraction as Definition Comprehension , 2019, SPNLP.

[357]  Samuel R. Bowman,et al.  BLiMP: The Benchmark of Linguistic Minimal Pairs for English , 2019, Transactions of the Association for Computational Linguistics.

[358]  Frank F. Xu,et al.  How Can We Know What Language Models Know? , 2019, Transactions of the Association for Computational Linguistics.

[359]  Khalil Mrini,et al.  Rethinking Self-Attention: Towards Interpretability in Neural Parsing , 2019, FINDINGS.

[360]  A. McCallum,et al.  Learning to Few-Shot Learn Across Diverse Natural Language Classification Tasks , 2019, COLING.

[361]  Elia Bruni,et al.  Location Attention for Extrapolation to Longer Sequences , 2019, ACL.

[362]  Xiaodong Liu,et al.  RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers , 2019, ACL.

[363]  Jianfeng Gao,et al.  SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization , 2019, ACL.

[364]  R. Thomas McCoy,et al.  BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance , 2019, BLACKBOXNLP.

[365]  Florian Metze,et al.  On Compositionality in Neural Machine Translation , 2019, ArXiv.

[366]  François Yvon,et al.  Generic and Specialized Word Embeddings for Multi-Domain Machine Translation , 2019, IWSLT.

[367]  Yaser Al-Onaizan,et al.  Robustness to Capitalization Errors in Named Entity Recognition , 2019, EMNLP.

[368]  Tomoki Taniguchi,et al.  CLER: Cross-task Learning with Expert Representation to Generalize Reading and Understanding , 2019, EMNLP.

[369]  Haoyang Huang,et al.  Improving the Robustness of Deep Reading Comprehension Models by Leveraging Syntax Prior , 2019, EMNLP.

[370]  Christopher Potts,et al.  Posing Fair Generalization Tasks for Natural Language Inference , 2019, EMNLP.

[371]  Yan Xu,et al.  Generalizing Question Answering System with Pre-trained Language Model Fine-tuning , 2019, EMNLP.

[372]  Fenglin Liu,et al.  Self-Adaptive Scaling for Learnable Residual Structure , 2019, CoNLL.

[373]  Maria Chang,et al.  Graph Enhanced Cross-Domain Text-to-SQL Generation , 2019, EMNLP.

[374]  Mikel Artetxe,et al.  On the Cross-lingual Transferability of Monolingual Representations , 2019, ACL.

[375]  Ryan Cotterell,et al.  The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection , 2019, Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[376]  Peter J. Liu,et al.  Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[377]  Danqi Chen,et al.  MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension , 2019, EMNLP.

[378]  Donggyu Kim,et al.  Domain-agnostic Question-Answering with Adversarial Training , 2019, EMNLP.

[379]  Ido Dagan,et al.  Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets , 2019, CoNLL.

[380]  Holger Schwenk,et al.  MLQA: Evaluating Cross-lingual Extractive Question Answering , 2019, ACL.

[381]  Liang Zhao,et al.  Compositional Generalization for Primitive Substitutions , 2019, EMNLP.

[382]  Roi Blanco,et al.  Book QA: Stories of Challenges and Opportunities , 2019, EMNLP.

[383]  Florian Schmidt Generalization in Generation: A closer look at Exposure Bias , 2019, EMNLP.

[384]  Frédéric Béchet,et al.  Robust Semantic Parsing with Adversarial Learning for Domain Generalization , 2019, NAACL.

[385]  Tom M. Mitchell,et al.  Look-up and Adapt: A One-shot Semantic Parser , 2019, EMNLP.

[386]  Hal Daumé,et al.  Global Voices: Crossing Borders in Automatic News Summarization , 2019, EMNLP.

[387]  Graham Neubig,et al.  Domain Differential Adaptation for Neural Machine Translation , 2019, EMNLP.

[388]  Marcus Bishop,et al.  Learning Invariant Representations of Social Media Users , 2019, EMNLP.

[389]  Zachary Chase Lipton,et al.  Learning the Difference that Makes a Difference with Counterfactually-Augmented Data , 2019, ICLR.

[390]  Antonia Baumann,et al.  Multilingual Language Models for Named Entity Recognition in German and English , 2019, RANLP.

[391]  Rabeeh Karimi Mahabadi,et al.  End-to-End Bias Mitigation by Modelling Biases in Corpora , 2019, ACL.

[392]  Jason Weston,et al.  Finding Generalizable Evidence by Learning to Convince Q&A Models , 2019, EMNLP.

[393]  Mirella Lapata,et al.  Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs , 2019, EMNLP.

[394]  Hung-yi Lee,et al.  LAMOL: LAnguage MOdeling for Lifelong Language Learning , 2019, ICLR.

[395]  Ryan Cotterell,et al.  Don’t Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction , 2019, EMNLP.

[396]  Shikha Bordia,et al.  Investigating BERT’s Knowledge of Language: Five Analysis Methods with NPIs , 2019, EMNLP.

[397]  Nanyun Peng,et al.  The Woman Worked as a Babysitter: On Biases in Language Generation , 2019, EMNLP.

[398]  Noah A. Smith,et al.  Topics to Avoid: Demoting Latent Confounds in Text Classification , 2019, EMNLP.

[399]  Jason Baldridge,et al.  Learning Dense Representations for Entity Retrieval , 2019, CoNLL.

[400]  Todor Mihaylov,et al.  Discourse-Aware Semantic Self-Attention for Narrative Reading Comprehension , 2019, EMNLP.

[401]  Emiel Krahmer,et al.  Neural data-to-text generation: A comparison between pipeline and end-to-end architectures , 2019, EMNLP.

[402]  Elia Bruni,et al.  Compositionality Decomposed: How do Neural Networks Generalise? , 2019, J. Artif. Intell. Res..

[403]  Yoav Goldberg,et al.  Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets , 2019, EMNLP.

[404]  Fei Liu,et al.  MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance , 2019, EMNLP.

[405]  Jason Baldridge,et al.  PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification , 2019, EMNLP.

[406]  Hazem M. Hajj,et al.  Improved Generalization of Arabic Text Classifiers , 2019, WANLP@ACL 2019.

[407]  Yang Yu,et al.  Out-of-Domain Detection for Low-Resource Text Classification Tasks , 2019, EMNLP.

[408]  José Manuél Gómez-Pérez,et al.  An Empirical Study on Pre-trained Embeddings and Language Models for Bot Detection , 2019, RepL4NLP@ACL.

[409]  Hazem M. Hajj,et al.  hULMonA: The Universal Language Model in Arabic , 2019, WANLP@ACL 2019.

[410]  Pascale Fung,et al.  Learning Multilingual Meta-Embeddings for Code-Switching Named Entity Recognition , 2019, RepL4NLP@ACL.

[411]  Joelle Pineau,et al.  CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text , 2019, EMNLP.

[412]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[413]  Dan Klein,et al.  Cross-Domain Generalization of Neural Constituency Parsers , 2019, ACL.

[414]  Yan Song,et al.  Knowledge-aware Pronoun Coreference Resolution , 2019, ACL.

[415]  Dan Roth,et al.  Zero-Shot Open Entity Typing as Type-Compatible Grounding , 2019, EMNLP.

[416]  Youmna Farag,et al.  Multi-Task Learning for Coherence Modeling , 2019, ACL.

[417]  Marie-Catherine de Marneffe,et al.  Do You Know That Florence Is Packed with Visitors? Evaluating State-of-the-art Models of Speaker Commitment , 2019, ACL.

[418]  Rick Siow Mong Goh,et al.  Dual Adversarial Neural Transfer for Low-Resource Named Entity Recognition , 2019, ACL.

[419]  Tong Zhang,et al.  Reinforced Training Data Selection for Domain Adaptation , 2019, ACL.

[420]  Michael J. Paul,et al.  Neural Temporality Adaptation for Document Classification: Diachronic Word Embeddings and Domain Adaptation Models , 2019, ACL.

[421]  Kyle Gorman,et al.  We Need to Talk about Standard Splits , 2019, ACL.

[422]  Maosong Sun,et al.  XQA: A Cross-lingual Open-domain Question Answering Dataset , 2019, ACL.

[423]  Goran Glavas,et al.  Multilingual and Cross-Lingual Graded Lexical Entailment , 2019, ACL.

[424]  Partha Pratim Talukdar,et al.  Zero-shot Word Sense Disambiguation using Sense Definition Embeddings , 2019, ACL.

[425]  Eser Kandogan,et al.  HEIDL: Learning Linguistic Expressions with Deep Learning and Human-in-the-Loop , 2019, ACL.

[426]  Charibeth Cheng,et al.  Localization of Fake News Detection via Multitask Transfer Learning , 2019, LREC.

[427]  Ming-Wei Chang,et al.  Zero-Shot Entity Linking by Reading Entity Descriptions , 2019, ACL.

[428]  Johan Bos,et al.  Can Neural Networks Understand Monotonicity Reasoning? , 2019, BlackboxNLP@ACL.

[429]  Lonneke van der Plas,et al.  Learning to Predict Novel Noun-Noun Compounds , 2019, MWE-WN@ACL.

[430]  Andrew McCallum,et al.  Energy and Policy Considerations for Deep Learning in NLP , 2019, ACL.

[431]  A. Korhonen,et al.  Are we there yet? Encoder-decoder neural networks as cognitive models of English past tense inflection , 2019, ACL.

[432]  Elia Bruni,et al.  Transcoding Compositionally: Using Attention to Find More Generalizable Solutions , 2019, BlackboxNLP@ACL.

[433]  Eva Schlinger,et al.  How Multilingual is Multilingual BERT? , 2019, ACL.

[434]  Jaime Carbonell,et al.  Domain Adaptation of Neural Machine Translation by Lexicon Induction , 2019, ACL.

[435]  Mathijs Mul,et al.  Siamese recurrent networks learn first-order logic reasoning and exhibit zero-shot compositional generalization , 2019, ArXiv.

[436]  Dragomir R. Radev,et al.  SParC: Cross-Domain Semantic Parsing in Context , 2019, ACL.

[437]  Dan Roth,et al.  Improving Generalization in Coreference Resolution via Adversarial Training , 2019, *SEMEVAL.

[438]  Jonathan Berant,et al.  MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension , 2019, ACL.

[439]  Jackie Chi Kit Cheung,et al.  A Cross-Domain Transferable Neural Coherence Model , 2019, ACL.

[440]  Samuel R. Bowman,et al.  Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark , 2019, ACL.

[441]  Marco Baroni,et al.  CNNs found to jump around more skillfully than RNNs: Compositional Generalization in Seq2seq Convolutional Networks , 2019, ACL.

[442]  Marcelo Finger,et al.  A logical-based corpus for cross-lingual evaluation , 2019, EMNLP.

[443]  Omer Levy,et al.  SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems , 2019, NeurIPS.

[444]  William Yang Wang,et al.  Few-Shot NLG with Pre-Trained Language Model , 2019, ACL.

[445]  Mark Dredze,et al.  Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT , 2019, EMNLP.

[446]  Jack Hessel,et al.  Something’s Brewing! Early Prediction of Controversy-causing Posts from Discussion Features , 2019, NAACL.

[447]  Graham Neubig,et al.  Density Matching for Bilingual Word Embedding , 2019, NAACL.

[448]  Ankur P. Parikh,et al.  Consistency by Agreement in Zero-Shot Neural Machine Translation , 2019, NAACL.

[449]  Mai ElSherief,et al.  Learning to Decipher Hate Symbols , 2019, NAACL.

[450]  Pushmeet Kohli,et al.  Analysing Mathematical Reasoning Abilities of Neural Models , 2019, ICLR.

[451]  Jason Baldridge,et al.  PAWS: Paraphrase Adversaries from Word Scrambling , 2019, NAACL.

[452]  Ryan Cotterell,et al.  A Probabilistic Generative Model of Linguistic Typology , 2019, NAACL.

[453]  Marco Baroni,et al.  The emergence of number and syntax units in LSTM language models , 2019, NAACL.

[454]  Lucy Vasserman,et al.  Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification , 2019, WWW.

[455]  Roger Levy,et al.  Structural Supervision Improves Learning of Non-Local Grammatical Dependencies , 2019, NAACL.

[456]  Orhan Firat,et al.  Massively Multilingual Neural Machine Translation , 2019, NAACL.

[457]  Jian Sun,et al.  Induction Networks for Few-Shot Text Classification , 2019, EMNLP.

[458]  R. Thomas McCoy,et al.  Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference , 2019, ACL.

[459]  Armand Joulin,et al.  Cooperative Learning of Disjoint Syntax and Semantics , 2019, NAACL.

[460]  Trevor Cohn,et al.  Massively Multilingual Transfer for NER , 2019, ACL.

[461]  Heike Adel,et al.  Adversarial Training for Satire Detection: Controlling for Confounding Variables , 2019, NAACL.

[462]  Lei Yu,et al.  Learning and Evaluating General Linguistic Intelligence , 2019, ArXiv.

[463]  Lucy Vasserman,et al.  Measuring and Mitigating Unintended Bias in Text Classification , 2018, AIES.

[464]  Alfio Gliozzo,et al.  Learning Relational Representations by Analogy using Hierarchical Siamese Networks , 2018, NAACL.

[465]  Lu Chen,et al.  DIAG-NRE: A Neural Pattern Diagnosis Framework for Distantly Supervised Neural Relation Extraction , 2018, ACL.

[466]  Samuel R. Bowman,et al.  Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks , 2018, ArXiv.

[467]  Graeme Hirst,et al.  Using context to identify the language of face-saving , 2018, ArgMining@EMNLP.

[468]  Stergios Chatzikyriakidis,et al.  Testing the Generalization Power of Neural Network Models across NLI Benchmarks , 2018, BlackboxNLP@ACL.

[469]  Omer Levy,et al.  pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference , 2018, NAACL.

[470]  Ngoc Thang Vu,et al.  Sequence-to-Sequence Models for Data-to-Text Natural Language Generation: Word- vs. Character-based Processing and Output Diversity , 2018, INLG.

[471]  Inioluwa Deborah Raji,et al.  Model Cards for Model Reporting , 2018, FAT.

[472]  Jan Snajder,et al.  Cross-Domain Detection of Abusive Language Online , 2018, ALW.

[473]  Anders Søgaard,et al.  Sentiment analysis under temporal shift , 2018, WASSA@EMNLP.

[474]  Tao Yu,et al.  Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task , 2018, EMNLP.

[475]  Guillaume Lample,et al.  XNLI: Evaluating Cross-lingual Sentence Representations , 2018, EMNLP.

[476]  Jason Weston,et al.  Jump to better conclusions: SCAN both left and right , 2018, BlackboxNLP@EMNLP.

[477]  Marilyn A. Walker,et al.  Can Neural Generators for Dialogue Learn Sentence Planning and Discourse Structuring? , 2018, INLG.

[478]  Hwee Tou Ng,et al.  Adaptive Semi-supervised Learning for Cross-domain Sentiment Classification , 2018, EMNLP.

[479]  Graham Neubig,et al.  MTNT: A Testbed for Machine Translation of Noisy Text , 2018, EMNLP.

[480]  Dieuwke Hupkes,et al.  Do Language Models Understand Anything? On the Ability of LSTMs to Understand Negative Polarity Items , 2018, BlackboxNLP@EMNLP.

[481]  José Camacho-Collados,et al.  WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations , 2018, NAACL.

[482]  Jaime G. Carbonell,et al.  Adapting Word Embeddings to New Languages with Morphological and Phonological Subword Representations , 2018, EMNLP.

[483]  Pascale Fung,et al.  Reducing Gender Bias in Abusive Language Detection , 2018, EMNLP.

[484]  Yejin Choi,et al.  SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference , 2018, EMNLP.

[485]  Florian Mohnert,et al.  Under the Hood: Using Diagnostic Classifiers to Investigate and Improve how Language Models Track Agreement Information , 2018, BlackboxNLP@EMNLP.

[486]  Zachary C. Lipton,et al.  How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks , 2018, EMNLP.

[487]  Ralf Krestel,et al.  Aggression Identification Using Deep Learning and Data Augmentation , 2018, TRAC@COLING 2018.

[488]  Marco Baroni,et al.  Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks , 2018, BlackboxNLP@EMNLP.

[489]  Gerard de Melo,et al.  A Helping Hand: Transfer Learning for Deep Sentiment Analysis , 2018, ACL.

[490]  Ryan Cotterell,et al.  Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate , 2018, TACL.

[491]  J. Tenenbaum Building Machines that Learn and Think Like People , 2018, AAMAS.

[492]  Ananth Balashankar,et al.  RECIPE: Applying Open Domain Question Answering to Privacy Policies , 2018, QA@ACL.

[493]  Isabelle Augenstein,et al.  Character-level Supervision for Low-resource POS Tagging , 2018, DeepLo@ACL.

[494]  Michael J. Paul,et al.  Examining Temporality in Document Classification , 2018, ACL.

[495]  Gerhard Weikum,et al.  diaNED: Time-Aware Named Entity Disambiguation for Diachronic Corpora , 2018, ACL.

[496]  Jianxin Li,et al.  Time-evolving Text Classification with Deep Neural Networks , 2018, IJCAI.

[497]  Richard Socher,et al.  The Natural Language Decathlon: Multitask Learning as Question Answering , 2018, ArXiv.

[498]  James Henderson,et al.  GILE: A Generalized Input-Label Embedding for Text Classification , 2018, TACL.

[499]  Iryna Gurevych,et al.  A Retrospective Analysis of the Fake News Challenge Stance-Detection Task , 2018, COLING.

[500]  Rui Wang,et al.  A Survey of Domain Adaptation for Neural Machine Translation , 2018, COLING.

[501]  Mari Ostendorf,et al.  Estimating Linguistic Complexity for Science Texts , 2018, BEA@NAACL-HLT.

[502]  Ryan Cotterell,et al.  Are All Languages Equally Hard to Language-Model? , 2018, NAACL.

[503]  Dragomir R. Radev,et al.  Improving Text-to-SQL Evaluation Methodology , 2018, ACL.

[504]  Joachim Bingel,et al.  Cross-lingual complex word identification with multitask learning , 2018, BEA@NAACL-HLT.

[505]  Pushpak Bhattacharyya,et al.  Leveraging Orthographic Similarity for Multilingual Neural Transliteration , 2018, TACL.

[506]  Yen-Chun Chen,et al.  Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting , 2018, ACL.

[507]  Cécile Paris,et al.  Cross-Target Stance Classification with Self-Attention Networks , 2018, ACL.

[508]  Sebastian Riedel,et al.  Behavior Analysis of NLI Models: Uncovering the Influence of Three Factors on Robustness , 2018, NAACL.

[509]  Matthias Grabmair,et al.  Towards Inference-Oriented Reading Comprehension: ParallelQA , 2018, ArXiv.

[510]  Ido Dagan,et al.  Paraphrase to Explicate: Revealing Implicit Noun-Compound Relations , 2018, ACL.

[511]  Yoav Goldberg,et al.  Breaking NLI Systems with Sentences that Require Simple Lexical Inferences , 2018, ACL.

[512]  Niranjan Balasubramanian,et al.  The Fine Line between Linguistic Generalization and Failure in Seq2Seq-Attention Models , 2018, ArXiv.

[513]  Rachel Rudinger,et al.  Hypothesis Only Baselines in Natural Language Inference , 2018, *SEMEVAL.

[514]  Timothy Baldwin,et al.  What’s in a Domain? Learning Domain-Robust Text Representations using Adversarial Training , 2018, NAACL.

[515]  Maxine Eskénazi,et al.  Zero-Shot Dialog Generation with Cross-Domain Latent Actions , 2018, SIGDIAL Conference.

[516]  Ari Rappoport,et al.  Multitask Parsing Across Semantic Representations , 2018, ACL.

[517]  Samuel R. Bowman,et al.  GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[518]  Sharon Goldwater,et al.  Evaluating Historical Text Normalization Systems: How Well Do They Generalize? , 2018, NAACL.

[519]  Dan Roth,et al.  End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions , 2018, ACL.

[520]  Zhong Zhou,et al.  Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation , 2018, WMT.

[521]  Jonathan Berant,et al.  Decoupling Structure and Lexicon for Zero-Shot Semantic Parsing , 2018, EMNLP.

[522]  Edouard Grave,et al.  Colorless Green Recurrent Networks Dream Hierarchically , 2018, NAACL.

[523]  Timnit Gebru,et al.  Datasheets for datasets , 2018, Commun. ACM.

[524]  Omer Levy,et al.  Annotation Artifacts in Natural Language Inference Data , 2018, NAACL.

[525]  Marco Baroni,et al.  Memorize or generalize? Searching for a compositional RNN in a haystack , 2018, ArXiv.

[526]  Ambedkar Dukkipati,et al.  Instance-based Inductive Deep Transfer Learning by Cross-Dataset Querying with Locality Sensitive Hashing , 2018, EMNLP.

[527]  Nitish Gupta,et al.  Neural Compositional Denotational Semantics for Question Answering , 2018, EMNLP.

[528]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[529]  Ruslan Salakhutdinov,et al.  Investigating the Working of Text Classifiers , 2018, COLING.

[530]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[531]  Gary Marcus,et al.  Deep Learning: A Critical Appraisal , 2018, ArXiv.

[532]  Dan Roth,et al.  Mapping to Declarative Knowledge for Word Problem Solving , 2017, TACL.

[533]  Willem H. Zuidema,et al.  Visualisation and 'diagnostic classifiers' reveal how recurrent and recursive neural networks process hierarchical structure , 2017, J. Artif. Intell. Res..

[534]  Marco Baroni,et al.  Generalization without Systematicity: On the Compositional Skills of Sequence-to-Sequence Recurrent Networks , 2017, ICML.

[535]  Brendan T. O'Connor,et al.  A Dataset and Classifier for Recognizing Social Media English , 2017, NUT@EMNLP.

[536]  Lemao Liu,et al.  Instance Weighting for Neural Machine Translation Domain Adaptation , 2017, EMNLP.

[537]  Benno Stein,et al.  Unit Segmentation of Argumentative Texts , 2017, ArgMining@EMNLP.

[538]  Mark A. Finlayson,et al.  A Simpler and More Generalizable Story Detector using Verb and Character Features , 2017, EMNLP.

[539]  Robert Malouf,et al.  Abstractive morphological learning with a recurrent neural network , 2017 .

[540]  Zhen-Hua Ling,et al.  Recurrent Neural Network-Based Sentence Encoder with Gated Attention for Natural Language Inference , 2017, RepEval@EMNLP.

[541]  Michael Strube,et al.  Using Linguistic Features to Improve the Generalization Capability of Neural Coreference Resolvers , 2017, EMNLP.

[542]  Percy Liang,et al.  Adversarial Examples for Evaluating Reading Comprehension Systems , 2017, EMNLP.

[543]  Young-Bum Kim,et al.  Domain Attention with an Ensemble of Experts , 2017, ACL.

[544]  Masao Utiyama,et al.  Sentence Embedding for Neural Machine Translation Domain Adaptation , 2017, ACL.

[545]  Omer Levy,et al.  Zero-Shot Relation Extraction via Reading Comprehension , 2017, CoNLL.

[546]  Le-Minh Nguyen,et al.  Natural Language Generation for Spoken Dialogue System using RNN Encoder-Decoder Networks , 2017, CoNLL.

[547]  Stefan Riezler,et al.  Bandit Structured Prediction for Neural Sequence-to-Sequence Learning , 2017, ACL.

[548]  Samuel R. Bowman,et al.  A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.

[549]  Bonnie L. Webber,et al.  Detecting negation scope is easy, except when it isn’t , 2017, EACL.

[550]  Michael Strube,et al.  Lexical Features in Coreference Resolution: To be Used With Caution , 2017, ACL.

[551]  Gary Geunbae Lee,et al.  Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems , 2017, Pattern Recognit. Lett..

[552]  Markus Freitag,et al.  Fast Domain Adaptation for Neural Machine Translation , 2016, ArXiv.

[553]  Kalina Bontcheva,et al.  Broad Twitter Corpus: A Diverse Named Entity Recognition Resource , 2016, COLING.

[554]  Fan Yang,et al.  Leveraging Multiple Domains for Sentiment Classification , 2016, COLING.

[555]  Emmanuel Dupoux,et al.  Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies , 2016, TACL.

[556]  Roi Reichart,et al.  Neural Structural Correspondence Learning for Domain Adaptation , 2016, CoNLL.

[557]  Richard Socher,et al.  Pointer Sentinel Mixture Models , 2016, ICLR.

[558]  Anette Frank,et al.  Modal Sense Classification At Large: Paraphrase-Driven Sense Projection, Semantically Enriched Classification Models and Cross-Genre Evaluations , 2016, LILT.

[559]  Barbara Plank,et al.  What to do about non-standard (or non-canonical) language in NLP , 2016, KONVENS.

[560]  Nanyun Peng,et al.  Multi-task Domain Adaptation for Sequence Tagging , 2016, Rep4NLP@ACL.

[561]  Brendan T. O'Connor,et al.  Demographic Dialectal Variation in Social Media: A Case Study of African-American English , 2016, EMNLP.

[562]  Nathanael Chambers,et al.  A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories , 2016, NAACL.

[563]  Nadir Durrani,et al.  How to Avoid Unwanted Pregnancies: Domain Adaptation using Neural Network Models , 2015, EMNLP.

[564]  Christopher Potts,et al.  A large annotated corpus for learning natural language inference , 2015, EMNLP.

[565]  Christopher Potts,et al.  Tree-Structured Composition in Neural Networks without Tree-Structured Architectures , 2015, CoCo@NIPS.

[566]  Dirk Hovy,et al.  Crowdsourcing and annotating NER for Twitter #drift , 2014, LREC.

[567]  Marco Marelli,et al.  A SICK cure for the evaluation of compositional distributional semantic models , 2014, LREC.

[568]  Jianfeng Gao,et al.  Domain Adaptation via Pseudo In-Domain Data Selection , 2011, EMNLP.

[569]  Federico Sangati,et al.  Accurate Parsing with Compact Tree-Substitution Grammars: Double-DOP , 2011, EMNLP.

[570]  Marcello Federico,et al.  Domain Adaptation for Statistical Machine Translation with Monolingual Resources , 2009, WMT@EACL.

[571]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[572]  F.C.K. Wong,et al.  Generalisation towards Combinatorial Productivity in Language Acquisition by Simple Recurrent Networks , 2007, 2007 International Conference on Integration of Knowledge Intensive Multi-Agent Systems.

[573]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[574]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[575]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[576]  Satoshi Nakamura,et al.  Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[577]  John Blitzer,et al.  Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[578]  Gary F. Marcus,et al.  Connectionism: with or without rules? Response to J.L. McClelland and D.C. Plaut (1999) , 1999, Trends in Cognitive Sciences.

[579]  D. Plaut,et al.  Does generalization in infant learning implicate abstract algebra-like rules? , 1999, Trends in Cognitive Sciences.

[580]  G. Marcus Rethinking Eliminative Connectionism , 1998, Cognitive Psychology.

[581]  Ronald Rosenfeld,et al.  A maximum entropy approach to adaptive statistical language modelling , 1996, Comput. Speech Lang..

[582]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[583]  Gary F. Marcus,et al.  German Inflection: The Exception That Proves the Rule , 1995, Cognitive Psychology.

[584]  Hermann Ney,et al.  Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[585]  David M. Magerman Statistical Decision-Tree Models for Parsing , 1995, ACL.

[586]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[587]  J. Fodor,et al.  Connectionism and cognitive architecture: A critical analysis , 1988, Cognition.

[588]  László Dezsö,et al.  Universal Grammar , 1981, Certainty in Action.

[589]  J. Berko The Child's Learning of English Morphology , 1958 .

[590]  Emmanouil Antonios Platanios,et al.  Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion , 2022, ACL.

[591]  Shima Asaadi,et al.  Knowledge Distillation Meets Few-Shot Learning: An Approach for Few-Shot Intent Classification Within and Across Domains , 2022, NLP4CONVAI.

[592]  Dinh Q. Phung,et al.  Domain Generalisation of NMT: Fusing Adapters with Leave-One-Domain-Out Training , 2022, FINDINGS.

[593]  Jey Han Lau,et al.  Cloze Evaluation for Deeper Understanding of Commonsense Stories in Indonesian , 2022, CSRR.

[594]  Lyle Ungar,et al.  Measuring the Language of Self-Disclosure across Corpora , 2022, FINDINGS.

[595]  M. Fomicheva,et al.  Bias Mitigation in Machine Translation Quality Estimation , 2022, ACL.

[596]  Shizhu He,et al.  Leveraging Explicit Lexico-logical Alignments in Text-to-SQL Parsing , 2022, ACL.

[597]  A. A. Krizhanovsky,et al.  SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection , 2022, SIGMORPHON.

[598]  E. Hobley,et al.  Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation , 2022, WOAH.

[599]  Xiaojie Wang,et al.  Learn to Adapt for Generalized Zero-Shot Text Classification , 2022, ACL.

[600]  Yangqiu Song,et al.  Rare and Zero-shot Word Sense Disambiguation using Z-Reweighting , 2022, ACL.

[601]  Noah A. Smith,et al.  Benchmarking Generalization via In-Context Instructions on 1, 600+ Language Tasks , 2022, ArXiv.

[602]  Xuanjing Huang,et al.  Flooding-X: Improving BERT’s Resistance to Adversarial Attacks via Loss-Restricted Fine-Tuning , 2022, ACL.

[603]  Rik Koncel-Kedziorski,et al.  Cross-Lingual G EN QA: Open-Domain Question Answering with Answer Sentence Generation , 2022 .

[604]  David Jurgens,et al.  Classification without (Proper) Representation: Political Heterogeneity in Social Media and Its Implications for Classification and Behavioral Analysis , 2022, FINDINGS.

[605]  Alexander I. Rudnicky,et al.  An Empirical study to understand the Compositional Prowess of Neural Dialog Models , 2022, INSIGHTS.

[606]  Di Wu,et al.  Challenges to Open-Domain Constituency Parsing , 2022, FINDINGS.

[607]  Tiansi Dong,et al.  How Can Cross-lingual Knowledge Contribute Better to Fine-Grained Entity Typing? , 2022, FINDINGS.

[608]  Roberto Zamparelli,et al.  Multilingualism Encourages Recursion: a Transfer Study with mBERT , 2022, SIGTYP.

[609]  Mohit Bansal,et al.  GraDA: Graph Generative Data Augmentation for Commonsense Reasoning , 2022, DLG4NLP.

[610]  Matt Gardner,et al.  Impact of Pretraining Term Frequencies on Few-Shot Numerical Reasoning , 2022, EMNLP.

[611]  Shafiq R. Joty,et al.  Effective Fine-Tuning Methods for Cross-lingual Adaptation , 2021, EMNLP.

[612]  Y. Taya,et al.  Multi-Layer Random Perturbation Training for improving Model Generalization Efficiently , 2021, BLACKBOXNLP.

[613]  Snigdha Chaturvedi,et al.  How Helpful is Inverse Reinforcement Learning for Table-to-Text Generation? , 2021, ACL.

[614]  Pawan Goyal,et al.  Attribute Value Generation from Product Title using Language Models , 2021, ECNLP.

[615]  Adina Williams,et al.  Generalising to German Plural Noun Classes, from the Perspective of a Recurrent Neural Network , 2021, CONLL.

[616]  Jing Jiang,et al.  Cross-Topic Rumor Detection using Topic-Mixtures , 2021, EACL.

[617]  Hao He,et al.  Diagnosing the First-Order Logical Reasoning Ability Through LogicNLI , 2021, EMNLP.

[618]  Cane Wing-ki Leung,et al.  Improving Model Generalization: A Chinese Named Entity Recognition Case Study , 2021, ACL.

[619]  Luheng He,et al.  QA-Driven Zero-shot Slot Filling with Weak Supervision Pretraining , 2021, ACL.

[620]  E. Hinrichs,et al.  Automatic Classification of Attributes in German Adjective-Noun Phrases , 2021, IWCS.

[621]  Senja Pollak,et al.  Zero-shot Cross-lingual Content Filtering: Offensive Language and Hate Speech Detection , 2021, HACKASHOP.

[622]  Ulf Leser,et al.  Extend, don’t rebuild: Phrasing conditional graph modification as autoregressive sequence labelling , 2021, EMNLP.

[623]  Nigel Collier,et al.  Synthetic Examples Improve Cross-Target Generalization: A Study on Stance Detection on a Twitter corpus. , 2021, WASSA.

[624]  Akhil Kedia,et al.  Keep Learning: Self-supervised Meta-learning for Learning from Inference , 2021, EACL.

[625]  Colin Wilson,et al.  Were We There Already? Applying Minimal Generalization to the SIGMORPHON-UniMorph Shared Task on Cognitively Plausible Morphological Inflection , 2021, Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[626]  Yohan Lee,et al.  Improving End-to-End Task-Oriented Dialog System with A Simple Auxiliary Task , 2021, EMNLP.

[627]  Vaibhava Goel,et al.  CNNBiF: CNN-based Bigram Features for Named Entity Recognition , 2021, EMNLP.

[628]  Victor Petrén Bach Hansen,et al.  Guideline Bias in Wizard-of-Oz Dialogues , 2021, BPPF.

[629]  Gerhard Heyer,et al.  On Classifying whether Two Texts are on the Same Side of an Argument , 2021, EMNLP.

[630]  Johan Bos,et al.  Evaluating Text Generation from Discourse Representation Structures , 2021, GEM.

[631]  I. Kobayashi,et al.  Towards a Language Model for Temporal Commonsense Reasoning , 2021, RANLP.

[632]  J. Piskorski,et al.  Fine-grained Event Classification in News-like Text Snippets - Shared Task 2, CASE 2021 , 2021, CASE.

[633]  Hinrich Schütze,et al.  Multidomain Pretrained Language Models for Green NLP , 2021, ADAPTNLP.

[634]  Wenbin Hu,et al.  BanditMTL: Bandit-based Multi-task Learning for Text Classification , 2021, ACL.

[635]  Marcello Federico,et al.  A Statistical Extension of Byte-Pair Encoding , 2021, IWSLT.

[636]  Hitomi Yanaka,et al.  Assessing the Generalization Capacity of Pre-trained Language Models through Japanese Adversarial Natural Language Inference , 2021, BLACKBOXNLP.

[637]  Francis M. Tyers,et al.  SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages , 2021, Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[638]  Timothy J. Hazen,et al.  Increasing Robustness to Spurious Correlations using Forgettable Examples , 2021, EACL.

[639]  Proceedings of the Fourth Workshop on NLP for Internet Freedom: Censorship, Disinformation, and Propaganda , 2021 .

[640]  Ali Ghodsi,et al.  How to Select One Among All ? An Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding , 2021, EMNLP.

[641]  Judith Yue Li,et al.  Semi-supervised Meta-learning for Cross-domain Few-shot Intent Classification , 2021, METANLP.

[642]  Natalie Schluter,et al.  MassiveSumm: a very large-scale, very multilingual, news summarisation dataset , 2021, EMNLP.

[643]  Dragomir R. Radev,et al.  Testing Cross-Database Semantic Parsers With Canonical Utterances , 2021, EVAL4NLP.

[644]  Baolin Peng,et al.  Few-Shot Named Entity Recognition: An Empirical Baseline Study , 2021, EMNLP.

[645]  Junlan Feng,et al.  Counterfactual Matters: Intrinsic Probing For Dialogue State Tracking , 2021, EANCS.

[646]  Edward Grefenstette,et al.  A Survey of Generalisation in Deep Reinforcement Learning , 2021, ArXiv.

[647]  Maria Barrett,et al.  Spurious Correlations in Cross-Topic Argument Mining , 2021, STARSEM.

[648]  David,et al.  IA On Learning the Past Tenses of English Verbs , 2021 .

[649]  Jose G. Moreno,et al.  Using a Frustratingly Easy Domain and Tagset Adaptation for Creating Slavic Named Entity Recognition Systems , 2021, BSNLP.

[650]  Martha Palmer,et al.  Predicate Representations and Polysemy in VerbNet Semantic Parsing , 2021, IWCS.

[651]  Xing Han,et al.  Multi-Pair Text Style Transfer for Unbalanced Data via Task-Adaptive Meta-Learning , 2021, METANLP.

[652]  Minh Le Nguyen,et al.  Learning Cross-lingual Representations for Event Coreference Resolution with Multi-view Alignment and Optimal Transport , 2021, MRL.

[653]  Wei Xu,et al.  WIKIBIAS: Detecting Multi-Span Subjective Biases in Language , 2021, EMNLP.

[654]  Nicholas Andrews,et al.  Learning Universal Authorship Representations , 2021, EMNLP.

[655]  Adam Ek,et al.  Training Strategies for Neural Multilingual Morphological Inflection , 2021, Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[656]  Pavel Pecina,et al.  Solving SCAN Tasks with Data Augmentation and Input Embeddings , 2021, RANLP.

[657]  Nigel Collier,et al.  Adversarial Training for News Stance Detection: Leveraging Signals from a Multi-Genre Corpus. , 2021, HACKASHOP.

[658]  Itzik Malkiel,et al.  Maximal Multiverse Learning for Promoting Cross-Task Generalization of Fine-Tuned Language Models , 2021, EACL.

[659]  Sarvnaz Karimi,et al.  Combining Shallow and Deep Representations for Text-Pair Classification , 2021, ALTA.

[660]  Chunyan Miao,et al.  MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-Lingual NER , 2021, ACL.

[661]  Frankie Robertson,et al.  Word Discriminations for Vocabulary Inventory Prediction , 2021, RANLP.

[662]  Zornitsa Kozareva,et al.  Few-shot Learning with Multilingual Language Models , 2021, ArXiv.

[663]  Pararth Shah,et al.  Multi-Action Dialog Policy Learning with Interactive Human Teaching , 2020, SIGDIAL.

[664]  Roger Levy,et al.  Cloze Distillation: Improving Neural Language Models with Human Next-Word Prediction , 2020, CoNLL.

[665]  A. Waibel,et al.  Supervised Adaptation of Sequence-to-Sequence Speech Recognition Systems using Batch-Weighting , 2020, LIFELONGNLP.

[666]  S. Chatzikyriakidis,et al.  How does Punctuation Affect Neural Models in Natural Language Inference , 2020, PAM.

[667]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[668]  Anna Feldman Proceedings of the Second Workshop on Natural Language Processing for Internet Freedom: Censorship, Disinformation, and Propaganda , 2019 .

[669]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[670]  Yejin Choi,et al.  An Adversarial Winograd Schema Challenge at Scale , 2019 .

[671]  Joachim Bingel,et al.  Bridging the Gaps: Multi Task Learning for Domain Transfer of Hate Speech Detection , 2018 .

[672]  Mans Hulden,et al.  A Neural Morphological Analyzer for Arapaho Verbs Learned from a Finite State Transducer , 2018 .

[673]  Gary Geunbae Lee,et al.  Out-of-domain Detection based on Generative Adversarial Network , 2018, EMNLP.

[674]  Joe Pater,et al.  Seq2Seq Models with Dropout can Learn Generalizable Reduplication , 2018 .

[675]  Heng Ji,et al.  Cross-lingual Name Tagging and Linking for 282 Languages , 2017, ACL.

[676]  Chenhui Chu,et al.  An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation , 2017, ACL.

[677]  Paul Cook,et al.  Supervised and unsupervised approaches to measuring usage similarity , 2017 .

[678]  Ines Rehbein,et al.  Authorship Attribution with Convolutional Neural Networks and POS-Eliding , 2017 .

[679]  Antal van den Bosch,et al.  Sarcastic Soulmates: Intimacy and irony markers in social media messaging , 2016, LILT.

[680]  Willem H. Zuidema,et al.  Diagnostic Classifiers Revealing how Neural Networks Process Hierarchical Structure , 2016, CoCo@NIPS.

[681]  Christopher D. Manning,et al.  Stanford Neural Machine Translation Systems for Spoken Language Domains , 2015, IWSLT.

[682]  Francisco Herrera,et al.  A unifying view on dataset shift in classification , 2012, Pattern Recognit..

[683]  J. Scott Istituto Dalle Molle di Studi Sull’Intelligenza Artificiale (IDSIA) | USI-SUPSI , 2010 .

[684]  Neil D. Lawrence,et al.  When Training and Test Sets Are Different: Characterizing Learning Transfer , 2009 .

[685]  G. Marcus The Algebraic Mind: Integrating Connectionism and Cognitive Science , 2001 .

[686]  R. Rosenfeld A Maximum Entropy Approach to Adaptive Statistical Language Modeling , 2001 .

[687]  urgen Schmidhuber Towards Compositional Learning in Dynamic Networks , 1990 .

[688]  Emily M. Bender Linguistic I Ssues in L Anguage Technology Lilt on Achieving and Evaluating Language-independence in Nlp on Achieving and Evaluating Language-independence in Nlp , 2022 .

[689]  Cees G. M. Snoek,et al.  Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation , 2021, ACL.

[690]  M. de Rijke,et al.  UvA-DARE (Digital Academic Repository) Learning to Ask Conversational Questions by Optimizing Levenshtein Distance , 2022 .