论文信息 - A taxonomy and review of generalization research in NLP - 字舞流文

A taxonomy and review of generalization research in NLP

Arabella J. Sinclair | Mikel Artetxe | Koustuv Sinha | D. Hupkes | Verna Dankers | Naomi Saphra | Tiago Pimentel | Khuyagbaatar Batsuren | Yanai Elazar | Christos Christodoulopoulos | Zhijing Jin | Dennis Ulmer | Maria Ryskina | Ryan Cotterell | Leila Khalatbari | Rita Frieske | Mario Giulianelli | Karim Lasri | Florian Schottmann | Kaiser Sun

[1] Yoav Goldberg,et al. Measuring Causal Effects of Data Statistics on Language Model's 'Factual' Predictions , 2022, ArXiv.

[2] Christophe Servan,et al. On the cross-lingual transferability of multilingual prototypical models across NLU tasks , 2022, METANLP.

[3] Shannon L. Spruit,et al. No Language Left Behind: Scaling Human-Centered Machine Translation , 2022, ArXiv.

[4] Ronan Le Bras,et al. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models , 2022, ArXiv.

[5] Yulia Tsvetkov,et al. ORCA: Interpreting Prompted Language Models via Locating Supporting Data Evidence in the Ocean of Pretraining Data , 2022, ArXiv.

[6] Yusuke Oda,et al. Are Prompt-based Models Clueless? , 2022, ACL.

[7] M. Dascalu,et al. Domain Adaptation in Multilingual and Multi-Domain Monolingual Settings for Complex Word Identification , 2022, ACL.

[8] Tiago Pimentel,et al. Naturalistic Causal Probing for Morpho-Syntax , 2022, TACL.

[9] Xi Victoria Lin,et al. OPT: Open Pre-trained Transformer Language Models , 2022, ArXiv.

[10] Anders Søgaard,et al. Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks , 2022, DADC.

[11] Arman Cohan,et al. Improving the Generalizability of Depression Detection by Leveraging Clinical Questionnaires , 2022, ACL.

[12] Jack G. M. FitzGerald,et al. MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages , 2022, ACL.

[13] T. Poibeau,et al. Does BERT really agree ? Fine-grained Analysis of Lexical Dependence on a Syntactic Task , 2022, FINDINGS.

[14] Rakesh R Menon,et al. CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations , 2022, ACL.

[15] Tianchuan Du,et al. Towards Generalizeable Semantic Product Search by Text Similarity Pre-training on Search Click Logs , 2022, ECNLP.

[16] Abed Alhakim Freihat,et al. Using Linguistic Typology to Enrich Multilingual Lexicons: the Case of Lexical Gaps in Kinship , 2022, LREC.

[17] G. Neumann,et al. Few-Shot Cross-lingual Transfer for Coarse-grained De-identification of Code-Mixed Clinical Texts , 2022, BIONLP.

[18] E. Mosca,et al. “That Is a Suspicious Reaction!”: Interpreting Logits Variation to Detect NLP Adversarial Attacks , 2022, ACL.

[19] Andrew M. Dai,et al. PaLM: Scaling Language Modeling with Pathways , 2022, J. Mach. Learn. Res..

[20] Lu Wang,et al. Efficient Argument Structure Extraction with Transfer Learning and Active Learning , 2022, FINDINGS.

[21] Lisa Anne Hendricks,et al. Training Compute-Optimal Large Language Models , 2022, ArXiv.

[22] Dilek Z. Hakkani-Tür,et al. What is wrong with you?: Leveraging User Sentiment for Automatic Dialog Evaluation , 2022, FINDINGS.

[23] Dipankar Das,et al. Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining? , 2022, ACL.

[24] Alessandro Sordoni,et al. Better Language Model with Hypernym Class Prediction , 2022, ACL.

[25] Seong Jae Hwang,et al. The Change that Matters in Discourse Parsing: Estimating the Impact of Domain Shift on Parser Error , 2022, FINDINGS.

[26] Tal Linzen,et al. Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence Models , 2022, FINDINGS.

[27] Reut Tsarfaty,et al. Morphological Reinflection with Multiple Arguments: An Extended Annotation schema and a Georgian Case Study , 2022, ACL.

[28] M. Shoeybi,et al. Multi-Stage Prompting for Knowledgeable Dialogue Generation , 2022, FINDINGS.

[29] Orhan Firat,et al. Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine Translation , 2022, ACL.

[30] P. Blunsom,et al. Revisiting the Compositional Generalization Abilities of Neural Sequence Models , 2022, ACL.

[31] Shafiq R. Joty,et al. Continual Few-shot Relation Learning via Embedding Space Regularization and Data Augmentation , 2022, ACL.

[32] Tao Shen,et al. ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer for Event-Centric Generation and Classification , 2022, ACL.

[33] Peter A. Cholak,et al. Overcoming a Theoretical Limitation of Self-Attention , 2022, ACL.

[34] Yixin Cao,et al. Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction , 2022, ACL.

[35] Matt Gardner,et al. Impact of Pretraining Term Frequencies on Few-Shot Reasoning , 2022, ArXiv.

[36] Alexander M. Rush,et al. PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts , 2022, ACL.

[37] Reza Yazdani Aminabadi,et al. Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model , 2022, ArXiv.

[38] Zhilin Yang,et al. ZeroPrompt: Scaling Prompt-Based Pretraining to 1, 000 Tasks Improves Zero-Shot Generalization , 2022, EMNLP.

[39] Dragomir R. Radev,et al. UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models , 2022, EMNLP.

[40] Yuri Burda,et al. Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets , 2022, ArXiv.

[41] Zoey Liu,et al. Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation , 2022, TACL.

[42] Xi Victoria Lin,et al. Few-shot Learning with Multilingual Generative Language Models , 2021, EMNLP.

[43] Xi Victoria Lin,et al. Efficient Large Scale Language Modeling with Mixtures of Experts , 2021, EMNLP.

[44] Po-Sen Huang,et al. Scaling Language Models: Methods, Analysis & Insights from Training Gopher , 2021, ArXiv.

[45] Jane A. Yu,et al. Quantifying Adaptability in Pre-trained Language Models with 500 Tasks , 2021, NAACL.

[46] Sanket Vaibhav Mehta,et al. ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning , 2021, ArXiv.

[47] Edward Grefenstette,et al. A Survey of Zero-shot Generalisation in Deep Reinforcement Learning , 2021, J. Artif. Intell. Res..

[48] Dawn Song,et al. Grounded Graph Decoding Improves Compositional Generalization in Question Answering , 2021, EMNLP.

[49] Jacob Andreas,et al. How Do Neural Sequence Models Generalize? Local and Global Cues for Out-of-Distribution Prediction , 2021, EMNLP.

[50] Daniel Khashabi,et al. Hey AI, Can You Solve Complex Tasks by Talking to Agents? , 2021, FINDINGS.

[51] Sanket Vaibhav Mehta,et al. Improving Compositional Generalization with Self-Training for Data-to-Text Generation , 2021, ACL.

[52] H. Mobahi,et al. Sharpness-Aware Minimization Improves Language Model Generalization , 2021, ACL.

[53] Bin Ma,et al. A Unified Speaker Adaptation Approach for ASR , 2021, EMNLP.

[54] Phu Mon Htut,et al. BBQ: A hand-built bias benchmark for question answering , 2021, FINDINGS.

[55] Alexander M. Rush,et al. Multitask Prompted Training Enables Zero-Shot Task Generalization , 2021, ICLR.

[56] Greg Durrett,et al. ASPECTNEWS: Aspect-Oriented Summarization of News Documents , 2021, ACL.

[57] Alexander M. Fraser,et al. Why don’t people use character-level machine translation? , 2021, FINDINGS.

[58] Ashwin Srinivasan,et al. Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations , 2021, FINDINGS.

[59] Dzmitry Bahdanau,et al. LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic Parsing , 2021, ACL.

[60] Stanislas Dehaene,et al. Causal Transformers Perform Below Chance on Recursive Nested Constructions, Unlike Humans , 2021, ArXiv.

[61] Dzmitry Bahdanau,et al. Compositional Generalization in Dependency Parsing , 2021, ACL.

[62] Chenhao Tan,et al. Investigating the Effect of Natural Language Explanations on Out-of-Distribution Generalization in Few-shot NLI , 2021, INSIGHTS.

[63] Mirella Lapata,et al. Disentangled Sequence to Sequence Learning for Compositional Generalization , 2021, ACL.

[64] Zhengyuan Liu,et al. DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing , 2021, CODI.

[65] Zhengyuan Liu,et al. Improving Multi-Party Dialogue Discourse Parsing via Domain Integration , 2021, CODI.

[66] Mark O. Riedl,et al. Situated Dialogue Learning through Procedural Environment Generation , 2021, ACL.

[67] David Restrepo Amariles,et al. JuriBERT: A Masked-Language Model Adaptation for French Legal Text , 2021, NLLP.

[68] Aleksandr Drozd,et al. Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics , 2021, INSIGHTS.

[69] D. Katz,et al. LexGLUE: A Benchmark Dataset for Legal Language Understanding in English , 2021, ACL.

[70] Mohit Bansal,et al. Inducing Transformer’s Compositional Generalization Ability via Auxiliary Sequence Prediction Tasks , 2021, EMNLP.

[71] Dan Friedman,et al. Single-dataset Experts for Multi-dataset Question Answering , 2021, EMNLP.

[72] Kai-Wei Chang,et al. Relation-Guided Pre-Training for Open-Domain Question Answering , 2021, EMNLP.

[73] Kevin Gimpel,et al. On Generalization in Coreference Resolution , 2021, CRAC.

[74] Kazuma Hashimoto,et al. RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering , 2021, Annual Meeting of the Association for Computational Linguistics.

[75] Marco Luca Sbodio,et al. Neural Unification for Logic Reasoning over Natural Language , 2021, EMNLP.

[76] Sarkar Snigdha Sarathi Das,et al. CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning , 2021, ACL.

[77] Ellie Pavlick,et al. Frequency Effects on Syntactic Rule Learning in Transformers , 2021, EMNLP.

[78] I. Augenstein,et al. How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs? , 2021, EMNLP.

[79] Xianpei Han,et al. Honey or Poison? Solving the Trigger Curse in Few-shot Event Detection via Causal Intervention , 2021, EMNLP.

[80] Albert Y.S. Lam,et al. Effectiveness of Pre-training for Few-shot Intent Classification , 2021, EMNLP.

[81] Michael J.Q. Zhang,et al. SituatedQA: Incorporating Extra-Linguistic Contexts into QA , 2021, EMNLP.

[82] Songfang Huang,et al. Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning , 2021, EMNLP.

[83] Matthew Purver,et al. Exploring Underexplored Limitations of Cross-Domain Text-to-SQL Generalization , 2021, EMNLP.

[84] Nikhil Ramesh,et al. Entity-Based Knowledge Conflicts in Question Answering , 2021, EMNLP.

[85] Mari Ostendorf,et al. DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization , 2021, EMNLP.

[86] Zhou Yu,et al. Zero-Shot Dialogue State Tracking via Cross-Task Transfer , 2021, EMNLP.

[87] Qi Zhang,et al. Low-Resource Dialogue Summarization with Domain-Agnostic Multi-Source Pretraining , 2021, EMNLP.

[88] Eric Nyberg,et al. Exploring Strategies for Generalizable Commonsense Reasoning with Pre-trained Models , 2021, EMNLP.

[89] Miguel Ballesteros,et al. How much pretraining data do language models need to learn syntax? , 2021, EMNLP.

[90] Jonathan Herzig,et al. Finding needles in a haystack: Sampling Structurally-diverse Training Sets from Synthetic Data for Compositional Generalization , 2021, EMNLP.

[91] M Saiful Bari,et al. Nearest Neighbour Few-Shot Learning for Cross-lingual Classification , 2021, EMNLP.

[92] Quoc V. Le,et al. Finetuned Language Models Are Zero-Shot Learners , 2021, ICLR.

[93] S. Riedel,et al. Challenges in Generalization in Open Domain Question Answering , 2021, NAACL-HLT.

[94] Xu Sun,et al. Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification , 2021, EMNLP.

[95] Einat Minkov,et al. Fight Fire with Fire: Fine-tuning Hate Detectors using Large Samples of Generated Hate Speech , 2021, EMNLP.

[96] Jin Yong Yoo,et al. Towards Improving Adversarial Training of NLP Models , 2021, EMNLP.

[97] Peng Cui,et al. Towards Out-Of-Distribution Generalization: A Survey , 2021, ArXiv.

[98] Xiaoxi Mao,et al. LOT: A Story-Centric Benchmark for Evaluating Chinese Long Text Understanding and Generation , 2021, TACL.

[99] J. Schmidhuber,et al. The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers , 2021, EMNLP.

[100] Elia Bruni,et al. The Paradox of the Compositionality of Natural Language: A Neural Machine Translation Case Study , 2021, ACL.

[101] Reut Tsarfaty,et al. (Un)solving Morphological Inflection: Lemma Overlap Artificially Inflates Models’ Performance , 2021, ACL.

[102] J. Ainslie,et al. Making Transformers Solve Compositional Tasks , 2021, ACL.

[103] Luke Zettlemoyer,et al. Noisy Channel Language Model Prompting for Few-Shot Text Classification , 2021, ACL.

[104] Olivier Bonami,et al. Not quite there yet: Combining analogical patterns and encoder-decoder networks for cognitively plausible inflection , 2021, Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[105] Panagiotis Kouris,et al. Abstractive Text Summarization: Enhancing Sequence-to-Sequence Models Using Word Sense Disambiguation and Semantic Content Generalization , 2021, CL.

[106] Ramón Fernández Astudillo,et al. Structural Guidance for Transformer Language Models , 2021, ACL.

[107] Emmanuele Chersoni,et al. Did the Cat Drink the Coffee? Challenging Transformers with Generalized Event Knowledge , 2021, STARSEM.

[108] Y. Gal,et al. Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks , 2021, NeurIPS Datasets and Benchmarks.

[109] Kyle Lo,et al. FLEX: Unifying Evaluation for Few-Shot NLP , 2021, NeurIPS.

[110] Mingyue Han,et al. Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models , 2021, ACL.

[111] E. Kharitonov,et al. Can Transformers Jump Around Right in Natural Language? Assessing Performance Transfer from SCAN , 2021, BLACKBOXNLP.

[112] He He,et al. An Investigation of the (In)effectiveness of Counterfactually Augmented Data , 2021, ACL.

[113] Hongxia Jin,et al. Enhancing the generalization for Intent Classification and Out-of-Domain Detection in SLU , 2021, ACL.

[114] Rifat Shahriyar,et al. XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages , 2021, FINDINGS.

[115] Matthew Richardson,et al. KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers , 2021, ACL.

[116] Pradeep Ravikumar,et al. Improving Compositional Generalization in Classification Tasks via Structure Annotations , 2021, ACL.

[117] Vivek Srikumar,et al. X-Fact: A New Benchmark Dataset for Multilingual Fact Checking , 2021, ACL.

[118] Nurul Lubis,et al. Domain-independent User Simulation with Transformers for Task-oriented Dialogue Systems , 2021, SIGDIAL.

[119] Marco Baroni. On the proper role of linguistically-oriented deep net analysis in linguistic theorizing , 2021, ArXiv.

[120] Dilek Z. Hakkani-Tür,et al. Generative Conversational Networks , 2021, SIGDIAL.

[121] Dietrich Klakow,et al. Modeling Profanity and Hate Speech in Social Media with Semantic Subspaces , 2021, WOAH.

[122] Sebastian Ruder,et al. Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks , 2021, ACL.

[123] Marco Damonte,et al. One Semantic Parser to Parse Them All: Sequence to Sequence Multi-Task Learning on Semantic Parsing Datasets , 2021, STARSEM.

[124] Megha Srivastava,et al. Question Generation for Adaptive Education , 2021, ACL.

[125] Kenny Smith,et al. Meta-Learning to Compositionally Generalize , 2021, ACL.

[126] Jacob Andreas,et al. Lexicon Learning for Few Shot Sequence Modeling , 2021, ACL.

[127] Pascale Fung,et al. X2Parser: Cross-Lingual and Cross-Domain Framework for Task-Oriented Compositional Semantic Parsing , 2021, REPL4NLP.

[128] Ethan Gotlieb Wilcox,et al. A Targeted Assessment of Incremental Processing in Neural Language Models and Humans , 2021, ACL.

[129] Kai-Wei Chang,et al. Syntax-augmented Multilingual BERT for Cross-lingual Transfer , 2021, ACL.

[130] Milad Shokouhi,et al. A Dataset and Baselines for Multilingual Reply Suggestion , 2021, ACL.

[131] Prateek Yadav,et al. multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning , 2021, NAACL.

[132] Marten van Schijndel,et al. Uncovering Constraint-Based Behavior in Neural Models via Targeted Fine-Tuning , 2021, ACL.

[133] Chinmay Choudhary,et al. Improving the Performance of UDify with Linguistic Typology Knowledge , 2021, SIGTYP.

[134] Sarah Ita Levitan,et al. Detecting Multilingual COVID-19 Misinformation on Social Media via Contextualized Embeddings , 2021, NLP4IF.

[135] Marco Brambilla,et al. Content-based Stance Classification of Tweets about the 2020 Italian Constitutional Referendum , 2021, SOCIALNLP.

[136] Ekaterina Vylomova,et al. Anlirika: An LSTM–CNN Flow Twister for Spoken Language Identification , 2021, SIGTYP.

[137] Francis M. Tyers,et al. Do RNN States Encode Abstract Phonological Alternations? , 2021, NAACL.

[138] Ahmed Khoumsi,et al. Domain Adaptation for Arabic Cross-Domain and Cross-Dialect Sentiment Analysis from Contextualized Word Embedding , 2021, NAACL.

[139] Jacob Andreas,et al. Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention , 2021, NAACL.

[140] Yongjing Yin,et al. On Compositional Generalization of Neural Machine Translation , 2021, ACL.

[141] Constantin Orasan,et al. An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers , 2021, ACL.

[142] Diyi Yang,et al. HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalizability , 2021, ACL.

[143] Ce Zhang,et al. Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models , 2021, ACL.

[144] Jakub Szymanik,et al. Language Models Use Monotonicity to Assess NPI Licensing , 2021, FINDINGS.

[145] Xiaodong Liu,et al. Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization , 2021, ACL.

[146] Douwe Kiela,et al. True Few-Shot Learning with Language Models , 2021, NeurIPS.

[147] Minlie Huang,et al. OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics , 2021, ACL.

[148] Mingxuan Wang,et al. Learning Language Specific Sub-network for Multilingual Machine Translation , 2021, ACL.

[149] Haitao Zheng,et al. Few-NERD: A Few-shot Named Entity Recognition Dataset , 2021, ACL.

[150] Gholamreza Haffari,et al. Neural-Symbolic Commonsense Reasoner with Relation Predictors , 2021, ACL.

[151] Kathleen McKeown,et al. Adversarial Learning for Zero-Shot Stance Detection on Social Media , 2021, NAACL.

[152] I. Kobayashi,et al. OCHADAI-KYOTO at SemEval-2021 Task 1: Enhancing Model Generalization and Robustness for Lexical Complexity Prediction , 2021, SEMEVAL.

[153] Zili Zhou,et al. Encoding Explanatory Knowledge for Zero-shot Science Question Answering , 2021, IWCS.

[154] Gerasimos Lampouras,et al. Generalising Multilingual Concept-to-Text NLG with Language Agnostic Delexicalisation , 2021, ACL.

[155] Juho Lee,et al. Learning to Perturb Word Embeddings for Out-of-distribution QA , 2021, ACL.

[156] Kentaro Inui,et al. Learning to Learn to be Right for the Right Reasons , 2021, NAACL.

[157] Xiang Zhou,et al. Hidden Biases in Unreliable News Detection Datasets , 2021, EACL.

[158] Xiang Ren,et al. X-METRA-ADA: Cross-lingual Meta-Transfer learning Adaptation to Natural Language Understanding and Question Answering , 2021, NAACL.

[159] S. Riedel,et al. Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity , 2021, ACL.

[160] Ngoc Thang Vu,et al. AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages , 2021, ACL.

[161] Hannaneh Hajishirzi,et al. Cross-Task Generalization via Natural Language Crowdsourcing Instructions , 2021, ACL.

[162] Bill Yuchen Lin,et al. Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning , 2021, EMNLP.

[163] S. Riedel,et al. Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation , 2021, EMNLP.

[164] Xiang Ren,et al. CrossFit: A Few-shot Learning Challenge for Cross-task Generalization in NLP , 2021, EMNLP.

[165] Nanyun Peng,et al. Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training , 2021, EMNLP.

[166] Oyvind Tafjord,et al. Explaining Answers with Entailment Trees , 2021, EMNLP.

[167] Marek Rei,et al. Memorisation versus Generalisation in Pre-trained Language Models , 2021, ACL.

[168] Dan Roth,et al. Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema , 2021, EMNLP.

[169] Diyi Yang,et al. Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs , 2021, NAACL.

[170] Jinlan Fu,et al. XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation , 2021, EMNLP.

[171] Jason Weston,et al. Retrieval Augmentation Reduces Hallucination in Conversation , 2021, EMNLP.

[172] Shrey Desai,et al. Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic Parsing , 2021, EMNLP.

[173] Douwe Kiela,et al. Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little , 2021, EMNLP.

[174] Snigdha Chaturvedi,et al. Is Everything in Order? A Simple Way to Order Sentences , 2021, EMNLP.

[175] Mans Hulden,et al. Can a Transformer Pass the Wug Test? Tuning Copying Bias in Neural Morphological Inflection Models , 2021, ACL.

[176] Xuezhi Wang,et al. Continual Learning for Text Classification with Information Disentanglement Based Regularization , 2021, NAACL.

[177] Wenpeng Yin,et al. Learning to Synthesize Data for Semantic Parsing , 2021, NAACL.

[178] T. Zhao,et al. Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach , 2021, EMNLP.

[179] Dan Klein,et al. Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections , 2021, EMNLP.

[180] Kai Yu,et al. ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser , 2021, NAACL.

[181] Tharindu Ranasinghe,et al. TransWiC at SemEval-2021 Task 2: Transformer-based Multilingual and Cross-lingual Word-in-Context Disambiguation , 2021, SEMEVAL.

[182] Zhiyi Ma,et al. Dynabench: Rethinking Benchmarking in NLP , 2021, NAACL.

[183] Erenay Dayanik,et al. Disentangling Document Topic and Author Gender in Multiple Languages: Lessons for Adversarial Debiasing , 2021, WASSA.

[184] Graham Neubig,et al. MasakhaNER: Named Entity Recognition for African Languages , 2021, Transactions of the Association for Computational Linguistics.

[185] Pascale Fung,et al. AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization , 2021, NAACL.

[186] Timothy Baldwin,et al. Evaluating Document Coherence Modeling , 2021, Transactions of the Association for Computational Linguistics.

[187] David Ifeoluwa Adelani,et al. The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation , 2021, MTSUMMIT.

[188] Sanjeev Khudanpur,et al. Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora , 2021, WMT.

[189] Franck Dernoncourt,et al. Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models , 2021, NAACL.

[190] Emily M. Bender,et al. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜 , 2021, FAccT.

[191] Diana Inkpen,et al. Conditional Adversarial Networks for Multi-Domain Text Classification , 2021, ADAPTNLP.

[192] Phil Blunsom,et al. Mind the Gap: Assessing Temporal Generalization in Neural Language Models , 2021, NeurIPS.

[193] Karin Verspoor,et al. Memorization vs. Generalization : Quantifying Data Leakage in NLP Performance Evaluation , 2021, EACL.

[194] Lucas Weber,et al. Language Modelling as a Multi-Task Problem , 2021, EACL.

[195] Hitomi Yanaka,et al. Exploring Transitivity in Neural NLI Models through Veridicality , 2021, EACL.

[196] Sonal Gupta,et al. Muppet: Massive Multi-task Representations with Pre-Finetuning , 2021, EMNLP.

[197] Roi Reichart,et al. Model Compression for Domain Adaptation through Causal Effect Estimation , 2021, Transactions of the Association for Computational Linguistics.

[198] Xiang Ren,et al. Learning to Generate Task-Specific Adapters from Task Description , 2021, ACL.

[199] Valentin Hofmann,et al. Superbizarre Is Not Superb: Derivational Morphology Improves BERT’s Interpretation of Complex Words , 2021, ACL.

[200] Jackie Chi Kit Cheung,et al. Optimizing Deeper Transformers on Small Datasets , 2020, ACL.

[201] Jianfeng Gao,et al. RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems , 2020, ACL.

[202] Mohit Bansal,et al. I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling , 2020, ACL.

[203] Magdalena Biesialska,et al. Continual Lifelong Learning in Natural Language Processing: A Survey , 2020, COLING.

[204] Pang Wei Koh,et al. WILDS: A Benchmark of in-the-Wild Distribution Shifts , 2020, ICML.

[205] Yoav Goldberg,et al. Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals , 2020, Transactions of the Association for Computational Linguistics.

[206] Nathan Schneider,et al. Supertagging the Long Tail with Tree-Structured Decoding of Complex Categories , 2020, Transactions of the Association for Computational Linguistics.

[207] Jiafeng Guo,et al. Event Coreference Resolution with their Paraphrases and Argument-aware Embeddings , 2020, COLING.

[208] Valeria de Paiva,et al. Hy-NLI: a Hybrid system for Natural Language Inference , 2020, COLING.

[209] Eduard Hovy,et al. On the Systematicity of Probing Contextualized Word Representations: The Case of Hypernymy in BERT , 2020, STARSEM.

[210] Miikka Silfverberg,et al. Noise Isn’t Always Negative: Countering Exposure Bias in Sequence-to-Sequence Inflection Models , 2020, COLING.

[211] Robert Frank,et al. Sequence-to-Sequence Networks Learn the Meaning of Reflexive Anaphora , 2020, CRAC.

[212] Kentaro Inui,et al. Efficient Estimation of Influence of a Training Instance , 2020, SUSTAINLP.

[213] Matt Gardner,et al. Learning from Task Descriptions , 2020, EMNLP.

[214] Ramesh Nallapati,et al. Unsupervised Domain Adaptation for Cross-lingual Text Labeling , 2020, FINDINGS.

[215] Daniel Gillick,et al. Entity Linking in 100 Languages , 2020, EMNLP.

[216] Alexander Rush,et al. Sequence-level Mixed Sample Data Augmentation , 2020, EMNLP.

[217] Coleman Haley,et al. This is a BERT. Now there are several of them. Can they generalize to novel words? , 2020, BLACKBOXNLP.

[218] Roger Levy,et al. Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization , 2020, BLACKBOXNLP.

[219] Mark Dredze,et al. Do Models of Mental Health Based on Social Media Data Generalize? , 2020, FINDINGS.

[220] Richard Tobin,et al. Not a cute stroke: Analysis of Rule- and Neural Network-based Information Extraction Systems for Brain Radiology Reports , 2020, LOUHI.

[221] Yaohui Jin,et al. Modeling Content Importance for Summarization with Pre-trained Language Models , 2020, EMNLP.

[222] Yejin Choi,et al. Social Chemistry 101: Learning to Reason about Social and Moral Norms , 2020, EMNLP.

[223] Caiming Xiong,et al. The Thieves on Sesame Street Are Polyglots — Extracting Multilingual Models from Monolingual APIs , 2020, EMNLP.

[224] Saptarashmi Bandyopadhyay,et al. Natural Language Response Generation from SQL with Generalization and Back-translation , 2020, INTEXSEMPAR.

[225] Swaroop Mishra,et al. Do We Need to Create Big Datasets to Learn a Task? , 2020, SUSTAINLP.

[226] Quan Wang,et al. Event Extraction as Multi-turn Question Answering , 2020, FINDINGS.

[227] Kaiyu Huang,et al. A Joint Multiple Criteria Model in Transfer Learning for Cross-domain Chinese Word Segmentation , 2020, EMNLP.

[228] Han Wang,et al. Enhancing Generalization in Natural Language Inference by Syntax , 2020, FINDINGS.

[229] Ayan Sengupta,et al. DATAMAFIA at WNUT-2020 Task 2: A Study of Pre-trained Language Models along with Regularization Techniques for Downstream Tasks , 2020, WNUT.

[230] Yonatan Belinkov,et al. Findings of the WMT 2020 Shared Task on Machine Translation Robustness , 2020, WMT.

[231] Timothy Baldwin,et al. Target Word Masking for Location Metonymy Resolution , 2020, COLING.

[232] Jia Deng,et al. Strongly Incremental Constituency Parsing with Graph Neural Networks , 2020, NeurIPS.

[233] Dan Roth,et al. Temporal Reasoning on Implicit Events from Distant Supervision , 2020, NAACL.

[234] Greg Durrett,et al. Effective Distant Supervision for Temporal Relation Extraction , 2020, ADAPTNLP.

[235] Ming-Wei Chang,et al. Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both? , 2020, ACL.

[236] Xiaodong Liu,et al. Posterior Differential Regularization with f-divergence for Improving Model Robustness , 2020, NAACL.

[237] Mirella Lapata,et al. Meta-Learning for Domain Generalization in Semantic Parsing , 2020, NAACL.

[238] Jimmy J. Lin,et al. Scientific Claim Verification with VerT5erini , 2020, LOUHI.

[239] Jungo Kasai,et al. XOR QA: Cross-lingual Open-Retrieval Question Answering , 2020, NAACL.

[240] Mirella Lapata,et al. Compositional Generalization via Semantic Tagging , 2020, EMNLP.

[241] Sungjin Lee,et al. Self-Supervised Contrastive Learning for Efficient User Satisfaction Prediction in Conversational Agents , 2020, NAACL.

[242] Siva Reddy,et al. Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle , 2020, NAACL.

[243] Holger Schwenk,et al. Beyond English-Centric Multilingual Machine Translation , 2020, J. Mach. Learn. Res..

[244] Xiaocheng Feng,et al. Incorporating Commonsense Knowledge into Abstractive Dialogue Summarization via Heterogeneous Graph Networks , 2020, CCL.

[245] Shrey Desai,et al. Compressive Summarization with Plausibility and Salience Modeling , 2020, EMNLP.

[246] Helen Yannakoudakis,et al. Grammatical Error Correction in Low Error Density Domains: A New Benchmark and Analyses , 2020, EMNLP.

[247] Benjamin Newman,et al. The EOS Decision and Length Extrapolation , 2020, BLACKBOXNLP.

[248] Svetlana Kiritchenko,et al. On Cross-Dataset Generalization in Automatic Detection of Online Abuse , 2020, ALW.

[249] Alessandro Raganato,et al. XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization , 2020, EMNLP.

[250] Roger Levy,et al. Structural Supervision Improves Few-Shot Learning and Syntactic Generalization in Neural Language Models , 2020, EMNLP.

[251] Tal Linzen,et al. COGS: A Compositional Generalization Challenge Based on Semantic Interpretation , 2020, EMNLP.

[252] Jonathan Berant,et al. Improving Compositional Generalization in Semantic Parsing , 2020, FINDINGS.

[253] Xuanjing Huang,et al. An Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems , 2020, FINDINGS.

[254] Siddharth Dalmia,et al. On Long-Tailed Phenomena in Neural Machine Translation , 2020, FINDINGS.

[255] Samuel R. Bowman,et al. Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data , 2020, INSIGHTS.

[256] Kathleen McKeown,et al. Zero-Shot Stance Detection: A Dataset and Model Using Generalized Topic Representations , 2020, EMNLP.

[257] Claire Cardie,et al. WikiLingua: A New Benchmark Dataset for Multilingual Abstractive Summarization , 2020, FINDINGS.

[258] Asish Ghoshal,et al. Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing , 2020, EMNLP.

[259] Iryna Gurevych,et al. Improving QA Generalization by Concurrent Modeling of Multiple Biases , 2020, FINDINGS.

[260] Yoav Goldberg,et al. Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data , 2020, EMNLP.

[261] Dragomir R. Radev,et al. Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start , 2020, EMNLP.

[262] Wenhu Chen,et al. KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation , 2020, EMNLP.

[263] Ralph Weischedel,et al. Learning to Generalize for Sequential Decision Making , 2020, FINDINGS.

[264] Anette Frank,et al. X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset , 2020, EMNLP.

[265] Siva Reddy,et al. Measuring Systematic Generalization in Neural Proof Generation with Transformers , 2020, NeurIPS.

[266] M. Choudhury,et al. TaxiNLI: Taking a Ride up the NLU Hill , 2020, CONLL.

[267] Tiancheng Zhao,et al. SPARTA: Efficient Open-Domain Question Answering via Sparse Transformer Matching Retrieval , 2020, NAACL.

[268] Sameer Singh,et al. Paired Examples as Indirect Supervision in Latent Decision Models , 2020, EMNLP.

[269] Yejin Choi,et al. Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics , 2020, EMNLP.

[270] Philip S. Yu,et al. Composed Variational Natural Language Generation for Few-shot Intents , 2020, FINDINGS.

[271] Kyunghyun Cho,et al. SSMBA: Self-Supervised Manifold Based Data Augmentation for Improving Out-of-Domain Robustness , 2020, EMNLP.

[272] Kumud Chauhan,et al. NEU at WNUT-2020 Task 2: Data Augmentation To Tell BERT That Death Is Not Necessarily Informative , 2020, WNUT.

[273] Christopher DuBois,et al. On the Transferability of Minimal Prediction Preserving Inputs in Question Answering , 2020, NAACL.

[274] Trapit Bansal,et al. Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks , 2020, EMNLP.

[275] Gregor Betz,et al. Critical Thinking for Language Models , 2020, IWCS.

[276] Jonathan Berant,et al. Span-based Semantic Parsing for Compositional Generalization , 2020, ACL.

[277] Haoran Li,et al. MTOP: A Comprehensive Multilingual Task-Oriented Semantic Parsing Benchmark , 2020, EACL.

[278] Sebastian Riedel,et al. Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets , 2020, EACL.

[279] Joachim Daiber,et al. MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering , 2020, Transactions of the Association for Computational Linguistics.

[280] Lifu Tu,et al. An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models , 2020, Transactions of the Association for Computational Linguistics.

[281] Tao Yu,et al. DART: Open-Domain Structured Data Record to Text Generation , 2020, NAACL.

[282] Franck Dernoncourt,et al. Exploiting the Syntax-Model Consistency for Neural Relation Extraction , 2020, ACL.

[283] Cornelia Caragea,et al. Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold Mixup , 2020, ACL.

[284] Shruti Rijhwani,et al. Temporally-Informed Analysis of Named Entity Recognition , 2020, ACL.

[285] Ming-Wei Chang,et al. Exploring Unexplored Generalization Challenges for Cross-Database Semantic Parsing , 2020, ACL.

[286] Deniz Yuret,et al. Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference , 2020, REPL4NLP.

[287] Yoshua Bengio,et al. Compositional Generalization by Factorizing Alignment and Translation , 2020, ACL.

[288] Ryan Cotterell,et al. SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection , 2020, SIGMORPHON.

[289] M. Marelli,et al. Mechanisms for handling nested dependencies in neural-network language models and humans , 2020, Cognition.

[290] Percy Liang,et al. Selective Question Answering under Domain Shift , 2020, ACL.

[291] Alvaro Soto,et al. Translating Natural Language Instructions for Behavioral Robot Navigation with a Multi-Head Attention Mechanism , 2020, WINLP.

[292] Rajaswa Patil,et al. LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits Based Humor Grading , 2020, SEMEVAL.

[293] Mark Chen,et al. Language Models are Few-Shot Learners , 2020, NeurIPS.

[294] Uri Shalit,et al. CausaLM: Causal Model Explanation Through Counterfactual Language Models , 2020, CL.

[295] Aakanksha Naik,et al. Towards Open Domain Event Trigger Identification using Adversarial Domain Adaptation , 2020, ACL.

[296] Mihir Kale,et al. Text-to-Text Pre-Training for Data-to-Text Tasks , 2020, INLG.

[297] Adam Lopez,et al. Inflecting When There’s No Majority: Limitations of Encoder-Decoder Neural Networks as Cognitive Models for German Plurals , 2020, ACL.

[298] Peter Szolovits,et al. Entity-Enriched Neural Models for Clinical Question Answering , 2020, BIONLP.

[299] Dilek Z. Hakkani-Tür,et al. Schema-Guided Natural Language Generation , 2020, INLG.

[300] Koustuv Sinha,et al. Probing Linguistic Systematicity , 2020, ACL.

[301] Sameer Singh,et al. Beyond Accuracy: Behavioral Testing of NLP Models with CheckList , 2020, ACL.

[302] Roger P. Levy,et al. A Systematic Assessment of Syntactic Generalization in Neural Language Models , 2020, ACL.

[303] Tal Linzen,et al. How Can We Accelerate Progress Towards Human-like Linguistic Generalization? , 2020, ACL.

[304] Bill Yuchen Lin,et al. RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms , 2020, EMNLP.

[305] Hannaneh Hajishirzi,et al. UnifiedQA: Crossing Format Boundaries With a Single QA System , 2020, FINDINGS.

[306] Emily Denton,et al. Social Biases in NLP Models as Barriers for Persons with Disabilities , 2020, ACL.

[307] Jianfeng Gao,et al. RMM: A Recursive Mental Model for Dialog Navigation , 2020, FINDINGS.

[308] Xiang Ren,et al. Teaching Machine Comprehension with Compositional Explanations , 2020, FINDINGS.

[309] Anders Sogaard,et al. We Need To Talk About Random Splits , 2020, EACL.

[310] Piotr Rybak,et al. KLEJ: Comprehensive Benchmark for Polish Language Understanding , 2020, ACL.

[311] Xiang Yue,et al. Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset , 2020, ACL.

[312] Bernard J. Jansen,et al. A Multi-Platform Arabic News Comment Dataset for Offensive Language Detection , 2020, LREC.

[313] A. Korhonen,et al. XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning , 2020, EMNLP.

[314] Arzucan Özgür,et al. Analyzing ELMo and DistilBERT on Socio-political News Classification , 2020, AESPEN.

[315] Afroz Ahamad,et al. AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition , 2020, LREC.

[316] Michael J. Paul,et al. Why Overfitting Isn’t Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries , 2020, ACL.

[317] Chitta Baral,et al. Self-Supervised Knowledge Triplet Learning for Zero-shot Question Answering , 2020, EMNLP.

[318] Greg Durrett,et al. Robust Question Answering Through Sub-part Alignment , 2020, NAACL.

[319] Nathan Schneider,et al. Lexical Semantic Recognition , 2020, MWE.

[320] Dan Jurafsky,et al. Learning Music Helps You Read: Using Transfer to Study Linguistic Structure in Language Models , 2020, EMNLP.

[321] Sylvain Lamprier,et al. MLSUM: The Multilingual Summarization Corpus , 2020, EMNLP.

[322] Michael A. Lepori,et al. Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMs , 2020, ACL.

[323] Rudolf Rosa,et al. Universal Dependencies according to BERT: both more specific and more general , 2020, FINDINGS.

[324] Christopher Potts,et al. Neural Natural Language Inference Models Partially Embed Theories of Lexical Entailment and Negation , 2020, BLACKBOXNLP.

[325] Veselin Stoyanov,et al. General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference , 2020, FINDINGS.

[326] Rico Sennrich,et al. Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation , 2020, ACL.

[327] R. Thomas McCoy,et al. Syntactic Data Augmentation Increases Robustness to Inference Heuristics , 2020, ACL.

[328] Yu Hong,et al. DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications , 2020, ACL.

[329] Doug Downey,et al. Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks , 2020, ACL.

[330] Sampo Pyysalo,et al. Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection , 2020, LREC.

[331] Xiao Huang,et al. TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition , 2020, ACL.

[332] Tim Rocktäschel,et al. There is Strength in Numbers: Avoiding the Hypothesis-Only Bias in Natural Language Inference via Ensemble Adversarial Training , 2020, ArXiv.

[333] Mohit Bansal,et al. Adversarial Augmentation Policy Search for Domain and Cross-Lingual Generalization in Reading Comprehension , 2020, FINDINGS.

[334] Dilek Z. Hakkani-Tür,et al. From Machine Reading Comprehension to Dialogue State Tracking: Bridging the Gap , 2020, NLP4CONVAI.

[335] Dawn Song,et al. Pretrained Transformers Improve Out-of-Distribution Robustness , 2020, ACL.

[336] Tatsuya Kawahara,et al. Designing Precise and Robust Dialogue Response Evaluators , 2020, ACL.

[337] Zhiyuan Liu,et al. More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction , 2020, AACL.

[338] Noah A. Smith,et al. Evaluating Models’ Local Decision Boundaries via Contrast Sets , 2020, FINDINGS.

[339] Xiujun Li,et al. Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space , 2020, EMNLP.

[340] Xiaodong Fan,et al. XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation , 2020, EMNLP.

[341] Kentaro Inui,et al. Do Neural Models Learn Systematicity of Monotonicity Inference in Natural Language? , 2020, ACL.

[342] Weijia Xu,et al. End-to-End Slot Alignment and Recognition for Cross-Lingual NLU , 2020, EMNLP.

[343] Orhan Firat,et al. XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization , 2020, ICML.

[344] Elena Kochkina,et al. Cost-Sensitive BERT for Generalisable Sentence Classification on Imbalanced Data , 2020, EMNLP.

[345] Armando Solar-Lezama,et al. Learning Compositional Rules via Neural Program Synthesis , 2020, NeurIPS.

[346] Eunsol Choi,et al. TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages , 2020, Transactions of the Association for Computational Linguistics.

[347] Jianfeng Gao,et al. Few-shot Natural Language Generation for Task-Oriented Dialog , 2020, FINDINGS.

[348] Pasquale Minervini,et al. Undersensitivity in Neural Reading Comprehension , 2020, FINDINGS.

[349] Bill Yuchen Lin,et al. CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning , 2020, FINDINGS.

[350] Sebastian Riedel,et al. Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension , 2020, Transactions of the Association for Computational Linguistics.

[351] Ryan Cotterell,et al. Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages , 2020, Transactions of the Association for Computational Linguistics.

[352] Timo Schick,et al. Exploiting Cloze-Questions for Few-Shot Text Classification and Natural Language Inference , 2020, EACL.

[353] R. Thomas McCoy,et al. Does Syntax Need to Grow on Trees? Sources of Hierarchical Inductive Bias in Sequence-to-Sequence Networks , 2020, TACL.

[354] Yoav Goldberg,et al. oLMpics-On What Language Model Pre-training Captures , 2019, Transactions of the Association for Computational Linguistics.

[355] Xiao Wang,et al. Measuring Compositional Generalization: A Comprehensive Method on Realistic Data , 2019, ICLR.

[356] Benjamin Van Durme,et al. Reading the Manual: Event Extraction as Definition Comprehension , 2019, SPNLP.

[357] Samuel R. Bowman,et al. BLiMP: The Benchmark of Linguistic Minimal Pairs for English , 2019, Transactions of the Association for Computational Linguistics.

[358] Frank F. Xu,et al. How Can We Know What Language Models Know? , 2019, Transactions of the Association for Computational Linguistics.

[359] Khalil Mrini,et al. Rethinking Self-Attention: Towards Interpretability in Neural Parsing , 2019, FINDINGS.

[360] A. McCallum,et al. Learning to Few-Shot Learn Across Diverse Natural Language Classification Tasks , 2019, COLING.

[361] Elia Bruni,et al. Location Attention for Extrapolation to Longer Sequences , 2019, ACL.

[362] Xiaodong Liu,et al. RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers , 2019, ACL.

[363] Jianfeng Gao,et al. SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization , 2019, ACL.

[364] R. Thomas McCoy,et al. BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance , 2019, BLACKBOXNLP.

[365] Florian Metze,et al. On Compositionality in Neural Machine Translation , 2019, ArXiv.

[366] François Yvon,et al. Generic and Specialized Word Embeddings for Multi-Domain Machine Translation , 2019, IWSLT.

[367] Yaser Al-Onaizan,et al. Robustness to Capitalization Errors in Named Entity Recognition , 2019, EMNLP.

[368] Tomoki Taniguchi,et al. CLER: Cross-task Learning with Expert Representation to Generalize Reading and Understanding , 2019, EMNLP.

[369] Haoyang Huang,et al. Improving the Robustness of Deep Reading Comprehension Models by Leveraging Syntax Prior , 2019, EMNLP.

[370] Christopher Potts,et al. Posing Fair Generalization Tasks for Natural Language Inference , 2019, EMNLP.

[371] Yan Xu,et al. Generalizing Question Answering System with Pre-trained Language Model Fine-tuning , 2019, EMNLP.

[372] Fenglin Liu,et al. Self-Adaptive Scaling for Learnable Residual Structure , 2019, CoNLL.

[373] Maria Chang,et al. Graph Enhanced Cross-Domain Text-to-SQL Generation , 2019, EMNLP.

[374] Mikel Artetxe,et al. On the Cross-lingual Transferability of Monolingual Representations , 2019, ACL.

[375] Ryan Cotterell,et al. The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection , 2019, Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[376] Peter J. Liu,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[377] Danqi Chen,et al. MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension , 2019, EMNLP.

[378] Donggyu Kim,et al. Domain-agnostic Question-Answering with Adversarial Training , 2019, EMNLP.

[379] Ido Dagan,et al. Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets , 2019, CoNLL.

[380] Holger Schwenk,et al. MLQA: Evaluating Cross-lingual Extractive Question Answering , 2019, ACL.

[381] Liang Zhao,et al. Compositional Generalization for Primitive Substitutions , 2019, EMNLP.

[382] Roi Blanco,et al. Book QA: Stories of Challenges and Opportunities , 2019, EMNLP.

[383] Florian Schmidt. Generalization in Generation: A closer look at Exposure Bias , 2019, EMNLP.

[384] Frédéric Béchet,et al. Robust Semantic Parsing with Adversarial Learning for Domain Generalization , 2019, NAACL.

[385] Tom M. Mitchell,et al. Look-up and Adapt: A One-shot Semantic Parser , 2019, EMNLP.

[386] Hal Daumé,et al. Global Voices: Crossing Borders in Automatic News Summarization , 2019, EMNLP.

[387] Graham Neubig,et al. Domain Differential Adaptation for Neural Machine Translation , 2019, EMNLP.

[388] Marcus Bishop,et al. Learning Invariant Representations of Social Media Users , 2019, EMNLP.

[389] Zachary Chase Lipton,et al. Learning the Difference that Makes a Difference with Counterfactually-Augmented Data , 2019, ICLR.

[390] Antonia Baumann,et al. Multilingual Language Models for Named Entity Recognition in German and English , 2019, RANLP.

[391] Rabeeh Karimi Mahabadi,et al. End-to-End Bias Mitigation by Modelling Biases in Corpora , 2019, ACL.

[392] Jason Weston,et al. Finding Generalizable Evidence by Learning to Convince Q&A Models , 2019, EMNLP.

[393] Mirella Lapata,et al. Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs , 2019, EMNLP.

[394] Hung-yi Lee,et al. LAMOL: LAnguage MOdeling for Lifelong Language Learning , 2019, ICLR.

[395] Ryan Cotterell,et al. Don’t Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction , 2019, EMNLP.

[396] Shikha Bordia,et al. Investigating BERT’s Knowledge of Language: Five Analysis Methods with NPIs , 2019, EMNLP.

[397] Nanyun Peng,et al. The Woman Worked as a Babysitter: On Biases in Language Generation , 2019, EMNLP.

[398] Noah A. Smith,et al. Topics to Avoid: Demoting Latent Confounds in Text Classification , 2019, EMNLP.

[399] Jason Baldridge,et al. Learning Dense Representations for Entity Retrieval , 2019, CoNLL.

[400] Todor Mihaylov,et al. Discourse-Aware Semantic Self-Attention for Narrative Reading Comprehension , 2019, EMNLP.

[401] Emiel Krahmer,et al. Neural data-to-text generation: A comparison between pipeline and end-to-end architectures , 2019, EMNLP.

[402] Elia Bruni,et al. Compositionality Decomposed: How do Neural Networks Generalise? , 2019, J. Artif. Intell. Res..

[403] Yoav Goldberg,et al. Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets , 2019, EMNLP.

[404] Fei Liu,et al. MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance , 2019, EMNLP.

[405] Jason Baldridge,et al. PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification , 2019, EMNLP.

[406] Hazem M. Hajj,et al. Improved Generalization of Arabic Text Classifiers , 2019, WANLP@ACL 2019.

[407] Yang Yu,et al. Out-of-Domain Detection for Low-Resource Text Classification Tasks , 2019, EMNLP.

[408] José Manuél Gómez-Pérez,et al. An Empirical Study on Pre-trained Embeddings and Language Models for Bot Detection , 2019, RepL4NLP@ACL.

[409] Hazem M. Hajj,et al. hULMonA: The Universal Language Model in Arabic , 2019, WANLP@ACL 2019.

[410] Pascale Fung,et al. Learning Multilingual Meta-Embeddings for Code-Switching Named Entity Recognition , 2019, RepL4NLP@ACL.

[411] Joelle Pineau,et al. CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text , 2019, EMNLP.

[412] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[413] Dan Klein,et al. Cross-Domain Generalization of Neural Constituency Parsers , 2019, ACL.

[414] Yan Song,et al. Knowledge-aware Pronoun Coreference Resolution , 2019, ACL.

[415] Dan Roth,et al. Zero-Shot Open Entity Typing as Type-Compatible Grounding , 2019, EMNLP.

[416] Youmna Farag,et al. Multi-Task Learning for Coherence Modeling , 2019, ACL.

[417] Marie-Catherine de Marneffe,et al. Do You Know That Florence Is Packed with Visitors? Evaluating State-of-the-art Models of Speaker Commitment , 2019, ACL.

[418] Rick Siow Mong Goh,et al. Dual Adversarial Neural Transfer for Low-Resource Named Entity Recognition , 2019, ACL.

[419] Tong Zhang,et al. Reinforced Training Data Selection for Domain Adaptation , 2019, ACL.

[420] Michael J. Paul,et al. Neural Temporality Adaptation for Document Classification: Diachronic Word Embeddings and Domain Adaptation Models , 2019, ACL.

[421] Kyle Gorman,et al. We Need to Talk about Standard Splits , 2019, ACL.

[422] Maosong Sun,et al. XQA: A Cross-lingual Open-domain Question Answering Dataset , 2019, ACL.

[423] Goran Glavas,et al. Multilingual and Cross-Lingual Graded Lexical Entailment , 2019, ACL.

[424] Partha Pratim Talukdar,et al. Zero-shot Word Sense Disambiguation using Sense Definition Embeddings , 2019, ACL.

[425] Eser Kandogan,et al. HEIDL: Learning Linguistic Expressions with Deep Learning and Human-in-the-Loop , 2019, ACL.

[426] Charibeth Cheng,et al. Localization of Fake News Detection via Multitask Transfer Learning , 2019, LREC.

[427] Ming-Wei Chang,et al. Zero-Shot Entity Linking by Reading Entity Descriptions , 2019, ACL.

[428] Johan Bos,et al. Can Neural Networks Understand Monotonicity Reasoning? , 2019, BlackboxNLP@ACL.

[429] Lonneke van der Plas,et al. Learning to Predict Novel Noun-Noun Compounds , 2019, MWE-WN@ACL.

[430] Andrew McCallum,et al. Energy and Policy Considerations for Deep Learning in NLP , 2019, ACL.

[431] A. Korhonen,et al. Are we there yet? Encoder-decoder neural networks as cognitive models of English past tense inflection , 2019, ACL.

[432] Elia Bruni,et al. Transcoding Compositionally: Using Attention to Find More Generalizable Solutions , 2019, BlackboxNLP@ACL.

[433] Eva Schlinger,et al. How Multilingual is Multilingual BERT? , 2019, ACL.

[434] Jaime Carbonell,et al. Domain Adaptation of Neural Machine Translation by Lexicon Induction , 2019, ACL.

[435] Mathijs Mul,et al. Siamese recurrent networks learn first-order logic reasoning and exhibit zero-shot compositional generalization , 2019, ArXiv.

[436] Dragomir R. Radev,et al. SParC: Cross-Domain Semantic Parsing in Context , 2019, ACL.

[437] Dan Roth,et al. Improving Generalization in Coreference Resolution via Adversarial Training , 2019, *SEMEVAL.

[438] Jonathan Berant,et al. MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension , 2019, ACL.

[439] Jackie Chi Kit Cheung,et al. A Cross-Domain Transferable Neural Coherence Model , 2019, ACL.

[440] Samuel R. Bowman,et al. Human vs. Muppet: A Conservative Estimate of Human Performance on the GLUE Benchmark , 2019, ACL.

[441] Marco Baroni,et al. CNNs found to jump around more skillfully than RNNs: Compositional Generalization in Seq2seq Convolutional Networks , 2019, ACL.

[442] Marcelo Finger,et al. A logical-based corpus for cross-lingual evaluation , 2019, EMNLP.

[443] Omer Levy,et al. SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems , 2019, NeurIPS.

[444] William Yang Wang,et al. Few-Shot NLG with Pre-Trained Language Model , 2019, ACL.

[445] Mark Dredze,et al. Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT , 2019, EMNLP.

[446] Jack Hessel,et al. Something’s Brewing! Early Prediction of Controversy-causing Posts from Discussion Features , 2019, NAACL.

[447] Graham Neubig,et al. Density Matching for Bilingual Word Embedding , 2019, NAACL.

[448] Ankur P. Parikh,et al. Consistency by Agreement in Zero-Shot Neural Machine Translation , 2019, NAACL.

[449] Mai ElSherief,et al. Learning to Decipher Hate Symbols , 2019, NAACL.

[450] Pushmeet Kohli,et al. Analysing Mathematical Reasoning Abilities of Neural Models , 2019, ICLR.

[451] Jason Baldridge,et al. PAWS: Paraphrase Adversaries from Word Scrambling , 2019, NAACL.

[452] Ryan Cotterell,et al. A Probabilistic Generative Model of Linguistic Typology , 2019, NAACL.

[453] Marco Baroni,et al. The emergence of number and syntax units in LSTM language models , 2019, NAACL.

[454] Lucy Vasserman,et al. Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification , 2019, WWW.

[455] Roger Levy,et al. Structural Supervision Improves Learning of Non-Local Grammatical Dependencies , 2019, NAACL.

[456] Orhan Firat,et al. Massively Multilingual Neural Machine Translation , 2019, NAACL.

[457] Jian Sun,et al. Induction Networks for Few-Shot Text Classification , 2019, EMNLP.

[458] R. Thomas McCoy,et al. Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference , 2019, ACL.

[459] Armand Joulin,et al. Cooperative Learning of Disjoint Syntax and Semantics , 2019, NAACL.

[460] Trevor Cohn,et al. Massively Multilingual Transfer for NER , 2019, ACL.

[461] Heike Adel,et al. Adversarial Training for Satire Detection: Controlling for Confounding Variables , 2019, NAACL.

[462] Lei Yu,et al. Learning and Evaluating General Linguistic Intelligence , 2019, ArXiv.

[463] Lucy Vasserman,et al. Measuring and Mitigating Unintended Bias in Text Classification , 2018, AIES.

[464] Alfio Gliozzo,et al. Learning Relational Representations by Analogy using Hierarchical Siamese Networks , 2018, NAACL.

[465] Lu Chen,et al. DIAG-NRE: A Neural Pattern Diagnosis Framework for Distantly Supervised Neural Relation Extraction , 2018, ACL.

[466] Samuel R. Bowman,et al. Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks , 2018, ArXiv.

[467] Graeme Hirst,et al. Using context to identify the language of face-saving , 2018, ArgMining@EMNLP.

[468] Stergios Chatzikyriakidis,et al. Testing the Generalization Power of Neural Network Models across NLI Benchmarks , 2018, BlackboxNLP@ACL.

[469] Omer Levy,et al. pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference , 2018, NAACL.

[470] Ngoc Thang Vu,et al. Sequence-to-Sequence Models for Data-to-Text Natural Language Generation: Word- vs. Character-based Processing and Output Diversity , 2018, INLG.

[471] Inioluwa Deborah Raji,et al. Model Cards for Model Reporting , 2018, FAT.

[472] Jan Snajder,et al. Cross-Domain Detection of Abusive Language Online , 2018, ALW.

[473] Anders Søgaard,et al. Sentiment analysis under temporal shift , 2018, WASSA@EMNLP.

[474] Tao Yu,et al. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task , 2018, EMNLP.

[475] Guillaume Lample,et al. XNLI: Evaluating Cross-lingual Sentence Representations , 2018, EMNLP.

[476] Jason Weston,et al. Jump to better conclusions: SCAN both left and right , 2018, BlackboxNLP@EMNLP.

[477] Marilyn A. Walker,et al. Can Neural Generators for Dialogue Learn Sentence Planning and Discourse Structuring? , 2018, INLG.

[478] Hwee Tou Ng,et al. Adaptive Semi-supervised Learning for Cross-domain Sentiment Classification , 2018, EMNLP.

[479] Graham Neubig,et al. MTNT: A Testbed for Machine Translation of Noisy Text , 2018, EMNLP.

[480] Dieuwke Hupkes,et al. Do Language Models Understand Anything? On the Ability of LSTMs to Understand Negative Polarity Items , 2018, BlackboxNLP@EMNLP.

[481] José Camacho-Collados,et al. WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations , 2018, NAACL.

[482] Jaime G. Carbonell,et al. Adapting Word Embeddings to New Languages with Morphological and Phonological Subword Representations , 2018, EMNLP.

[483] Pascale Fung,et al. Reducing Gender Bias in Abusive Language Detection , 2018, EMNLP.

[484] Yejin Choi,et al. SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference , 2018, EMNLP.

[485] Florian Mohnert,et al. Under the Hood: Using Diagnostic Classifiers to Investigate and Improve how Language Models Track Agreement Information , 2018, BlackboxNLP@EMNLP.

[486] Zachary C. Lipton,et al. How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks , 2018, EMNLP.

[487] Ralf Krestel,et al. Aggression Identification Using Deep Learning and Data Augmentation , 2018, TRAC@COLING 2018.

[488] Marco Baroni,et al. Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks , 2018, BlackboxNLP@EMNLP.

[489] Gerard de Melo,et al. A Helping Hand: Transfer Learning for Deep Sentiment Analysis , 2018, ACL.

[490] Ryan Cotterell,et al. Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate , 2018, TACL.

[491] J. Tenenbaum. Building Machines that Learn and Think Like People , 2018, AAMAS.

[492] Ananth Balashankar,et al. RECIPE: Applying Open Domain Question Answering to Privacy Policies , 2018, QA@ACL.

[493] Isabelle Augenstein,et al. Character-level Supervision for Low-resource POS Tagging , 2018, DeepLo@ACL.

[494] Michael J. Paul,et al. Examining Temporality in Document Classification , 2018, ACL.

[495] Gerhard Weikum,et al. diaNED: Time-Aware Named Entity Disambiguation for Diachronic Corpora , 2018, ACL.

[496] Jianxin Li,et al. Time-evolving Text Classification with Deep Neural Networks , 2018, IJCAI.

[497] Richard Socher,et al. The Natural Language Decathlon: Multitask Learning as Question Answering , 2018, ArXiv.

[498] James Henderson,et al. GILE: A Generalized Input-Label Embedding for Text Classification , 2018, TACL.

[499] Iryna Gurevych,et al. A Retrospective Analysis of the Fake News Challenge Stance-Detection Task , 2018, COLING.

[500] Rui Wang,et al. A Survey of Domain Adaptation for Neural Machine Translation , 2018, COLING.

[501] Mari Ostendorf,et al. Estimating Linguistic Complexity for Science Texts , 2018, BEA@NAACL-HLT.

[502] Ryan Cotterell,et al. Are All Languages Equally Hard to Language-Model? , 2018, NAACL.

[503] Dragomir R. Radev,et al. Improving Text-to-SQL Evaluation Methodology , 2018, ACL.

[504] Joachim Bingel,et al. Cross-lingual complex word identification with multitask learning , 2018, BEA@NAACL-HLT.

[505] Pushpak Bhattacharyya,et al. Leveraging Orthographic Similarity for Multilingual Neural Transliteration , 2018, TACL.

[506] Yen-Chun Chen,et al. Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting , 2018, ACL.

[507] Cécile Paris,et al. Cross-Target Stance Classification with Self-Attention Networks , 2018, ACL.

[508] Sebastian Riedel,et al. Behavior Analysis of NLI Models: Uncovering the Influence of Three Factors on Robustness , 2018, NAACL.

[509] Matthias Grabmair,et al. Towards Inference-Oriented Reading Comprehension: ParallelQA , 2018, ArXiv.

[510] Ido Dagan,et al. Paraphrase to Explicate: Revealing Implicit Noun-Compound Relations , 2018, ACL.

[511] Yoav Goldberg,et al. Breaking NLI Systems with Sentences that Require Simple Lexical Inferences , 2018, ACL.

[512] Niranjan Balasubramanian,et al. The Fine Line between Linguistic Generalization and Failure in Seq2Seq-Attention Models , 2018, ArXiv.

[513] Rachel Rudinger,et al. Hypothesis Only Baselines in Natural Language Inference , 2018, *SEMEVAL.

[514] Timothy Baldwin,et al. What’s in a Domain? Learning Domain-Robust Text Representations using Adversarial Training , 2018, NAACL.

[515] Maxine Eskénazi,et al. Zero-Shot Dialog Generation with Cross-Domain Latent Actions , 2018, SIGDIAL Conference.

[516] Ari Rappoport,et al. Multitask Parsing Across Semantic Representations , 2018, ACL.

[517] Samuel R. Bowman,et al. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[518] Sharon Goldwater,et al. Evaluating Historical Text Normalization Systems: How Well Do They Generalize? , 2018, NAACL.

[519] Dan Roth,et al. End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions , 2018, ACL.

[520] Zhong Zhou,et al. Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation , 2018, WMT.

[521] Jonathan Berant,et al. Decoupling Structure and Lexicon for Zero-Shot Semantic Parsing , 2018, EMNLP.

[522] Edouard Grave,et al. Colorless Green Recurrent Networks Dream Hierarchically , 2018, NAACL.

[523] Timnit Gebru,et al. Datasheets for datasets , 2018, Commun. ACM.

[524] Omer Levy,et al. Annotation Artifacts in Natural Language Inference Data , 2018, NAACL.

[525] Marco Baroni,et al. Memorize or generalize? Searching for a compositional RNN in a haystack , 2018, ArXiv.

[526] Ambedkar Dukkipati,et al. Instance-based Inductive Deep Transfer Learning by Cross-Dataset Querying with Locality Sensitive Hashing , 2018, EMNLP.

[527] Nitish Gupta,et al. Neural Compositional Denotational Semantics for Question Answering , 2018, EMNLP.

[528] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[529] Ruslan Salakhutdinov,et al. Investigating the Working of Text Classifiers , 2018, COLING.

[530] Sebastian Ruder,et al. Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[531] Gary Marcus,et al. Deep Learning: A Critical Appraisal , 2018, ArXiv.

[532] Dan Roth,et al. Mapping to Declarative Knowledge for Word Problem Solving , 2017, TACL.

[533] Willem H. Zuidema,et al. Visualisation and 'diagnostic classifiers' reveal how recurrent and recursive neural networks process hierarchical structure , 2017, J. Artif. Intell. Res..

[534] Marco Baroni,et al. Generalization without Systematicity: On the Compositional Skills of Sequence-to-Sequence Recurrent Networks , 2017, ICML.

[535] Brendan T. O'Connor,et al. A Dataset and Classifier for Recognizing Social Media English , 2017, NUT@EMNLP.

[536] Lemao Liu,et al. Instance Weighting for Neural Machine Translation Domain Adaptation , 2017, EMNLP.

[537] Benno Stein,et al. Unit Segmentation of Argumentative Texts , 2017, ArgMining@EMNLP.

[538] Mark A. Finlayson,et al. A Simpler and More Generalizable Story Detector using Verb and Character Features , 2017, EMNLP.

[539] Robert Malouf,et al. Abstractive morphological learning with a recurrent neural network , 2017 .

[540] Zhen-Hua Ling,et al. Recurrent Neural Network-Based Sentence Encoder with Gated Attention for Natural Language Inference , 2017, RepEval@EMNLP.

[541] Michael Strube,et al. Using Linguistic Features to Improve the Generalization Capability of Neural Coreference Resolvers , 2017, EMNLP.

[542] Percy Liang,et al. Adversarial Examples for Evaluating Reading Comprehension Systems , 2017, EMNLP.

[543] Young-Bum Kim,et al. Domain Attention with an Ensemble of Experts , 2017, ACL.

[544] Masao Utiyama,et al. Sentence Embedding for Neural Machine Translation Domain Adaptation , 2017, ACL.

[545] Omer Levy,et al. Zero-Shot Relation Extraction via Reading Comprehension , 2017, CoNLL.

[546] Le-Minh Nguyen,et al. Natural Language Generation for Spoken Dialogue System using RNN Encoder-Decoder Networks , 2017, CoNLL.

[547] Stefan Riezler,et al. Bandit Structured Prediction for Neural Sequence-to-Sequence Learning , 2017, ACL.

[548] Samuel R. Bowman,et al. A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.

[549] Bonnie L. Webber,et al. Detecting negation scope is easy, except when it isn’t , 2017, EACL.

[550] Michael Strube,et al. Lexical Features in Coreference Resolution: To be Used With Caution , 2017, ACL.

[551] Gary Geunbae Lee,et al. Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems , 2017, Pattern Recognit. Lett..

[552] Markus Freitag,et al. Fast Domain Adaptation for Neural Machine Translation , 2016, ArXiv.

[553] Kalina Bontcheva,et al. Broad Twitter Corpus: A Diverse Named Entity Recognition Resource , 2016, COLING.

[554] Fan Yang,et al. Leveraging Multiple Domains for Sentiment Classification , 2016, COLING.

[555] Emmanuel Dupoux,et al. Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies , 2016, TACL.

[556] Roi Reichart,et al. Neural Structural Correspondence Learning for Domain Adaptation , 2016, CoNLL.

[557] Richard Socher,et al. Pointer Sentinel Mixture Models , 2016, ICLR.

[558] Anette Frank,et al. Modal Sense Classification At Large: Paraphrase-Driven Sense Projection, Semantically Enriched Classification Models and Cross-Genre Evaluations , 2016, LILT.

[559] Barbara Plank,et al. What to do about non-standard (or non-canonical) language in NLP , 2016, KONVENS.

[560] Nanyun Peng,et al. Multi-task Domain Adaptation for Sequence Tagging , 2016, Rep4NLP@ACL.

[561] Brendan T. O'Connor,et al. Demographic Dialectal Variation in Social Media: A Case Study of African-American English , 2016, EMNLP.

[562] Nathanael Chambers,et al. A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories , 2016, NAACL.

[563] Nadir Durrani,et al. How to Avoid Unwanted Pregnancies: Domain Adaptation using Neural Network Models , 2015, EMNLP.

[564] Christopher Potts,et al. A large annotated corpus for learning natural language inference , 2015, EMNLP.

[565] Christopher Potts,et al. Tree-Structured Composition in Neural Networks without Tree-Structured Architectures , 2015, CoCo@NIPS.

[566] Dirk Hovy,et al. Crowdsourcing and annotating NER for Twitter #drift , 2014, LREC.

[567] Marco Marelli,et al. A SICK cure for the evaluation of compositional distributional semantic models , 2014, LREC.

[568] Jianfeng Gao,et al. Domain Adaptation via Pseudo In-Domain Data Selection , 2011, EMNLP.

[569] Federico Sangati,et al. Accurate Parsing with Compact Tree-Substitution Grammars: Double-DOP , 2011, EMNLP.

[570] Marcello Federico,et al. Domain Adaptation for Statistical Machine Translation with Monolingual Resources , 2009, WMT@EACL.

[571] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[572] F.C.K. Wong,et al. Generalisation towards Combinatorial Productivity in Language Acquisition by Simple Recurrent Networks , 2007, 2007 International Conference on Integration of Knowledge Intensive Multi-Agent Systems.

[573] John Blitzer,et al. Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[574] Hal Daumé,et al. Frustratingly Easy Domain Adaptation , 2007, ACL.

[575] Dan Klein,et al. Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[576] Satoshi Nakamura,et al. Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[577] John Blitzer,et al. Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.

[578] Gary F. Marcus,et al. Connectionism: with or without rules? Response to J.L. McClelland and D.C. Plaut (1999) , 1999, Trends in Cognitive Sciences.

[579] D. Plaut,et al. Does generalization in infant learning implicate abstract algebra-like rules? , 1999, Trends in Cognitive Sciences.

[580] G. Marcus. Rethinking Eliminative Connectionism , 1998, Cognitive Psychology.

[581] Ronald Rosenfeld,et al. A maximum entropy approach to adaptive statistical language modelling , 1996, Comput. Speech Lang..

[582] Michael Collins,et al. A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[583] Gary F. Marcus,et al. German Inflection: The Exception That Proves the Rule , 1995, Cognitive Psychology.

[584] Hermann Ney,et al. Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[585] David M. Magerman. Statistical Decision-Tree Models for Parsing , 1995, ACL.

[586] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[587] J. Fodor,et al. Connectionism and cognitive architecture: A critical analysis , 1988, Cognition.

[588] László Dezsö,et al. Universal Grammar , 1981, Certainty in Action.

[589] J. Berko. The Child's Learning of English Morphology , 1958 .

[590] Emmanouil Antonios Platanios,et al. Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion , 2022, ACL.

[591] Shima Asaadi,et al. Knowledge Distillation Meets Few-Shot Learning: An Approach for Few-Shot Intent Classification Within and Across Domains , 2022, NLP4CONVAI.

[592] Dinh Q. Phung,et al. Domain Generalisation of NMT: Fusing Adapters with Leave-One-Domain-Out Training , 2022, FINDINGS.

[593] Jey Han Lau,et al. Cloze Evaluation for Deeper Understanding of Commonsense Stories in Indonesian , 2022, CSRR.

[594] Lyle Ungar,et al. Measuring the Language of Self-Disclosure across Corpora , 2022, FINDINGS.

[595] M. Fomicheva,et al. Bias Mitigation in Machine Translation Quality Estimation , 2022, ACL.

[596] Shizhu He,et al. Leveraging Explicit Lexico-logical Alignments in Text-to-SQL Parsing , 2022, ACL.

[597] A. A. Krizhanovsky,et al. SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection , 2022, SIGMORPHON.

[598] E. Hobley,et al. Improving Generalization of Hate Speech Detection Systems to Novel Target Groups via Domain Adaptation , 2022, WOAH.

[599] Xiaojie Wang,et al. Learn to Adapt for Generalized Zero-Shot Text Classification , 2022, ACL.

[600] Yangqiu Song,et al. Rare and Zero-shot Word Sense Disambiguation using Z-Reweighting , 2022, ACL.

[601] Noah A. Smith,et al. Benchmarking Generalization via In-Context Instructions on 1, 600+ Language Tasks , 2022, ArXiv.

[602] Xuanjing Huang,et al. Flooding-X: Improving BERT’s Resistance to Adversarial Attacks via Loss-Restricted Fine-Tuning , 2022, ACL.

[603] Rik Koncel-Kedziorski,et al. Cross-Lingual G EN QA: Open-Domain Question Answering with Answer Sentence Generation , 2022 .

[604] David Jurgens,et al. Classification without (Proper) Representation: Political Heterogeneity in Social Media and Its Implications for Classification and Behavioral Analysis , 2022, FINDINGS.

[605] Alexander I. Rudnicky,et al. An Empirical study to understand the Compositional Prowess of Neural Dialog Models , 2022, INSIGHTS.

[606] Di Wu,et al. Challenges to Open-Domain Constituency Parsing , 2022, FINDINGS.

[607] Tiansi Dong,et al. How Can Cross-lingual Knowledge Contribute Better to Fine-Grained Entity Typing? , 2022, FINDINGS.

[608] Roberto Zamparelli,et al. Multilingualism Encourages Recursion: a Transfer Study with mBERT , 2022, SIGTYP.

[609] Mohit Bansal,et al. GraDA: Graph Generative Data Augmentation for Commonsense Reasoning , 2022, DLG4NLP.

[610] Matt Gardner,et al. Impact of Pretraining Term Frequencies on Few-Shot Numerical Reasoning , 2022, EMNLP.

[611] Shafiq R. Joty,et al. Effective Fine-Tuning Methods for Cross-lingual Adaptation , 2021, EMNLP.

[612] Y. Taya,et al. Multi-Layer Random Perturbation Training for improving Model Generalization Efficiently , 2021, BLACKBOXNLP.

[613] Snigdha Chaturvedi,et al. How Helpful is Inverse Reinforcement Learning for Table-to-Text Generation? , 2021, ACL.

[614] Pawan Goyal,et al. Attribute Value Generation from Product Title using Language Models , 2021, ECNLP.

[615] Adina Williams,et al. Generalising to German Plural Noun Classes, from the Perspective of a Recurrent Neural Network , 2021, CONLL.

[616] Jing Jiang,et al. Cross-Topic Rumor Detection using Topic-Mixtures , 2021, EACL.

[617] Hao He,et al. Diagnosing the First-Order Logical Reasoning Ability Through LogicNLI , 2021, EMNLP.

[618] Cane Wing-ki Leung,et al. Improving Model Generalization: A Chinese Named Entity Recognition Case Study , 2021, ACL.

[619] Luheng He,et al. QA-Driven Zero-shot Slot Filling with Weak Supervision Pretraining , 2021, ACL.

[620] E. Hinrichs,et al. Automatic Classification of Attributes in German Adjective-Noun Phrases , 2021, IWCS.

[621] Senja Pollak,et al. Zero-shot Cross-lingual Content Filtering: Offensive Language and Hate Speech Detection , 2021, HACKASHOP.

[622] Ulf Leser,et al. Extend, don’t rebuild: Phrasing conditional graph modification as autoregressive sequence labelling , 2021, EMNLP.

[623] Nigel Collier,et al. Synthetic Examples Improve Cross-Target Generalization: A Study on Stance Detection on a Twitter corpus. , 2021, WASSA.

[624] Akhil Kedia,et al. Keep Learning: Self-supervised Meta-learning for Learning from Inference , 2021, EACL.

[625] Colin Wilson,et al. Were We There Already? Applying Minimal Generalization to the SIGMORPHON-UniMorph Shared Task on Cognitively Plausible Morphological Inflection , 2021, Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[626] Yohan Lee,et al. Improving End-to-End Task-Oriented Dialog System with A Simple Auxiliary Task , 2021, EMNLP.

[627] Vaibhava Goel,et al. CNNBiF: CNN-based Bigram Features for Named Entity Recognition , 2021, EMNLP.

[628] Victor Petrén Bach Hansen,et al. Guideline Bias in Wizard-of-Oz Dialogues , 2021, BPPF.

[629] Gerhard Heyer,et al. On Classifying whether Two Texts are on the Same Side of an Argument , 2021, EMNLP.

[630] Johan Bos,et al. Evaluating Text Generation from Discourse Representation Structures , 2021, GEM.

[631] I. Kobayashi,et al. Towards a Language Model for Temporal Commonsense Reasoning , 2021, RANLP.

[632] J. Piskorski,et al. Fine-grained Event Classification in News-like Text Snippets - Shared Task 2, CASE 2021 , 2021, CASE.

[633] Hinrich Schütze,et al. Multidomain Pretrained Language Models for Green NLP , 2021, ADAPTNLP.

[634] Wenbin Hu,et al. BanditMTL: Bandit-based Multi-task Learning for Text Classification , 2021, ACL.

[635] Marcello Federico,et al. A Statistical Extension of Byte-Pair Encoding , 2021, IWSLT.

[636] Hitomi Yanaka,et al. Assessing the Generalization Capacity of Pre-trained Language Models through Japanese Adversarial Natural Language Inference , 2021, BLACKBOXNLP.

[637] Francis M. Tyers,et al. SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages , 2021, Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[638] Timothy J. Hazen,et al. Increasing Robustness to Spurious Correlations using Forgettable Examples , 2021, EACL.

[639] Proceedings of the Fourth Workshop on NLP for Internet Freedom: Censorship, Disinformation, and Propaganda , 2021 .

[640] Ali Ghodsi,et al. How to Select One Among All ? An Empirical Study Towards the Robustness of Knowledge Distillation in Natural Language Understanding , 2021, EMNLP.

[641] Judith Yue Li,et al. Semi-supervised Meta-learning for Cross-domain Few-shot Intent Classification , 2021, METANLP.

[642] Natalie Schluter,et al. MassiveSumm: a very large-scale, very multilingual, news summarisation dataset , 2021, EMNLP.

[643] Dragomir R. Radev,et al. Testing Cross-Database Semantic Parsers With Canonical Utterances , 2021, EVAL4NLP.

[644] Baolin Peng,et al. Few-Shot Named Entity Recognition: An Empirical Baseline Study , 2021, EMNLP.

[645] Junlan Feng,et al. Counterfactual Matters: Intrinsic Probing For Dialogue State Tracking , 2021, EANCS.

[646] Edward Grefenstette,et al. A Survey of Generalisation in Deep Reinforcement Learning , 2021, ArXiv.

[647] Maria Barrett,et al. Spurious Correlations in Cross-Topic Argument Mining , 2021, STARSEM.

[648] David,et al. IA On Learning the Past Tenses of English Verbs , 2021 .

[649] Jose G. Moreno,et al. Using a Frustratingly Easy Domain and Tagset Adaptation for Creating Slavic Named Entity Recognition Systems , 2021, BSNLP.

[650] Martha Palmer,et al. Predicate Representations and Polysemy in VerbNet Semantic Parsing , 2021, IWCS.

[651] Xing Han,et al. Multi-Pair Text Style Transfer for Unbalanced Data via Task-Adaptive Meta-Learning , 2021, METANLP.

[652] Minh Le Nguyen,et al. Learning Cross-lingual Representations for Event Coreference Resolution with Multi-view Alignment and Optimal Transport , 2021, MRL.

[653] Wei Xu,et al. WIKIBIAS: Detecting Multi-Span Subjective Biases in Language , 2021, EMNLP.

[654] Nicholas Andrews,et al. Learning Universal Authorship Representations , 2021, EMNLP.

[655] Adam Ek,et al. Training Strategies for Neural Multilingual Morphological Inflection , 2021, Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology.

[656] Pavel Pecina,et al. Solving SCAN Tasks with Data Augmentation and Input Embeddings , 2021, RANLP.

[657] Nigel Collier,et al. Adversarial Training for News Stance Detection: Leveraging Signals from a Multi-Genre Corpus. , 2021, HACKASHOP.

[658] Itzik Malkiel,et al. Maximal Multiverse Learning for Promoting Cross-Task Generalization of Fine-Tuned Language Models , 2021, EACL.

[659] Sarvnaz Karimi,et al. Combining Shallow and Deep Representations for Text-Pair Classification , 2021, ALTA.

[660] Chunyan Miao,et al. MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-Lingual NER , 2021, ACL.

[661] Frankie Robertson,et al. Word Discriminations for Vocabulary Inventory Prediction , 2021, RANLP.

[662] Zornitsa Kozareva,et al. Few-shot Learning with Multilingual Language Models , 2021, ArXiv.

[663] Pararth Shah,et al. Multi-Action Dialog Policy Learning with Interactive Human Teaching , 2020, SIGDIAL.

[664] Roger Levy,et al. Cloze Distillation: Improving Neural Language Models with Human Next-Word Prediction , 2020, CoNLL.

[665] A. Waibel,et al. Supervised Adaptation of Sequence-to-Sequence Speech Recognition Systems using Batch-Weighting , 2020, LIFELONGNLP.

[666] S. Chatzikyriakidis,et al. How does Punctuation Affect Neural Models in Natural Language Inference , 2020, PAM.

[667] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[668] Anna Feldman. Proceedings of the Second Workshop on Natural Language Processing for Internet Freedom: Censorship, Disinformation, and Propaganda , 2019 .

[669] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[670] Yejin Choi,et al. An Adversarial Winograd Schema Challenge at Scale , 2019 .

[671] Joachim Bingel,et al. Bridging the Gaps: Multi Task Learning for Domain Transfer of Hate Speech Detection , 2018 .

[672] Mans Hulden,et al. A Neural Morphological Analyzer for Arapaho Verbs Learned from a Finite State Transducer , 2018 .

[673] Gary Geunbae Lee,et al. Out-of-domain Detection based on Generative Adversarial Network , 2018, EMNLP.

[674] Joe Pater,et al. Seq2Seq Models with Dropout can Learn Generalizable Reduplication , 2018 .

[675] Heng Ji,et al. Cross-lingual Name Tagging and Linking for 282 Languages , 2017, ACL.

[676] Chenhui Chu,et al. An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation , 2017, ACL.

[677] Paul Cook,et al. Supervised and unsupervised approaches to measuring usage similarity , 2017 .

[678] Ines Rehbein,et al. Authorship Attribution with Convolutional Neural Networks and POS-Eliding , 2017 .

[679] Antal van den Bosch,et al. Sarcastic Soulmates: Intimacy and irony markers in social media messaging , 2016, LILT.

[680] Willem H. Zuidema,et al. Diagnostic Classifiers Revealing how Neural Networks Process Hierarchical Structure , 2016, CoCo@NIPS.

[681] Christopher D. Manning,et al. Stanford Neural Machine Translation Systems for Spoken Language Domains , 2015, IWSLT.

[682] Francisco Herrera,et al. A unifying view on dataset shift in classification , 2012, Pattern Recognit..

[683] J. Scott. Istituto Dalle Molle di Studi Sull’Intelligenza Artificiale (IDSIA) | USI-SUPSI , 2010 .

[684] Neil D. Lawrence,et al. When Training and Test Sets Are Different: Characterizing Learning Transfer , 2009 .

[685] G. Marcus. The Algebraic Mind: Integrating Connectionism and Cognitive Science , 2001 .

[686] R. Rosenfeld. A Maximum Entropy Approach to Adaptive Statistical Language Modeling , 2001 .

[687] urgen Schmidhuber. Towards Compositional Learning in Dynamic Networks , 1990 .

[688] Emily M. Bender. Linguistic I Ssues in L Anguage Technology Lilt on Achieving and Evaluating Language-independence in Nlp on Achieving and Evaluating Language-independence in Nlp , 2022 .

[689] Cees G. M. Snoek,et al. Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation , 2021, ACL.

[690] M. de Rijke,et al. UvA-DARE (Digital Academic Repository) Learning to Ask Conversational Questions by Optimizing Levenshtein Distance , 2022 .