论文信息 - Harnessing the linguistic signal to predict scalar inferences

Harnessing the linguistic signal to predict scalar inferences

Pragmatic inferences often subtly depend on the presence or absence of linguistic features. For example, the presence of a partitive construction (of the) increases the strength of a so-called scalar inference: listeners perceive the inference that Chris did not eat all of the cookies to be stronger after hearing "Chris ate some of the cookies" than after hearing the same utterance without a partitive, "Chris ate some cookies." In this work, we explore to what extent neural network sentence encoders can learn to predict the strength of scalar inferences. We first show that an LSTM-based sentence encoder trained on an English dataset of human inference strength ratings is able to predict ratings with high accuracy (r=0.78). We then probe the model's behavior using manually constructed minimal sentence pairs and corpus data. We find that the model inferred previously established associations between linguistic features and inference strength, suggesting that the model learns to use linguistic features to predict pragmatic inferences.

[1] Yang Liu,et al. Visualizing and Understanding Neural Machine Translation , 2017, ACL.

[2] Christopher Potts,et al. A large annotated corpus for learning natural language inference , 2015, EMNLP.

[3] Christopher Potts,et al. Colors in Context: A Pragmatic Neural Model for Grounded Language Understanding , 2017, TACL.

[4] Judith Degen,et al. Investigating the distribution of some (but not all ) implicatures using corpora and web-based methods , 2015 .

[5] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[6] Yonatan Belinkov,et al. Linguistic Knowledge and Transferability of Contextual Representations , 2019, NAACL.

[7] John J. Godfrey,et al. SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8] Christopher Potts,et al. Pragmatically Informative Image Captioning with Character-Level Inference , 2018, NAACL.

[9] Marie-Catherine de Marneffe,et al. Do You Know That Florence Is Packed with Visitors? Evaluating State-of-the-art Models of Speaker Commitment , 2019, ACL.

[10] A. Feeney,et al. When some is actually all: Scalar inferences in face-threatening contexts , 2009, Cognition.

[11] Luke S. Zettlemoyer,et al. Deep Contextualized Word Representations , 2018, NAACL.

[12] Greg Carlson,et al. A unified analysis of the English bare plural , 1977 .

[13] Dan Klein,et al. Reasoning about Pragmatics with Neural Listeners and Speakers , 2016, EMNLP.

[14] Jiqiang Guo,et al. Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[15] Roger Levy,et al. What Syntactic Structures block Dependencies in RNN Language Models? , 2019, CogSci.

[16] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[17] Rachel Rudinger,et al. Neural Models of Factuality , 2018, NAACL.

[18] Noah D. Goodman,et al. Knowledge and implicature: Modeling language understanding as social cognition , 2012, CogSci.

[19] S. A. Chowdhury,et al. RNN Simulations of Grammaticality Judgments on Long-distance Dependencies , 2018, COLING.