Finding Fuzziness in Neural Network Models of Language Processing

Humans often communicate by using imprecise language, suggesting that fuzzy concepts with unclear boundaries are prevalent in language use. In this paper, we test the extent to which models trained to capture the distributional statistics of language show correspondence to fuzzy-membership patterns. Using the task of natural language inference, we test a recent state of the art model on the classical case of temperature, by examining its mapping of temperature data to fuzzy-perceptions such as cool, hot, etc. We find the model to show patterns that are similar to classical fuzzy-set theoretic formulations of linguistic hedges, albeit with a substantial amount of noise, suggesting that models trained solely on language show promise in encoding fuzziness.

[1]  Benjamin Van Durme,et al.  Probing Neural Language Models for Human Tacit Assumptions , 2020, CogSci.

[2]  M. McCloskey,et al.  Natural categories: Well defined or fuzzy sets? , 1978 .

[3]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[4]  Kees van Deemter Not Exactly: In Praise of Vagueness , 2010 .

[5]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[6]  L. Zadeh A Fuzzy-Set-Theoretic Interpretation of Linguistic Hedges , 1972 .

[7]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[8]  Lotfi A. Zadeh,et al.  Fuzzy logic = computing with words , 1996, IEEE Trans. Fuzzy Syst..

[9]  J. R. Firth,et al.  A Synopsis of Linguistic Theory, 1930-1955 , 1957 .

[10]  Sebastian Riedel,et al.  Language Models as Knowledge Bases? , 2019, EMNLP.

[11]  E. Rosch Cognitive Representations of Semantic Categories. , 1975 .

[12]  Christopher Potts,et al.  A large annotated corpus for learning natural language inference , 2015, EMNLP.

[13]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[14]  George Lakoff,et al.  Hedges: A Study In Meaning Criteria And The Logic Of Fuzzy Concepts , 1973 .

[15]  Samuel R. Bowman,et al.  A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.

[16]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[17]  Fernando Batista,et al.  A critical survey on the use of Fuzzy Sets in Speech and Natural Language Processing , 2012, 2012 IEEE International Conference on Fuzzy Systems.

[18]  Omer Levy,et al.  GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding , 2018, BlackboxNLP@EMNLP.

[19]  B. Winterhalder,et al.  Encyclopedia of Cognitive Science , 2006 .

[20]  Omer Levy,et al.  SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems , 2019, NeurIPS.

[21]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[22]  Martine De Cock,et al.  Linguistic Hedges: a Quantifier Based Approach , 2002, HIS.