Visual resemblance and communicative context constrain the emergence of graphical conventions

From photorealistic sketches to schematic diagrams, drawing provides a versatile medium for communicating about the visual world. How do images spanning such a broad range of appearances reliably convey meaning? Do viewers understand drawings based solely on their ability to resemble the entities they refer to (i.e., as images), or do they understand drawings based on shared but arbitrary associations with these entities (i.e., as symbols)? In this paper, we provide evidence for a cognitive account of pictorial meaning in which both visual and social information is integrated to support effective visual communication. To evaluate this account, we used a communication task where pairs of participants used drawings to repeatedly communicate the identity of a target object among multiple distractor objects. We manipulated social cues across three experiments and a full internal replication, finding pairs of participants develop referent-specific and interaction-specific strategies for communicating more efficiently over time, going beyond what could be explained by either task practice or a pure resemblance-based account alone. Using a combination of modelbased image analyses and crowdsourced sketch annotations, we further determined that drawings did not drift toward “arbitrariness,” as predicted by a pure convention-based account, but systematically preserved those visual features that were most distinctive of the target object. Taken together, these findings advance theories of pictorial meaning and have implications for how successful graphical conventions emerge via complex interactions between visual perception, communicative experience, and social context.

[1]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[2]  Thomas L. Griffiths,et al.  Evaluating (and Improving) the Correspondence Between Deep Neural Networks and Human Representations , 2017, Cogn. Sci..

[3]  Simon Kirby,et al.  Iconicity and the Emergence of Combinatorial Structure in Language , 2016, Cogn. Sci..

[4]  Patrick Cavanagh,et al.  What Line Drawings Reveal About the Visual Brain , 2011, Front. Hum. Neurosci..

[5]  H. H. Clark,et al.  Understanding by addressees and overhearers , 1989, Cognitive Psychology.

[6]  M. Tamariz Experimental Studies on the Cultural Evolution of Language , 2017 .

[7]  Barbara Tversky,et al.  Creating visual explanations improves learning , 2016, Cognitive research: principles and implications.

[8]  Ichiro Umata,et al.  Graphical Language Games: Interactional Constraints on Representational Form , 2007, Cogn. Sci..

[9]  Kenny Smith,et al.  Cultural Evolution and Perpetuation of Arbitrary Communicative Conventions in Experimental Microsocieties , 2012, PloS one.

[10]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[11]  Bruno Galantucci,et al.  An Experimental Study of the Emergence of Human Communication Systems , 2005, Cogn. Sci..

[12]  Gary Lupyan,et al.  Iconicity can ground the creation of vocal symbols , 2015, Royal Society Open Science.

[13]  Bradley Walker,et al.  Can iterated learning explain the emergence of graphical symbols , 2010 .

[14]  I. J. Gelb A study of writing , 1954 .

[15]  D. M. Hutton,et al.  Numerical Notation: A Comparative History , 2012 .

[16]  K. Goedert,et al.  Clock drawing in spatial neglect: a comprehensive analysis of clock perimeter, placement, and accuracy. , 2012, Journal of neuropsychology.

[17]  Heinrich René Liesefeld,et al.  Combining speed and accuracy to control for speed-accuracy trade-offs(?) , 2018, Behavior Research Methods.

[18]  Jonas Kubilius,et al.  Deep Neural Networks as a Computational Model for Human Shape Sensitivity , 2016, PLoS Comput. Biol..

[19]  Simon Garrod,et al.  Experimental Semiotics: A Review , 2010, Front. Hum. Neurosci..

[20]  Jon Oberlander,et al.  Foundations of Representation: Where Might Graphical Symbol Systems Come From? , 2007, Cogn. Sci..

[21]  Mike Frank,et al.  Convention-formation in iterated reference games , 2017, CogSci.

[22]  Nicolas Fay,et al.  Iconicity: From sign to system in human communication and language , 2014 .

[23]  Susan Goldin-Meadow,et al.  Creating a communication system from scratch: gesture beats vocalization hands down , 2014, Front. Psychol..

[24]  Simon Garrod,et al.  The Interactive Evolution of Human Communication Systems , 2010, Cogn. Sci..

[25]  Jordan Gunn,et al.  Relating Visual Production and Recognition of Objects in Human Visual Cortex , 2019, The Journal of Neuroscience.

[26]  A. Voltolini A Syncretistic Theory of Depiction , 2015 .

[27]  T. Mark Ellison,et al.  The Cultural Evolution of Human Communication Systems in Different Sized Populations: Usability Trumps Learnability , 2013, PloS one.

[28]  Daniel L. K. Yamins,et al.  Common Object Representations for Visual Production and Recognition , 2018, Cogn. Sci..

[29]  R. Krauss,et al.  Changes in reference phrases as a function of frequency of usage in social interaction: a preliminary study , 1964 .

[30]  Judith E. Fan,et al.  Improvised Numerals Rely on 1-to-1 Correspondence , 2021 .

[31]  Philip R. Cohen,et al.  Referring as a Collaborative Process , 2003 .

[32]  J. Deręgowski Real space and represented space: Cross-cultural perspectives , 1989, Behavioral and Brain Sciences.

[33]  T. Rogers,et al.  A duck with four legs: Investigating the structure of conceptual knowledge using picture drawing in semantic dementia , 2003, Cognitive neuropsychology.

[34]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[35]  Masayuki Tanaka,et al.  Recognition of pictorial representations by chimpanzees (Pan troglodytes) , 2007, Animal Cognition.

[36]  Lori Markson,et al.  Intention and Analogy in Children's Naming of Pictorial Representations , 1998 .

[37]  S Goldin-Meadow,et al.  Silence is liberating: removing the handcuffs on grammatical expression in the manual modality. , 1996, Psychological review.

[38]  Michael C. Frank,et al.  Characterizing the dynamics of learning in repeated reference games , 2020, Cogn. Sci..

[39]  J. Gibson The Ecological Approach to Visual Perception , 1979 .

[40]  Cave Art , 2021, Encyclopedic Dictionary of Archaeology.

[41]  Karen B. Schloss,et al.  Color inference in visual communication: the meaning of colors in recycling , 2018, Cognitive research: principles and implications.

[42]  Riccardo Fusaroli,et al.  The evolution of early symbolic behavior in Homo sapiens , 2020, Proceedings of the National Academy of Sciences.

[43]  R. J. Miller Cross-cultural research in the perception of pictorial materials. , 1973, Psychological bulletin.

[44]  Simon Kirby,et al.  Systematicity and arbitrariness in novel communication systems , 2010 .

[45]  J. Hochberg,et al.  Pictorial recognition as an unlearned ability: a study of one child's performance. , 1962, The American journal of psychology.

[46]  S. Garrod,et al.  How to Create Shared Symbols. , 2018, Cognitive science.

[47]  M. Hagen,et al.  Cultural Effects on Pictorial Perception: How Many Words Is One Picture Really Worth? , 1978 .

[48]  Robert X D Hawkins,et al.  Conducting real-time multiplayer experiments on the web , 2014, Behavior Research Methods.

[49]  N. Goodman,et al.  Languages of Art: An Approach to a Theory of Symbols , 1971 .

[50]  Mike Wu,et al.  Pragmatic Inference and Visual Abstraction Enable Contextual Flexibility During Visual Communication , 2019, Computational Brain & Behavior.

[51]  W. Hudson Pictorial Depth Perception in Sub-Cultural Groups in Africa , 1960 .

[52]  John M. Kennedy,et al.  Outline Picture Perception by the Songe of Papua , 1975 .