The Role of Hierarchy in Learning to Categorize Images

The Role of Hierarchy in Learning to Categorize Images Reza Shahbazi (rs689@cornell.edu) Uris Hall, Dept. of Psychology, Cornell University Ithaca, NY, 14853, USA David Field (djf3@cornell.edu) Uris Hall, Dept. of Psychology, Cornell University Ithaca, NY, 14853, USA Shimon Edelman (se37@cornell.edu) Uris Hall, Dept. of Psychology, Cornell University Ithaca, NY, 14853, USA Abstract Converging evidence from anatomical studies (Maunsell, 1983) and functional analyses (Hubel & Wisesel, 1968) of the nervous system suggests that the feed-forward pathway of the mammalian perceptual system follows a largely hierarchic organization scheme. This may be because hierarchic structures are intrinsically more viable and thus more likely to evolve (Simon, 2002). But it may also be because objects in our environment have a hierarchic structure and the perceptual system has evolved to match it. We conducted a behavioral experiment to investigate the effect of the degree of hierarchy of the generative probabilistic structure in categorization. We generated one set of stimuli using a hierarchic underlying probability distribution, and another set according to a non-hierarchic one. Participants were instructed to categorize these images into one of the two possible categories a. Our results suggest that participants perform more accurately in the case of hierarchically structured stimuli. Keywords: Hierarchy, Statistical Learning, Vision, Bayes, Probabilistic. Regarding hierarchies The anatomy of the primate visual system suggests that the retinal input progresses through several stages of processing that form an approximate hierarchy. In the visual system, a large number of photoreceptors project to one ganglion cell, several of which converge onto a single LGN cell; then come the cortical areas V1, V2, IT, etc. (Kaiser & Hilgetag, 2010; Kandel, 2000; Modha & Singh, 2010). The impression of hierarchy is further strengthened by evidence from functional analysis of the neuronal circuits. For instance, in V1 several simple cells send their axons to one complex cell whose preferred stimulus is constructed by the preferred stimuli of its input simple cells (Hubel & Wiesel, 1968). Moreover, starting from the retina and going up to higher cortical areas, the complexity of the features that each stage of this hierarchy responds best to increases (Gross, 1972). There exist at least three different definitions of hierarchy in the literature. According to the most parsimonious of them, a hierarchy is any system of items where no item is superior to itself. Furthermore, there needs to be one hierarch, an item which is superior to all other items (Dawkins, 1976). This definition emphasizes that aspect of hierarchy that differentiates it from a heterarchy (McCulloch, 1945). According to McCulloch, heterarchy is a structure with a certain circularity. This circularity results in the possibility of members of the system being superior to themselves. Because of the paradoxes that it may engender, heterarchy is an unlikely structure to be observed in our everyday lives, hence the name (heterarchy is Greek for “under the governance of an alien”; Goldammer, 2003). Another definition of hierarchy comes from algebra, where hierarchies are defined in terms of partially ordered sets (posets; Lehmann, 1996). The third definition is the one advocated by Herbert Simon (1974 ), the pioneering figure of hierarchy theory. While the three definitions are not in disagreement with each other, the third one seems to be best suited for the present discussion. According to Simon, a hierarchy is a nested collection of items where each item contains another set of subcollections. He uses the analogy of Chinese boxes, in which each box contains several smaller boxes while it is itself contained, together with other boxes, in a larger one. Graphically, this resembles the structure of a tree where vertices represent items and edges indicate containment. At least since the mid twentieth century, hierarchies have been believed to be the appropriate structure for the organization of complex systems in various domains including sociology, biology, computer science, and cognitive science (Simon, 1974; Hirtle, 1985; Holling, 2001). In cognitive science, neuroanatomical data are one source of the evidence for the hierarchic structure of the visual system. Another line of evidence come from computational considerations. The problem of inferring the state of the environment from the sensory input is an ill posed problem (Chater, Tenenbaum, and Yuille, 2006; Edelman, 2008). The normative approach to this problem is to rely on the

[1]  Howard Hunt Pattee,et al.  Hierarchy Theory: The Challenge of Complex Systems , 1973 .

[2]  John H. R. Maunsell,et al.  The connections of the middle temporal visual area (MT) and their relationship to a cortical hierarchy in the macaque monkey , 1983, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[3]  J. Tenenbaum,et al.  Special issue on “Probabilistic models of cognition , 2022 .

[4]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[5]  Fritz Lehmann,et al.  Big Posets of Participatings and Thematic Roles , 1996, ICCS.

[6]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[7]  Illtyd Trethowan Causality , 1938 .

[8]  R. Hinde,et al.  Growing Points in Ethology , 1976 .

[9]  D. Modha,et al.  Network architecture of the long-distance pathways in the macaque brain , 2010, Proceedings of the National Academy of Sciences.

[10]  S. Edelman Computing the mind , 2008 .

[11]  D. B. Bender,et al.  Visual properties of neurons in inferotemporal cortex of the Macaque. , 1972, Journal of neurophysiology.

[12]  Marcus Kaiser,et al.  Hierarchy and Dynamics of Neural Networks , 2010, Front. Neuroinform..

[13]  J. Jonides,et al.  Evidence of hierarchies in cognitive maps , 1985, Memory & cognition.

[14]  C. S. Holling Understanding the Complexity of Economic, Ecological, and Social Systems , 2001, Ecosystems.

[15]  Elisabeth Dévière,et al.  Analyzing linguistic data: a practical introduction to statistics using R , 2009 .

[16]  Jan Khre,et al.  The Mathematical Theory of Information , 2012 .

[17]  W. McCulloch The heterarchy of values determined by the topology of nervous nets. , 1945, The Bulletin of mathematical biophysics.

[18]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[19]  Austin F. Frank,et al.  Analyzing linguistic data: a practical introduction to statistics using R , 2010 .

[20]  Richard N Aslin,et al.  Bayesian learning of visual chunks by human observers , 2008, Proceedings of the National Academy of Sciences.

[21]  Cristian S. Calude The mathematical theory of information , 2007 .

[22]  M. Alexander,et al.  Principles of Neural Science , 1981 .

[23]  H. Simon,et al.  Near decomposability and the speed of evolution , 2002 .