论文信息 - The learning of action sequences through social transmission

The learning of action sequences through social transmission

Previous empirical work on animal social learning has found that many species lack the ability to learn entire action sequences solely through reliance on social information. Conversely, acquiring action sequences through asocial learning can be difficult due to the large number of potential sequences arising from even a small number of base actions. In spite of this, several studies report that some primates use action sequences in the wild. We investigate how social information can be integrated with asocial learning to facilitate the learning of action sequences. We formalize this problem by examining how learners using temporal difference learning, a widely applicable model of reinforcement learning, can combine social cues with their own experiences to acquire action sequences. The learning problem is modeled as a Markov decision process. The learning of nettle processing by mountain gorillas serves as a focal example. Through simulations, we find that the social facilitation of component actions can combine with individual learning to facilitate the acquisition of action sequences. Our analysis illustrates that how even simple forms of social learning, combined with asocial learning, generate substantially faster learning of action sequences compared to asocial processes alone, and that the benefits of social information increase with the length of the action sequence and the number of base actions.

K. Laland | D. Cownden | A. Whalen

[1] E. Thorndike. “Animal Intelligence” , 1898, Nature.

[2] R. R. Bush,et al. A Mathematical Model for Simple Learning , 1951 .

[3] J. Goodall,et al. Tool-Using and Aimed Throwing in a Community of Free-Living Chimpanzees , 1964, Nature.

[4] R. Rescorla,et al. A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[5] C. Boesch,et al. Optimisation of Nut-Cracking With Natural Hammers By Wild Chimpanzees , 1983 .

[6] C. Lumsden. Culture and the Evolutionary Process, Robert Boyd, Peter J. Richerson. University of Chicago Press, Chicago & London (1985), viii, +301. Price $29.95 , 1986 .

[7] S. Walker. Social Learning: Psychological and Biological Perspectives, Thomas R. Zentall, Bennet G. Galef Jr. (Eds.). Lawrence Erlbaum, Hillsdale, New Jersey (1988), xi , 1988 .

[8] Alan R. Rogers,et al. Does Biology Constrain Culture , 1988 .

[9] Richard S. Sutton,et al. Time-Derivative Models of Pavlovian Reinforcement , 1990 .

[10] M. Gabriel,et al. Learning and Computational Neuroscience: Foundations of Adaptive Networks , 1990 .

[11] B. Galef. The question of animal culture , 1992, Human nature.

[12] Richard W. Byrne,et al. Complex leaf‐gathering skills of mountain gorillas (Gorilla g. beringei): Variability and standardization , 1993, American journal of primatology.

[13] R. A. Preston,et al. Delay reduction: current status. , 1993, Journal of the experimental analysis of behavior.

[14] R. Grace. A contextual model of concurrent-chains choice. , 1994, Journal of the experimental analysis of behavior.

[15] C. Heyes,et al. SOCIAL LEARNING IN ANIMALS: CATEGORIES AND MECHANISMS , 1994, Biological reviews of the Cambridge Philosophical Society.

[16] M. Tomasello,et al. Use of social information in the problem solving of orangutans (Pongo pygmaeus) and human children (Homo sapiens). , 1995, Journal of comparative psychology.

[17] Joseph Terkel,et al. Cultural Transmission of Feeding Behavior in the Black Rat (Rattus rattus) , 1996 .

[18] C. Heyes,et al. Social learning in animals : the roots of culture , 1996 .

[19] A. Whiten. Imitation of the sequential structure of actions by chimpanzees (Pan troglodytes). , 1998, Journal of comparative psychology.

[20] R. Boyd,et al. The evolution of conformist transmission and the emergence of between-group differences. , 1998 .

[21] R. Byrne,et al. Priming primates: Human and otherwise , 1998, Behavioral and Brain Sciences.

[22] Andrew Whiten,et al. Social Learning of an Artificial Fruit Task in Capuchin Monkeys (Cebus apella) , 1999 .

[23] A. Whiten,et al. Cultures in chimpanzees , 1999, Nature.

[24] Martin D. Buhmann,et al. Radial Basis Functions , 2021, Encyclopedia of Mathematical Geosciences.

[25] A. Whiten,et al. Testing for social learning in the "artificial fruit" processing of wildborn orangutans (Pongo pygmaeus), Tanjung Puting, Indonesia , 2001, Animal Cognition.

[26] A. K. Reid,et al. The development of functional response units: the role of demarcating stimuli. , 2001, Journal of the experimental analysis of behavior.

[27] R. Byrne. Imitation as behaviour parsing. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[28] A. Whiten,et al. Social learning by orangutans (Pongo abelii and Pongo pygmaeus) in a simulated food-processing task. , 2003, Journal of comparative psychology.

[29] D. Biro,et al. Cultural innovation and transmission of tool use in wild chimpanzees: evidence from field experiments , 2003, Animal Cognition.

[30] Peter Dayan,et al. Temporal difference models describe higher-order learning in humans , 2004, Nature.

[31] Herbert S. Terrace,et al. The simultaneous chain: a new approach to serial learning , 2005, Trends in Cognitive Sciences.

[32] W. Pan,et al. Dopamine Cells Respond to Predicted Events during Classical Conditioning: Evidence for Eligibility Traces in the Reward-Learning Network , 2005, The Journal of Neuroscience.

[33] M. Brass,et al. Imitation: is cognitive neuroscience solving the correspondence problem? , 2005, Trends in Cognitive Sciences.

[34] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[35] R. Grace,et al. Initial-link duration and acquisition of preference in concurrent chains , 2006, Learning & behavior.

[36] Magnus Enquist,et al. Social Learning : A Solution to Rogers ’ s Paradox of Nonadaptive Culture , 2007 .

[37] Magnus Enquist,et al. CRITICAL POINTS IN CURRENT THEORY OF CONFORMIST SOCIAL LEARNING , 2007 .

[38] K. Laland,et al. Response facilitation in the domestic fowl , 2007, Animal Behaviour.

[39] Kevin N. Laland,et al. Chapter 3 Social Processes Influencing Learning in Animals: A Review of the Evidence , 2008 .

[40] M. Tomasello,et al. An experimental study of nettle feeding in captive gorillas , 2008, American journal of primatology.

[41] P. Izar,et al. Capuchin monkey tool use: Overview and implications , 2008 .

[42] P. Dayan,et al. Reinforcement learning: The Good, The Bad and The Ugly , 2008, Current Opinion in Neurobiology.

[43] J. Call,et al. Design complexity in termite-fishing tools of chimpanzees (Pan troglodytes) , 2009, Biology Letters.

[44] Cecilia Heyes,et al. Evolution, development and intentional control of imitation , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[45] P. Glimcher. Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis , 2011, Proceedings of the National Academy of Sciences.

[46] Dennis Garlick,et al. Pigeon and human performance in a multi-armed bandit task in response to changes in variable interval schedules , 2011, Learning & behavior.

[47] K. Laland,et al. Social Learning: An Introduction to Mechanisms, Methods, and Models , 2013 .

[48] Bernhard Voelkl,et al. Social learning. an introduction to mechanisms, methods, and models William Hoppitt Kevin N. Laland , 2014, Animal Behaviour.

[49] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.