论文信息 - The Bayesian Superorganism I: collective probability estimation

The Bayesian Superorganism I: collective probability estimation

Superorganisms such as social insect colonies are very successful relative to their non-social counterparts. Powerful emergent information processing capabilities would seem to contribute to the abundance of such ‘swarm’ systems, as they effectively explore and exploit their environment collectively. We develop a Bayesian model of collective information processing in a decision-making task: choosing a nest site (a ‘multi-armed bandit’ problem). House-hunting Temnothorax ants are adept at discovering and choosing the best available nest site for their colony: we propose that this is possible via rapid, decentralized estimation of the probability that each choice is best. Viewed this way, their behavioral algorithm can be understood as a statistical method that anticipates recent advances in mathematics. Our nest finding model in-corporates insights from approximate Bayesian computation as a model of colony-level behavior; and particle filtering as a model of Temnothorax ‘tandem running’. Our framework suggests that the mechanisms of complex collective behavior can sometimes be explained as a spatial enactment of Bayesian inference. It facilitates the generation of quantitative hypotheses regarding individual and collective movement behaviors when collective decisions must be made. It also points to the potential for bioinspired statistical techniques. Finally, it suggests simple mechanisms for collective decision-making in engineered systems, such as robot swarms.

[1] J. Gittins. Bandit processes and dynamic allocation indices , 1979 .

[2] T. Sherratt. THE OPTIMAL SAMPLING STRATEGY FOR UNFAMILIAR PREY , 2011, Evolution; international journal of organic evolution.

[3] Christian M. Ernst,et al. Multi-armed Bandit Allocation Indices , 1989 .

[4] W. R. Thompson. ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .

[5] S. Graham,et al. Honey Bee Nest Thermoregulation: Diversity Promotes Stability , 2004, Science.

[6] David S. Leslie,et al. Optimistic Bayesian Sampling in Contextual-Bandit Problems , 2012, J. Mach. Learn. Res..

[7] Richard P Mann. Collective decision making by rational individuals , 2018, Proceedings of the National Academy of Sciences.

[8] S. Garnier,et al. Decision-making without a brain: how an amoeboid organism solves the two-armed bandit , 2016, Journal of The Royal Society Interface.

[9] Carolina Doran,et al. Economic investment by ant colonies in searches for better homes , 2013, Biology Letters.

[10] P. Whittle. Multi‐Armed Bandits and the Gittins Index , 1980 .

[11] Shipra Agrawal,et al. Analysis of Thompson Sampling for the Multi-armed Bandit Problem , 2011, COLT.

[12] Thorsten Joachims,et al. The K-armed Dueling Bandits Problem , 2012, COLT.

[13] Kevin D. Glazebrook,et al. Multi-Armed Bandit Allocation Indices: Gittins/Multi-Armed Bandit Allocation Indices , 2011 .

[14] P. Taylor,et al. Test of optimal sampling by foraging great tits , 1978 .

[15] Erol Sahin,et al. Swarm Robotics: From Sources of Inspiration to Domains of Application , 2004, Swarm Robotics.

[16] Daniel A. Braun,et al. A Minimum Relative Entropy Principle for Learning and Acting , 2008, J. Artif. Intell. Res..

[17] Shipra Agrawal,et al. Further Optimal Regret Bounds for Thompson Sampling , 2012, AISTATS.

[18] Anna Dornhaus,et al. Reconnaissance and latent learning in ants , 2007, Proceedings of the Royal Society B: Biological Sciences.

[19] E. Robinson,et al. Can ant colonies choose a far-and-away better nest over an in-the-way poor one? , 2008, Animal Behaviour.

[20] Nigel R. Franks,et al. Ants estimate area using Buffon's needle , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[21] I. Couzin,et al. Emergent Sensing of Complex Environments by Mobile Animal Groups , 2013, Science.

[22] Sara J. Shettleworth,et al. Time horizons of pigeons on a two-armed bandit , 1989, Animal Behaviour.

[23] S. Pratt,et al. A context-dependent alarm signal in the ant Temnothorax rugatulus , 2014, Journal of Experimental Biology.

[24] F. Roces,et al. Carbon dioxide concentrations and nest ventilation in nests of the leaf-cutting ant Atta vollenweideri , 2000, Insectes Sociaux.

[25] D. Dennett,et al. The Extended Phenotype: The Long Reach of the Gene , 2008 .

[26] Deborah M. Gordon,et al. The effect of individual variation on the structure and function of interaction networks in harvester ants , 2011, Journal of The Royal Society Interface.

[27] Alex Kacelnik,et al. The Three-Spined Stickleback and the Two-Armed Bandit , 1985 .

[28] Naoki Masuda,et al. Bayesian Decision Making in Human Collectives with Binary Choices , 2015, PloS one.

[29] Ola Olsson,et al. Bayes' theorem and its applications in animal behaviour , 2006 .

[30] Naoki Masuda,et al. Computational model of collective nest selection by ants with heterogeneous acceptance thresholds , 2015, Royal Society Open Science.

[31] A. Dornhaus,et al. Decision making by small and large house-hunting ant colonies: one size fits all , 2006, Animal Behaviour.

[32] A. Pérez-Escudero,et al. A common rule for decision making in animal collectives across species , 2012, Proceedings of the National Academy of Sciences.

[33] H. Robbins. Some aspects of the sequential design of experiments , 1952 .

[34] L. Levin,et al. Biodiversity on the Rocks: Macrofauna Inhabiting Authigenic Carbonate at Costa Rica Methane Seeps , 2015, PloS one.

[35] Anna Dornhaus,et al. Speed versus accuracy in collective decision making , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[36] Thomas J. Valone,et al. Are animals capable of Bayesian updating? An empirical review , 2006 .

[37] S. Pratt. Quorum sensing by encounter rates in the ant Temnothorax albipennis , 2005 .

[38] Tze Leung Lai,et al. Incomplete learning from endogenous data in dynamic allocation , 1999 .

[39] Simon Garnier,et al. Estimation models describe well collective decisions among three options , 2013, Proceedings of the National Academy of Sciences.

[40] Maarten Speekenbrink,et al. A tutorial on particle filters , 2016 .

[41] Lihong Li,et al. An Empirical Evaluation of Thompson Sampling , 2011, NIPS.

[42] Edmund R. Hunt,et al. The Bayesian superorganism: externalized memories facilitate distributed sampling , 2018, bioRxiv.

[43] E. Robinson,et al. Do ants make direct comparisons? , 2009, Proceedings of the Royal Society B: Biological Sciences.

[44] Eamonn B. Mallon,et al. Quorum sensing, recruitment, and collective decision-making during colony emigration by the ant Leptothorax albipennis , 2002, Behavioral Ecology and Sociobiology.

[45] S. Pratt,et al. Ant colonies outperform individuals when a sensory discrimination task is difficult but not when it is easy , 2013, Proceedings of the National Academy of Sciences.

[46] Edmund R. Hunt,et al. Phenotypic Plasticity Provides a Bioinspiration Framework for Minimal Field Swarm Robotics , 2020, Frontiers in Robotics and AI.

[47] Steven L. Scott,et al. A modern Bayesian look at the multi-armed bandit , 2010 .

[48] Guy Theraulaz,et al. Self-Organization in Biological Systems , 2001, Princeton studies in complexity.

[49] Jeremy Wyatt,et al. Exploration and inference in learning from reinforcement , 1998 .

[50] Theodore P. Pavlic,et al. Division of labour promotes the spread of information in colony emigrations by the ant Temnothorax rugatulus , 2020, Proceedings of the Royal Society B.

[51] Ofer Feinerman,et al. Experience, corpulence and decision making in ant foraging , 2012, Journal of Experimental Biology.

[52] T. Seeley,et al. Nest-site selection in honey bees: how well do swarms implement the "best-of-N" decision rule? , 2001, Behavioral Ecology and Sociobiology.

[53] Brandon M. Turner,et al. A tutorial on approximate Bayesian computation , 2012 .

[54] Thomas T. Hills,et al. Exploration versus exploitation in space, mind, and society , 2015, Trends in Cognitive Sciences.

[55] Y. Ishii,et al. The mechanism underlying the regulation of work-related behaviors in the monomorphic ant, Myrmica kotokui , 2012, Journal of Ethology.

[56] Lars Chittka,et al. Speed-accuracy tradeoffs in animal decision making. , 2009, Trends in ecology & evolution.

[57] Malcolm J. A. Strens,et al. A Bayesian Framework for Reinforcement Learning , 2000, ICML.

[58] Takao Sasaki,et al. Parallel vs. comparative evaluation of alternative options by colonies and individuals of the ant Temnothorax rugatulus , 2018, Scientific Reports.

[59] E. Wilson,et al. The Superorganism: The Beauty, Elegance, and Strangeness of Insect Societies , 2008 .

[60] Edmund R. Hunt,et al. The Bayesian Superorganism II: optimal foraging and the information theory of gambling , 2018, bioRxiv.

[61] A. Pérez-Escudero,et al. Adversity magnifies the importance of social information in decision-making , 2016, bioRxiv.

[62] Tamar Keasar,et al. Bees in two-armed bandit situations: foraging choices and possible decision mechanisms , 2002 .

[63] Maarten Speekenbrink,et al. Uncertainty and Exploration in a Restless Bandit Problem , 2015, Top. Cogn. Sci..

[64] J. Bather,et al. Multi‐Armed Bandit Allocation Indices , 1990 .

[65] Eamonn B. Mallon,et al. Strategies for choosing between alternatives with different attributes: exemplified by house-hunting ants , 2003, Animal Behaviour.

[66] A. Dornhaus,et al. Ants move to improve: colonies of Leptothorax albipennis emigrate whenever they find a superior nest site , 2004, Animal Behaviour.

[67] Alfonso Pérez-Escudero,et al. Collective Animal Behavior from Bayesian Estimation and Probability Matching , 2011, PLoS Comput. Biol..