Many Paths to the Same Goal: Balancing Exploration and Exploitation during Probabilistic Route Planning

Abstract During self-guided behaviors, animals identify constraints of the problems they face and adaptively employ appropriate strategies (Marsh, 2002). In the case of foraging, animals must balance sensory-guided exploration of an environment with memory-guided exploitation of known resource locations. Here, we show that animals adaptively shift cognitive resources between sensory and memory systems during foraging to optimize route planning under uncertainty. We demonstrate this using a new, laboratory-based discovery method to define the strategies used to solve a difficult route optimization scenario, the probabilistic “traveling salesman” problem (Raman and Gill, 2017; Fuentes et al., 2018; Mukherjee et al., 2019). Using this system, we precisely manipulated the strength of prior information as well as the complexity of the problem. We find that rats are capable of efficiently solving this route-planning problem, even under conditions with unreliable prior information and a large space of possible solutions. Through analysis of animals’ trajectories, we show that they shift the balance between exploiting known locations and searching for new locations of rewards based on the predictability of reward locations. When compared with a Bayesian search, we found that animal performance is consistent with an approach that adaptively allocates cognitive resources between sensory processing and memory, enhancing sensory acuity and reducing memory load under conditions in which prior information is unreliable. Our findings establish new approaches to understand neural substrates of natural behavior as well as the rational development of biologically inspired approaches for complex real-world optimization.

[1]  I. Whishaw,et al.  Variation in visual acuity within pigmented, and between pigmented and albino rat strains , 2002, Behavioural Brain Research.

[2]  Cheng-Fa Tsai,et al.  A new hybrid heuristic approach for solving large traveling salesman problem , 2004, Inf. Sci..

[3]  Ken G. Smith,et al.  The interplay between exploration and exploitation. , 2006 .

[4]  M. Ernst,et al.  Humans integrate visual and haptic information in a statistically optimal fashion , 2002, Nature.

[5]  Ian H. Stevenson,et al.  Spatially Distributed Local Fields in the Hippocampus Encode Rat Position , 2014, Science.

[6]  R. Douglas,et al.  Behavioral assessment of visual acuity in mice and rats , 2000, Vision Research.

[7]  Vikrant Kapoor,et al.  Mice Develop Efficient Strategies for Foraging and Navigation Using Complex Natural Stimuli , 2016, Current Biology.

[8]  D. Kramer,et al.  Exploration versus exploitation: a field study of time allocation to environmental tracking by foraging chipmunks , 1991, Animal Behaviour.

[9]  C. Drea,et al.  Cooperative problem solving in a social carnivore , 2009, Animal Behaviour.

[10]  P. Dayan,et al.  Goals and Habits in the Brain , 2013, Neuron.

[11]  Denise Manahan-Vaughan,et al.  Spatial Olfactory Learning Contributes to Place Field Formation in the Hippocampus , 2013, Cerebral cortex.

[12]  E.S. Hernández Gress,et al.  Solution to travelling salesman problem by clusters and a modified multi-restart iterated local search metaheuristic , 2018, PloS one.

[13]  Nathaniel N. Urban,et al.  Mouse Navigation Strategies for Odor Source Localization , 2019, bioRxiv.

[14]  Laurent Itti,et al.  A Bayesian model for efficient visual search and recognition , 2010, Vision Research.

[15]  Matthias Bethge,et al.  Using DeepLabCut for 3D markerless pose estimation across species and behaviors , 2018 .

[16]  Burton Slotnick,et al.  Animal cognition and the rat olfactory system , 2001, Trends in Cognitive Sciences.

[17]  Manoranjan Maiti,et al.  Constrained covering solid travelling salesman problems in uncertain environment , 2019, J. Ambient Intell. Humaniz. Comput..

[18]  P. Taylor,et al.  Test of optimal sampling by foraging great tits , 1978 .

[19]  Rachel E. Blaser,et al.  Route selection by rats and humans in a navigational traveling salesman problem , 2011, Animal Cognition.

[20]  B. Menguc,et al.  Balancing exploration and exploitation: The moderating role of competitive intensity , 2005 .

[21]  N. Vickers Mechanisms of animal navigation in odor plumes. , 2000, The Biological bulletin.

[22]  J. Clark,et al.  Behavioral assessment. , 1977, Oral health.

[23]  Mohammad F. Tariq,et al.  Using low-cost alcohol sensors to monitor real-time olfactory information during odor-guided navigation , 2019 .

[24]  A. Kacelnik,et al.  Behavioural ecology: Tool manufacture by naive juvenile crows , 2005, Nature.

[25]  Nasib Singh Gill,et al.  Review of different heuristic algorithms for solving Travelling Salesman Problem , 2017 .

[26]  Min Whan Jung,et al.  A computer vision-based automated Figure-8 maze for working memory test in rodents , 2006, Journal of Neuroscience Methods.

[27]  Ben R. Newell,et al.  Unpacking the Exploration–Exploitation Tradeoff: A Synthesis of Human and Animal Literatures , 2015 .

[28]  Timo Leipälä,et al.  On the solutions of stochastic traveling salesman problems , 1978 .

[29]  Julian N. Marewski,et al.  Strategy selection: An introduction to the modeling challenge. , 2014, Wiley interdisciplinary reviews. Cognitive science.

[30]  David S. Johnson,et al.  Local Optimization and the Traveling Salesman Problem , 1990, ICALP.

[31]  Feng Jin,et al.  Bioinspired engineering of honeycomb structure – Using nature to inspire human innovation , 2015 .

[32]  Russell P. Balda,et al.  Coadaptations of the Clark's nutcracker and the pinon pine for efficient seed harvest and dispersal. , 1977 .

[33]  Ola Olsson,et al.  Bayes' theorem and its applications in animal behaviour , 2006 .

[34]  O. Martin,et al.  The Stochastic Traveling Salesman Problem: Finite Size Scaling and the Cavity Prediction , 1998, cond-mat/9802295.

[35]  Barnaby Marsh Do Animals Use Heuristics? , 2002 .

[36]  Robin Milner,et al.  On Observing Nondeterminism and Concurrency , 1980, ICALP.