Enhanced Equivalence Projective Simulation: A Framework for Modeling Formation of Stimulus Equivalence Classes

Formation of stimulus equivalence classes has been recently modeled through equivalence projective simulation (EPS), a modified version of a projective simulation (PS) learning agent. PS is endowed with an episodic memory that resembles the internal representation in the brain and the concept of cognitive maps. PS flexibility and interpretability enable the EPS model and, consequently the model we explore in this letter, to simulate a broad range of behaviors in matching-to-sample experiments. The episodic memory, the basis for agent decision making, is formed during the training phase. Derived relations in the EPS model that are not trained directly but can be established via the network's connections are computed on demand during the test phase trials by likelihood reasoning. In this letter, we investigate the formation of derived relations in the EPS model using network enhancement (NE), an iterative diffusion process, that yields an offline approach to the agent decision making at the testing phase. The NE process is applied after the training phase to denoise the memory network so that derived relations are formed in the memory network and retrieved during the testing phase. During the NE phase, indirect relations are enhanced, and the structure of episodic memory changes. This approach can also be interpreted as the agent's replay after the training phase, which is in line with recent findings in behavioral and neuroscience studies. In comparison with EPS, our model is able to model the formation of derived relations and other features such as the nodal effect in a more intrinsic manner. Decision making in the test phase is not an ad hoc computational method, but rather a retrieval and update process of the cached relations from the memory network based on the test trial. In order to study the role of parameters on agent performance, the proposed model is simulated and the results discussed through various experimental settings.

[1]  Erik Arntzen,et al.  USING CONDITIONAL DISCRIMINATION PROCEDURES TO STUDY REMEMBERING IN AN ALZHEIMER'S PATIENT , 2011 .

[2]  Erik Arntzen,et al.  The Effects of Different Training Structures in the Establishment of Conditional Discriminations and Subsequent Performance on Tests for Stimulus Equivalence , 2010 .

[3]  Erik Arntzen,et al.  Probability of stimulus equivalence as a function of training design , 1997 .

[4]  Nicole C Groskreutz,et al.  Using complex auditory-visual samples to produce emergent relations in children with autism. , 2010, Journal of applied behavior analysis.

[5]  S C Hayes,et al.  Nonhumans have not yet shown stimulus equivalence. , 1989, Journal of the experimental analysis of behavior.

[6]  Erik Arntzen,et al.  Training and Testing Parameters in Formation of Stimulus Equivalence: Methodological Issues , 2012 .

[7]  Jeff Shrager,et al.  Observation of Phase Transitions in Spreading Activation Networks , 1987, Science.

[8]  Katja Ried,et al.  How a Minimal Learning Agent can Infer the Existence of Unobserved Variables in a Complex Environment , 2019, Minds and Machines.

[9]  Anis Yazidi,et al.  Equivalence Projective Simulation as a Framework for Modeling Formation of Stimulus Equivalence Classes , 2020, Neural Computation.

[10]  L. Nadel,et al.  The Hippocampus as a Cognitive Map , 1978 .

[11]  Dermot Barnes-Holmes,et al.  A Transfer of Sequence Function Via Equivalence in A Connectionist Network , 2001 .

[12]  D. Shohamy,et al.  Preference by Association: How Memory Mechanisms in the Hippocampus Bias Decisions , 2012, Science.

[13]  Bin Yu,et al.  Impact of regularization on spectral clustering , 2016 .

[14]  Raymond J Dolan,et al.  A map of abstract relational knowledge in the human hippocampal–entorhinal cortex , 2017, eLife.

[15]  James L. McClelland The Place of Modeling in Cognitive Science , 2009, Top. Cogn. Sci..

[16]  R. Jackendoff What is a cognitive map? , 1979, Behavioral and Brain Sciences.

[17]  K. Vohs,et al.  Case Western Reserve University , 1990 .

[18]  Karl J. Friston,et al.  Neuronal message passing using Mean-field, Bethe, and Marginal approximations , 2019, Scientific Reports.

[19]  D. Barnes,et al.  Stimulus Equivalence and Connectionism: Implications for Behavior Analysis and Cognitive Science , 1993 .

[20]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[21]  M. Botvinick,et al.  The successor representation in human reinforcement learning , 2016, Nature Human Behaviour.

[22]  M. Sidman,et al.  Conditional discrimination vs. matching to sample: an expansion of the testing paradigm. , 1982, Journal of the experimental analysis of behavior.

[23]  O. Hove Differential Probability of Equivalence Class Formation Following a One-To-Many Versus a Many-To-One Training Structure , 2003 .

[24]  E. Tolman Cognitive maps in rats and men. , 1948, Psychological review.

[25]  S. Hayes,et al.  OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR EQUIVALENCE CLASS FORMATION IN LANGUAGE-ABLE AND LANGUAGE-DISABLED CHILDREN , 2005 .

[26]  Samuel Gershman,et al.  Predictive representations can link model-based reinforcement learning to model-free mechanisms , 2017, bioRxiv.

[27]  E. Arntzen,et al.  On the effectiveness of including meaningful pictures in the formation of equivalence classes. , 2020, Journal of the experimental analysis of behavior.

[28]  Stefan J. Kiebel,et al.  Active Inference, Belief Propagation, and the Bethe Approximation , 2018, Neural Computation.

[29]  M Sidman,et al.  A search for symmetry in the conditional discriminations of rhesus monkeys, baboons, and children. , 1982, Journal of the experimental analysis of behavior.

[30]  Alborz Geramifard,et al.  Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping , 2008, UAI.

[31]  Dermot Barnes-Holmes,et al.  Stimulus equivalence as a function of training protocol in a connectionist network. , 2007 .

[32]  P. Dayan,et al.  Model-based influences on humans’ choices and striatal prediction errors , 2011, Neuron.

[33]  P N Chase,et al.  Speed analyses of stimulus equivalence. , 1996, Journal of the experimental analysis of behavior.

[34]  Erik Arntzen,et al.  Training Structures and the Formation of Equivalence Classes , 2011 .

[35]  Jozsef Csicsvari,et al.  Hippocampal Reactivation of Random Trajectories Resembling Brownian Diffusion , 2019, Neuron.

[36]  Hans-J. Briegel,et al.  Projective simulation with generalization , 2015, Scientific Reports.

[37]  Kenneth A. Norman,et al.  Offline Replay Supports Planning: fMRI Evidence from Reward Revaluation , 2017, bioRxiv.

[38]  M. Eckardt The Hippocampus as a Cognitive Map , 1980 .

[39]  Kimberly L. Stachenfeld,et al.  The hippocampus as a predictive map , 2017, Nature Neuroscience.

[40]  Murray Sidman,et al.  Equivalence Relations and Behavior: A Research Story , 1994 .

[41]  B. Silvano Zanutto,et al.  A Computational Theory for the Learning of Equivalence Relations , 2011, Front. Hum. Neurosci..

[42]  Fiona Lyddy,et al.  A Transfer of Explicitly and Nonexplicitly Trained Sequence Responses Through Equivalence Relations: An Experimental Demonstration and Connectionist Model , 1994 .

[43]  M Sidman,et al.  Reading and auditory-visual equivalences. , 1971, Journal of speech and hearing research.

[44]  Long Ji Lin,et al.  Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.

[45]  M Sidman,et al.  Acquisition of matching to sample via mediated transfer. , 1974, Journal of the experimental analysis of behavior.

[46]  Ida Momennejad Learning Structures: Predictive Representations, Replay, and Generalization , 2020, Current Opinion in Behavioral Sciences.

[47]  James L. McClelland Integrating probabilistic models of perception and interactive neural networks: a historical and tutorial review , 2013, Front. Psychol..

[48]  Murray Sidman,et al.  Matching-to-sample procedures and the development of equivalence relations: The role of naming , 1986 .

[49]  Hans-J. Briegel,et al.  Projective Simulation for Classical Learning Agents: A Comprehensive Investigation , 2015, New Generation Computing.

[50]  H O'mara Quantitative and methodological aspects of stimulus equivalence. , 1991, Journal of the experimental analysis of behavior.

[51]  Scott D. Brown,et al.  Diffusion Decision Model: Current Issues and History , 2016, Trends in Cognitive Sciences.

[52]  James L. McClelland,et al.  Generalization Through the Recurrent Interaction of Episodic Memories , 2012, Psychological review.

[53]  Zeb Kurth-Nelson,et al.  What Is a Cognitive Map? Organizing Knowledge for Flexible Behavior , 2018, Neuron.

[54]  Bo Wang,et al.  Network enhancement as a general method to denoise weighted biological networks , 2018, Nature Communications.

[55]  L Fields,et al.  The effects of nodality on the formation of equivalence classes. , 1990, Journal of the experimental analysis of behavior.

[56]  Chris Ninness,et al.  The Emergence of Stimulus Relations: Human and Computer Learning , 2017, Perspectives on Behavior Science.

[57]  Gert Westermann,et al.  A Neurocomputational Approach to Trained and Transitive Relations in Equivalence Classes , 2017, Front. Psychol..

[58]  Hans J. Briegel,et al.  Projective simulation for artificial intelligence , 2011, Scientific Reports.

[59]  Ella Bingham,et al.  Enhancing the stability and efficiency of spectral ordering with partial supervision and feature selection , 2009, Knowledge and Information Systems.

[60]  Brendon O. Watson,et al.  Patterned activation of action potential patterns during offline states in the neocortex: replay and non-replay , 2020, Philosophical Transactions of the Royal Society B.

[61]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[62]  Daniel M Fienup,et al.  Optimizing equivalence-based instruction: Effects of training protocols on equivalence class formation. , 2015, Journal of applied behavior analysis.