The importance of recurrent top-down synaptic connections for the anticipation of dynamic emotions

Different studies have shown the efficiency of a feed-forward neural network in categorizing basic emotional facial expressions. However, recent findings in psychology and cognitive neuroscience suggest that visual recognition is not a pure bottom-up process but likely involves top-down recurrent connectivity. In the present computational study, we compared the performances of a pure bottom-up neural network (a standard multi-layer perceptron, MLP) with a neural network involving recurrent top-down connections (a simple recurrent network, SRN) in the anticipation of emotional expressions. In two complementary simulations, results revealed that the SRN outperformed the MLP for ambiguous intensities in the temporal sequence, when the emotions were not fully depicted but when sufficient contextual information (related to previous time frames) was provided. Taken together, these results suggest that, despite the cost of recurrent connections in terms of energy and processing time for biological organisms, they can provide a substantial advantage for the fast recognition of uncertain visual signals.

[1]  Garrison W. Cottrell,et al.  Organization of face and object recognition in modular neural network models , 1999, Neural Networks.

[2]  Karl J. Friston,et al.  A theory of cortical responses , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[3]  Martial Mermillod,et al.  Coarse scales are sufficient for efficient categorization of emotional facial expressions: Evidence from neural computation , 2010, Neurocomputing.

[4]  Garrison W Cottrell,et al.  Central and peripheral vision for scene recognition: A neurocomputational modeling exploration. , 2017, Journal of vision.

[5]  Martial Mermillod,et al.  The importance of low spatial frequency information for recognising fearful facial expressions , 2009, Connect. Sci..

[6]  Bo Sun,et al.  Affect recognition from facial movements and body gestures by hierarchical deep spatio-temporal features and fusion strategy , 2017, Neural Networks.

[7]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[8]  Louise Kauffmann,et al.  Effective connectivity in the neural network underlying coarse-to-fine categorization of visual scenes. A dynamic causal modeling study , 2015, Brain and Cognition.

[9]  Nathalie Guyader,et al.  The Neural Bases of the Semantic Interference of Spatial Frequency-based Information in Scenes , 2015, Journal of Cognitive Neuroscience.

[10]  M. Mermillod,et al.  Is eye contact the key to the social brain? , 2010, Behavioral and Brain Sciences.

[11]  Tetsuo Furukawa,et al.  Tensor SOM and tensor GTM: Nonlinear tensor analysis by topographic mappings , 2016, Neural Networks.

[12]  Huaguang Zhang,et al.  Robust Exponential Stability of Recurrent Neural Networks With Multiple Time-Varying Delays , 2007, IEEE Transactions on Circuits and Systems II: Express Briefs.

[13]  Stefan Wermter,et al.  Multimodal emotional state recognition using sequence-dependent deep hierarchical features , 2015, Neural Networks.

[14]  Martial Mermillod,et al.  Are Coarse Scales Sufficient for Fast Detection of Visual Threat? , 2010, Psychological science.

[15]  G. Cottrell,et al.  EMPATH: A Neural Network that Categorizes Facial Expressions , 2002, Journal of Cognitive Neuroscience.

[16]  Catherine Auxiette,et al.  Contraintes perceptives et temporelles dans l’exploration du modèle de Ledoux , 2011, L’Année psychologique.

[17]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[18]  Matthew H Tong,et al.  Why is the fusiform face area recruited for novel categories of expertise? A neurocomputational investigation , 2008, Brain Research.

[19]  P. Ekman Pictures of Facial Affect , 1976 .

[20]  Isabel Gauthier,et al.  Are Face and Object Recognition Independent? A Neurocomputational Modeling Exploration , 2016, Journal of Cognitive Neuroscience.

[21]  Martial Mermillod,et al.  Evidence of Rapid Modulation by Social Information of Subjective, Physiological, and Neural Responses to Emotional Expressions , 2018, Front. Behav. Neurosci..

[22]  M. Bar Visual objects in context , 2004, Nature Reviews Neuroscience.

[23]  R W Guillery,et al.  The role of the thalamus in the flow of information to the cortex. , 2002, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[24]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[25]  Martial Mermillod,et al.  Enhanced embodied response following ambiguous emotional processing , 2012, Cognitive Processing.

[26]  M. Bar,et al.  Top-down predictions in the cognitive brain , 2007, Brain and Cognition.

[27]  M. Bar,et al.  Prediction, context, and competition in visual recognition , 2015, Annals of the New York Academy of Sciences.

[28]  J. P. Jones,et al.  The two-dimensional spatial structure of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[29]  Martial Mermillod,et al.  Reduction of interference effect by low spatial frequency information priming in an emotional Stroop task. , 2015, Journal of vision.

[30]  Martial Mermillod,et al.  Neural computation as a tool to differentiate perceptual from emotional processes: The case of anger superiority effect , 2009, Cognition.

[31]  Huaguang Zhang,et al.  Global Asymptotic Stability of Recurrent Neural Networks With Multiple Time-Varying Delays , 2008, IEEE Transactions on Neural Networks.

[32]  C. Summerfield,et al.  Expectation (and attention) in visual cognition , 2009, Trends in Cognitive Sciences.

[33]  E. Halgren,et al.  Top-down facilitation of visual recognition. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Rajesh P. N. Rao,et al.  Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. , 1999 .

[35]  Moshe Bar,et al.  See it with feeling: affective predictions during object perception , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[36]  James M. Shine,et al.  Predictions penetrate perception: Converging insights from brain, behaviour and disorder , 2017, Consciousness and Cognition.

[37]  M. Bar,et al.  Magnocellular Projections as the Trigger of Top-Down Facilitation in Recognition , 2007, The Journal of Neuroscience.

[38]  Louise Kauffmann,et al.  The neural bases of spatial frequency processing during scene perception , 2014, Front. Integr. Neurosci..

[39]  Hanna Damasio,et al.  Single-neuron responses to emotional visual stimuli recorded in human ventral prefrontal cortex , 2001, Nature Neuroscience.

[40]  J. Bullier Integrated model of visual processing , 2001, Brain Research Reviews.

[41]  M. Bar The proactive brain: using analogies and associations to generate predictions , 2007, Trends in Cognitive Sciences.

[42]  Huaguang Zhang,et al.  A Comprehensive Review of Stability Analysis of Continuous-Time Recurrent Neural Networks , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[43]  Skyler T. Hawk,et al.  Moving faces, looking places: validation of the Amsterdam Dynamic Facial Expression Set (ADFES). , 2011, Emotion.