A Multimodal Approach to Investigate the Neural Mechanisms of Real World Social Vision

Eye tracking and other behavioral measurements collected from patient-participants in their hospital rooms afford a unique opportunity to study immersive natural behavior for basic and clinical-translational research, and also requires addressing important logistical, technical, and ethical challenges. Hospital rooms provide the opportunity to richly capture both clinically relevant and ordinary natural behavior. As clinical settings, they add the potential to study the relationship between behavior and physiology by collecting physiological data synchronized to behavioral measures from participants. Combining eye-tracking, other behavioral measures, and physiological measurements enables clinical-translational research into understanding the participants' disorders and clinician-patient interactions, as well as basic research into natural, real world behavior as participants eat, read, converse with friends and family, etc. Here we describe a paradigm in individuals undergoing surgical treatment for epilepsy who spend 1-2 weeks in the hospital with electrodes implanted in their brain to determine the source of their seizures. This provides the unique opportunity to record behavior using eye tracking glasses customized to address clinically-related ergonomic concerns, synchronized direct neural recordings, use computer vision to assist with video annotation, and apply multivariate machine learning analyses to multimodal data encompassing hours of natural behavior. We discuss the acquisition, quality control, annotation, and analysis pipelines to study the neural basis of real world social and affective perception during natural conversations with friends and family in participants with epilepsy. We also discuss clinical, logistical, and ethical and privacy considerations that must be addressed to acquire high quality multimodal data in this setting.

[1]  A. Schwerdtfeger,et al.  Physiological linkage during interactions between doctors and cancer patients. , 2021, Social science & medicine.

[2]  Anna L. Gert,et al.  Wild lab: A naturalistic free viewing experiment reveals previously unknown EEG signatures of face processing , 2021, bioRxiv.

[3]  K. Ueda,et al.  Contribution of Eye-Tracking to Study Cognitive Impairments Among Clinical Populations , 2021, Frontiers in Psychology.

[4]  Mackenzie W. Mathis,et al.  Multi-animal pose estimation and tracking with DeepLabCut , 2021, bioRxiv.

[5]  E. Liebenthal,et al.  Computational analysis of spoken language in acute psychosis and mania , 2021, Schizophrenia Research.

[6]  D. Melcher,et al.  The behavioural preview effect with faces is susceptible to statistical regularities: Evidence for predictive processing across the saccade , 2021, Scientific reports.

[7]  Michael J. Ward,et al.  Multiple Adjoining Word- and Face-Selective Regions in Ventral Temporal Cortex Exhibit Distinct Dynamics , 2020, The Journal of Neuroscience.

[8]  Cory S. Inman,et al.  Boundary-anchored neural mechanisms of location-encoding for self and others , 2020, Nature.

[9]  Michal Muszynski,et al.  Depression Severity Assessment for Adolescents at High Risk of Mental Disorders , 2020, ICMI.

[10]  Roberto A. Gulli,et al.  The application of noninvasive, restraint-free eye-tracking methods for use with nonhuman primates , 2020, Behavior Research Methods.

[11]  Gijs A. Holleman,et al.  The ‘Real-World Approach’ and Its Problems: A Critique of the Term Ecological Validity , 2020, Frontiers in Psychology.

[12]  Francisco Cercas,et al.  Physiological and Behavior Monitoring Systems for Smart Healthcare Environments: A Review , 2020, Sensors.

[13]  Itzhak Fried,et al.  Wireless Programmable Recording and Stimulation of Deep Brain Activity in Freely Moving Humans , 2020, Neuron.

[14]  Pavel Korshunov,et al.  Pyannote.Audio: Neural Building Blocks for Speaker Diarization , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Leibny Paola García-Perera,et al.  End-to-End Domain-Adversarial Voice Activity Detection , 2019, INTERSPEECH.

[16]  Matthias Bethge,et al.  Pretraining boosts out-of-domain robustness for pose estimation , 2019, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[17]  Saskia Haegens,et al.  Dynamic Modulation of Cortical Excitability during Visual Active Sensing , 2019, Cell reports.

[18]  Xu Tan,et al.  Almost Unsupervised Text to Speech and Automatic Speech Recognition , 2019, ICML.

[19]  David Melcher,et al.  Post-Saccadic Face Processing Is Modulated by Pre-Saccadic Preview: Evidence from Fixation-Related Potentials , 2019, The Journal of Neuroscience.

[20]  J. Erichsen,et al.  The potential and value of objective eye tracking in the ophthalmology clinic , 2019, Eye.

[21]  K. Bales,et al.  Non-invasive Eye Tracking Methods for New World and Old World Monkeys , 2019, Front. Behav. Neurosci..

[22]  Catherine Perrodin,et al.  Are We Ready for Real-world Neuroscience? , 2019, Journal of Cognitive Neuroscience.

[23]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Matthias Bethge,et al.  Using DeepLabCut for 3D markerless pose estimation across species and behaviors , 2018, Nature Protocols.

[25]  Bruno Rossion,et al.  The inferior occipital gyrus is a major cortical source of the face‐evoked N170: Evidence from simultaneous scalp and intracerebral human recordings , 2018, Human brain mapping.

[26]  Benjamin W Tatler,et al.  Gaze in a real-world social interaction: A dual eye-tracking study , 2018, Quarterly journal of experimental psychology.

[27]  Claude Barras,et al.  Neural Speech Turn Segmentation and Affinity Propagation for Speaker Diarization , 2018, INTERSPEECH.

[28]  U. Schnyder,et al.  Emotion Transfer, Emotion Regulation, and Empathy-Related Processes in Physician-Patient Interactions and Their Association With Physician Well-Being: A Theoretical Model , 2018, Front. Psychiatry.

[29]  Kevin M. Cury,et al.  DeepLabCut: markerless pose estimation of user-defined body parts with deep learning , 2018, Nature Neuroscience.

[30]  Manfredo Atzori,et al.  Head-mounted eye gaze tracking devices: An overview of modern devices and recent advances , 2018, Journal of rehabilitation and assistive technologies engineering.

[31]  Ingrid R. Olson,et al.  The Original Social Network: White Matter and Social Cognition , 2018, Trends in Cognitive Sciences.

[32]  Louis-Philippe Morency,et al.  OpenFace 2.0: Facial Behavior Analysis Toolkit , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[33]  Björn Schuller,et al.  Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments , 2017 .

[34]  Colin Camerer,et al.  Differences in Behavior and Brain Activity during Hypothetical and Real Choices , 2017, Trends in Cognitive Sciences.

[35]  Gil G. Rosenthal,et al.  What artifice can and cannot tell us about animal behavior , 2016, Current zoology.

[36]  Bernt Schiele,et al.  DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model , 2016, ECCV.

[37]  Daniel C. Richardson,et al.  Breaking the Fourth Wall of Cognitive Science , 2016 .

[38]  G. Kuhn,et al.  Don’t be fooled! Attentional responses to social cues in a face-to-face and video magic trick reveals greater top-down control for overt than covert attention , 2016, Cognition.

[39]  Ming Hu,et al.  A wireless transmission neural interface system for unconstrained non-human primates , 2015, Journal of neural engineering.

[40]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Jia Liu,et al.  Neural decoding reveals impaired face configural processing in the right fusiform face area of individuals with developmental prosopagnosia. , 2015, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[42]  Erich Elsen,et al.  Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.

[43]  John A. Pyles,et al.  Dynamic Encoding of Face Information in the Human Fusiform Gyrus , 2014, Nature Communications.

[44]  James J. Gibson,et al.  The Ecological Approach to Visual Perception: Classic Edition , 2014 .

[45]  L. Astolfi,et al.  Social neuroscience and hyperscanning techniques: Past, present and future , 2014, Neuroscience & Biobehavioral Reviews.

[46]  Saif Mohammad,et al.  Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..

[47]  K. Grill-Spector,et al.  Electrical Stimulation of Human Fusiform Face-Selective Regions Distorts Face Perception , 2012, The Journal of Neuroscience.

[48]  Christopher J. Honey,et al.  Future trends in Neuroimaging: Neural processes as expressed within real-life contexts , 2012, NeuroImage.

[49]  Mohamed Chetouani,et al.  Interpersonal Synchrony: A Survey of Evaluation Methods across Disciplines , 2012, IEEE Transactions on Affective Computing.

[50]  Evan F. Risko,et al.  Social attention with real versus reel stimuli: toward an empirical approach to concerns about ecological validity , 2012, Front. Hum. Neurosci..

[51]  Xiaoqin Wang,et al.  Wireless multi-channel single unit recording in freely moving and vocalizing primates , 2012, Journal of Neuroscience Methods.

[52]  J. Hietanen,et al.  Does it make a difference if I have an eye contact with you or with your picture? An ERP study. , 2011, Social cognitive and affective neuroscience.

[53]  Richard M. Leahy,et al.  Brainstorm: A User-Friendly Application for MEG/EEG Analysis , 2011, Comput. Intell. Neurosci..

[54]  Yadin Dudai,et al.  Fear Thou Not: Activity of Frontal and Temporal Circuits in Moments of Real-Life Courage , 2010, Neuron.

[55]  Nick F. Ramsey,et al.  Automated electrocorticographic electrode localization on individually rendered brain surfaces , 2010, Journal of Neuroscience Methods.

[56]  K. Ochsner,et al.  The Need for a Cognitive Neuroscience of Naturalistic Social Cognition , 2009, Annals of the New York Academy of Sciences.

[57]  A. Finset,et al.  The medical consultation viewed as a value chain: a neurobehavioral approach to emotion regulation in doctor-patient interaction. , 2009, Patient education and counseling.

[58]  Doris Y. Tsao,et al.  Mechanisms of face perception. , 2008, Annual review of neuroscience.

[59]  S. Shiffman,et al.  Ecological momentary assessment. , 2008, Annual review of clinical psychology.

[60]  Anders M. Dale,et al.  An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest , 2006, NeuroImage.

[61]  Suramya Tomar,et al.  Converting video formats with FFmpeg , 2006 .

[62]  D. Kiesler,et al.  Optimal matches of patient preferences for information, decision-making and interpersonal behavior: evidence, models and interventions. , 2006, Patient education and counseling.

[63]  Frank H. Wilhelm,et al.  Continuous electronic data capture of physiology, behavior and experience in real life: towards ecological momentary assessment of emotion , 2006, Interact. Comput..

[64]  Lynn Clemow,et al.  Anxiety and outcome expectations predict the white-coat effect , 2005, Blood pressure monitoring.

[65]  Andrew Jackson,et al.  An autonomous implantable computer for neural recording and stimulation in unrestrained primates , 2005, Journal of Neuroscience Methods.

[66]  Thomas G. Pickering,et al.  What is the white-coat effect and how should it be measured? , 2002, Blood pressure monitoring.

[67]  L. Parker,et al.  INFORMATION(AL) MATTERS: BIOETHICS AND THE BOUNDARIES OF THE PUBLIC AND THE PRIVATE , 2002, Social Philosophy and Policy.

[68]  Allan Aasbjerg Nielsen,et al.  Multiset canonical correlations analysis and multispectral, truly multitemporal remote sensing data , 2002, IEEE Trans. Image Process..

[69]  J. Keenan,et al.  Lesions of the fusiform face area impair perception of facial configuration in prosopagnosia , 2002, Neurology.

[70]  L. Parker,et al.  Trial Design and Informed Consent for a Clinic-Based Study With a Treatment as Usual Control Arm , 2002, Ethics & behavior.

[71]  N. Kanwisher Domain specificity in face perception , 2000, Nature Neuroscience.

[72]  T. Allison,et al.  Electrophysiological studies of human face perception. I: Potentials generated in occipitotemporal cortex by face and non-face stimuli. , 1999, Cerebral cortex.

[73]  P. Johnson,et al.  Remote Continuous Physiological Monitoring in the Home , 1996, Journal of telemedicine and telecare.

[74]  B. Korsch,et al.  Gaps in doctor-patient communication. 1. Doctor-patient interaction and patient satisfaction. , 1968, Pediatrics.

[75]  Brian R. Baucom,et al.  Behavioral, Affective, and Physiological Monitoring , 2016 .

[76]  Judith A. Hall,et al.  Studies of doctor-patient interaction. , 1989, Annual review of public health.

[77]  G. F. Judisch,et al.  Informed Consent. Legal Theory and Clinical Practice , 1988 .

[78]  P. Ekman,et al.  Facial action coding system: a technique for the measurement of facial movement , 1978 .