Robust is not necessarily reliable: From within-subjects fMRI contrasts to between-subjects comparisons

&NA; Advances in cognitive and affective neuroscience come largely from within‐subjects comparisons, in which the functional significance of neural activity is determined by contrasting two or more experimental conditions. Clinical and social neuroscience studies have attempted to leverage between‐subject variability in such condition differences to better understand psychopathology and other individual differences. Shifting from within‐to between‐subjects comparisons requires that measures have adequate internal consistency to function as individual difference variables. This is particularly relevant for difference scores—which have lower reliability. The field has assumed reasonable internal consistency of neural measures based on consistent findings across studies (i.e., if a within‐subject difference in neural activity is robust, then it must be reliable). Using one of the most common fMRI paradigms in the clinical neuroscience literature (i.e., a face‐ and shape‐matching task), in a large sample of adolescents (N = 139) we replicate a robust finding: amygdala activation is greater for faces than shapes. Moreover, we demonstrate that the internal consistency of the amygdala in face and shape blocks was excellent (Spearman‐Brown corrected reliability [SB] > .94). However, the internal consistency of the activation difference between faces and shapes was nearly zero (SB = −.06). This reflected the fact that the amygdala response to faces and shapes was highly correlated (r = .97) across individuals. Increased neural activation to faces versus shapes could not possibly function as an individual difference measure in these data—illustrating how neural activation can be robust within subjects, but unreliable as an individual difference measure. Strong and reproducible condition differences in neural activity are not necessarily well‐suited for individual differences research—and neuroimaging studies should always report the internal consistency of, and correlations between, activations used in individual differences research. HighlightsRobust within‐subject differences are often used as individual difference measures.A robust within‐subject difference may not be internally consistent.Difference scores are problematic when constituent measures are highly correlated.Subtraction‐based measures should report the correlation between the two variables.Internal consistencies should be reported for difference and constituent scores

[1]  Thomas E. Nichols,et al.  Thresholding of Statistical Maps in Functional Neuroimaging Using the False Discovery Rate , 2002, NeuroImage.

[2]  E. Pitman A NOTE ON NORMAL CORRELATION , 1939 .

[3]  Daniel R. Weinberger,et al.  Variation of Human Amygdala Response During Threatening Stimuli as a Function of 5′HTTLPR Genotype and Personality Style , 2005, Biological Psychiatry.

[4]  Jean-Luc Anton,et al.  Region of interest analysis using an SPM toolbox , 2010 .

[5]  Marko Wilke,et al.  Assessment of spatial normalization of whole‐brain magnetic resonance images in children , 2002, Human brain mapping.

[6]  E. Leibenluft,et al.  The NIMH Child Emotional Faces Picture Set (NIMH‐ChEFS): a new set of children's facial emotion stimuli , 2011, International journal of methods in psychiatric research.

[7]  Karine Sergerie,et al.  The role of the amygdala in emotional processing: A quantitative meta-analysis of functional neuroimaging studies , 2008, Neuroscience & Biobehavioral Reviews.

[8]  Francesco Fera,et al.  The Amygdala Response to Emotional Stimuli: A Comparison of Faces and Scenes , 2002, NeuroImage.

[9]  R. Kotov,et al.  Psychometrics and the Neuroscience of Individual Differences: Internal Consistency Limits Between-Subjects Effects , 2017, Journal of abnormal psychology.

[10]  Frederic M. Lord,et al.  The Measurement of Growth , 1956 .

[11]  Andreas Meyer-Lindenberg,et al.  Evidence that altered amygdala activity in schizophrenia is related to clinical state and not genetic risk. , 2009, The American journal of psychiatry.

[12]  Francesco Fera,et al.  Dopamine Modulates the Response of the Human Amygdala: A Study in Parkinson's Disease , 2002, The Journal of Neuroscience.

[13]  M. D’Esposito,et al.  The Influence of Working-Memory Demand and Subject Performance on Prefrontal Cortical Activity , 2002, Journal of Cognitive Neuroscience.

[14]  Isabel Gauthier,et al.  Reliability of composite-task measurements of holistic face processing , 2015, Behavior research methods.

[15]  L. Cronbach,et al.  How we should measure "change": Or should we? , 1970 .

[16]  Greg Hajcak,et al.  Considering ERP difference scores as individual difference measures: Issues with subtraction and alternative approaches. , 2017, Psychophysiology.

[17]  Moriah E. Thomason,et al.  The stimuli drive the response: An fMRI study of youth processing adult or child emotional face stimuli , 2013, NeuroImage.

[18]  Geraldine Dawson,et al.  Association between amygdala response to emotional faces and social anxiety in autism spectrum disorders , 2010, Neuropsychologia.

[19]  Joseph E LeDoux The Emotional Brain, Fear, and the Amygdala , 2003, Cellular and Molecular Neurobiology.

[20]  J. Willett Chapter 9: Questions and Answers in the Measurement of Change , 1988 .

[21]  Michael F Egan,et al.  A susceptibility gene for affective disorders and the response of the human amygdala. , 2005, Archives of general psychiatry.

[22]  W. A. Morgan TEST FOR THE SIGNIFICANCE OF THE DIFFERENCE BETWEEN THE TWO VARIANCES IN A SAMPLE FROM A NORMAL BIVARIATE POPULATION , 1939 .

[23]  C. Patrick,et al.  RDoC: Translating promise into progress. , 2016, Psychophysiology.

[24]  Justin S. Feinstein,et al.  Increased amygdala and insula activation during emotion processing in anxiety-prone subjects. , 2007, The American journal of psychiatry.

[25]  John B. Willett,et al.  Questions and Answers in the Measurement of Change , 1988 .

[26]  A. Hariri,et al.  Individual Differences in Typical Reappraisal Use Predict Amygdala and Prefrontal Responses , 2009, Biological Psychiatry.

[27]  Annchen R. Knodt,et al.  The reliability paradox: Why robust cognitive tasks do not produce reliable individual differences , 2017, Behavior Research Methods.

[28]  P. Jezzard,et al.  Correction for geometric distortion in echo planar images from B0 field variations , 1995, Magnetic resonance in medicine.