r Human Brain Mapping 29:958–972 (2008) r Test–Retest and Between-Site Reliability in a Multicenter fMRI Study

In the present report, estimates of test–retest and between‐site reliability of fMRI assessments were produced in the context of a multicenter fMRI reliability study (FBIRN Phase 1, www.nbirn.net). Five subjects were scanned on 10 MRI scanners on two occasions. The fMRI task was a simple block design sensorimotor task. The impulse response functions to the stimulation block were derived using an FIR‐deconvolution analysis with FMRISTAT. Six functionally‐derived ROIs covering the visual, auditory and motor cortices, created from a prior analysis, were used. Two dependent variables were compared: percent signal change and contrast‐to‐noise‐ratio. Reliability was assessed with intraclass correlation coefficients derived from a variance components analysis. Test–retest reliability was high, but initially, between‐site reliability was low, indicating a strong contribution from site and site‐by‐subject variance. However, a number of factors that can markedly improve between‐site reliability were uncovered, including increasing the size of the ROIs, adjusting for smoothness differences, and inclusion of additional runs. By employing multiple steps, between‐site reliability for 3T scanners was increased by 123%. Dropping one site at a time and assessing reliability can be a useful method of assessing the sensitivity of the results to particular sites. These findings should provide guidance toothers on the best practices for future multicenter studies. Hum Brain Mapp, 2008. © 2007 Wiley‐Liss, Inc.

[1]  M. R. Novick,et al.  Statistical Theories of Mental Test Scores. , 1971 .

[2]  Donald B. Rubin,et al.  The Dependability of Behavioral Measurements: Theory of Generalizability for Scores and Profiles. , 1974 .

[3]  Eva Nick,et al.  The dependability of behavioral measurements: theory of generalizability for scores and profiles , 1973 .

[4]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[5]  D. Cicchetti,et al.  Developing criteria for establishing interrater reliability of specific items: applications to assessment of adaptive behavior. , 1981, American journal of mental deficiency.

[6]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[7]  A. Damasio,et al.  Improving functional imaging techniques: the dream of a single image for a single mental event. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[8]  R W Cox,et al.  AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. , 1996, Computers and biomedical research, an international journal.

[9]  M. Lowe,et al.  Spatially filtering functional magnetic resonance imaging data , 1997, Magnetic resonance in medicine.

[10]  T H Le,et al.  Methods for assessing accuracy and reliability in functional MRI , 1997, NMR in biomedicine.

[11]  C. Genovese,et al.  Estimating test‐retest reliability in functional MR imaging I: Statistical methodology , 1997, Magnetic resonance in medicine.

[12]  D C Noll,et al.  Estimating test‐retest reliability in functional MR imaging II: Application to motor and cognitive activation studies , 1997, Magnetic resonance in medicine.

[13]  S. Walter,et al.  Sample size and optimal designs for reliability studies. , 1998, Statistics in medicine.

[14]  Karl J. Friston,et al.  Multisubject fMRI Studies and Conjunction Analyses , 1999, NeuroImage.

[15]  M S Cohen,et al.  Stability, repeatability, and the expression of signal magnitude in functional magnetic resonance imaging , 1999, Journal of magnetic resonance imaging : JMRI.

[16]  R. Charter,et al.  Sample size requirements for precise estimates of reliability, generalizability, and validity coefficients. , 1999, Journal of clinical and experimental neuropsychology.

[17]  Alan C. Evans,et al.  A general statistical analysis for fMRI data , 2000, NeuroImage.

[18]  D. Cicchetti Methodological Commentary The Precision of Reliability and Validity Estimates Re-Visited: Distinguishing Between Clinical and Statistical Significance of Sample Size Requirements , 2001 .

[19]  B. Giraudeau,et al.  Planning a reproducibility study: how many subjects and how many replicates per subject for an expected width of the 95 per cent confidence interval of the intraclass correlation coefficient , 2001, Statistics in medicine.

[20]  S. Rauch,et al.  Test-retest reliability of a functional MRI working memory paradigm in normal and schizophrenic subjects. , 2001, The American journal of psychiatry.

[21]  Ranjan Maitra,et al.  Test‐retest reliability estimation of functional MRI data , 2002, Magnetic resonance in medicine.

[22]  N Jon Shah,et al.  Assessment of reliability in functional imaging studies , 2003, Journal of magnetic resonance imaging : JMRI.

[23]  Javier Quintana,et al.  Prefrontal–posterior parietal networks in schizophrenia: primary dysfunctions and secondary compensations , 2003, Biological Psychiatry.

[24]  Graham Dunn,et al.  Statistical Evaluation of Measurement Errors: Design and Analysis of Reliability Studies , 2004 .

[25]  Charles R. G. Guttmann,et al.  Functional MRI of auditory verbal working memory: long-term reproducibility analysis , 2004, NeuroImage.

[26]  A. Kelly,et al.  Human functional neuroimaging of brain changes associated with practice. , 2005, Cerebral cortex.

[27]  Gregory G. Brown,et al.  Reproducibility of functional MR imaging: preliminary results of prospective multi-institutional study performed by Biomedical Informatics Research Network. , 2005, Radiology.

[28]  Russell A. Poldrack,et al.  Long-term test–retest reliability of functional MRI in a classification learning task , 2006, NeuroImage.

[29]  Gary H. Glover,et al.  Reducing interscanner variability of activation in a multicenter fMRI study: Controlling for signal-to-fluctuation-noise-ratio (SFNR) differences , 2006, NeuroImage.

[30]  Gary H. Glover,et al.  Reducing inter-scanner variability of activation in a multicenter fMRI study: Role of smoothness equalization , 2006, NeuroImage.

[31]  John A. D. Aston,et al.  A method for generating reproducible evidence in fMRI studies , 2006, NeuroImage.

[32]  Randy L. Gollub,et al.  Test–retest study of fMRI signal change evoked by electroacupuncture stimulation , 2007, NeuroImage.