Longitudinal ComBat: A method for harmonizing longitudinal multi-scanner imaging data

While aggregation of neuroimaging datasets from multiple sites and scanners can yield increased statistical power, it also presents challenges due to systematic scanner effects. This unwanted technical variability can introduce noise and bias into estimation of biological variability of interest. We propose a method for harmonizing longitudinal multi-scanner imaging data based on ComBat, a method originally developed for genomics and later adapted to cross-sectional neuroimaging data. Using longitudinal cortical thickness measurements from 663 participants in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) study, we demonstrate the presence of additive and multiplicative scanner effects in various brain regions. We compare estimates of the association between diagnosis and change in cortical thickness over time using three versions of the ADNI data: unharmonized data, data harmonized using cross-sectional ComBat, and data harmonized using longitudinal ComBat. In simulation studies, we show that longitudinal ComBat is more powerful for detecting longitudinal change than cross-sectional ComBat and controls the type I error rate better than unharmonized data with scanner included as a covariate. The proposed method would be useful for other types of longitudinal data requiring harmonization, such as genomic data, or neuroimaging studies of neurodevelopment, psychiatric disorders, or other neurological diseases.

[1]  Anders M. Dale,et al.  Reliability of MRI-derived measurements of human cerebral cortical thickness: The effects of field strength, scanner upgrade and manufacturer , 2006, NeuroImage.

[2]  K. Ohtomo,et al.  Effect of scanner in longitudinal studies of brain volume changes , 2011, Journal of magnetic resonance imaging : JMRI.

[3]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .

[4]  Aaron Carass,et al.  DeepHarmony: A deep learning approach to contrast harmonization across scanner changes. , 2019, Magnetic resonance imaging.

[5]  Michael W. Weiner,et al.  2014 Update of the Alzheimer's Disease Neuroimaging Initiative: A review of papers published since its inception , 2015, Alzheimer's & Dementia.

[6]  Harald Binder,et al.  Removing Batch Effects from Longitudinal Gene Expression - Quantile Normalization Plus ComBat as Best Approach for Microarray Transcriptome Data , 2016, PloS one.

[7]  M. Weissman,et al.  Statistical harmonization corrects site effects in functional connectivity measurements from multi‐site fMRI data , 2018, Human brain mapping.

[8]  J. John Recovery of inter-block information , 1987 .

[9]  Hyunwoo Lee,et al.  Estimating and accounting for the effect of MRI scanner changes on longitudinal whole-brain volume change measurements , 2019, NeuroImage.

[10]  Akram Bakkour,et al.  The cortical signature of prodromal AD , 2009, Neurology.

[11]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[12]  Christos Davatzikos,et al.  Harmonization of large MRI datasets for the analysis of brain imaging patterns throughout the lifespan , 2019, NeuroImage.

[13]  Norbert Schuff,et al.  Measurement of MRI scanner performance with the ADNI phantom. , 2009, Medical physics.

[14]  Nick C Fox,et al.  The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods , 2008, Journal of magnetic resonance imaging : JMRI.

[15]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[16]  Anup Rao,et al.  Longitudinal changes in medial temporal cortical thickness in normal subjects with the APOE-4 polymorphism , 2010, NeuroImage.

[17]  Chunshui Yu,et al.  Hippocampal volume and asymmetry in mild cognitive impairment and Alzheimer's disease: Meta‐analyses of MRI studies , 2009, Hippocampus.

[18]  Mark E. Schmidt,et al.  The Alzheimer’s Disease Neuroimaging Initiative: A review of papers published since its inception , 2012, Alzheimer's & Dementia.

[19]  Christos Davatzikos,et al.  Longitudinally and inter-site consistent multi-atlas based parcellation of brain anatomy using harmonized atlases , 2018, NeuroImage.

[20]  D. Reich,et al.  Volumetric Analysis from a Harmonized Multisite Brain MRI Study of a Single Subject with Multiple Sclerosis , 2017, American Journal of Neuroradiology.

[21]  J. Morris,et al.  The Cortical Signature of Alzheimer's Disease: Regionally Specific Cortical Thinning Relates to Symptom Severity in Very Mild to Mild AD Dementia and is Detectable in Asymptomatic Amyloid-Positive Individuals , 2008, Cerebral cortex.

[22]  Arno Klein,et al.  101 Labeled Brain Images and a Consistent Human Cortical Labeling Protocol , 2012, Front. Neurosci..

[23]  M. Kenward,et al.  Small sample inference for fixed effects from restricted maximum likelihood. , 1997, Biometrics.

[24]  M. E. Johnson,et al.  A Comparative Study of Tests for Homogeneity of Variances, with Applications to the Outer Continental Shelf Bidding Data , 1981 .

[25]  Russell T. Shinohara,et al.  Harmonization of cortical thickness measurements across scanners and sites , 2017, NeuroImage.

[26]  Michael Wagner,et al.  Cortical thinning in individuals with subjective memory impairment. , 2015, Journal of Alzheimer's disease : JAD.

[27]  Mark Mühlau,et al.  Grey-matter atrophy in Alzheimer's disease is asymmetric but not lateralized. , 2011, Journal of Alzheimer's disease : JAD.

[28]  Daniel L Gillen,et al.  Longitudinal Mapping of Cortical Thickness Measurements: An Alzheimer's Disease Neuroimaging Initiative-Based Evaluation Study. , 2019, Journal of Alzheimer's disease : JAD.

[29]  Anders M. Dale,et al.  Reliability in multi-site structural MRI studies: Effects of gradient non-linearity correction on phantom and human data , 2006, NeuroImage.

[30]  M. Soussan,et al.  A Postreconstruction Harmonization Method for Multicenter Radiomic Studies in PET , 2018, The Journal of Nuclear Medicine.

[31]  Vijay K. Venkatraman,et al.  Region of interest correction factors improve reliability of diffusion imaging measures within and across scanners and field strengths , 2015, NeuroImage.

[32]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[33]  Søren Højsgaard,et al.  A Kenward-Roger approximation and parametric bootstrap methods for tests in linear mixed models: The R Package pbkrtest , 2014 .

[34]  Ragini Verma,et al.  Harmonization of multi-site diffusion tensor imaging data , 2017, NeuroImage.