EROICA: exploring regions of interest with cluster analysis in large functional magnetic resonance imaging data sets

We describe a divide and conquer strategy for an exploratory data analysis (EDA) of large functional magnetic resonance imaging (fMRI) data sets. The need for an EDA to precede and complement a confirmatory model-based analysis is now well established. For complex fMRI experiments, where a prior model of the expected response cannot be posited, the sole option is to conduct an initial EDA. An EDA often discovers unanticipated behavior, allowing the experimenter to augment or even change the original hypothesis. In addition, the gross artifact behavior that EDA makes evident may aid the experimenter in deciding whether the data set is even usable, some additional preprocessing step is required, or the one used has introduced spurious effects. The proposed strategy, named EROICA for exploring regions of interest with cluster analysis, evolved from an empirical observation that a typical cluster of activation or artifact time series can be partitioned into three subsets: time series corrupted by significant trends and time series above and below some noise level. Moreover, the sought after common temporal behavior among the cluster time series can be extracted in an uncorrupted form from the above noise level time series alone. Thus, the key feature of EROICA is the initial partition of the data set into trendy and below the noise level time series, followed by the fuzzy cluster analysis (FCA) of the above the noise level time series to extract common cluster behavior patterns (centroids). The initial partition is based on a test statistic in the power spectrum domain. This step has significant ramifications: it greatly speeds up the FCA because of the much smaller number of time series to cluster; it makes the clustering results more robust because they are no longer affected by the trendy and noisy time series: the above the noise level time series can be further grouped according to the location of the spectral peak on the frequency axis, and these groups can be used to create a subset of initial centroids that greatly improves the convergence rate of the FCA; and the group of below the noise level time series (referred to as the noise pool) can be used as a data-driven representation of the underlying noise source. In the final step, each time series is modeled as a linear combination of the closest centroid plus noise. The noise pool is very convenient for obtaining thresholds when testing the significance of the model parameter without having to model or assume the distributional properties of the underlying noise source. To limit the number of false positives in the activation maps, the significance test also tests the time series power spectrum values at the frequency locations determined by the cluster centroid. EROICA is one of the analysis options offered by the general image-processing package EvIdent ® .

[1]  Ray L. Somorjai,et al.  Exploratory analysis of fMRI data by fuzzy clustering: philosophy, strategy, tactics, implementation , 2003 .

[2]  Ray L. Somorjai,et al.  Exploring regions of interest with cluster analysis (EROICA) using a spectral peak statistic for selecting and testing the significance of fMRI activation time-series , 2002, Artif. Intell. Medicine.

[3]  Alan C. Evans,et al.  A General Statistical Analysis for fMRI Data , 2000, NeuroImage.

[4]  Emery N. Brown,et al.  Locally Regularized Spatiotemporal Modeling and Model Comparison for Functional MRI , 2001, NeuroImage.

[5]  R. Somorjai,et al.  Purifying heterogeneous clusters using multiple correlation clustering , 2001, NeuroImage.

[6]  W. Richter,et al.  Assessing coactivation in fMRI by minimum spanning tree ordering , 2001, NeuroImage.

[7]  Richard Baumgartner,et al.  A new statistical inference test for fMRI time-series , 2001, NeuroImage.

[8]  Emery N. Brown,et al.  A signal estimation approach to functional MRI , 2001, IEEE Transactions on Medical Imaging.

[9]  Brian D. Ripley,et al.  A New Statistical Approach to Detecting Significant Activation in Functional MRI , 2000, NeuroImage.

[10]  L Ryner,et al.  Novelty indices: identifiers of potentially interesting time-courses in functional MRI data. , 2000, Magnetic resonance imaging.

[11]  W. Richter,et al.  Performance comparison of novelty indices used in blind source separation for preprocessing fMRI time-series , 2000, NeuroImage.

[12]  Ravi S. Menon,et al.  Motor Area Activity During Mental Rotation Studied by Time-Resolved Single-Trial fMRI , 2000, Journal of Cognitive Neuroscience.

[13]  R Baumgartner,et al.  Comparison of two exploratory data analysis methods for fMRI: fuzzy clustering vs. principal component analysis. , 2000, Magnetic resonance imaging.

[14]  R Baumgartner,et al.  Assessment of cluster homogeneity in fMRI data using Kendall's coefficient of concordance. , 1999, Magnetic resonance imaging.

[15]  L. K. Hansen,et al.  On Clustering fMRI Time Series , 1999, NeuroImage.

[16]  L. Ryner Comparison of Exploratory Data Analysis Methods in fMR 1 : Fuzzy Clustering and Principal Component Analysis , 1999 .

[17]  L. Ryner,et al.  Retrograde cerebral perfusion results in flow distribution abnormalities and neuronal damage. A magnetic resonance imaging and histopathological study in pigs. , 1998, Circulation.

[18]  K Ugurbil,et al.  Human hippocampal long‐term sustained response during word memory processing , 1998, Neuroreport.

[19]  R. Deslauriers,et al.  Does simultaneous antegrade/retrograde cardioplegia improve myocardial perfusion in the areas at risk? A magnetic resonance perfusion imaging study in isolated pig hearts. , 1998, The Journal of thoracic and cardiovascular surgery.

[20]  C. Windischberger,et al.  Quantification in functional magnetic resonance imaging: fuzzy clustering vs. correlation analysis. , 1998, Magnetic resonance imaging.

[21]  R. Weisskoff,et al.  Effect of temporal autocorrelation due to physiological noise and stimulus paradigm on voxel‐level false‐positive rates in fMRI , 1998, Human brain mapping.

[22]  R Baumgartner,et al.  Fuzzy clustering of gradient‐echo functional MRI in the human visual cortex. Part II: Quantification , 1997, Journal of magnetic resonance imaging : JMRI.

[23]  R Baumgartner,et al.  Fuzzy clustering of gradient‐echo functional MRI in the human visual cortex. Part I: Reproducibility , 1997, Journal of magnetic resonance imaging : JMRI.

[24]  R. Deslauriers,et al.  Retrograde cerebral perfusion provides limited distribution of blood to the brain: a study in pigs. , 1997, The Journal of thoracic and cardiovascular surgery.

[25]  R L Somorjai,et al.  Fuzzy C-means clustering and principal component analysis of time series from near-infrared imaging of forearm ischemia. , 1997, Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society.

[26]  Rajesh N. Davé,et al.  Robust clustering methods: a unified view , 1997, IEEE Trans. Fuzzy Syst..

[27]  M. D’Esposito,et al.  Empirical Analyses of BOLD fMRI Statistics , 1997, NeuroImage.

[28]  Scott L. Zeger,et al.  Non‐linear Fourier Time Series Analysis for Human Brain Mapping by Functional Magnetic Resonance Imaging , 1997 .

[29]  R. Somorjai,et al.  The utility of fuzzy clustering in identifying diverse activations in fMRI , 1996, NeuroImage.

[30]  R. Somorjai,et al.  Activation and deactivation in functional brain images , 1996, NeuroImage.

[31]  N Lange Statistical approaches to human brain mapping by functional magnetic resonance imaging. , 1996, Statistics in medicine.

[32]  E. Bullmore,et al.  Statistical methods of estimation and inference for functional MR image analysis , 1996, Magnetic resonance in medicine.

[33]  Karl J. Friston,et al.  Analysis of fMRI Time-Series Revisited—Again , 1995, NeuroImage.

[34]  R. Turner,et al.  Characterizing Evoked Hemodynamics with fMRI , 1995, NeuroImage.

[35]  Karl J. Friston,et al.  Analysis of fMRI Time-Series Revisited , 1995, NeuroImage.

[36]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[37]  P. Brockwell,et al.  Time Series: Theory and Methods , 2013 .

[38]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[39]  B. Silverman,et al.  Some Aspects of the Spline Smoothing Approach to Non‐Parametric Regression Curve Fitting , 1985 .

[40]  D. B. Preston Spectral Analysis and Time Series , 1983 .

[41]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[42]  Bruno O. Shubert,et al.  Random variables and stochastic processes , 1979 .

[43]  R. Fisher Tests of significance in harmonic analysis , 1929 .