The Quantitative Evaluation of Functional Neuroimaging Experiments: Mutual Information Learning Curves

Learning curves are presented as an unbiased means for evaluating the performance of models for neuroimaging data analysis. The learning curve measures the predictive performance in terms of the generalization or prediction error as a function of the number of independent examples (e.g., subjects) used to determine the parameters in the model. Cross-validation resampling is used to obtain unbiased estimates of a generic multivariate Gaussian classifier, for training set sizes from 2 to 16 subjects. We apply the framework to four different activation experiments, in this case [(15)O]water data sets, although the framework is equally valid for multisubject fMRI studies. We demonstrate how the prediction error can be expressed as the mutual information between the scan and the scan label, measured in units of bits. The mutual information learning curve can be used to evaluate the impact of different methodological choices, e.g., classification label schemes, preprocessing choices. Another application for the learning curve is to examine the model performance using bias/variance considerations enabling the researcher to determine if the model performance is limited by statistical bias or variance. We furthermore present the sensitivity map as a general method for extracting activation maps from statistical models within the probabilistic framework and illustrate relationships between mutual information and pattern reproducibility as derived in the NPAIRS framework described in a companion paper.

[1]  D. A. Rottenberg,et al.  PET Studies of Perceptuomotor Learning in a Mirror-reversal Paradigm , 1998, NeuroImage.

[2]  Stephen C. Strother,et al.  Effects of Changes in Experimental Design on PET Studies of Isometric Force , 2001, NeuroImage.

[3]  L.K. Hansen,et al.  Design and evaluation of neural classifiers application to skin lesion classification , 1995, Proceedings of 1995 IEEE Workshop on Neural Networks for Signal Processing.

[4]  Stephen C. Strother,et al.  Penalized Discriminant Analysis of [15O]-water PET Brain Images with Prediction Error Selection of Smoothness and Regularization , 2001, IEEE Trans. Medical Imaging.

[5]  Lars Kai Hansen,et al.  Massive Weight Sharing: A Cure For Extremely Ill-Posed Problems , 1994 .

[6]  S C Strother,et al.  Improved resolution for PET volume imaging through three-dimensional iterative reconstruction. , 1997, Journal of nuclear medicine : official publication, Society of Nuclear Medicine.

[7]  R. Woods,et al.  Principal Component Analysis and the Scaled Subprofile Model Compared to Intersubject Averaging and Statistical Parametric Mapping: I. “Functional Connectivity” of the Human Motor System Studied with [15O]Water PET , 1995, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[8]  Rafal Kustra,et al.  Statistical analysis of medical images with applications to neuroimaging , 2000 .

[9]  David G. Stork,et al.  Pattern Classification , 1973 .

[10]  Karl J. Friston,et al.  Spatial registration and normalization of images , 1995 .

[11]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[12]  R. Tibshirani,et al.  Penalized Discriminant Analysis , 1995 .

[13]  Tom Heskes,et al.  Bias/Variance Decompositions for Likelihood-Based Estimators , 1998, Neural Computation.

[14]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[15]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[16]  Lars Kai Hansen,et al.  Enhancing the Multivariate Signal of [15O] water PET Studies with a New Non-Linear Neuroanatomical Registration Algorithm , 1999, IEEE Trans. Medical Imaging.

[17]  Lars Kai Hansen,et al.  Nonlinear versus Linear Models in Functional Neuroimaging: Learning Curves and Generalization Crossover , 1997, IPMI.

[18]  Gene H. Golub,et al.  Matrix computations , 1983 .

[19]  William H. Press,et al.  Numerical recipes , 1990 .

[20]  S. C. Strother,et al.  Generalization performance of nonlinear vs. Linear models for [15O]water PET functional activation studies , 1996, NeuroImage.

[21]  Barry J. Richmond,et al.  Information spectroscopy of single neurons , 1995 .

[22]  L. K. Hansen,et al.  The Quantitative Evaluation of Functional Neuroimaging Experiments: The NPAIRS Data Analysis Framework , 2000, NeuroImage.

[23]  Karl J. Friston,et al.  Characterizing the Response of PET and fMRI Data Using Multivariate Linear Models , 1997, NeuroImage.

[24]  Lars Kai Hansen,et al.  A Multivariate Approach to Functional Neuro Modeling , 1998 .

[25]  S. Strother,et al.  Penalized discriminant analysis of [/sup 15/O]-water PET brain images with prediction error selection of smoothness and regularization hyperparameters , 2001, IEEE Transactions on Medical Imaging.

[26]  Karl J. Friston,et al.  A multivariate analysis of PET activation studies , 1996, Human brain mapping.

[27]  Thomas E. Nichols,et al.  Threshold determination using the false discovery rate , 2001, NeuroImage.

[28]  S. C. Strother,et al.  Multidimensional state-spaces for fMRI and PET activation studies , 1996, NeuroImage.

[29]  Christopher M. Bishop,et al.  Bayesian PCA , 1998, NIPS.

[30]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[31]  John C. Gore,et al.  ROC Analysis of Statistical Methods Used in Functional MRI: Individual Subjects , 1999, NeuroImage.

[32]  Lars Kai Hansen,et al.  Generalizable Singular Value Decomposition for Ill-posed Datasets , 2000, NIPS.

[33]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[34]  L. K. Hansen,et al.  Generalizable Patterns in Neuroimaging: How Many Principal Components? , 1999, NeuroImage.

[35]  Stephen C. Strother,et al.  Multivariate Predictive Relationship between Kinematic and Functional Activation Patterns in a PET Study of Visuomotor Learning , 2000, NeuroImage.

[36]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[37]  Robert A. Lordo,et al.  Learning from Data: Concepts, Theory, and Methods , 2001, Technometrics.