Using Multiparametric Data with Missing Features for Learning Patterns of Pathology

The paper presents a method for learning multimodal classifiers from datasets in which not all subjects have data from all modalities. Usually, subjects with a severe form of pathology are the ones failing to satisfactorily complete the study, especially when it consists of multiple imaging modalities. A classifier capable of handling subjects with unequal numbers of modalities prevents discarding any subjects, as is traditionally done, thereby broadening the scope of the classifier to more severe pathology. It also allows design of the classifier to include as much of the available information as possible and facilitates testing of subjects with missing modalities over the constructed classifier. The presented method employs an ensemble based approach where several subsets of complete data are formed and trained using individual classifiers., The output from these classifiers is fused using a weighted aggregation step giving an optimal probabilistic score for each subject. The method is applied to a spatio-temporal dataset for autism spectrum disorders (ASD) (96 patients with ASD and 42 typically developing controls) that consists of functional features from magnetoencephalography (MEG) and structural connectivity features from diffusion tensor imaging (DTI). A clear distinction between ASD and controls is obtained with an average 5-fold accuracy of 83.3% and testing accuracy of 88.4%. The fusion classifier performance is superior to the classification achieved using single modalities as well as multimodal classifier using only complete data (78.3%). The presented multimodal classifier framework is applicable to all modality combinations.

[1]  Arthur W. Toga,et al.  Atlas-based whole brain white matter analysis using large deformation diffeomorphic metric mapping: Application to normal elderly and Alzheimer's disease participants , 2009, NeuroImage.

[2]  Luke Bloy,et al.  Diffusion based abnormality markers of pathology: Toward learned diagnostic prediction of ASD , 2011, NeuroImage.

[3]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[4]  Ming Dong,et al.  Selection-fusion approach for classification of datasets with missing values , 2010, Pattern Recognit..

[5]  Jessica Brian,et al.  Magnetoencephalography identifies rapid temporal processing deficit in autism and language impairment , 2005, Neuroreport.

[6]  David B. Dunson,et al.  Classification with Incomplete Data Using Dirichlet Process Priors , 2010, J. Mach. Learn. Res..

[7]  Robert T. Schultz,et al.  White matter atlas generation using HARDI based automated parcellation , 2012, NeuroImage.

[8]  Dinggang Shen,et al.  COMPARE: Classification of Morphological Patterns Using Adaptive Regional Elements , 2007, IEEE Transactions on Medical Imaging.

[9]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[10]  Lisa Blaskey,et al.  MEG detection of delayed auditory evoked responses in autism spectrum disorders: towards an imaging biomarker for autism , 2010, Autism research : official journal of the International Society for Autism Research.

[11]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[12]  Aníbal R. Figueiras-Vidal,et al.  Pattern classification with missing data: a review , 2010, Neural Computing and Applications.

[13]  Daoqiang Zhang,et al.  Multimodal classification of Alzheimer's disease and mild cognitive impairment , 2011, NeuroImage.

[14]  Ben Taskar,et al.  Regularized Tensor Factorization for Multi-Modality Medical Image Classification , 2011, MICCAI.

[15]  Mark Jenkinson,et al.  Non-local Shape Descriptor: A New Similarity Metric for Deformable Multi-modal Registration , 2011, MICCAI.