A Localized MKL Method for Brain Classification with Known Intra-class Variability

Automatic decisional systems based on pattern classification methods are becoming very important to support medical diagnosis. In general, the overall objective is to classify between healthy subjects and patients affected by a certain disease. To reach this aim, significant efforts have been spent in finding reliable biomarkers which are able to robustly discriminate between the two populations (i.e., patients and controls). However, in real medical scenarios there are many factors, like the gender or the age, which make the source data very heterogeneous. This introduces a large intra-class variation by affecting the performance of the classification procedure. In this paper we exploit how to use the knowledge on heterogeneity factors to improve the classification accuracy. We propose a Clustered Localized Multiple Kernel Learning (CLMKL) algorithm by encoding in the classication model the information on the clusters of apriory known stratifications.

[1]  Ethem Alpaydin,et al.  Localized multiple kernel learning , 2008, ICML '08.

[2]  Michael I. Jordan,et al.  Multiple kernel learning, conic duality, and the SMO algorithm , 2004, ICML.

[3]  David S. Doermann,et al.  Selection of classifiers for the construction of multiple classifier systems , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[4]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[5]  Robert P. W. Duin,et al.  Dissimilarity-Based Detection of Schizophrenia , 2010, ICPR 2010.

[6]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[7]  Vikas Singh,et al.  Predictive markers for AD in a multi-modality framework: An analysis of MCI progression in the ADNI population , 2011, NeuroImage.

[8]  Roman Filipovych,et al.  Multi-Kernel Classification for Integration of Clinical and Imaging Data: Application to Prediction of Cognitive Decline in Older Adults , 2011, MLMI.

[9]  Kevin W. Bowyer,et al.  Combination of Multiple Classifiers Using Local Accuracy Estimates , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[11]  Karl J. Friston,et al.  Statistical parametric mapping , 2013 .

[12]  A. Versace,et al.  Decreased entorhinal cortex volumes in schizophrenia , 2008, Schizophrenia Research.

[13]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[14]  Teemu Roos,et al.  Discriminative Learning of Bayesian Networks via Factorized Conditional Log-Likelihood , 2011, J. Mach. Learn. Res..

[15]  John Ashburner,et al.  A fast diffeomorphic image registration algorithm , 2007, NeuroImage.

[16]  Yves Grandvalet,et al.  Y.: SimpleMKL , 2008 .

[17]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[18]  Ferath Kherif,et al.  Multivariate voxel-based morphometry successfully differentiates schizophrenia patients from healthy controls , 2007, NeuroImage.

[19]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[20]  Yongyi Yang,et al.  Machine Learning in Medical Imaging , 2010, IEEE Signal Processing Magazine.

[21]  Vince D. Calhoun,et al.  Characterization of groups using composite kernels and multi-source fMRI analysis data: Application to schizophrenia , 2011, NeuroImage.

[22]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .