Deep multi-view representation learning for multi-modal features of the schizophrenia and schizo-affective disorder

This work is originated from the MLSP 2014 Classification Challenge which tries to automatically detect subjects with schizophrenia and schizo-affective disorder by analyzing multi-modal features derived from magnetic resonance imaging (MRI) data. We employ Deep Neural Network (DNN)-based multi-view representation learning for combining multimodal features. The DNN-based multi-view models include deep canonical correlation analysis (DCCA) and deep canonically correlated auto-encoders (DCCAE). In addition, support vector machine with Gaussian kernel is used to conduct classification with the compact bottleneck features learned by the deep multi-view models. Our experiments on the dataset provided by the MLSP Classification Challenge show that bottleneck features learned via deep multi-view models obtain better results than the trimming features used in the baseline system in terms of the receiver operating characteristic (ROC) area under the curve (AUC).

[1]  Alexander V. Lebedev,et al.  The 10th annual MLSP competition: Second place , 2014, 2014 IEEE International Workshop on Machine Learning for Signal Processing (MLSP).

[2]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[3]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[4]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[5]  Raman Arora,et al.  Kernel CCA for multi-view learning of acoustic features using articulatory measurements , 2012, MLSLP.

[6]  A. Vita,et al.  Schizophrenia , 2016, The Lancet.

[7]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[8]  Vince D. Calhoun,et al.  Thalamus and posterior temporal lobe show greater inter-network connectivity at rest and across sensory paradigms in schizophrenia , 2014, NeuroImage.

[9]  Jeff A. Bilmes,et al.  Deep Canonical Correlation Analysis , 2013, ICML.

[10]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[11]  Dong Yu,et al.  Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Rex E. Jung,et al.  Neuroinformatics Original Research Article Correspondence between Structure and Function in the Human Brain at Rest , 2022 .

[13]  Bill Bynum,et al.  Lancet , 2015, The Lancet.

[14]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[15]  Jeff A. Bilmes,et al.  On Deep Multi-View Representation Learning , 2015, ICML.

[16]  Sarah A. J. Reading,et al.  Neurobiology of Schizophrenia , 2006, Neuron.

[17]  Shotaro Akaho,et al.  A kernel method for canonical correlation analysis , 2006, ArXiv.