Truncated Gaussian-Mixture Variational AutoEncoder

Variation Autoencoder (VAE) has become a powerful tool in modeling the non-linear generative process of data from a low-dimensional latent space. Recently, several studies have proposed to use VAE for unsupervised clustering by using mixture models to capture the multi-modal structure of latent representations. This strategy, however, is ineffective when there are outlier data samples whose latent representations are meaningless, yet contaminating the estimation of key major clusters in the latent space. This exact problem arises in the context of resting-state fMRI (rs-fMRI) analysis, where clustering major functional connectivity patterns is often hindered by heavy noise of rs-fMRI and many minor clusters (rare connectivity patterns) of no interest to analysis. In this paper we propose a novel generative process, in which we use a Gaussian-mixture to model a few major clusters in the data, and use a non-informative uniform distribution to capture the remaining data. We embed this truncated Gaussian-Mixture model in a Variational AutoEncoder framework to obtain a general joint clustering and outlier detection approach, called tGM-VAE. We demonstrated the applicability of tGM-VAE on the MNIST dataset and further validated it in the context of rs-fMRI connectivity analysis.

[1]  A. Belger,et al.  Dynamic functional connectivity analysis reveals transient states of dysconnectivity in schizophrenia , 2014, NeuroImage: Clinical.

[2]  Torsten Rohlfing,et al.  Influences of Age, Sex, and Moderate Alcohol Drinking on the Intrinsic Functional Architecture of Adolescent Brains , 2018, Cerebral cortex.

[3]  Padhraic Smyth,et al.  Stick-Breaking Variational Autoencoders , 2016, ICLR.

[4]  Bhiksha Raj,et al.  Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery , 2017, INTERSPEECH.

[5]  Huachun Tan,et al.  Variational Deep Embedding: An Unsupervised and Generative Approach to Clustering , 2016, IJCAI.

[6]  Mikkel N. Schmidt,et al.  Nonparametric Modeling of Dynamic Functional Connectivity in fMRI Data , 2016, NIPS 2015.

[7]  Alfred O. Hero,et al.  Shrinkage Algorithms for MMSE Covariance Estimation , 2009, IEEE Transactions on Signal Processing.

[8]  Jalil Taghia,et al.  Bayesian switching factor analysis for estimating time-varying functional connectivity in fMRI , 2017, NeuroImage.

[9]  Michael I. Jordan,et al.  Variational inference for Dirichlet process mixtures , 2006 .

[10]  Eswar Damaraju,et al.  Tracking whole-brain connectivity dynamics in the resting state. , 2014, Cerebral cortex.

[11]  Christopher Burgess,et al.  beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[12]  Kilian M. Pohl,et al.  A Riemannian Framework for Longitudinal Analysis of Resting-State Functional Connectivity , 2018, MICCAI.

[13]  Yuanyuan Chen,et al.  Age-Related Decline in the Variation of Dynamic Functional Connectivity: A Resting State Analysis , 2017, Front. Aging Neurosci..

[14]  Murray Shanahan,et al.  Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders , 2016, ArXiv.

[15]  Fenna M. Krienen,et al.  Opportunities and limitations of intrinsic functional connectivity MRI , 2013, Nature Neuroscience.

[16]  Leonardo L. Gollo,et al.  Time-resolved resting-state brain networks , 2014, Proceedings of the National Academy of Sciences.

[17]  Adolf Pfefferbaum,et al.  The SRI24 multichannel atlas of normal adult human brain structure , 2009, Human brain mapping.

[18]  Anton van den Hengel,et al.  Infinite Variational Autoencoder for Semi-Supervised Learning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[20]  Max Welling,et al.  Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[21]  Adolf Pfefferbaum,et al.  Altered Brain Developmental Trajectories in Adolescents After Initiating Drinking. , 2017, The American journal of psychiatry.

[22]  Hao He,et al.  Assessing dynamic brain graphs of time-varying connectivity in fMRI data: Application to healthy controls and patients with schizophrenia , 2015, NeuroImage.