Variational auto-encoder (VAE) acquires not only a low dimensional representation of the data, but also the probability distribution of the data, in its latent space. Therefore, if an input data is not from the trained category, it fails to find an appropriate point in the latent space, which leads to a poor reconstruction. Thus, it can be used for data abnormality detection. We use VAE to detect a phenotype abnormality in a biological system. When each of lethal genes in early embryogenesis is knocked down by RNAi technique, an observed abnormality can be linked to the corresponding gene function. We train VAE by the wild type (WT) data without any gene manipulation, and use to characterize a phenotype of RNAi embryo in two cell stage. Abnormality is defined by a data reconstruction error, and several genes are found whose absence causes the abnormality, including some already known genes.
[1]
Sergey Ioffe,et al.
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
,
2015,
ICML.
[2]
Yoshua Bengio,et al.
Gradient-based learning applied to document recognition
,
1998,
Proc. IEEE.
[3]
Geoffrey E. Hinton,et al.
Reducing the Dimensionality of Data with Neural Networks
,
2006,
Science.
[4]
Yoko Suzuki,et al.
WDDD: Worm Developmental Dynamics Database
,
2012,
Nucleic Acids Res..
[5]
Graham W. Taylor,et al.
Deconvolutional networks
,
2010,
2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[6]
Max Welling,et al.
Auto-Encoding Variational Bayes
,
2013,
ICLR.