Probabilistic Modeling of Imaging, Genetics and Diagnosis

We propose a unified Bayesian framework for detecting genetic variants associated with disease by exploiting image-based features as an intermediate phenotype. The use of imaging data for examining genetic associations promises new directions of analysis, but currently the most widely used methods make sub-optimal use of the richness that these data types can offer. Currently, image features are most commonly selected based on their relevance to the disease phenotype. Then, in a separate step, a set of genetic variants is identified to explain the selected features. In contrast, our method performs these tasks simultaneously in order to jointly exploit information in both data types. The analysis yields probabilistic measures of clinical relevance for both imaging and genetic markers. We derive an efficient approximate inference algorithm that handles the high dimensionality of image and genetic data. We evaluate the algorithm on synthetic data and demonstrate that it outperforms traditional models. We also illustrate our method on Alzheimer's Disease Neuroimaging Initiative data.

[1]  D. Bredesen,et al.  Netrin-1 interacts with amyloid precursor protein and regulates amyloid-β production , 2009, Cell Death and Differentiation.

[2]  Mert R. Sabuncu,et al.  Joint Modeling of Imaging and Genetics , 2013, IPMI.

[3]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[4]  A M Dale,et al.  Measuring the thickness of the human cerebral cortex from magnetic resonance images. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[5]  R. Ransohoff,et al.  Inflammatory cell trafficking across the blood–brain barrier: chemokine regulation and in vitro models , 2012, Immunological reviews.

[6]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[7]  Genevera I. Allen Automatic Feature Selection via Weighted Kernels and Regularization , 2013 .

[8]  T. J. Mitchell,et al.  Bayesian Variable Selection in Linear Regression , 1988 .

[9]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[10]  F. Deconinck,et al.  Information Processing in Medical Imaging , 1984, Springer Netherlands.

[11]  G. Abecasis,et al.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes , 2010, Genetic epidemiology.

[12]  C. Hoggart,et al.  Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies , 2008, PLoS genetics.

[13]  Tim Salimans,et al.  Fixed-Form Variational Posterior Approximation through Stochastic Linear Regression , 2012, ArXiv.

[14]  Andrew J. Saykin,et al.  Voxelwise genome-wide association study (vGWAS) , 2010, NeuroImage.

[15]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[16]  M. West,et al.  Bounded Approximations for Marginal Likelihoods , 2010 .

[17]  Ben Taskar,et al.  Generative-Discriminative Basis Learning for Medical Imaging , 2012, IEEE Transactions on Medical Imaging.

[18]  Shane J. Neph,et al.  Systematic Localization of Common Disease-Associated Variation in Regulatory DNA , 2012, Science.

[19]  Thomas E. Nichols,et al.  Anatomically-distinct genetic associations of APOE ɛ4 allele load with regional cortical atrophy in Alzheimer's disease , 2009, NeuroImage.

[20]  Nikos Makris,et al.  Automatically parcellating the human cerebral cortex. , 2004, Cerebral cortex.

[21]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[22]  M. Stephens,et al.  Scalable Variational Inference for Bayesian Variable Selection in Regression, and Its Accuracy in Genetic Association Studies , 2012 .

[23]  Mert R. Sabuncu,et al.  The Relevance Voxel Machine (RVoxM): A Bayesian Method for Image-Based Prediction , 2011, MICCAI.

[24]  Alexander Shapiro,et al.  Stochastic Approximation approach to Stochastic Programming , 2013 .

[25]  A. Dale,et al.  Cortical Surface-Based Analysis II: Inflation, Flattening, and a Surface-Based Coordinate System , 1999, NeuroImage.

[26]  O. Favorova,et al.  A Polygenic Approach to the Study 
of Polygenic Diseases , 2012, Acta naturae.

[27]  H. Jeffreys An invariant form for the prior probability in estimation problems , 1946, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[28]  I. Veer,et al.  Strongly reduced volumes of putamen and thalamus in Alzheimer's disease: an MRI study , 2008, Brain : a journal of neurology.

[29]  Thomas E. Nichols,et al.  Discovering genetic associations with high-dimensional neuroimaging phenotypes: A sparse reduced-rank regression approach , 2010, NeuroImage.

[30]  R. Buckner,et al.  The association between a polygenic Alzheimer score and cortical thickness in clinically normal subjects. , 2012, Cerebral cortex.

[31]  Antonio Moreno,et al.  Significant correlation between a set of genetic polymorphisms and a functional brain network revealed by feature selection and sparse Partial Least Squares , 2012, NeuroImage.

[32]  A. Gelman Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper) , 2004 .

[33]  Daphne Koller,et al.  Polarization of the Effects of Autoimmune and Neurodegenerative Risk Alleles in Leukocytes , 2014, Science.

[34]  Anders M. Dale,et al.  Cortical Surface-Based Analysis I. Segmentation and Surface Reconstruction , 1999, NeuroImage.

[35]  A. Dale,et al.  Whole Brain Segmentation Automated Labeling of Neuroanatomical Structures in the Human Brain , 2002, Neuron.

[36]  P. Thompson,et al.  Neuroimaging endophenotypes: Strategies for finding genes influencing brain structure and function , 2007, Human brain mapping.

[37]  Roger Baker Grosse,et al.  Model selection in compositional spaces , 2014 .

[38]  J. Coyle,et al.  Alzheimer's disease and senile dementia: loss of neurons in the basal forebrain. , 1982, Science.

[39]  A. Meyer-Lindenberg,et al.  Intermediate phenotypes and genetic mechanisms of psychiatric disorders , 2006, Nature Reviews Neuroscience.

[40]  Gary D. Bader,et al.  Pathway Commons, a web resource for biological pathway data , 2010, Nucleic Acids Res..

[41]  Nick C Fox,et al.  Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease , 2013, Nature Genetics.

[42]  Manolis Kellis,et al.  Conserved epigenomic signals in mice and humans reveal immune basis of Alzheimer’s disease , 2015, Nature.

[43]  Daniel Mathalon,et al.  A genome-wide association study of schizophrenia using brain activation as a quantitative phenotype. , 2009, Schizophrenia bulletin.

[44]  P. Mehlen,et al.  Amyloid Precursor Protein Regulates Netrin-1-mediated Commissural Axon Outgrowth* , 2012, The Journal of Biological Chemistry.

[45]  M. Stephens,et al.  Bayesian variable selection regression for genome-wide association studies and other large-scale problems , 2011, 1110.6019.

[46]  Matthew J. Beal,et al.  The variational Bayesian EM algorithm for incomplete data: with application to scoring graphical model structures , 2003 .

[47]  T. Liao Interpreting Probability Models: Logit, Probit, and Other Generalized Linear Models , 1994 .

[48]  Paul M. Thompson,et al.  Sparse reduced-rank regression detects genetic associations with voxel-wise longitudinal phenotypes in Alzheimer's disease , 2012, NeuroImage.

[49]  R. Mayeux,et al.  Molecular drivers and cortical spread of lateral entorhinal cortex dysfunction in preclinical Alzheimer's disease , 2013, Nature Neuroscience.

[50]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[51]  S. Broadley,et al.  What Do Effective Treatments for Multiple Sclerosis Tell Us about the Molecular Mechanisms Involved in Pathogenesis? , 2012, International journal of molecular sciences.

[52]  C. Jack,et al.  Alzheimer's Disease Neuroimaging Initiative , 2008 .

[53]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[54]  Juha Karhunen,et al.  Approximate Riemannian Conjugate Gradient Learning for Fixed-Form Variational Bayes , 2010, J. Mach. Learn. Res..

[55]  Allan R. Jones,et al.  An anatomically comprehensive atlas of the adult human brain transcriptome , 2012, Nature.