Integrated Model of De Novo and Inherited Genetic Variants Yields Greater Power to Identify Risk Genes

De novo mutations affect risk for many diseases and disorders, especially those with early-onset. An example is autism spectrum disorders (ASD). Four recent whole-exome sequencing (WES) studies of ASD families revealed a handful of novel risk genes, based on independent de novo loss-of-function (LoF) mutations falling in the same gene, and found that de novo LoF mutations occurred at a twofold higher rate than expected by chance. However successful these studies were, they used only a small fraction of the data, excluding other types of de novo mutations and inherited rare variants. Moreover, such analyses cannot readily incorporate data from case-control studies. An important research challenge in gene discovery, therefore, is to develop statistical methods that accommodate a broader class of rare variation. We develop methods that can incorporate WES data regarding de novo mutations, inherited variants present, and variants identified within cases and controls. TADA, for Transmission And De novo Association, integrates these data by a gene-based likelihood model involving parameters for allele frequencies and gene-specific penetrances. Inference is based on a Hierarchical Bayes strategy that borrows information across all genes to infer parameters that would be difficult to estimate for individual genes. In addition to theoretical development we validated TADA using realistic simulations mimicking rare, large-effect mutations affecting risk for ASD and show it has dramatically better power than other common methods of analysis. Thus TADA's integration of various kinds of WES data can be a highly effective means of identifying novel risk genes. Indeed, application of TADA to WES data from subjects with ASD and their families, as well as from a study of ASD subjects and controls, revealed several novel and promising ASD candidate genes with strong statistical support.

[1]  Kathryn Roeder,et al.  Common genetic variants, acting additively, are a major source of risk for autism , 2012, Molecular Autism.

[2]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[3]  Timothy W. Yu,et al.  Whole-Exome Sequencing and Homozygosity Analysis Implicate Depolarization-Regulated Neuronal Genes in Autism , 2012, PLoS genetics.

[4]  J. Ott,et al.  A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data. , 2001, American journal of human genetics.

[5]  M. Rieder,et al.  Corrigendum: Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations (Nature Genetics (2011) 43, (585-589)) , 2012 .

[6]  J. Pritchard Are rare variants responsible for susceptibility to complex diseases? , 2001, American journal of human genetics.

[7]  Bradley P. Coe,et al.  Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations , 2012, Nature.

[8]  Christoph Lange,et al.  The Fundamentals of Modern Statistical Genetics , 2010 .

[9]  Kenny Q. Ye,et al.  De Novo Gene Disruptions in Children on the Autistic Spectrum , 2012, Neuron.

[10]  Ruey-Hwa Chen,et al.  PDZ-RhoGEF ubiquitination by Cullin3–KLHL20 controls neurotrophin-induced neurite outgrowth , 2011, The Journal of cell biology.

[11]  David S. Park,et al.  The Rb/E2F Pathway Modulates Neurogenesis through Direct Regulation of the Dlx1/Dlx2 Bigene Cluster , 2012, The Journal of Neuroscience.

[12]  Kathryn Roeder,et al.  Rare Complete Knockouts in Humans: Population Distribution and Significant Role in Autism Spectrum Disorders , 2013, Neuron.

[13]  Margaret A. Pericak-Vance,et al.  A genome-wide scan for common alleles affecting risk for autism , 2010, Human molecular genetics.

[14]  Hongyu Zhao,et al.  Association of COL25A1 with Comorbid Antisocial Personality Disorder and Substance Dependence , 2012, Biological Psychiatry.

[15]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[16]  Michael F. Walker,et al.  De novo mutations revealed by whole-exome sequencing are strongly associated with autism , 2012, Nature.

[17]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[18]  J. Nevins,et al.  A role for Mediator complex subunit MED13L in Rb/E2F-induced growth arrest , 2012, Oncogene.

[19]  C. Betancur,et al.  Etiological heterogeneity in autism spectrum disorders: More than 100 genetic and genomic disorders and still counting , 2011, Brain Research.

[20]  Boris Yamrom,et al.  Rare De Novo and Transmitted Copy-Number Variation in Autistic Spectrum Disorders , 2011, Neuron.

[21]  V. Bansal,et al.  Statistical analysis strategies for association studies involving rare variants , 2010, Nature Reviews Genetics.

[22]  David S. Park,et al.  Rb/E2F Regulates Expression of Neogenin during Neuronal Migration , 2010, Molecular and Cellular Biology.

[23]  E. Wijsman,et al.  Inheritance Model Introduces Differential Bias in CNV Calls Between Parents and Offspring , 2012, Genetic epidemiology.

[24]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[25]  Adam Kiezun,et al.  Exome sequencing and the genetic basis of complex traits , 2012, Nature Genetics.

[26]  T. Dawson,et al.  NMDA-induced neuronal survival is mediated through nuclear factor I-A in mice. , 2010, The Journal of clinical investigation.

[27]  Gary D Bader,et al.  Functional impact of global rare copy number variation in autism spectrum disorders , 2010, Nature.

[28]  Kathryn Roeder,et al.  Multiple Recurrent De Novo CNVs, Including Duplications of the 7q11.23 Williams Syndrome Region, Are Strongly Associated with Autism , 2011, Neuron.

[29]  David M. Simcha,et al.  Tackling the widespread and critical impact of batch effects in high-throughput data , 2010, Nature Reviews Genetics.

[30]  C. Rongo,et al.  KEL-8 is a substrate receptor for CUL3-dependent ubiquitin ligase that regulates synaptic glutamate receptor turnover. , 2005, Molecular biology of the cell.

[31]  J. Veltman,et al.  De novo mutations in human genetic disease , 2012, Nature Reviews Genetics.

[32]  Christian P. Robert,et al.  Large-scale inference , 2010 .

[33]  W. Hennah,et al.  DISC1 Conditioned GWAS for Psychosis Proneness in a Large Finnish Birth Cohort , 2012, PloS one.

[34]  Bradley P. Coe,et al.  Multiplex Targeted Sequencing Identifies Recurrently Mutated Genes in Autism Spectrum Disorders , 2012, Science.

[35]  Kathryn Roeder,et al.  Analysis of Rare, Exonic Variation amongst Subjects with Autism Spectrum Disorders and Population Controls , 2013, PLoS genetics.

[36]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[37]  Rebecca D Hodge,et al.  Tbr1 regulates regional and laminar identity of postmitotic neurons in developing neocortex , 2010, Proceedings of the National Academy of Sciences.

[38]  M. Rieder,et al.  Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations , 2011, Nature Genetics.

[39]  D. Wilkinson,et al.  A feedback loop mediated by degradation of an inhibitor is required to initiate neuronal differentiation. , 2010, Genes & development.

[40]  K. Roeder,et al.  Do common variants play a role in risk for autism? Evidence and theoretical musings , 2011, Brain Research.

[41]  K. Scearce-Levie,et al.  COL25A1 triggers and promotes Alzheimer’s disease-like pathology in vivo , 2009, neurogenetics.

[42]  Evan T. Geller,et al.  Patterns and rates of exonic de novo mutations in autism spectrum disorders , 2012, Nature.

[43]  L. Richards,et al.  Abnormal Development of Forebrain Midline Glia and Commissural Projections in Nfia Knock-Out Mice , 2003, The Journal of Neuroscience.