New mutations, old statistical challenges

Based on targeted sequencing of 208 genes in 11,730 neurodevelopmental disorder cases, Stessman et al. report the identification of 91 genes associated (at a False Discovery Rate [FDR] of 0.1) with autism spectrum disorders (ASD), intellectual disability (ID), and developmental delay (DD)-including what they characterize as 38 novel genes, not previously reported as connected with these diseases. If true, this would represent a substantial step forward. Unfortunately, each of the two discovery analyses (1. De novo mutation analysis and, 2. a comparison of private mutations with public control data) contain critical statistical flaws. When one accounts for these problems, fewer than half of the genes - and very few, if any, of the novel findings - survive. These errors have implications for how future analyses should be conducted, for understanding the genetic basis of these disorders, and for genomic medicine.

[1]  Bradley P. Coe,et al.  Targeted sequencing identifies 91 neurodevelopmental disorder risk genes with autism and developmental disability biases , 2017, Nature Genetics.

[2]  Joan,et al.  Prevalence and architecture of de novo mutations in developmental disorders , 2017, Nature.

[3]  Deciphering Developmental Disorders Study,et al.  Prevalence and architecture of de novo mutations in developmental disorders , 2017, Nature.

[4]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[5]  Christopher S. Poultney,et al.  Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci , 2015, Neuron.

[6]  David J Cutler,et al.  Population Genetics Identifies Challenges in Analyzing Rare Variants , 2015, Genetic epidemiology.

[7]  Tomas W. Fitzgerald,et al.  Large-scale discovery of novel genetic causes of developmental disorders , 2014, Nature.

[8]  Boris Yamrom,et al.  The contribution of de novo coding mutations to autism spectrum disorder , 2014, Nature.

[9]  Christopher S. Poultney,et al.  Synaptic, transcriptional, and chromatin genes disrupted in autism , 2014, Nature.

[10]  Kali T. Witherspoon,et al.  Refining analyses of copy number variation identifies specific genes associated with developmental delay , 2014, Nature Genetics.

[11]  Stephan J Sanders,et al.  A framework for the interpretation of de novo mutation in human disease , 2014, Nature Genetics.

[12]  A. Hoischen,et al.  Severe mental retardation with breathing abnormalities (Pitt-Hopkins syndrome) is caused by haploinsufficiency of the neuronal bHLH transcription factor TCF4. , 2007, Human molecular genetics.

[13]  Nathalie Boddaert,et al.  Mutations in TCF4, encoding a class I basic helix-loop-helix transcription factor, are responsible for Pitt-Hopkins syndrome, a severe epileptic encephalopathy associated with autonomic dysfunction. , 2007, American journal of human genetics.

[14]  Juliane Hoyer,et al.  Haploinsufficiency of TCF4 causes syndromal mental retardation with intermittent hyperventilation (Pitt-Hopkins syndrome). , 2007, American journal of human genetics.

[15]  I. Pe’er,et al.  Optimal two‐stage genotyping designs for genome‐wide association scans , 2006, Genetic epidemiology.

[16]  G. Abecasis,et al.  Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies , 2006, Nature Genetics.

[17]  R. Elston,et al.  Optimal two‐stage genotyping in population‐based association studies , 2003, Genetic epidemiology.

[18]  E. Lander,et al.  Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease , 2003, Nature Genetics.

[19]  J Blangero,et al.  Large upward bias in estimation of locus-specific effects from genomewide scans. , 2001, American journal of human genetics.

[20]  J. Ioannidis,et al.  Replication validity of genetic association studies , 2001, Nature Genetics.

[21]  R. Elston,et al.  One-stage versus two-stage strategies for genome scans. , 2001, Advances in genetics.

[22]  M. Daly,et al.  Linkage thresholds for two-stage genome scans. , 1998, American journal of human genetics.