Whole exome association of rare deletions in multiplex oral cleft families

By sequencing the exomes of distantly related individuals in multiplex families, rare mutational and structural changes to coding DNA can be characterized and their relationship to disease risk can be assessed. Recently, several rare single nucleotide variants (SNVs) were associated with an increased risk of nonsyndromic oral cleft, highlighting the importance of rare sequence variants in oral clefts and illustrating the strength of family‐based study designs. However, the extent to which rare deletions in coding regions of the genome occur and contribute to risk of nonsyndromic clefts is not well understood. To identify putative structural variants underlying risk, we developed a pipeline for rare hemizygous deletions in families from whole exome sequencing and statistical inference based on rare variant sharing. Among 56 multiplex families with 115 individuals, we identified 53 regions with one or more rare hemizygous deletions. We found 45 of the 53 regions contained rare deletions occurring in only one family member. Members of the same family shared a rare deletion in only eight regions. We also devised a scalable global test for enrichment of shared rare deletions.

[1]  Supporting information , 2010 .

[2]  N. Schork,et al.  Weighted Score Tests Implementing Model-Averaging Schemes in Detection of Rare Variants in Case-Control Studies , 2015, PloS one.

[3]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[4]  R. Elston,et al.  Detecting rare and common variants for complex traits: sibpair and odds ratio weighted sum statistics (SPWSS, ORWSS) , 2011, Genetic epidemiology.

[5]  Hokeun Sun,et al.  Statistical Selection Strategy for Risk and Protective Rare Variants Associated with Complex Traits , 2015, J. Comput. Biol..

[6]  Eleftheria Zeggini,et al.  Rare variant association analysis methods for complex traits. , 2010, Annual review of genetics.

[7]  Peter Donnelly,et al.  Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array , 2011, Genetic epidemiology.

[8]  R. Ophoff,et al.  Genome-wide burden of deleterious coding variants increased in schizophrenia , 2015, Nature Communications.

[9]  R. Iyer,et al.  A complex 6p25 rearrangement in a child with multiple epiphyseal dysplasia , 2011, American journal of medical genetics. Part A.

[10]  Menachem Fromer,et al.  Using XHMM Software to Detect Copy Number Variation in Whole‐Exome Sequencing Data , 2014, Current protocols in human genetics.

[11]  Greg Gibson,et al.  Common genetic variation and performance on standardized cognitive tests , 2010, European Journal of Human Genetics.

[12]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[13]  Sean M. Grimmond,et al.  The uniqueome: a mappability resource for short-tag sequencing , 2010, Bioinform..

[14]  Stéphane Robin,et al.  Joint segmentation, calling, and normalization of multiple CGH profiles. , 2011, Biostatistics.

[15]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[16]  Andrew F Pilon Midline orofacial cleft defects in association with type 1 Duane's retraction syndrome , 2009, Clinical & experimental optometry.

[17]  T. Beaty,et al.  Whole Exome Sequencing of Distant Relatives in Multiplex Families Implicates Rare Variants in Candidate Genes for Oral Clefts , 2014, Genetics.

[18]  E. Banks,et al.  Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. , 2012, American journal of human genetics.

[19]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[20]  Holger Schwender,et al.  Fast detection of de novo copy number variants from SNP arrays for case-parent trios , 2012, BMC Bioinformatics.

[21]  S. Chern,et al.  A boy with cleft palate, hearing impairment, microcephaly, micrognathia and psychomotor retardation and a microdeletion in 6p25.3 involving the DUSP22 gene. , 2013, Genetic counseling.

[22]  Min A. Jhun,et al.  A statistical approach for rare-variant association testing in affected sibships. , 2015, American journal of human genetics.

[23]  S. Zöllner,et al.  Robust and Powerful Affected Sibpair Test for Rare Variant Association , 2015, Genetic epidemiology.

[24]  E. Cuppen,et al.  Systematic biases in DNA copy number originate from isolation procedures , 2013, Genome Biology.

[25]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[26]  Francesco Cucca,et al.  Methods for Association Analysis and Meta‐Analysis of Rare Variants in Families , 2015, Genetic epidemiology.

[27]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[28]  Tomas W. Fitzgerald,et al.  Breaking the waves: improved detection of copy number variation from microarray-based comparative genomic hybridization , 2007, Genome Biology.

[29]  Frederick E. Dewey,et al.  CLAMMS: a scalable algorithm for calling common and rare copy number variants from exome sequencing data , 2015, Bioinform..

[30]  J. Lupski,et al.  Exome Sequence Analysis Suggests that Genetic Burden Contributes to Phenotypic Variability and Complex Neuropathy. , 2015, Cell reports.

[31]  J Ragoussis,et al.  Evidence of a locus for orofacial clefting on human chromosome 6p24 and STS content map of the region. , 1995, Human molecular genetics.

[32]  Agus Salim,et al.  Statistical challenges associated with detecting copy number variations with next-generation sequencing , 2012, Bioinform..

[33]  Tomas W. Fitzgerald,et al.  A robust statistical method for case-control association testing with copy number variation , 2008, Nature Genetics.

[34]  Ingo Ruczinski,et al.  Inferring rare disease risk variants based on exact probabilities of sharing by multiple affected relatives , 2014, Bioinform..

[35]  David G. Knowles,et al.  Fast Computation and Applications of Genome Mappability , 2012, PloS one.

[36]  S. Chib Marginal Likelihood from the Gibbs Output , 1995 .

[37]  J. S. Marron,et al.  BlackOPs: increasing confidence in variant detection through mappability filtering , 2013, Nucleic acids research.

[38]  E. Wijsman The role of large pedigrees in an era of high-throughput sequencing , 2012, Human Genetics.

[39]  Bradley P. Coe,et al.  Copy number variation detection and genotyping from exome sequence data , 2012, Genome research.