The ExAC browser: displaying reference data information from over 60 000 exomes

Worldwide, hundreds of thousands of humans have had their genomes or exomes sequenced, and access to the resulting data sets can provide valuable information for variant interpretation and understanding gene function. Here, we present a lightweight, flexible browser framework to display large population datasets of genetic variation. We demonstrate its use for exome sequence data from 60 706 individuals in the Exome Aggregation Consortium (ExAC). The ExAC browser provides gene- and transcript-centric displays of variation, a critical view for clinical applications. Additionally, we provide a variant display, which includes population frequency and functional annotation data as well as short read support for the called variant. This browser is open-source, freely available at http://exac.broadinstitute.org, and has already been used extensively by clinical laboratories worldwide.

[1]  S. Letovsky,et al.  Exploring the landscape of pathogenic genetic variation in the ExAC population database: insights of relevance to variant classification , 2015, Genetics in Medicine.

[2]  S. Henikoff,et al.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm , 2009, Nature Protocols.

[3]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[4]  Alessandra Renieri,et al.  Targeted Next‐Generation Sequencing Analysis of 1,000 Individuals with Intellectual Disability , 2015, Human mutation.

[5]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[6]  Jakob Grove,et al.  Genetic risk for autism spectrum disorders and neuropsychiatric variation in the general population , 2015, Nature Genetics.

[7]  Andrew J. Hill,et al.  Analysis of protein-coding genetic variation in 60,706 humans , 2015, bioRxiv.

[8]  Daniel Rios,et al.  Bioinformatics Applications Note Databases and Ontologies Deriving the Consequences of Genomic Variants with the Ensembl Api and Snp Effect Predictor , 2022 .

[9]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[10]  E. Boerwinkle,et al.  dbNSFP: A Lightweight Database of Human Nonsynonymous SNPs and Their Functional Predictions , 2011, Human mutation.

[11]  Jeffrey Heer,et al.  SpanningAspectRatioBank Easing FunctionS ArrayIn ColorIn Date Interpolator MatrixInterpola NumObjecPointI Rectang ISchedu Parallel Pause Scheduler Sequen Transition Transitioner Transiti Tween Co DelimGraphMLCon IData JSONCon DataField DataSc Dat DataSource Data DataUtil DirtySprite LineS RectSprite , 2011 .

[12]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, Genome Biology.

[13]  E. Boerwinkle,et al.  dbNSFP v2.0: A Database of Human Non‐synonymous SNVs and Their Functional Predictions and Annotations , 2013, Human mutation.

[14]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[15]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[16]  Monkol Lek,et al.  Patterns of genic intolerance of rare copy number variation in 59,898 human exomes , 2016, Nature Genetics.

[17]  Stephan J Sanders,et al.  A framework for the interpretation of de novo mutation in human disease , 2014, Nature Genetics.

[18]  E. Banks,et al.  Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. , 2012, American journal of human genetics.

[19]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[20]  C. Tyler-Smith,et al.  Ancient DNA and the rewriting of human history: be sparing with Occam’s razor , 2016, Genome Biology.

[21]  Roger E. Stevenson,et al.  Targeted Next Generation Sequencing Analysis of 1000 individuals with Intellectual Disability , 2015 .