unCOVERApp: an interactive graphical application for clinical assessment of sequence coverage at the base-pair level

Motivation Next Generation Sequencing (NGS) is increasingly adopted in the clinical practice largely thanks to concurrent advancements in bioinformatic tools for variant detection and annotation. Despite improvements in available approaches, the need to assess sequencing quality down to the base-pair level still poses challenges for diagnostic accuracy. One of the most popular quality parameters of diagnostic NGS is the percentage of targeted bases characterized by low depth of coverage (DoC). These regions potentially hide a clinically-relevant variant, but no annotation is usually returned for them. However, visualizing low-DoC data with their potential functional and clinical consequences may be useful to prioritize inspection of specific regions before re-sequencing all coverage gaps or making assertions about completeness of the diagnostic test. To meet this need we have developed unCOVERApp, an interactive application for graphical inspection and clinical annotation of low-DoC genomic regions containing genes. Results unCOVERApp is a suite of graphical and statistical tools to support clinical assessment of low-DoC regions. Its interactive plots allow to display gene sequence coverage down to the base-pair level, and functional and clinical annotations of sites below a user-defined DoC threshold can be downloaded in a user-friendly spreadsheet format. Moreover, unCOVERApp provides a simple statistical framework to evaluate if DoC is sufficient for the detection of somatic variants, where the usual 20x DoC threshold used for germline variants is not adequate. A maximum credible allele frequency calculator is also available allowing users to set allele frequency cut-offs based on assumptions about the genetic architecture of the disease instead of applying a general one (e.g. 5%). In conclusion, unCOVERApp is an original tool designed to identify sites of potential clinical interest that may be hidden in diagnostic sequencing data. Availability unCOVERApp is a freely available application written in R and developed with Shiny packages and available in GitHub.

[1]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[2]  David Haussler,et al.  The UCSC Genome Browser database: 2018 update , 2017, Nucleic Acids Res..

[3]  Florian Hahne,et al.  Visualizing Genomic Data Using Gviz and Bioconductor , 2016, Statistical Genomics.

[4]  D. MacArthur,et al.  Using high-resolution variant frequencies to empower clinical genome interpretation , 2016, Genetics in Medicine.

[5]  E. Boerwinkle,et al.  dbNSFP v3.0: A One‐Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice‐Site SNVs , 2016, Human mutation.

[6]  Mahdi Sarmady,et al.  Characterizing reduced coverage regions through comparison of exome and genome sequencing data across ten centers , 2017, Genetics in Medicine.

[7]  Gert Matthijs,et al.  Guidelines for diagnostic next-generation sequencing , 2015, European Journal of Human Genetics.

[8]  Nikhil Wagle,et al.  Clinical Sequencing Exploratory Research Consortium: Accelerating Evidence-Based Practice of Genomic Medicine. , 2016, American journal of human genetics.

[9]  Bale,et al.  Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology , 2015, Genetics in Medicine.

[10]  Leslie G Biesecker,et al.  Diagnostic clinical genome and exome sequencing. , 2014, The New England journal of medicine.

[11]  S. Henikoff,et al.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm , 2009, Nature Protocols.

[12]  Shashikant Kulkarni,et al.  Assuring the quality of next-generation sequencing in clinical laboratory practice , 2012, Nature Biotechnology.

[13]  Astrid Gall,et al.  Ensembl 2018 , 2017, Nucleic Acids Res..

[14]  J. Lehmann-Che,et al.  Resistance to therapy in acute promyelocytic leukemia. , 2014, The New England journal of medicine.