Diagnostic targETEd seQuencing adjudicaTion (DETEQT): Algorithms for Adjudicating Targeted Infectious Disease Next-Generation Sequencing Panels.

Next-generation sequencing (NGS) for infectious disease diagnostics is a relatively new and underdeveloped concept. If this technology is to become a regulatory-grade clinical diagnostic, standardization in the form of locked-down assays and firmly established underlying processes is necessary. Targeted sequencing, specifically by amplification of genomic signatures, has the potential to bridge the gap between PCR- and NGS-based diagnostics; however, existing NGS assay panels lack validated analytical techniques to adjudicate high background and error-prone NGS data. Herein, we present the Diagnostic targETEd seQuencing adjudicaTion (DETEQT) software, consisting of an intuitive bioinformatics pipeline entailing a set of algorithms to translate raw sequencing data into positive, negative, and indeterminate diagnostic determinations. After basic read filtering and mapping, the software compares abundance and quality metrics against heuristic and fixed thresholds. A novel generalized quality function provides an amalgamated quality score for the match between sequence reads of an assay and panel targets, rather than considering each component factor independently. When evaluated against numerous assay samples and parameters (mock clinical, human, and nonhuman primate clinical data sets; diverse amplification strategies; downstream applications; and sequence platforms), DETEQT demonstrated improved rejection of false positives and accuracies >95%. Finally, DETEQT was implemented in the user-friendly Empowering the Development of Genomics Expertise (EDGE) bioinformatics platform, providing a complete, end-to-end solution that can be operated by nonexperts in a clinical laboratory setting.

[1]  C. Boucher,et al.  Targeted Enrichment for Pathogen Detection and Characterization in Three Felid Species , 2017, Journal of Clinical Microbiology.

[2]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[3]  Timothy D. Minogue,et al.  Targeted next-generation sequencing for the detection of ciprofloxacin resistance markers using molecular inversion probes , 2016, Scientific Reports.

[4]  V. Veer,et al.  Next-generation sequencing in clinical virology: Discovery of new viruses. , 2015, World journal of virology.

[5]  Sarah L. Westcott,et al.  Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform , 2013, Applied and Environmental Microbiology.

[6]  Jonathan E. Allen,et al.  Targeted amplification for enhanced detection of biothreat agents by next-generation sequencing , 2015, BMC Research Notes.

[7]  Gustavo F. Palacios,et al.  Development and Evaluation of a Panel of Filovirus Sequence Capture Probes for Pathogen Detection by Next-Generation Sequencing , 2014, PloS one.

[8]  D. Kwiatkowski,et al.  Efficient Depletion of Host DNA Contamination in Malaria Clinical Sequencing , 2012, Journal of Clinical Microbiology.

[9]  M. Rowicka,et al.  Strategies for Achieving High Sequencing Accuracy for Low Diversity Samples and Avoiding Sample Bleeding Using Illumina Platform , 2015, PloS one.

[10]  Fiona J. Stewart,et al.  A Method for Selectively Enriching Microbial DNA from Contaminating Vertebrate Host DNA , 2013, PloS one.

[11]  Ryan R Wick,et al.  Deepbinner: Demultiplexing barcoded Oxford Nanopore reads with deep convolutional neural networks , 2018, bioRxiv.

[12]  John Chilton,et al.  The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update , 2016, Nucleic Acids Res..

[13]  Puthen V. Jithesh,et al.  Depletion of Human DNA in Spiked Clinical Specimens for Improvement of Sensitivity of Pathogen Detection by Next-Generation Sequencing , 2016, Journal of Clinical Microbiology.

[14]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[15]  P. Roey,et al.  Clinical Sensitivity of Cystic Fibrosis Mutation Panels in a Diverse Population , 2016, Human mutation.

[16]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[17]  Po-E Li,et al.  Enabling the democratization of the genomics revolution with a fully integrated web-based bioinformatics platform , 2016, bioRxiv.

[18]  M. Borchert,et al.  Ebola Virus Disease Outbreak in Isiro, Democratic Republic of the Congo, 2012: Signs and Symptoms, Management and Outcomes , 2015, PloS one.

[19]  Po-E Li,et al.  Accurate read-based metagenome characterization using a hierarchical suite of unique signatures , 2015, Nucleic acids research.

[20]  Niranjan Nagarajan,et al.  Fast and sensitive mapping of nanopore sequencing reads with GraphMap , 2016, Nature Communications.

[21]  C. Ponting,et al.  Sequencing depth and coverage: key considerations in genomic analyses , 2014, Nature Reviews Genetics.