TIDDIT, an efficient and comprehensive structural variant caller for massive parallel sequencing data

Reliable detection of large structural variation ( > 1000 bp) is important in both rare and common genetic disorders. Whole genome sequencing (WGS) is a technology that may be used to identify a large proportion of the genomic structural variants (SVs) in an individual in a single experiment. Even though SV callers have been extensively used in research to detect mutations, the potential usage of SV callers within routine clinical diagnostics is still limited. One well known, but not well-addressed problem is the large number of benign variants and reference errors present in the human genome that further complicates analysis. Even though there is a wide range of SV-callers available, the number of callers that allow detection of the entire spectra of SV at a low computational cost is still relatively limited.

[1]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[2]  Bassem A Bejjani,et al.  Application of array-based comparative genomic hybridization to clinical diagnostics. , 2006, The Journal of molecular diagnostics : JMD.

[3]  Sonja W. Scholz,et al.  Genome-wide SNP assay reveals structural genomic variation, extended homozygosity and cell-line induced alterations in normal individuals. , 2007, Human molecular genetics.

[4]  Paul Medvedev,et al.  Computational methods for discovering structural variation with next-generation sequencing , 2009, Nature Methods.

[5]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[6]  Kai Ye,et al.  Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads , 2009, Bioinform..

[7]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[8]  Masao Nagasaki,et al.  Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing , 2010, Nature Genetics.

[9]  P. Stankiewicz,et al.  Structural variation in the human genome and its role in disease. , 2010, Annual review of medicine.

[10]  Ryan Bishop,et al.  Applications of fluorescence in situ hybridization (FISH) in detecting genetic aberrations of medical significance , 2010 .

[11]  Bradley P. Coe,et al.  Genome structural variation discovery and genotyping , 2011, Nature Reviews Genetics.

[12]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[13]  M. Gerstein,et al.  CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. , 2011, Genome research.

[14]  Markus J. van Roosmalen,et al.  Chromothripsis as a mechanism driving complex de novo structural rearrangements in the germline. , 2011, Human molecular genetics.

[15]  Thomas Zichner,et al.  DELLY: structural variant discovery by integrated paired-end and split-read analysis , 2012, Bioinform..

[16]  H. Swerdlow,et al.  A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers , 2012, BMC Genomics.

[17]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[18]  O. Mäkitie,et al.  Different mutations in PDE4D associated with developmental disorders with mirror phenotypes , 2013, Journal of Medical Genetics.

[19]  E. Mardis Next-generation sequencing platforms. , 2013, Annual review of analytical chemistry.

[20]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[21]  Ryan M. Layer,et al.  LUMPY: a probabilistic framework for structural variant discovery , 2012, Genome Biology.

[22]  Lars Arvestad,et al.  BESST - Efficient scaffolding of large fragmented assemblies , 2014, BMC Bioinformatics.

[23]  D. Jong,et al.  GRM1 is upregulated through gene fusion and promoter swapping in chondromyxoid fibroma , 2014, Nature Genetics.

[24]  J. Kere,et al.  CTNND2—a candidate gene for reading problems and mild intellectual disability , 2014, Journal of Medical Genetics.

[25]  Erika Check Hayden,et al.  Technology: The $1,000 genome , 2014, Nature.

[26]  J. Lupski,et al.  Recurrent CNVs and SNVs at the NPHP1 locus contribute pathogenic alleles to Bardet-Biedl syndrome. , 2014, American journal of human genetics.

[27]  M. Schatz,et al.  Accurate detection of de novo and transmitted indels within exome-capture data using micro-assembly , 2014, Nature Methods.

[28]  A. Magi,et al.  Detection of Genomic Structural Variants from Next-Generation Sequencing Data , 2015, Front. Bioeng. Biotechnol..

[29]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[30]  Heng Li,et al.  FermiKit: assembly-based variant calling for Illumina resequencing data , 2015, Bioinform..

[31]  Anna Lindstrand,et al.  Dominant mutations in KAT6A cause intellectual disability with recognizable syndromic features. , 2015, American journal of human genetics.

[32]  Dan Nettleton,et al.  SimSeq: a nonparametric approach to simulation of RNA-sequence datasets , 2015, Bioinform..

[33]  O. Mäkitie,et al.  Low Copy Number of the AMY1 Locus Is Associated with Early-Onset Female Obesity in Finland , 2015, PloS one.

[34]  Wolfgang Losert,et al.  svclassify: a method to establish benchmark structural variant calls , 2015, BMC Genomics.

[35]  Amina Noor,et al.  Frequency and Complexity of De Novo Structural Mutation in Autism. , 2016, American journal of human genetics.

[36]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, Genome Biology.

[37]  Xiaoyu Chen,et al.  Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications , 2016, Bioinform..

[38]  Alexa B. R. McIntyre,et al.  Extensive sequencing of seven human genomes to characterize benchmark reference materials , 2015, Scientific Data.

[39]  Pall I. Olason,et al.  SweGen: A whole-genome map of genetic variability in a cross-section of the Swedish population , 2016, bioRxiv.

[40]  R. Pfundt,et al.  Identification of new TRIP12 variants and detailed clinical evaluation of individuals with non-syndromic intellectual disability with or without autism , 2016, Human Genetics.

[41]  Daniel Nilsson,et al.  Whole‐Genome Sequencing of Cytogenetically Balanced Chromosome Translocations Identifies Potentially Pathological Gene Disruptions and Highlights the Importance of Microhomology in the Mechanism of Formation , 2017, Human mutation.

[42]  Pall I. Olason,et al.  SweGen: a whole-genome data resource of genetic variability in a cross-section of the Swedish population , 2017, European Journal of Human Genetics.