SynthEx: a synthetic-normal-based DNA sequencing tool for copy number alteration detection and tumor heterogeneity profiling

Changes in the quantity of genetic material, known as somatic copy number alterations (CNAs), can drive tumorigenesis. Many methods exist for assessing CNAs using microarrays, but considerable technical issues limit current CNA calling based upon DNA sequencing. We present SynthEx, a novel tool for detecting CNAs from whole exome and genome sequencing. SynthEx utilizes a “synthetic-normal” strategy to overcome technical and financial issues. In terms of accuracy and precision, SynthEx is highly comparable to array-based methods and outperforms sequencing-based CNA detection tools. SynthEx robustly identifies CNAs using sequencing data without the additional costs associated with matched normal specimens.

[1]  Matthew D. Wilkerson,et al.  ABRA: improved coding indel detection via assembly-based realignment , 2014, Bioinform..

[2]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[3]  Jos Jonkers,et al.  CopywriteR: DNA copy number detection from off-target sequence data , 2015, Genome Biology.

[4]  Fred A. Wright,et al.  Integrated study of copy number states and genotype calls using high-density SNP arrays , 2009, Nucleic acids research.

[5]  Oliver Sieber,et al.  A statistical approach for detecting genomic aberrations in heterogeneous tumor samples from single nucleotide polymorphism genotyping data , 2010, Genome Biology.

[6]  Steven J. M. Jones,et al.  Comprehensive Molecular Portraits of Invasive Lobular Breast Cancer , 2015, Cell.

[7]  Christopher A. Miller,et al.  VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. , 2012, Genome research.

[8]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[9]  V. Seshan,et al.  FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing , 2016, Nucleic acids research.

[10]  Tatiana Popova,et al.  Supplementary Methods , 2012, Acta Neuropsychiatrica.

[11]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[12]  John Quackenbush,et al.  Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV , 2011, Bioinform..

[13]  Michael C. Schatz,et al.  Interactive analysis and assessment of single-cell copy-number variations , 2015, Nature Methods.

[14]  Hongyu Zhao,et al.  SomatiCA: Identifying, Characterizing and Quantifying Somatic Copy Number Aberrations from Cancer Genome Sequencing Data , 2013, PloS one.

[15]  Gabor T. Marth,et al.  Haplotype-based variant detection from short-read sequencing , 2012, 1207.3907.

[16]  E. Lander,et al.  Assessing the significance of chromosomal aberrations in cancer: Methodology and application to glioma , 2007, Proceedings of the National Academy of Sciences.

[17]  Michael L. Gatza,et al.  Cross-species DNA copy number analyses identifies multiple 1q21-q23 subtype-specific driver genes for breast cancer , 2015, Breast Cancer Research and Treatment.

[18]  Agus Salim,et al.  Statistical challenges associated with detecting copy number variations with next-generation sequencing , 2012, Bioinform..

[19]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[20]  C. Perou,et al.  Allele-specific copy number analysis of tumors , 2010, Proceedings of the National Academy of Sciences.

[21]  Sampsa Hautaniemi,et al.  Comparative analysis of methods for identifying somatic copy number alterations from deep sequencing data , 2015, Briefings Bioinform..

[22]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[23]  Anders Isaksson,et al.  Patchwork: allele-specific copy number analysis of whole-genome sequenced tumor tissue , 2013, Genome Biology.

[24]  Nancy R. Zhang,et al.  Allele-specific copy number profiling by next-generation DNA sequencing , 2014, Nucleic acids research.

[25]  D. Hayes,et al.  Combined Targeted DNA Sequencing in Non-Small Cell Lung Cancer (NSCLC) Using UNCseq and NGScopy, and RNA Sequencing Using UNCqeR for the Detection of Genetic Aberrations in NSCLC , 2015, PloS one.

[26]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[27]  Steven J. M. Jones,et al.  Comprehensive genomic characterization of head and neck squamous cell carcinomas , 2015, Nature.

[28]  Nancy R. Zhang,et al.  CODEX: a normalization and copy number variation detection method for whole exome sequencing , 2015, Nucleic acids research.

[29]  Michael L. Gatza,et al.  An integrated genomics approach identifies drivers of proliferation in luminal subtype human breast cancer , 2014, Nature Genetics.

[30]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[31]  Christopher R. Cabanski,et al.  Integrated RNA and DNA sequencing improves mutation detection in low purity tumors , 2014, Nucleic acids research.

[32]  S. Halgamuge,et al.  Inferring copy number and genotype in tumour exome data , 2014, BMC Genomics.