Joint detection of germline and somatic copy number events in matched tumor-normal sample pairs

MOTIVATION Whole genome sequencing (WGS) of tumor-normal sample pairs is a powerful approach for comprehensively characterizing germline copy number variations and somatic copy number alterations in cancer research and clinical practice. Existing computational approaches for detecting copy number events cannot detect germline copy number variations and somatic copy number alterations simultaneously, and yield low accuracy for somatic copy number alterations. RESULTS In this study, we developed TumorCNV, a novel approach for jointly detecting germline copy number variations and somatic copy number alterations from WGS data of the matched tumor-normal sample pair. We compared TumorCNV with existing copy number event detection approaches using the simulated data and real data for the COLO-829 melanoma cell line. The experimental results showed that TumorCNV achieved superior performance than existing approaches. AVAILABILITY The software TumorCNV is implemented using a combination of Java and R, and it is freely available from the website at https://github.com/yongzhuang/TumorCNV. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  J. R. MacDonald,et al.  A Comprehensive Workflow for Read Depth-Based Identification of Copy-Number Variation from Whole-Genome Sequence Data. , 2018, American journal of human genetics.

[2]  C. Swanton Intratumor heterogeneity: evolution through space and time. , 2012, Cancer research.

[3]  Mark D. Johnson,et al.  Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion , 2011, Proceedings of the National Academy of Sciences.

[4]  A global reference for human genetic variation , 2015, Nature.

[5]  Tatiana Popova,et al.  Supplementary Methods , 2012, Acta Neuropsychiatrica.

[6]  Joseph T. Glessner,et al.  PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. , 2007, Genome research.

[7]  A. Butte,et al.  Systematic pan-cancer analysis of tumour purity , 2015, Nature Communications.

[8]  Emmanuel Barillot,et al.  Control-free calling of copy number alterations in deep-sequencing data using GC-content normalization , 2010, Bioinform..

[9]  Weixin Wang,et al.  Integrative DNA copy number detection and genotyping from sequencing and array-based platforms , 2017, bioRxiv.

[10]  Derek Y. Chiang,et al.  The landscape of somatic copy-number alteration across human cancers , 2010, Nature.

[11]  Sohrab P. Shah,et al.  TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data , 2014, Genome research.

[12]  R. Kuiper,et al.  Germline copy number variation and cancer risk. , 2010, Current opinion in genetics & development.

[13]  D. Pellman,et al.  From polyploidy to aneuploidy, genome instability and cancer , 2004, Nature Reviews Molecular Cell Biology.

[14]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[15]  G. McVean,et al.  Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications , 2014, Nature Genetics.

[16]  Steven J. M. Jones,et al.  A somatic reference standard for cancer genome sequencing , 2016, Scientific Reports.

[17]  A. Shlien,et al.  Copy number variations and cancer , 2009, Genome Medicine.

[18]  Mingming Jia,et al.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer , 2014, Nucleic Acids Res..

[19]  Steve Lee,et al.  Canvas: versatile and scalable detection of copy number variants , 2016, bioRxiv.

[20]  Yadong Wang,et al.  Joint detection of copy number variations in parent-offspring trios , 2016, Bioinform..

[21]  Gabor T. Marth,et al.  An integrated map of structural variation in 2,504 human genomes , 2015, Nature.

[22]  Leping Li,et al.  ART: a next-generation sequencing read simulator , 2012, Bioinform..

[23]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[24]  J. Troge,et al.  Tumour evolution inferred by single-cell sequencing , 2011, Nature.

[25]  S. Gabriel,et al.  Pan-cancer patterns of somatic copy-number alteration , 2013, Nature Genetics.

[26]  B. Yamrom,et al.  Rare de novo germline copy-number variation in testicular cancer. , 2012, American journal of human genetics.

[27]  Henry M. Wood,et al.  Correcting for cancer genome size and tumour cell content enables better estimation of copy number alterations from next-generation sequence data , 2012, Bioinform..

[28]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[29]  Michael C. Rusch,et al.  CREST maps somatic structural variation in cancer genomes with base-pair resolution , 2011, Nature Methods.

[30]  Qingguo Wang,et al.  Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives , 2013, BMC Bioinformatics.

[31]  E. Banks,et al.  Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. , 2012, American journal of human genetics.

[32]  P. Park,et al.  Copy number analysis of whole-genome data using BIC-seq2 and its application to detection of cancer susceptibility variants , 2016, Nucleic acids research.