Determination of genomic copy number alteration emphasizing a restriction site-based strategy of genome re-sequencing

MOTIVATION Copy number abbreviation (CNA) is one type of genomic aberration that is often induced by genome instability and is associated with diseases such as cancer. Determination of the genome-wide CNA profile is an important step in identifying the underlying mutation mechanisms. Genomic data based on next-generation sequencing technology are particularly suitable for determination of high-quality CNA profile. Now is an important time to reevaluate the use of sequencing techniques for CNA analysis, especially with the rapid growth of the different targeted genome and whole-genome sequencing strategies. RESULTS In this study, we provide a comparison of resequencing strategies, with regard to their utility, applied to the same hepatocellular carcinoma sample for copy number determination. These strategies include whole-genome, exome and restriction site-associated DNA (RAD) sequencing. The last of these strategies is a targeted sequencing technique that involves cutting the genome with a restriction enzyme and isolating the targeted sequences. Our data demonstrate that RAD sequencing is an efficient and comprehensive strategy that allows the cost-effective determination of CNAs. Further investigation of RAD sequencing data led to the finding that a precise measurement of the allele frequency would be a helpful complement to the read depth for CNA analysis for two reasons. First, knowledge of the allele frequency helps to resolve refined calculations of allele-specific copy numbers, which, in turn, identify the functionally important CNAs that are under natural selection on the parental alleles. Second, this knowledge enables deconvolution of CNA patterns in complex genomic regions.

[1]  Joshua F. McMichael,et al.  Genome Remodeling in a Basal-like Breast Cancer Metastasis and Xenograft , 2010, Nature.

[2]  M. Blaxter,et al.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing , 2011, Nature Reviews Genetics.

[3]  Chao Xie,et al.  CNV-seq, a new method to detect copy number variation using high-throughput sequencing , 2009, BMC Bioinformatics.

[4]  P. Etter,et al.  Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers , 2008, PloS one.

[5]  E. Eichler,et al.  Mutational and selective effects on copy-number variants in the human genome , 2007, Nature Genetics.

[6]  A. Singleton,et al.  Rare Structural Variants Disrupt Multiple Genes in Neurodevelopmental Pathways in Schizophrenia , 2008, Science.

[7]  Joseph A. Gogos,et al.  Strong association of de novo copy number mutations with sporadic schizophrenia , 2008, Nature Genetics.

[8]  SathirapongsasutiJarupon Fah,et al.  Exome sequencing-based copy-number variation and loss of heterozygosity detection , 2011 .

[9]  Masatoshi Nei,et al.  Genomic drift and copy number variation of sensory receptor genes in humans , 2007, Proceedings of the National Academy of Sciences.

[10]  Seungtai Yoon,et al.  Detecting common copy number variants in high-throughput sequencing data by using JointSLM algorithm , 2011, Nucleic acids research.

[11]  Ryan Mills,et al.  Comprehensive assessment of array-based platforms and calling algorithms for detection of copy number variants , 2011, Nature Biotechnology.

[12]  P. Stankiewicz,et al.  Genomic Imbalances in Neonates With Birth Defects: High Detection Rates by Using Chromosomal Microarray Analysis , 2008, Pediatrics.

[13]  Misko Dzamba,et al.  Detecting copy number variation with mated short reads. , 2010, Genome research.

[14]  Caleb Webber,et al.  Bias of Selection on Human Copy-Number Variants , 2006, PLoS genetics.

[15]  Nicholas Stiffler,et al.  Population Genomics of Parallel Adaptation in Threespine Stickleback using Sequenced RAD Tags , 2010, PLoS genetics.

[16]  Jian Lu,et al.  Population genetics in nonmodel organisms: II. natural selection in marginal habitats revealed by deep sequencing on dual platforms. , 2011, Molecular biology and evolution.

[17]  Hui Jiang,et al.  Comprehensive comparison of three commercial human whole-exome capture platforms , 2011, Genome Biology.

[18]  Matteo Benelli,et al.  A very fast and accurate method for calling aberrations in array-CGH data. , 2010, Biostatistics.

[19]  Elaine R. Mardis,et al.  A decade’s perspective on DNA sequencing technology , 2011, Nature.

[20]  Li Jin,et al.  Identification of copy number variation hotspots in human populations. , 2010, American journal of human genetics.

[21]  M. Stratton,et al.  The cancer genome , 2009, Nature.

[22]  Fernando A. Villanea,et al.  Diet and the evolution of human amylase gene copy number variation , 2007, Nature Genetics.

[23]  Y. Teo,et al.  Genome wide association studies (GWAS) and copy number variation (CNV) studies of the major psychoses: What have we learnt? , 2012, Neuroscience & Biobehavioral Reviews.

[24]  Christopher D. Brown,et al.  Rapid growth of a hepatocellular carcinoma and the driving mutations revealed by cell-population genetic analysis of whole-genome data , 2011, Proceedings of the National Academy of Sciences.

[25]  Peter H. Sudmant,et al.  Diversity of Human Copy Number Variation and Multicopy Genes , 2010, Science.

[26]  Peter Lichter,et al.  Minimal sizes of deletions detected by comparative genomic hybridization , 1998, Genes, chromosomes & cancer.

[27]  M. Gerstein,et al.  CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. , 2011, Genome research.

[28]  Cheng Li,et al.  dChipSNP: significance curve and clustering of SNP-array-based loss-of-heterozygosity data , 2004, Bioinform..

[29]  Kenny Q. Ye,et al.  Sensitive and accurate detection of copy number variants using read depth of coverage. , 2009, Genome research.

[30]  J. Troge,et al.  Tumour evolution inferred by single-cell sequencing , 2011, Nature.

[31]  Bradley P. Coe,et al.  Copy number variation detection and genotyping from exome sequence data , 2012, Genome research.

[32]  Susanne Walitza,et al.  Genome-wide copy number variation study associates metabotropic glutamate receptor gene networks with attention deficit hyperactivity disorder , 2011, Nature Genetics.

[33]  C. Greenman Estimation of Rearrangement Phylogeny in Cancer , 2012 .

[34]  Jared T. Simpson,et al.  Copy number variant detection in inbred strains from short read sequence data , 2009, Bioinform..

[35]  E. Eichler,et al.  Segmental duplications and copy-number variation in the human genome. , 2005, American journal of human genetics.

[36]  Tom Royce,et al.  A comprehensive catalogue of somatic mutations from a human cancer genome , 2010, Nature.

[37]  John Quackenbush,et al.  Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV , 2011, Bioinform..

[38]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[39]  D. Pinkel,et al.  Array comparative genomic hybridization and its applications in cancer , 2005, Nature Genetics.

[40]  Clare Garvey,et al.  A decade and genome of change , 2010, Genome Biology.

[41]  Mark D. Johnson,et al.  Copy number variation detection in whole-genome sequencing data using the Bayesian information criterion , 2011, Proceedings of the National Academy of Sciences.

[42]  Mark Gerstein,et al.  Genome-Wide Mapping of Copy Number Variation in Humans: Comparative Analysis of High Resolution Array Platforms , 2011, PloS one.

[43]  N. Carter,et al.  Estimation of rearrangement phylogeny for cancer genomes. , 2012, Genome research.

[44]  Dario Strbenac,et al.  Copy-number-aware differential analysis of quantitative DNA sequencing data , 2012, Genome research.

[45]  H. Döhner,et al.  Matrix‐based comparative genomic hybridization: Biochips to screen for genomic imbalances , 1997, Genes, chromosomes & cancer.

[46]  Bradley P. Coe,et al.  Genome structural variation discovery and genotyping , 2011, Nature Reviews Genetics.

[47]  Antony V. Cox,et al.  Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing , 2008, Nature Genetics.

[48]  Hanlee P. Ji,et al.  Next-generation DNA sequencing , 2008, Nature Biotechnology.