Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing

Human cancers often carry many somatically acquired genomic rearrangements, some of which may be implicated in cancer development. However, conventional strategies for characterizing rearrangements are laborious and low-throughput and have low sensitivity or poor resolution. We used massively parallel sequencing to generate sequence reads from both ends of short DNA fragments derived from the genomes of two individuals with lung cancer. By investigating read pairs that did not align correctly with respect to each other on the reference human genome, we characterized 306 germline structural variants and 103 somatic rearrangements to the base-pair level of resolution. The patterns of germline and somatic rearrangement were markedly different. Many somatic rearrangements were from amplicons, although rearrangements outside these regions, notably including tandem duplications, were also observed. Some somatic rearrangements led to abnormal transcripts, including two from internal tandem duplications and two fusion transcripts created by interchromosomal rearrangements. Germline variants were predominantly mediated by retrotransposition, often involving AluY and LINE elements. The results demonstrate the feasibility of systematic, genome-wide characterization of rearrangements in complex human cancer genomes, raising the prospect of a new harvest of genes associated with cancer using this strategy.

[1]  L Corcoran,et al.  Variant (6;15) translocations in murine plasmacytomas involve a chromosome 15 locus at least 72 kb from the c‐myc oncogene. , 1985, The EMBO journal.

[2]  Nature Genetics , 1991, Nature.

[3]  K. Huppi,et al.  Chimeric transcripts with an open reading frame are generated as a result of translocation to the Pvt‐1 region in mouse B‐cell tumors , 1994, International journal of cancer.

[4]  J. Minna,et al.  NCI series of cell lines: An historical perspective , 1996, Journal of cellular biochemistry. Supplement.

[5]  F. Couch,et al.  17q23 amplifications in breast cancer involve the PAT1, RAD51C, PS6K, and SIGma1B genes. , 2000, Cancer research.

[6]  J. Mullikin,et al.  SSAHA: a fast search method for large DNA databases. , 2001, Genome research.

[7]  M. Batzer,et al.  Alu repeats and human genomic diversity , 2002, Nature Reviews Genetics.

[8]  K. Chin,et al.  End-sequence profiling: Sequence-based analysis of aberrant genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[10]  L. Stubbs,et al.  Two reciprocal translocations provide new clues to the high mutability of the Grid2 locus , 2004, Mammalian Genome.

[11]  B. Johansson,et al.  Fusion genes and rearranged genes as a linear function of chromosome aberrations in cancer , 2004, Nature Genetics.

[12]  F. Apiou,et al.  Characterization of a conserved aphidicolin-sensitive common fragile site at human 4q22 and mouse 6C1: possible association with an inherited disease and cancer , 2004, Oncogene.

[13]  J. Tchinda,et al.  Recurrent fusion of TMPRSS2 and ETS transcription factor genes in prostate cancer. , 2006, Science.

[14]  Carlos Caldas,et al.  Chromosome abnormalities in 10 lung cancer cell lines of the NCI-H series analyzed with spectral karyotyping. , 2005, Cancer genetics and cytogenetics.

[15]  F. E. Bertrand,et al.  The MLL partial tandem duplication in acute myeloid leukaemia , 2006, British journal of haematology.

[16]  J. Carney,et al.  Mechanisms of eukaryotic DNA double strand break repair. , 2006, Frontiers in bioscience : a journal and virtual library.

[17]  M. Caligiuri,et al.  Mll partial tandem duplication induces aberrant Hox expression in vivo via specific epigenetic alterations. , 2006, The Journal of clinical investigation.

[18]  Andrew Menzies,et al.  Architectures of somatic genomic rearrangement in human cancer amplicons at sequence-level resolution. , 2007, Genome research.

[19]  Philip M. Kim,et al.  Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome , 2007, Science.

[20]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[21]  H. Aburatani,et al.  Identification of the transforming EML4–ALK fusion gene in non-small-cell lung cancer , 2007, Nature.

[22]  S. Dhanasekaran,et al.  Distinct classes of chromosomal rearrangements create oncogenic ETS gene fusions in prostate cancer , 2007, Nature.

[23]  Atif Shahab,et al.  Fusion transcripts and transcribed retrotransposed loci discovered through comprehensive transcriptome analysis using Paired-End diTags (PETs). , 2007, Genome research.

[24]  E. S. Venkatraman,et al.  A faster circular binary segmentation algorithm for the analysis of array CGH data , 2007, Bioinform..

[25]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[26]  V P Collins,et al.  Array painting reveals a high frequency of balanced translocations in breast cancer cell lines that break in cancer-relevant genes , 2008, Oncogene.