Simul-seq: combined DNA and RNA sequencing for whole-genome and transcriptome profiling

Paired DNA and RNA profiling is increasingly employed in genomics research to uncover molecular mechanisms of disease and to explore personal genotype and phenotype correlations. Here, we introduce Simul-seq, a technique for the production of high-quality whole-genome and transcriptome sequencing libraries from small quantities of cells or tissues. We apply the method to laser-capture-microdissected esophageal adenocarcinoma tissue, revealing a highly aneuploid tumor genome with extensive blocks of increased homozygosity and corresponding increases in allele-specific expression. Among this widespread allele-specific expression, we identify germline polymorphisms that are associated with response to cancer therapies. We further leverage this integrative data to uncover expressed mutations in several known cancer genes as well as a recurrent mutation in the motor domain of KIF3B that significantly affects kinesin–microtubule interactions. Simul-seq provides a new streamlined approach for generating comprehensive genome and transcriptome profiles from limited quantities of clinically relevant samples.

[1]  Yuchen Jiao,et al.  Comparative genomic analysis of esophageal adenocarcinoma and squamous cell carcinoma. , 2012, Cancer discovery.

[2]  Aviv Regev,et al.  Comprehensive comparative analysis of RNA sequencing methods for degraded or low input samples , 2013, Nature Methods.

[3]  Peter J. Campbell,et al.  Genomic catastrophes frequently arise in esophageal adenocarcinoma and drive tumorigenesis , 2014, Nature Communications.

[4]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[5]  S. Gabriel,et al.  Discovery and saturation analysis of cancer genes across 21 tumor types , 2014, Nature.

[6]  A. Sivachenko,et al.  Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples , 2013, Nature Biotechnology.

[7]  A. Fersht,et al.  Quantitative analysis of residual folding and DNA binding in mutant p53 core domain: definition of mutant states for rescue in cancer therapy , 2000, Oncogene.

[8]  S. Groshen,et al.  Cyclin D1 and epidermal growth factor polymorphisms associated with survival in patients with advanced colorectal cancer treated with Cetuximab , 2006, Pharmacogenetics and genomics.

[9]  Alessandro Romanel,et al.  ASEQ: fast allele-specific studies from next-generation sequencing data , 2015, BMC Medical Genomics.

[10]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[11]  Tetsu Akiyama,et al.  Identification of a link between the tumour suppressor APC and the kinesin superfamily , 2002, Nature Cell Biology.

[12]  Sohrab P. Shah,et al.  JointSNVMix: a probabilistic model for accurate detection of somatic mutations in normal/tumour paired next-generation sequencing data , 2012, Bioinform..

[13]  T. Schumacher,et al.  Neoantigens in cancer immunotherapy , 2015, Science.

[14]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[15]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[16]  M. Korc,et al.  A variant epidermal growth factor receptor exhibits altered type alpha transforming growth factor binding and transmembrane signaling. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Ronald D Vale,et al.  Microtubule Interaction Site of the Kinesin Motor , 1997, Cell.

[18]  Ming-Huang Chen,et al.  Epidermal growth factor receptor R521K polymorphism shows favorable outcomes in KRAS wild‐type colorectal cancer patients treated with cetuximab‐based chemotherapy , 2012, Cancer science.

[19]  Pablo Cingolani,et al.  Using Drosophila melanogaster as a Model for Genotoxic Chemical Mutational Studies with a New Program, SnpSift , 2012, Front. Gene..

[20]  Siddharth S. Dey,et al.  Integrated genome and transcriptome sequencing from the same cell , 2014, Nature Biotechnology.

[21]  S. Henikoff,et al.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm , 2009, Nature Protocols.

[22]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[23]  C. Ponting,et al.  G&T-seq: parallel sequencing of single-cell genomes and transcriptomes , 2015, Nature Methods.

[24]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[25]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[26]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[27]  Judith B. Zaugg,et al.  Genetic Control of Chromatin States in Humans Involves Local and Distal Chromosomal Interactions , 2015, Cell.

[28]  F. Bertucci,et al.  A polymorphism of EGFR extracellular domain is associated with progression free-survival in metastatic colorectal cancer patients receiving cetuximab-based treatment , 2008, BMC Cancer.

[29]  David I. Smith,et al.  Tumor Transcriptome Sequencing Reveals Allelic Expression Imbalances Associated with Copy Number Alterations , 2010, PloS one.

[30]  Z. Modrušan,et al.  Predicting immunogenic tumour mutations by combining mass spectrometry and exome sequencing , 2014, Nature.

[31]  A. Fersht,et al.  Structural basis for understanding oncogenic p53 mutations and designing rescue drugs , 2006, Proceedings of the National Academy of Sciences.

[32]  Alan R. Fersht,et al.  Small molecule induced reactivation of mutant p53 in cancer cells , 2013, Nucleic acids research.

[33]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[34]  C. Perou,et al.  Comparison of RNA-Seq by poly (A) capture, ribosomal RNA depletion, and DNA microarray for expression profiling , 2014, BMC Genomics.

[35]  Euan A Ashley,et al.  Performance comparison of whole-genome sequencing platforms , 2011, Nature Biotechnology.

[36]  Yue Yu,et al.  The role of kinesin family proteins in tumorigenesis and progression , 2010, Cancer.

[37]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of urothelial bladder carcinoma , 2014, Nature.

[38]  Zhongming Zhao,et al.  TSGene 2.0: an updated literature-based knowledgebase for tumor suppressor genes , 2015, Nucleic Acids Res..

[39]  H. Lenz,et al.  The cyclin D1 (CCND1) rs9344 G>A polymorphism predicts clinical outcome in colon cancer patients treated with adjuvant 5-FU-based chemotherapy , 2013, The Pharmacogenomics Journal.

[40]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[41]  Jimmy Lin,et al.  Mining Exomic Sequencing Data to Identify Mutated Antigens Recognized by Adoptively Transferred Tumor-reactive T cells , 2013, Nature Medicine.

[42]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[43]  Emmanouil T. Dermitzakis,et al.  Putative cis-regulatory drivers in colorectal cancer , 2014, Nature.

[44]  Michael C. Rusch,et al.  CREST maps somatic structural variation in cancer genomes with base-pair resolution , 2011, Nature Methods.

[45]  Tomoatsu Hayashi,et al.  Role of the Kinesin-2 Family Protein, KIF3, during Mitosis* , 2006, Journal of Biological Chemistry.

[46]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[47]  A. McKenna,et al.  Exome and whole genome sequencing of esophageal adenocarcinoma identifies recurrent driver events and mutational complexity , 2013, Nature Genetics.

[48]  G. Church,et al.  Genome-Wide Identification of Human RNA Editing Sites by Parallel DNA Capturing and Sequencing , 2009, Science.

[49]  Ken Chen,et al.  SomaticSniper: identification of somatic point mutations in whole genome sequencing data , 2012, Bioinform..

[50]  R. Wilson,et al.  INTEGRATE: gene fusion discovery using whole genome and transcriptome data , 2016, Genome research.

[51]  Christopher A. Miller,et al.  VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. , 2012, Genome research.

[52]  Irmtraud M. Meyer,et al.  The clonal and mutational evolution spectrum of primary triple-negative breast cancers , 2012, Nature.

[53]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[54]  A. McKenna,et al.  Paired Exome Analysis of Barrett’s Esophagus and Adenocarcinoma , 2015, Nature Genetics.

[55]  The Cancer Genome Atlas Research Network,et al.  Comprehensive molecular characterization of urothelial bladder carcinoma , 2014, Nature.

[56]  R. Stahel,et al.  Cyclin D1 (CCND1) A870G gene polymorphism modulates smoking-induced lung cancer risk and response to platinum-based chemotherapy in non-small cell lung cancer (NSCLC) patients. , 2006, Lung cancer.

[57]  Andrew C. Adey,et al.  Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition , 2010, Genome Biology.

[58]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[59]  M. F. Stock,et al.  Expression of kinesin in Escherichia coli. , 2001, Methods in molecular biology.

[60]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[61]  The External Rna Controls Consortium The External RNA Controls Consortium: a progress report , 2005 .

[62]  Wei Li,et al.  RSeQC: quality control of RNA-seq experiments , 2012, Bioinform..