Targeted or whole genome sequencing of formalin fixed tissue samples: potential applications in cancer genomics

Current genomic studies are limited by the poor availability of fresh-frozen tissue samples. Although formalin-fixed diagnostic samples are in abundance, they are seldom used in current genomic studies because of the concern of formalin-fixation artifacts. Better characterization of these artifacts will allow the use of archived clinical specimens in translational and clinical research studies. To provide a systematic analysis of formalin-fixation artifacts on Illumina sequencing, we generated 26 DNA sequencing data sets from 13 pairs of matched formalin-fixed paraffin-embedded (FFPE) and fresh-frozen (FF) tissue samples. The results indicate high rate of concordant calls between matched FF/FFPE pairs at reference and variant positions in three commonly used sequencing approaches (whole genome, whole exome, and targeted exon sequencing). Global mismatch rates and C·G > T·A substitutions were comparable between matched FF/FFPE samples, and discordant rates were low (<0.26%) in all samples. Finally, low-pass whole genome sequencing produces similar pattern of copy number alterations between FF/FFPE pairs. The results from our studies suggest the potential use of diagnostic FFPE samples for cancer genomic studies to characterize and catalog variations in cancer genomes.

[1]  H. Lehrach,et al.  Genome-Wide Massively Parallel Sequencing of Formaldehyde Fixed-Paraffin Embedded (FFPE) Tumor Tissues for Copy-Number- and Mutation-Analysis , 2009, PloS one.

[2]  G. Coetzee,et al.  5-Methylcytosine as an endogenous mutagen in the human LDL receptor and p53 genes. , 1990, Science.

[3]  M. Loda,et al.  DNA degradation test predicts success in whole-genome amplification from diverse clinical samples. , 2007, The Journal of molecular diagnostics : JMD.

[4]  M. Rubin,et al.  Targeted next-generation sequencing of advanced prostate cancer identifies potential therapeutic targets and disease heterogeneity. , 2013, European urology.

[5]  M. Stratton Exploring the Genomes of Cancer Cells: Progress and Promise , 2011, Science.

[6]  Lincoln D. Stein,et al.  Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes , 2012, Nature.

[7]  Jesse J. Salk,et al.  Detection of ultra-rare mutations by next-generation sequencing , 2012, Proceedings of the National Academy of Sciences.

[8]  Henry M. Wood,et al.  Estimating optimal window size for analysis of low-coverage next-generation sequence data , 2014, Bioinform..

[9]  Pieter Wesseling,et al.  DNA copy number analysis of fresh and formalin-fixed specimens by shallow whole-genome sequencing with identification and exclusion of problematic regions in the genome assembly , 2014, Genome research.

[10]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[11]  R. Klopfleisch,et al.  Excavation of a buried treasure--DNA, mRNA, miRNA and protein analysis in formalin fixed, paraffin embedded tissues. , 2011, Histology and histopathology.

[12]  Jeffrey A. Hussmann,et al.  High-throughput DNA sequencing errors are reduced by orders of magnitude using circle sequencing , 2013, Proceedings of the National Academy of Sciences.

[13]  A. Sparks,et al.  The mutation spectrum revealed by paired genome sequences from a lung cancer patient , 2010, Nature.

[14]  R. Tibshirani,et al.  Comment on "The Consensus Coding Sequences of Human Breast and Colorectal Cancers" , 2007, Science.

[15]  Wessel N. van Wieringen,et al.  CGHregions: Dimension Reduction for Array CGH Data with Minimal Information Loss , 2007 .

[16]  F. Pontén,et al.  A high frequency of sequence alterations is due to formalin fixation of archival specimens. , 1999, The American journal of pathology.

[17]  J. Carpten,et al.  Deep Clonal Profiling of Formalin Fixed Paraffin Embedded Clinical Samples , 2012, PloS one.

[18]  J. Leamon,et al.  Combining highly multiplexed PCR with semiconductor-based sequencing for rapid cancer genotyping. , 2013, The Journal of molecular diagnostics : JMD.

[19]  W. Lam,et al.  DNA Extraction from Paraffin Embedded Material for Genetic and Epigenetic Analyses , 2011, Journal of visualized experiments : JoVE.

[20]  Peter Beyerlein,et al.  Robust gene expression and mutation analyses of RNA-sequencing of formalin-fixed diagnostic tumor samples , 2015, Scientific Reports.

[21]  Jill P. Mesirov,et al.  MEDULLOBLASTOMA EXOME SEQUENCING UNCOVERS SUBTYPE-SPECIFIC SOMATIC MUTATIONS , 2012, Nature.

[22]  Shengyue Wang,et al.  Exploring the cancer genome in the era of next-generation sequencing , 2012, Frontiers of Medicine.

[23]  C. Nusbaum,et al.  Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. , 1998, Science.

[24]  Ralf Herwig,et al.  Targeted high throughput sequencing in clinical cancer Settings: formaldehyde fixed-paraffin embedded (FFPE) tumor tissues, input amount and tumor heterogeneity , 2011, BMC Medical Genomics.

[25]  R. Zeillinger,et al.  Sensitive detection of KRAS mutations in archived formalin-fixed paraffin-embedded tissue using mutant-enriched PCR and reverse-hybridization. , 2009, The Journal of molecular diagnostics : JMD.

[26]  Wessel N. van Wieringen,et al.  CGHcall: Calling aberrations for array CGH tumor profiles. , 2008 .

[27]  S. Bonin,et al.  Multicentre validation study of nucleic acids extraction from FFPE tissues , 2010, Virchows Archiv.

[28]  Nikhil Wagle,et al.  High-throughput detection of actionable genomic alterations in clinical tumor samples by targeted, massively parallel sequencing. , 2012, Cancer discovery.

[29]  P. Green,et al.  Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolution. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Anne-Mette K. Hein,et al.  Next-Generation Sequencing of RNA and DNA Isolated from Paired Fresh-Frozen and Formalin-Fixed Paraffin-Embedded Samples of Human Cancer and Normal Tissue , 2014, PloS one.

[31]  A. Sivachenko,et al.  SF3B1 and other novel cancer genes in chronic lymphocytic leukemia. , 2011, The New England journal of medicine.

[32]  K. Kinzler,et al.  Detection and quantification of rare mutations with massively parallel sequencing , 2011, Proceedings of the National Academy of Sciences.

[33]  A. Sivachenko,et al.  Sequence analysis of mutations and translocations across breast cancer subtypes , 2012, Nature.

[34]  S. Gabriel,et al.  Advances in understanding cancer genomes through second-generation sequencing , 2010, Nature Reviews Genetics.

[35]  S. Jewell,et al.  Copyright © American Society for Investigative Pathology Review Effect of Fixatives and Tissue Processing on the Content and Integrity of Nucleic Acids , 2022 .

[36]  J. Solassol,et al.  KRAS Mutation Detection in Paired Frozen and Formalin-Fixed Paraffin-Embedded (FFPE) Colorectal Cancer Tissues , 2011, International journal of molecular sciences.

[37]  Richard B. Schwab,et al.  Identification of high-confidence somatic mutations in whole genome sequence of formalin-fixed breast cancer specimens , 2012, Nucleic acids research.

[38]  Alexander Dobrovic,et al.  Dramatic reduction of sequence artefacts from DNA isolated from formalin-fixed cancer biopsies by treatment with uracil-DNA glycosylase , 2012, Oncotarget.

[39]  M. A. van de Wiel,et al.  CGHregions: Dimension Reduction for Array CGH Data with Minimal Information Loss , 2007, Cancer informatics.

[40]  K. Maclennan,et al.  Using next-generation sequencing for high resolution multiplex analysis of copy number variation from nanogram quantities of DNA from formalin-fixed paraffin-embedded specimens , 2010, Nucleic acids research.

[41]  Juliane C. Dohm,et al.  Whole-genome sequencing identifies recurrent mutations in chronic lymphocytic leukaemia , 2011, Nature.

[42]  P. Pevzner,et al.  Whole-genome analysis of Alu repeat elements reveals complex evolutionary history. , 2004, Genome research.