Sequence artifacts in DNA from formalin-fixed tissues: causes and strategies for minimization.

BACKGROUND Precision medicine is dependent on identifying actionable mutations in tumors. Accurate detection of mutations is often problematic in formalin-fixed paraffin-embedded (FFPE) tissues. DNA extracted from formalin-fixed tissues is fragmented and also contains DNA lesions that are the sources of sequence artifacts. Sequence artifacts can be difficult to distinguish from true mutations, especially in the context of tumor heterogeneity, and are an increasing interpretive problem in this era of massively parallel sequencing. Understanding of the sources of sequence artifacts in FFPE tissues and implementation of preventative strategies are critical to improve the accurate detection of actionable mutations. CONTENT This mini-review focuses on DNA template lesions in FFPE tissues as the source of sequence artifacts in molecular analysis. In particular, fragmentation, base modification (including uracil and thymine deriving from cytosine deamination), and abasic sites are discussed as indirect or direct sources of sequence artifacts. We discuss strategies that can be implemented to minimize sequence artifacts and to distinguish true mutations from sequence artifacts. These strategies are applicable for the detection of actionable mutations in both single amplicon and massively parallel amplicon sequencing approaches. SUMMARY Because FFPE tissues are usually the only available material for DNA analysis, it is important to maximize the accurate informational content from FFPE DNA. Careful consideration of each step in the work flow is needed to minimize sequence artifacts. In addition, validation of actionable mutations either by appropriate experimental design or by orthogonal methods should be considered.

[1]  G. Chowdhury,et al.  Interstrand cross-links generated by abasic sites in duplex DNA. , 2007, Journal of the American Chemical Society.

[2]  Clive R Taylor,et al.  DNA Extraction from Archival Formalin-fixed, Paraffin-embedded Tissue Sections Based on the Antigen Retrieval Principle: Heating Under the Influence of pH , 2002, The journal of histochemistry and cytochemistry : official journal of the Histochemistry Society.

[3]  P. V. von Hippel,et al.  Formaldehyde as a probe of DNA structure. II. Reaction with endocyclic imino groups of DNA bases. , 1975, Biochemistry.

[4]  M. Feldman,et al.  Reactions of nucleic acids and nucleoproteins with formaldehyde. , 1973, Progress in nucleic acid research and molecular biology.

[5]  L. Pearl,et al.  A read-ahead function in archaeal DNA polymerases detects promutagenic template-strand uracil. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[6]  R. Sepp,et al.  Rapid techniques for DNA extraction from routinely processed archival tissue for use in PCR. , 1994, Journal of clinical pathology.

[7]  T. Frebourg,et al.  Metastatic colorectal cancer KRAS genotyping in routine practice: results and pitfalls , 2011, Modern Pathology.

[8]  G. Pfeifer,et al.  Human thymine DNA glycosylase (TDG) and methyl-CpG-binding protein 4 (MBD4) excise thymine glycol (Tg) from a Tg:G mispair. , 2003, Nucleic acids research.

[9]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[10]  Qun Zhong,et al.  Multiplex picoliter-droplet digital PCR for quantitative assessment of DNA integrity in clinical samples. , 2013, Clinical chemistry.

[11]  J. Halgunset,et al.  Low copy number DNA template can render polymerase chain reaction error prone in a sequence-dependent manner. , 2005, The Journal of molecular diagnostics : JMD.

[12]  Roman K. Thomas,et al.  Benchmarking of Mutation Diagnostics in Clinical Lung Cancer Specimens , 2011, PloS one.

[13]  Jason Li,et al.  Reducing sequence artifacts in amplicon-based massively parallel sequencing of formalin-fixed paraffin-embedded DNA by enzymatic depletion of uracil-containing templates. , 2013, Clinical chemistry.

[14]  Zhong Wen-zhao,et al.  Erlotinib in Lung Cancer - Molecular and Clinical Predictors of Outcome , 2006 .

[15]  F. Pontén,et al.  A high frequency of sequence alterations is due to formalin fixation of archival specimens. , 1999, The American journal of pathology.

[16]  G. Coetzee,et al.  5-Methylcytosine as an endogenous mutagen in the human LDL receptor and p53 genes. , 1990, Science.

[17]  H. Hogrefe,et al.  Archaeal dUTPase enhances PCR amplifications with archaeal DNA polymerases by preventing dUTP incorporation , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Alexander Dobrovic,et al.  Dramatic reduction of sequence artefacts from DNA isolated from formalin-fixed cancer biopsies by treatment with uracil-DNA glycosylase , 2012, Oncotarget.

[19]  P. V. von Hippel,et al.  Formaldehyde as a probe of DNA structure. I. Reaction with exocyclic amino groups of DNA bases. , 1975, Biochemistry.

[20]  Adrian W. Briggs,et al.  Road blocks on paleogenomes—polymerase extension profiling reveals the frequency of blocking lesions in ancient DNA , 2010, Nucleic acids research.

[21]  Martin Kircher,et al.  CORRESPONDENCE Open Access , 2022 .

[22]  M. Otterlei,et al.  Uracil in DNA--general mutagen, but normal intermediate in acquired immunity. , 2007, DNA repair.

[23]  Adam C. Marko,et al.  Functional DNA quantification guides accurate next-generation sequencing mutation detection in formalin-fixed, paraffin-embedded tumor biopsies , 2013, Genome Medicine.

[24]  V. Jackson Studies on histone organization in the nucleosome using formaldehyde as a reversible cross-linking agent , 1978, Cell.

[25]  V. Zsikla,et al.  Effect of buffered formalin on amplification of DNA from paraffin wax embedded small biopsies using real-time PCR , 2004, Journal of Clinical Pathology.

[26]  H. Fraenkel-conrat,et al.  The reaction of formaldehyde with proteins; cross-linking between amino and primary amide or guanidyl groups. , 1948, Journal of the American Chemical Society.

[27]  Enzo Medico,et al.  Emergence of KRAS mutations and acquired resistance to anti-EGFR therapy in colorectal cancer , 2012, Nature.

[28]  Leslie Cope,et al.  Clinical validation of KRAS, BRAF, and EGFR mutation detection using next-generation sequencing. , 2014, American journal of clinical pathology.

[29]  Jesse J. Salk,et al.  Detection of ultra-rare mutations by next-generation sequencing , 2012, Proceedings of the National Academy of Sciences.

[30]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[31]  Olivier Harismendy,et al.  Detection of low prevalence somatic mutations in solid tumors with ultra-deep targeted sequencing , 2011, Genome Biology.

[32]  Y. Morimitsu,et al.  Reaction of formaldehyde with calf-thymus nucleohistone. , 1979, European journal of biochemistry.

[33]  D. Shibata,et al.  In vitro mutation artifacts after formalin fixation and error prone translesion synthesis during PCR , 2004, BMC clinical pathology.

[34]  K. Breslauer,et al.  Influence of abasic and anucleosidic sites on the stability, conformation, and melting behavior of a DNA duplex: correlations of thermodynamic and structural data. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[35]  P. V. von Hippel,et al.  Formaldehyde as a probe of DNA structure. r. Mechanism of the initial reaction of Formaldehyde with DNA. , 1977, Biochemistry.

[36]  Lloyd M. Smith,et al.  Measuring the Formaldehyde Protein–DNA Cross-Link Reversal Rate , 2014, Analytical chemistry.

[37]  Soile Tapio,et al.  Nucleic acids from long-term preserved FFPE tissues are suitable for downstream analyses , 2012, Virchows Archiv.

[38]  M. Meyerson,et al.  EGFR mutation and resistance of non-small-cell lung cancer to gefitinib. , 2005, The New England journal of medicine.

[39]  K. Makino,et al.  Mechanistic studies on depurination and apurinic site chain breakage in oligodeoxyribonucleotides. , 1994, Nucleic acids research.

[40]  Christopher M. Hindson,et al.  Absolute quantification by droplet digital PCR versus analog real-time PCR , 2013, Nature Methods.

[41]  R. Dikshit,et al.  Efficient DNA extraction from 25‐year‐old paraffin‐embedded tissues: study of 365 samples , 2007, Pathology.

[42]  K. Robasky,et al.  The role of replicates for error mitigation in next-generation sequencing , 2013, Nature Reviews Genetics.

[43]  Trevor J Pugh,et al.  Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation , 2013, Nucleic acids research.

[44]  F. André,et al.  Molecular circuits of solid tumors: prognostic and predictive tools for bedside use , 2010, Nature Reviews Clinical Oncology.

[45]  G. Giaccone,et al.  EGFR and K-ras Mutation Analysis in Non-Small Cell Lung Cancer: Comparison of Paraffin Embedded versus Frozen Specimens , 2007, Cellular oncology : the official journal of the International Society for Cellular Oncology.

[46]  C. Yamashiro,et al.  Extraction and Amplification of DNA From Formalin-Fixed, Paraffin-Embedded Tissues , 2002, Applied immunohistochemistry & molecular morphology : AIMM.

[47]  B. Werness,et al.  Mutations in BRCA1 from fixed, paraffin-embedded tissue can be artifacts of preservation. , 1998, Cancer genetics and cytogenetics.

[48]  T. Lindahl,et al.  Rate of depurination of native deoxyribonucleic acid. , 1972, Biochemistry.

[49]  Issa J Dahabreh,et al.  Somatic Mutations of the Tyrosine Kinase Domain of Epidermal Growth Factor Receptor and Tyrosine Kinase Inhibitor Response to TKIs in Non-small Cell Lung Cancer: An Analytical Database , 2008, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[50]  James Denvir,et al.  DNA damage reduces Taq DNA polymerase fidelity and PCR amplification efficiency. , 2007, Biochemical and biophysical research communications.

[51]  K. Kinzler,et al.  Detection and quantification of rare mutations with massively parallel sequencing , 2011, Proceedings of the National Academy of Sciences.

[52]  Leming Shi,et al.  Next-generation sequencing in the clinic: promises and challenges. , 2013, Cancer letters.

[53]  Jeffrey H. Miller,et al.  Mutagenic deamination of cytosine residues in DNA , 1980, Nature.

[54]  P. Jones,et al.  The rate of hydrolytic deamination of 5-methylcytosine in double-stranded DNA. , 1994, Nucleic acids research.

[55]  A. Marchetti,et al.  Assessing EGFR mutations. , 2006, The New England journal of medicine.

[56]  David M. Thomas,et al.  Sequence artefacts in a prospective series of formalin-fixed tumours tested for mutations in hotspot regions by massively parallel sequencing , 2014, BMC Medical Genomics.

[57]  G. Zhu,et al.  High T790M detection rate in TKI-naive NSCLC with EGFR sensitive mutation: truth or artifact? , 2013, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[58]  T. Lindahl,et al.  Rate of chain breakage at apurinic sites in double-stranded deoxyribonucleic acid. , 1972, Biochemistry.

[59]  James R. Eshleman,et al.  Cytosine Deamination Is a Major Cause of Baseline Noise in Next-Generation Sequencing , 2014, Molecular Diagnosis & Therapy.

[60]  Giampietro Gasparini,et al.  K-ras mutations and cetuximab in colorectal cancer. , 2009, The New England journal of medicine.

[61]  H. Fraenkel-conrat,et al.  The Reaction of Formaldehyde with Proteins , 1945 .

[62]  C. Yamashiro,et al.  Extraction and amplification of DNA from formalin-fixed, paraffin-embedded tissues. , 2002 .

[63]  R. Tothill,et al.  Targeted-capture massively-parallel sequencing enables robust detection of clinically informative mutations from formalin-fixed tumours , 2013, Scientific Reports.

[64]  Mark Stoneking,et al.  A new approach for detecting low-level mutations in next-generation sequence data , 2012, Genome Biology.