A novel use of random priming-based single-strand library preparation for whole genome sequencing of formalin-fixed paraffin-embedded tissue samples

Abstract The desire to analyse limited amounts of biological material, historic samples and rare cell populations has collectively driven the need for efficient methods for whole genome sequencing (WGS) of limited amounts of poor quality DNA. Most protocols are designed to recover double-stranded DNA (dsDNA) by ligating sequencing adaptors to dsDNA with or without subsequent polymerase chain reaction amplification of the library. While this is sufficient for many applications, limited DNA requires a method that can recover both single-stranded DNA (ssDNA) and dsDNA. Here, we present a WGS library preparation method, called ‘degraded DNA adaptor tagging’ (DDAT), adapted from a protocol designed for whole genome bisulfite sequencing. This method uses two rounds of random primer extension to recover both ssDNA and dsDNA. We show that by using DDAT we can generate WGS data from formalin-fixed paraffin-embedded (FFPE) samples using as little as 2 ng of highly degraded DNA input. Furthermore, DDAT WGS data quality was higher for all FFPE samples tested compared to data produced using a standard WGS library preparation method. Therefore, the DDAT method has potential to unlock WGS data from DNA previously considered impossible to sequence, broadening opportunities to understand the role of genetics in health and disease.

[1]  Alexander Dobrovic,et al.  Dramatic reduction of sequence artefacts from DNA isolated from formalin-fixed cancer biopsies by treatment with uracil-DNA glycosylase , 2012, Oncotarget.

[2]  U. Landegren,et al.  Glycosylases and AP-cleaving enzymes as a general tool for probe-directed cleavage of ssDNA targets , 2010, Nucleic acids research.

[3]  David M. Thomas,et al.  Sequence artefacts in a prospective series of formalin-fixed tumours tested for mutations in hotspot regions by massively parallel sequencing , 2014, BMC Medical Genomics.

[4]  Hanlee P. Ji,et al.  A robust targeted sequencing approach for low input and variable quality DNA from clinical samples , 2017, npj Genomic Medicine.

[5]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer , 2011, Nature Biotechnology.

[6]  F. van Nieuwerburgh,et al.  Library construction for next-generation sequencing: overviews and challenges. , 2014, BioTechniques.

[7]  P. Nederlof,et al.  A multiplex PCR predictor for aCGH success of FFPE samples , 2005, British Journal of Cancer.

[8]  J. Shendure,et al.  DNA sequencing at 40: past, present and future , 2017, Nature.

[9]  P. Ruoff,et al.  Excision of uracil from DNA by hSMUG1 includes strand incision and processing , 2018, Nucleic acids research.

[10]  F. Miura,et al.  Amplification-free whole-genome bisulfite sequencing by post-bisulfite adaptor tagging , 2012, Nucleic acids research.

[11]  The 100 000 Genomes Project: bringing whole genome sequencing to the NHS , 2018, British Medical Journal.

[12]  Alexander Dobrovic,et al.  Sequence artifacts in DNA from formalin-fixed tissues: causes and strategies for minimization. , 2015, Clinical chemistry.

[13]  M. Kirschner,et al.  Identification of a new uracil-DNA glycosylase family by expression cloning using synthetic inhibitors , 1999, Current Biology.

[14]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[15]  Louise Aigrain,et al.  Quantitation of next generation sequencing library preparation protocol efficiencies using droplet digital PCR assays - a systematic comparison of DNA library preparation kits for Illumina sequencing , 2016, BMC Genomics.

[16]  Transcriptome comparison reveals key candidate genes in response to vernalization of Oriental lily , 2016, BMC Genomics.

[17]  M. Meyer,et al.  Single-stranded DNA library preparation for the sequencing of ancient or damaged DNA , 2013, Nature Protocols.

[18]  Gouri Nanjangud,et al.  Whole-genome single-cell copy number profiling from formalin-fixed paraffin-embedded samples , 2017, Nature Medicine.

[19]  O. Stegle,et al.  Single-Cell Genome-Wide Bisulfite Sequencing for Assessing Epigenetic Heterogeneity , 2014, Nature Methods.

[20]  Eric D. Wieben,et al.  Conserved recurrent gene mutations correlate with pathway deregulation and clinical outcomes of lung adenocarcinoma in never-smokers , 2014, BMC Medical Genomics.