Calibrating genomic and allelic coverage bias in single-cell sequencing

Artifacts introduced in whole-genome amplification (WGA) make it difficult to derive accurate genomic information from single-cell genomes and require different analytical strategies from bulk genome analysis. Here, we describe statistical methods to quantitatively assess the amplification bias resulting from whole-genome amplification of single-cell genomic DNA. Analysis of single-cell DNA libraries generated by different technologies revealed universal features of the genome coverage bias predominantly generated at the amplicon level (1-10 kb). The magnitude of coverage bias can be accurately calibrated from low-pass sequencing (∼0.1 × ) to predict the depth-of-coverage yield of single-cell DNA libraries sequenced at arbitrary depths. We further provide a benchmark comparison of single-cell libraries generated by multi-strand displacement amplification (MDA) and multiple annealing and looping-based amplification cycles (MALBAC). Finally, we develop statistical models to calibrate allelic bias in single-cell whole-genome amplification and demonstrate a census-based strategy for efficient and accurate variant detection from low-input biopsy samples.

[1]  Aviv Regev,et al.  Whole exome sequencing of circulating tumor cells provides a window into metastatic prostate cancer , 2014, Nature Biotechnology.

[2]  Stephen R Quake,et al.  Genomic analysis at the single-cell level. , 2011, Annual review of genetics.

[3]  J. Troge,et al.  Tumour evolution inferred by single-cell sequencing , 2011, Nature.

[4]  P. Blainey The future is now: single-cell genomics of bacteria and archaea. , 2013, FEMS microbiology reviews.

[5]  Allen D. Delaney,et al.  Impact of whole genome amplification on analysis of copy number variants , 2008, Nucleic acids research.

[6]  Kelly Rae Chi,et al.  Singled out for sequencing , 2013, Nature Methods.

[7]  Michael Wigler,et al.  Genome-wide copy number analysis of single cells , 2012, Nature Protocols.

[8]  C. Walsh,et al.  Single-Neuron Sequencing Analysis of L1 Retrotransposition and Somatic Mutation in the Human Brain , 2012, Cell.

[9]  Ira M. Hall,et al.  Mosaic Copy Number Variation in Human Neurons , 2013, Science.

[10]  Timothy Daley,et al.  Predicting the molecular complexity of sequencing libraries , 2013, Nature Methods.

[11]  Jie Qiao,et al.  Probing Meiotic Recombination and Aneuploidy of Single Sperm Cells by Whole-Genome Sequencing , 2012, Science.

[12]  M. Stratton,et al.  Single-cell paired-end genome sequencing reveals structural variation per cell cycle , 2013, Nucleic acids research.

[13]  Mark B Gerstein,et al.  Assessment of whole genome amplification-induced bias through high-throughput, massively parallel whole genome sequencing , 2006, BMC Genomics.

[14]  Kun Zhang,et al.  Massively parallel polymerase cloning and genome sequencing of single cells using nanoliter microwells , 2013, Nature Biotechnology.

[15]  Rameen Beroukhim,et al.  Genome coverage and sequence fidelity of phi29 polymerase-based multiple strand displacement whole genome amplification. , 2004, Nucleic acids research.

[16]  Michael R. Speicher,et al.  Identification of small gains and losses in single cells after whole genome amplification on tiling oligo arrays , 2009, Nucleic acids research.

[17]  Huanming Yang,et al.  Single-Cell Exome Sequencing and Monoclonal Evolution of a JAK2-Negative Myeloproliferative Neoplasm , 2012, Cell.

[18]  Rona S. Gertner,et al.  Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells , 2013, Nature.

[19]  G. Church,et al.  Sequencing genomes from single cells by polymerase cloning , 2006, Nature Biotechnology.

[20]  A. Sivachenko,et al.  Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples , 2013, Nature Biotechnology.

[21]  Junhyong Kim,et al.  The promise of single-cell sequencing , 2013, Nature Methods.

[22]  E. Lander,et al.  Genomic mapping by fingerprinting random clones: a mathematical analysis. , 1988, Genomics.

[23]  R. Hubert,et al.  Whole genome amplification from a single cell: implications for genetic analysis. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Daniel Pinkel,et al.  Whole genome analysis of genetic alterations in small DNA samples using hyperbranched strand displacement amplification and array-CGH. , 2003, Genome research.

[25]  E. Shapiro,et al.  Single-cell sequencing-based technologies will revolutionize whole-organism science , 2013, Nature Reviews Genetics.

[26]  J. C. Love,et al.  EGFR variant heterogeneity in glioblastoma resolved through single-nucleus sequencing. , 2014, Cancer discovery.

[27]  S. Kingsmore,et al.  Comprehensive human genome amplification using multiple displacement amplification , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  X. Xie,et al.  Genome-Wide Detection of Single-Nucleotide and Copy-Number Variations of a Single Human Cell , 2012, Science.

[29]  Charles Gawad,et al.  A Quantitative Comparison of Single-Cell Whole Genome Amplification Methods , 2014, PloS one.

[30]  S. Horvath,et al.  Genetic programs in human and mouse early embryos revealed by single-cell RNA sequencing , 2013, Nature.

[31]  X. Xie,et al.  Reproducible copy number variation patterns among single circulating tumor cells of lung cancer patients , 2013, Proceedings of the National Academy of Sciences.

[32]  David Pellman,et al.  A Mechanism Linking Extra Centrosomes to Chromosomal Instability , 2009, Nature.

[33]  N. Navin,et al.  Clonal Evolution in Breast Cancer Revealed by Single Nucleus Genome Sequencing , 2014, Nature.

[34]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[35]  Stephen R. Quake,et al.  Genome-wide Single-Cell Analysis of Recombination Activity and De Novo Mutation Rates in Human Sperm , 2012, Cell.

[36]  Roger S Lasken,et al.  Unbiased whole-genome amplification directly from clinical samples. , 2003, Genome research.

[37]  Zhou Zhou,et al.  A Mathematical Analysis of Technical Analysis , 2017, Applied Mathematical Finance.