Separating endogenous ancient DNA from modern day contamination in a Siberian Neandertal

Significance Strict laboratory precautions against present day human DNA contamination are standard in ancient DNA studies, but contamination is already present inside many ancient human fossils from previous handling without specific precautions. We designed a statistical framework to isolate endogenous ancient DNA sequences from contaminating sequences using postmortem degradation patterns and were able to reduce high-contamination fractions to negligible levels. We captured DNA sequences from a contaminated Neandertal bone from Okladnikov Cave in Siberia and used our method to assemble its mitochondrial genome sequence, which we find to be from a lineage basal to five of six previously published complete Neandertal mitochondrial genomes. Our method paves the way for the large-scale genetic analysis of contaminated human remains. One of the main impediments for obtaining DNA sequences from ancient human skeletons is the presence of contaminating modern human DNA molecules in many fossil samples and laboratory reagents. However, DNA fragments isolated from ancient specimens show a characteristic DNA damage pattern caused by miscoding lesions that differs from present day DNA sequences. Here, we develop a framework for evaluating the likelihood of a sequence originating from a model with postmortem degradation—summarized in a postmortem degradation score—which allows the identification of DNA fragments that are unlikely to originate from present day sources. We apply this approach to a contaminated Neandertal specimen from Okladnikov Cave in Siberia to isolate its endogenous DNA from modern human contaminants and show that the reconstructed mitochondrial genome sequence is more closely related to the variation of Western Neandertals than what was discernible from previous analyses. Our method opens up the potential for genomic analysis of contaminated fossil material.

[1]  Yong Wang,et al.  An Aboriginal Australian Genome Reveals Separate Human Dispersals into Asia , 2011, Science.

[2]  C. Lalueza-Fox,et al.  Tracking down human contamination in ancient human teeth. , 2006, Molecular biology and evolution.

[3]  Federico Sánchez-Quinto,et al.  Fragmentation of Contaminant and Endogenous DNA in Ancient Samples Determined by Shotgun Sequencing; Prospects for Human Palaeogenomics , 2011, PloS one.

[4]  B. Viola,et al.  Neanderthals in central Asia and Siberia , 2007, Nature.

[5]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[6]  Philip L. F. Johnson,et al.  Genetic history of an archaic hominin group from Denisova Cave in Siberia , 2010, Nature.

[7]  Matthias Meyer,et al.  Illumina sequencing library preparation for highly multiplexed target capture and sequencing. , 2010, Cold Spring Harbor protocols.

[8]  Martin Kircher,et al.  Analysis of high-throughput ancient DNA sequencing data. , 2012, Methods in molecular biology.

[9]  C. Lalueza-Fox,et al.  A highly divergent mtDNA sequence in a Neandertal individual from Italy , 2006, Current Biology.

[10]  A. Krogh,et al.  Ancient human genome sequence of an extinct Palaeo-Eskimo , 2010, Nature.

[11]  Philip L. F. Johnson,et al.  mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters , 2013, Bioinform..

[12]  L. Orlando,et al.  Revisiting Neandertal diversity with a 100,000 year old mtDNA sequence , 2006, Current Biology.

[13]  Michael Inouye,et al.  Founder population-specific HapMap panel increases power in GWA studies through improved imputation accuracy and CNV tagging. , 2010, Genome research.

[14]  Philip L. F. Johnson,et al.  A Draft Sequence of the Neandertal Genome , 2010, Science.

[15]  Maxim Teslenko,et al.  MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model Space , 2012, Systematic biology.

[16]  Natalie M. Myres,et al.  New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing , 2012, Nature Communications.

[17]  S. Pääbo Ancient DNA: extraction, characterization, molecular cloning, and enzymatic amplification. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[18]  M. Beaumont,et al.  Novel high-resolution characterization of ancient DNA reveals C > U-type base modification events as the sole cause of post mortem miscoding lesions , 2007, Nucleic acids research.

[19]  Qiaomei Fu,et al.  DNA analysis of an early modern human from Tianyuan Cave, China , 2013, Proceedings of the National Academy of Sciences.

[20]  Stephan C. Schuster,et al.  Response to Comment on "Whole-Genome Shotgun Sequencing of Mitochondria from Ancient Hair Shafts" , 2008, Science.

[21]  S. Pääbo,et al.  A view of Neandertal genetic diversity , 2000, Nature Genetics.

[22]  Svante Pääbo,et al.  Temporal Patterns of Nucleotide Misincorporations and DNA Fragmentation in Ancient DNA , 2012, PloS one.

[23]  Adrian W. Briggs,et al.  Analysis of one million base pairs of Neanderthal DNA , 2006, Nature.

[24]  Philip L. F. Johnson,et al.  A Complete Neandertal Mitochondrial Genome Sequence Determined by High-Throughput Sequencing , 2008, Cell.

[25]  S. Pääbo,et al.  Mitochondrial genome variation and the origin of modern humans , 2000, Nature.

[26]  N. Rohland,et al.  Comparison and optimization of ancient DNA extraction. , 2007, BioTechniques.

[27]  J. Arsuaga,et al.  Partial genetic turnover in neandertals: continuity in the East and population replacement in the West. , 2012, Molecular biology and evolution.

[28]  Mark Stoneking,et al.  Learning about human population history from ancient and modern genomes , 2011, Nature Reviews Genetics.

[29]  B. Sykes,et al.  Authenticating DNA Extracted From Ancient Skeletal Remains , 1995 .

[30]  Adrian W. Briggs,et al.  A High-Coverage Genome Sequence from an Archaic Denisovan Individual , 2012, Science.

[31]  Philip L. F. Johnson,et al.  Patterns of damage in genomic DNA sequences from a Neandertal , 2007, Proceedings of the National Academy of Sciences.

[32]  M. Stoneking,et al.  Neandertal DNA Sequences and the Origin of Modern Humans , 1997, Cell.

[33]  A. von Haeseler,et al.  DNA sequences from multiple amplifications reveal artifacts induced by cytosine deamination in ancient DNA. , 2001, Nucleic acids research.

[34]  Martin Kircher,et al.  Improved base calling for the Illumina Genome Analyzer using machine learning strategies , 2009, Genome Biology.

[35]  Qiaomei Fu,et al.  The complete mitochondrial DNA genome of an unknown hominin from southern Siberia , 2010, Nature.

[36]  M. Jakobsson,et al.  Origins and Genetic Legacy of Neolithic Farmers and Hunter-Gatherers in Europe , 2012, Science.

[37]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[38]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[39]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[40]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[41]  W. Goodwin,et al.  Molecular analysis of Neanderthal DNA from the northern Caucasus , 2000, Nature.

[42]  Feng Chen,et al.  Sequencing and Analysis of Neanderthal Genomic DNA , 2006, Science.

[43]  S. Pääbo,et al.  The retrieval of ancient human DNA sequences. , 1996, American journal of human genetics.

[44]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[45]  M. Jakobsson,et al.  Archaic human ancestry in East Asia , 2011, Proceedings of the National Academy of Sciences.

[46]  K. Stefánsson,et al.  A Statistical Approach to Identify Ancient Template DNA , 2007, Journal of Molecular Evolution.

[47]  Adrian W. Briggs,et al.  Targeted Retrieval and Analysis of Five Neandertal mtDNA Genomes , 2009, Science.

[48]  Adrian W. Briggs,et al.  The Neandertal genome and ancient DNA authenticity , 2009, The EMBO journal.

[49]  Federico Sánchez-Quinto,et al.  Genomic Affinities of Two 7,000-Year-Old Iberian Hunter-Gatherers , 2012, Current Biology.

[50]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[51]  Francesc Calafell,et al.  Mitochondrial DNA of an Iberian Neandertal suggests a population affinity with other European Neandertals , 2006, Current Biology.

[52]  N. Tuross,et al.  Ancient DNA analysis of human populations. , 2000, American journal of physical anthropology.

[53]  C. Lalueza-Fox,et al.  Neandertal evolutionary genetics: mitochondrial DNA data from the iberian peninsula. , 2005, Molecular biology and evolution.

[54]  H. Malmström,et al.  Extensive human DNA contamination in extracts from ancient dog bones and teeth. , 2005, Molecular biology and evolution.

[55]  J. Wall,et al.  Inconsistencies in Neanderthal Genomic DNA Sequences , 2007, PLoS genetics.

[56]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[57]  Martin Kircher,et al.  A Complete mtDNA Genome of an Early Modern Human from Kostenki, Russia , 2010, Current Biology.

[58]  C. Wiuf,et al.  Statistical evidence for miscoding lesions in ancient DNA templates. , 2001, Molecular biology and evolution.