ATLAS: Analysis Tools for Low-depth and Ancient Samples

Summary Post-mortem damage (PMD) obstructs the proper analysis of ancient DNA samples. Currently, PMD can only be addressed by adjusting sequencing quality scores or by removing potentially damaged data. Here we present ATLAS, a suite of methods to analyze ancient samples that properly account for PMD. It works directly from raw BAM files and contains all necessary methods to infer patterns of PMD, recalibrate base quality scores and accurately genotype ancient DNA, along with many other useful tools. ATLAS enables the building of complete and customized pipelines for the analysis of ancient and low-depth samples in a very user-friendly way. Using simulations we show that, in the presence of PMD, a dedicated pipeline of ATLAS calls genotypes more accurately than the state of the art pipeline of GATK combined with mapDamage 2.0. Availability ATLAS is an open-source C++ program freely available at https://bitbucket.org/phaentu/atlas. Contact Daniel.Wegmann@unifr.ch Supplementary information Supplementary data are available at Bioinformatics online.

[1]  R. Durbin,et al.  Inference of human population history from individual whole-genome sequences. , 2011, Nature.

[2]  Philip L. F. Johnson,et al.  Patterns of damage in genomic DNA sequences from a Neandertal , 2007, Proceedings of the National Academy of Sciences.

[3]  M. Honma,et al.  Mutagenic consequences of cytosine alterations site-specifically embedded in the human genome , 2016, Genes and Environment.

[4]  Aaron R. Quinlan,et al.  BamTools: a C++ API and toolkit for analyzing and managing BAM files , 2011, Bioinform..

[5]  János Dani,et al.  Genome flux and stasis in a five millennium transect of European prehistory , 2014, Nature Communications.

[6]  Daniel L. Ayres,et al.  BEAGLE: An Application Programming Interface and High-Performance Computing Library for Statistical Phylogenetics , 2011, Systematic biology.

[7]  Adrian W. Briggs,et al.  Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA , 2009, Nucleic acids research.

[8]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[9]  Joshua S. Paul,et al.  Genotype and SNP calling from next-generation sequencing data , 2011, Nature Reviews Genetics.

[10]  W. Krzyzosiak,et al.  Sequence-non-specific effects of RNA interference triggers and microRNA regulators , 2009, Nucleic acids research.

[11]  Heng Li,et al.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data , 2011, Bioinform..

[12]  Philip L. F. Johnson,et al.  mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters , 2013, Bioinform..

[13]  Aurélien Ginolhac,et al.  Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX , 2014, Nature Protocols.

[14]  Yoan Diekmann,et al.  Early farmers from across Europe directly descended from Neolithic Aegeans , 2015, Proceedings of the National Academy of Sciences.

[15]  M. Jakobsson,et al.  Separating endogenous ancient DNA from modern day contamination in a Siberian Neandertal , 2014, Proceedings of the National Academy of Sciences.

[16]  Insuk Lee,et al.  Systematic comparison of variant calling pipelines using gold standard personal exome variants , 2015, Scientific Reports.

[17]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[18]  M. Slatkin,et al.  Ancient DNA and human history , 2016, Proceedings of the National Academy of Sciences.

[19]  Daniel Wegmann,et al.  Inferring Heterozygosity from Ancient and Low Coverage Genomes , 2016, Genetics.