Sequencing an Ashkenazi reference panel supports population-targeted personal genomics and illuminates Jewish and European origins

The Ashkenazi Jewish (AJ) population is a genetic isolate close to European and Middle Eastern groups, with genetic diversity patterns conducive to disease mapping. Here we report high-depth sequencing of 128 complete genomes of AJ controls. Compared with European samples, our AJ panel has 47% more novel variants per genome and is eightfold more effective at filtering benign variants out of AJ clinical genomes. Our panel improves imputation accuracy for AJ SNP arrays by 28%, and covers at least one haplotype in ≈67% of any AJ genome with long, identical-by-descent segments. Reconstruction of recent AJ history from such segments confirms a recent bottleneck of merely ≈350 individuals. Modelling of ancient histories for AJ and European populations using their joint allele frequency spectrum determines AJ to be an even admixture of European and likely Middle Eastern origins. We date the split between the two ancestral populations to ≈12–25 Kyr, suggesting a predominantly Near Eastern source for the repopulation of Europe after the Last Glacial Maximum.

[1]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[2]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[3]  David Reich,et al.  Discerning the Ancestry of European Americans in Genetic Association Studies , 2007, PLoS genetics.

[4]  Karen Marder,et al.  Genome-Wide association study identifies candidate genes for Parkinson's disease in an Ashkenazi Jewish population , 2011, BMC Medical Genetics.

[5]  Christine Klein,et al.  LRRK2 G2019S as a cause of Parkinson's disease in Ashkenazi Jews. , 2006, The New England journal of medicine.

[6]  S. Warren,et al.  Signatures of founder effects, admixture, and selection in the Ashkenazi Jewish population , 2010, Proceedings of the National Academy of Sciences.

[7]  H. Ostrer,et al.  The population genetics of the Jewish people , 2012, Human Genetics.

[8]  W. Stephan,et al.  Analytical results on the neutral non-equilibrium allele frequency spectrum based on diffusion theory. , 2011, Theoretical population biology.

[9]  R. Villems,et al.  Mitochondrial DNA signals of late glacial recolonization of Europe from near eastern refugia. , 2012, American journal of human genetics.

[10]  Kenny Q. Ye,et al.  Distinguishing between longevity and buffered-deleterious genotypes for exceptional human longevity: the case of the MTP gene. , 2012, The journals of gerontology. Series A, Biological sciences and medical sciences.

[11]  C. Tyler-Smith,et al.  Genome-Wide Diversity in the Levant Reveals Recent Structuring by Culture , 2013, PLoS genetics.

[12]  C. Stringer,et al.  The earliest evidence for anatomically modern humans in northwestern Europe , 2011, Nature.

[13]  Ryan D. Hernandez,et al.  Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data , 2009, PLoS genetics.

[14]  R. Villems,et al.  Ancient DNA from European Early Neolithic Farmers Reveals Their Near Eastern Affinities , 2010, PLoS biology.

[15]  I. Pe’er,et al.  Length distributions of identity by descent reveal fine-scale demographic history. , 2012, American journal of human genetics.

[16]  Birgir Hrafnkelsson,et al.  An Icelandic example of the impact of population structure on association studies , 2005, Nature Genetics.

[17]  A. Need,et al.  A genome-wide genetic signature of Jewish ancestry perfectly separates individuals with and without full Jewish ancestry in a large random sample of European Americans , 2009, Genome Biology.

[18]  Evan E Eichler,et al.  Properties and rates of germline mutations in humans. , 2013, Trends in genetics : TIG.

[19]  Andreas Keller,et al.  Population Genomic Analysis of Ancient and Modern Genomes Yields New Insights into the Genetic Ancestry of the Tyrolean Iceman and the Genetic Structure of Europe , 2014, PLoS genetics.

[20]  Daniel F. Gudbjartsson,et al.  Nonsense mutation in the LGR4 gene is associated with several human diseases and other traits , 2013, Nature.

[21]  B. Levinson Genetic Disorders Among the Jewish People , 1980, The Yale Journal of Biology and Medicine.

[22]  J. Marchini,et al.  Fast and accurate genotype imputation in genome-wide association studies through pre-phasing , 2012, Nature Genetics.

[23]  P. Awadalla,et al.  Whole-Exome Sequencing Reveals a Rapid Change in the Frequency of Rare Functional Variants in a Founding Population of Humans , 2013, PLoS genetics.

[24]  J. Pritchard,et al.  The deleterious mutation load is insensitive to recent population history , 2013, Nature Genetics.

[25]  Ignazio Carbone,et al.  A calibrated human Y-chromosomal phylogeny based on resequencing , 2013, Genome research.

[26]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[27]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[28]  A. Clark,et al.  Recent Explosive Human Population Growth Has Resulted in an Excess of Rare Genetic Variants , 2012, Science.

[29]  Serafim Batzoglou,et al.  Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP++ , 2010, PLoS Comput. Biol..

[30]  Saharon Rosset,et al.  The genome-wide structure of the Jewish people , 2010, Nature.

[31]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[32]  Gabor T. Marth,et al.  Demographic history and rare allele sharing among human populations , 2011, Proceedings of the National Academy of Sciences.

[33]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[34]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[35]  Molly Przeworski,et al.  Determinants of mutation rate variation in the human germline. , 2014, Annual review of genomics and human genetics.

[36]  Karen Marder,et al.  Familial aggregation of early‐ and late‐onset Parkinson's disease , 2003, Annals of neurology.

[37]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  R. Durbin,et al.  Revising the human mutation rate: implications for understanding human evolution , 2012, Nature Reviews Genetics.

[39]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[40]  T. Lehtimäki,et al.  High Risk Population Isolate Reveals Low Frequency Variants Predisposing to Intracranial Aneurysms , 2014, PLoS genetics.

[41]  Mark Yandell,et al.  Global analysis of disease-related DNA sequence variation in 10 healthy individuals: Implications for whole genome-based clinical diagnostics , 2011, Genetics in Medicine.

[42]  M. Feldman,et al.  Genomic microsatellites identify shared Jewish ancestry intermediate between Middle Eastern and European populations , 2009, BMC Genetics.

[43]  Eleazar Eskin,et al.  Analysis of genetic variation in Ashkenazi Jews by high density SNP genotyping , 2008, BMC Genetics.

[44]  Jennifer Mulle,et al.  A Genome-Wide Scan of Ashkenazi Jewish Crohn's Disease Suggests Novel Susceptibility Loci , 2012, PLoS genetics.

[45]  M. Lyons,et al.  Low-Pass Genome-Wide Sequencing and Variant Inference Using Identity-by-Descent in an Isolated Human Population , 2012, Genetics.

[46]  O. Delaneau,et al.  Supplementary Information for ‘ Improved whole chromosome phasing for disease and population genetic studies ’ , 2012 .

[47]  Vladimir Vacic,et al.  The Variance of Identity-by-Descent Sharing in the Wright–Fisher Model , 2012, Genetics.

[48]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[49]  Alexander Gusev,et al.  Whole population, genome-wide mapping of hidden relatedness. , 2009, Genome research.

[50]  Itsik Pe'er,et al.  Implications for health and disease in the genetic signature of the Ashkenazi Jewish population , 2012, Genome Biology.

[51]  M. Jakobsson,et al.  Origins and Genetic Legacy of Neolithic Farmers and Hunter-Gatherers in Europe , 2012, Science.

[52]  Ryan D. Hernandez,et al.  Proportionally more deleterious genetic variation in European than in African populations , 2008, Nature.

[53]  P. Hartge,et al.  The risk of cancer associated with specific mutations of BRCA1 and BRCA2 among Ashkenazi Jews. , 1997, The New England journal of medicine.

[54]  H. Ostrer,et al.  Contrasting patterns of Y chromosome variation in Ashkenazi Jewish and host non-Jewish European populations , 2004, Human Genetics.

[55]  Christopher R. Gignoux,et al.  Reconstructing Native American Migrations from Whole-Genome and Whole-Exome Data , 2013, PLoS genetics.

[56]  Annette Lee,et al.  Genome-wide association study implicates NDST3 in schizophrenia and bipolar disorder , 2013, Nature Communications.

[57]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[58]  Robert B. Hartlage,et al.  This PDF file includes: Materials and Methods , 2009 .

[59]  Alexander Gusev,et al.  The architecture of long-range haplotypes shared within and across populations. , 2012, Molecular biology and evolution.

[60]  Bonnie Berger,et al.  Ancient human genomes suggest three ancestral populations for present-day Europeans , 2013, Nature.

[61]  Gary K. Chen,et al.  Fast and flexible simulation of DNA sequence data. , 2008, Genome research.

[62]  E. Zeggini,et al.  A rare functional cardioprotective APOC3 variant has risen in frequency in distinct population isolates , 2013, Nature Communications.

[63]  Joseph K. Pickrell,et al.  Ancient DNA Reveals Key Stages in the Formation of Central European Mitochondrial Genetic Diversity , 2013, Science.

[64]  Gonçalo Abecasis,et al.  Genotype-imputation accuracy across worldwide human populations. , 2009, American journal of human genetics.

[65]  J. Wakeley Coalescent Theory: An Introduction , 2008 .

[66]  Itsik Pe'er,et al.  Abraham's children in the genome era: major Jewish diaspora populations comprise distinct genetic clusters with shared Middle Eastern Ancestry. , 2010, American journal of human genetics.