Complete Khoisan and Bantu genomes from southern Africa

The genetic structure of the indigenous hunter-gatherer peoples of southern Africa, the oldest known lineage of modern human, is important for understanding human diversity. Studies based on mitochondrial and small sets of nuclear markers have shown that these hunter-gatherers, known as Khoisan, San, or Bushmen, are genetically divergent from other humans. However, until now, fully sequenced human genomes have been limited to recently diverged populations. Here we present the complete genome sequences of an indigenous hunter-gatherer from the Kalahari Desert and a Bantu from southern Africa, as well as protein-coding regions from an additional three hunter-gatherers from disparate regions of the Kalahari. We characterize the extent of whole-genome and exome diversity among the five men, reporting 1.3 million novel DNA differences genome-wide, including 13,146 novel amino acid variants. In terms of nucleotide substitutions, the Bushmen seem to be, on average, more different from each other than, for example, a European and an Asian. Observed genomic differences between the hunter-gatherers and others may help to pinpoint genetic adaptations to an agricultural lifestyle. Adding the described variants to current databases will facilitate inclusion of southern Africans in medical research efforts, particularly when family and medical histories can be correlated with genome-wide data.

[1]  I Litvan,et al.  Association of an extended haplotype in the tau gene with progressive supranuclear palsy. , 1999, Human molecular genetics.

[2]  D. Turnbull,et al.  Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA , 1999, Nature Genetics.

[3]  John P. Hutchison,et al.  African Languages: An Introduction , 2000 .

[4]  M. Stoneking,et al.  Human mtDNA and Y-chromosome variation is correlated with matrilocal versus patrilocal residence , 2001, Nature Genetics.

[5]  J. Mullikin,et al.  The phusion assembler. , 2003, Genome research.

[6]  A. Cederbaum,et al.  CYP2E1: biochemistry, toxicology, regulation and function in ethanol-induced liver injury. , 2003, Current molecular medicine.

[7]  H. Stefánsson,et al.  A common inversion under selection in Europeans , 2005, Nature Genetics.

[8]  P. Donnelly,et al.  A Fine-Scale Map of Recombination Rates and Hotspots Across the Human Genome , 2005, Science.

[9]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[10]  Timothy B. Stockwell,et al.  The Diploid Genome Sequence of an Individual Human , 2007, PLoS biology.

[11]  Fernando A. Villanea,et al.  Diet and the evolution of human amylase gene copy number variation , 2007, Nature Genetics.

[12]  Holly M. Mortensen,et al.  Whole-mtDNA genome sequence analysis of ancient African lineages. , 2007, Molecular biology and evolution.

[13]  J. Lupski,et al.  The complete genome of an individual by massively parallel DNA sequencing , 2008, Nature.

[14]  Peter A Underhill,et al.  New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree. , 2008, Genome research.

[15]  Joshua M. Korn,et al.  Integrated detection and population-genetic analysis of SNPs and copy number variation , 2008, Nature Genetics.

[16]  Zhaoshi Jiang,et al.  Evolutionary toggling of the MAPT 17q21.31 inversion region , 2008, Nature Genetics.

[17]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[18]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[19]  Dawei Li,et al.  The diploid genome sequence of an Asian individual , 2008, Nature.

[20]  Zhaoshi Jiang,et al.  Characterization of six human disease-associated inversion polymorphisms , 2009, Human molecular genetics.

[21]  Shuichi Matsumura,et al.  Genetic Discontinuity Between Local Hunter-Gatherers and Central Europe’s First Farmers , 2009, Science.

[22]  Joseph K. Pickrell,et al.  The Role of Geography in Human Adaptation , 2009, PLoS genetics.

[23]  M. Nalls,et al.  Reduced Neutrophil Count in People of African Descent Is Due To a Regulatory Variant in the Duffy Antigen Receptor for Chemokines Gene , 2009, PLoS genetics.

[24]  J. Kitzman,et al.  Personalized Copy-Number and Segmental Duplication Maps using Next-Generation Sequencing , 2009, Nature Genetics.

[25]  Scott M. Williams,et al.  The Genetic Structure and History of Africans and African Americans , 2009, Science.

[26]  Mark George Thomas,et al.  Ancient DNA Reveals Lack of Continuity between Neolithic Hunter-Gatherers and Contemporary Scandinavians , 2009, Current Biology.

[27]  Sangsoo Kim,et al.  The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group. , 2009, Genome research.

[28]  D. Comas,et al.  Genetic and demographic implications of the Bantu expansion: insights from human paternal lineages. , 2009, Molecular biology and evolution.

[29]  D. Reich,et al.  Genetic structure of a unique admixed population: implications for medical research. , 2010, Human molecular genetics.