Loci associated with skin pigmentation identified in African populations

African genomics and skin color Skin color varies among human populations and is thought to be under selection, with light skin maximizing vitamin D production at higher latitudes and dark skin providing UV protection in equatorial zones. To identify the genes that give rise to the palette of human skin tones, Crawford et al. applied genome-wide analyses across diverse African populations (see the Perspective by Tang and Barsh). Genetic variants were identified with likely function in skin phenotypes. Comparison to model organisms verified a conserved function of MFSD12 in pigmentation. A global genetic panel was used to trace how alleles associated with skin color likely moved across the globe as humans migrated, both within and out of Africa. Science, this issue p. eaan8433; see also p. 867 Genome-wide analysis of 2000 Africans identifies and functionally characterizes pigmentation loci. INTRODUCTION Variation in pigmentation among human populations may reflect local adaptation to regional light environments, because dark skin is more photoprotective, whereas pale skin aids the production of vitamin D. Although genes associated with skin pigmentation have been identified in European populations, little is known about the genetic basis of skin pigmentation in Africans. RATIONALE Genetically and phenotypically diverse African populations are informative for mapping genetic variants associated with skin pigmentation. Analysis of the genetics of skin pigmentation in Africans informs upon melanocyte biology and the evolution of skin pigmentation in humans. RESULTS We observe extensive variation in skin pigmentation in Africa, with lowest melanin levels observed in southern African San hunter-gatherers and highest levels in East African Nilo-Saharan pastoralists. A genome-wide association study (GWAS) of 1570 Africans identified variants significantly associated with skin pigmentation, which clustered in four genomic regions that together account for almost 30% of the phenotypic variation. The most significantly associated single-nucleotide polymorphisms were at SLC24A5, a gene associated with pigmentation in Europeans. We show that SLC24A5 was introduced into East Africa >5 thousand years ago (ka) and has risen to high frequency. The second most significantly associated region is near the gene MFSD12. Using in vitro and in vivo analyses, we show that MFSD12 codes for a lysosomal protein that modifies pigmentation in human melanocytes, with decreased MFSD12 expression associated with darker pigmentation. We also show that genetic knockout of Mfsd12 affects pigmentation in mice. A third highly associated region encompasses a cluster of genes that play a role in ultraviolet (UV) response and DNA damage repair. We find the strongest associations in a regulatory region upstream of DDB1, the gene encoding damage-specific DNA binding protein 1, and that these variants are associated with increased expression of DDB1. The alleles associated with light pigmentation swept to near fixation outside of Africa due to positive selection, and we show that these lineages coalesce ~60 ka, corresponding with the time of migration of modern humans out of Africa. The fourth significantly associated region encompasses the OCA2 and HERC2 loci. We identify previously uncharacterized variants at HERC2 associated with the expression of OCA2. These variants arose independently from eye and skin pigmentation–associated variants in non-Africans. We also identify variants at OCA2 that are correlated with alternative splicing; alleles associated with light pigmentation are correlated with a shorter transcript, which lacks a transmembrane domain. CONCLUSION We identify previously uncharacterized genes and variants associated with skin pigmentation in ethnically diverse Africans. These genes have diverse functions, from repairing UV damage to playing important roles in melanocyte biology. We show that both dark and light pigmentation alleles arose before the origin of modern humans and that both light and dark pigmented skin has continued to evolve throughout hominid history. We show that variants associated with dark pigmentation in Africans are identical by descent in South Asian and Australo-Melanesian populations. This study sheds light on the evolutionary history, and adaptive significance, of skin pigmentation in humans. GWAS and functional assays illuminate the genetic basis of pigmentation in Africa. A GWAS identified four genomic regions associated with skin pigmentation in Africa. Functional assays in melanocytes and mice characterized their impact on skin pigmentation. Evolutionary genetic analyses revealed that most derived variants evolved before the origin of modern humans. Ma, million years ago. Despite the wide range of skin pigmentation in humans, little is known about its genetic basis in global populations. Examining ethnically diverse African genomes, we identify variants in or near SLC24A5, MFSD12, DDB1, TMEM138, OCA2, and HERC2 that are significantly associated with skin pigmentation. Genetic evidence indicates that the light pigmentation variant at SLC24A5 was introduced into East Africa by gene flow from non-Africans. At all other loci, variants associated with dark pigmentation in Africans are identical by descent in South Asian and Australo-Melanesian populations. Functional analyses indicate that MFSD12 encodes a lysosomal protein that affects melanogenesis in mice, and that mutations in melanocyte-specific regulatory regions near DDB1/TMEM138 correlate with expression of ultraviolet response genes under selection in Eurasians.

[1]  R. Joshi,et al.  Identifying signatures of positive selection in pigmentation genes in two South Asian populations , 2017, American journal of human biology : the official journal of the Human Biology Council.

[2]  N. Jablonski,et al.  The colours of humanity: the evolution of pigmentation in the human lineage , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[3]  J. Hublin,et al.  The age of the hominin fossils from Jebel Irhoud, Morocco, and the origins of the Middle Stone Age , 2017, Nature.

[4]  Peter M Visscher,et al.  Inference on the Genetic Basis of Eye and Skin Color in an Admixed Population via Bayesian Linear Mixed Models , 2017, Genetics.

[5]  W. Pavan,et al.  TFAP2 paralogs regulate melanocyte differentiation in parallel with MITF , 2017, PLoS genetics.

[6]  Yun S. Song,et al.  Robust and scalable inference of population history from hundreds of unphased whole genomes , 2016, Nature Genetics.

[7]  S. Tishkoff,et al.  Inferences of African evolutionary history from genomic data. , 2016, Current opinion in genetics & development.

[8]  S. Burgess,et al.  A high-throughput functional genomics workflow based on CRISPR/Cas9-mediated targeted mutagenesis in zebrafish , 2016, Nature Protocols.

[9]  Søren Brunak,et al.  A genomic history of Aboriginal Australia , 2016, Nature.

[10]  Alan M. Kwong,et al.  Next-generation genotype imputation service and methods , 2016, Nature Genetics.

[11]  Michael C. Westaway,et al.  Genomic analyses inform on migration events during the peopling of Eurasia , 2016, Nature.

[12]  Ayellet V. Segrè,et al.  Colocalization of GWAS and eQTL Signals Detects Target Genes , 2016, bioRxiv.

[13]  Matthew Stephens,et al.  Estimating Time to the Common Ancestor for a Beneficial Allele , 2016, bioRxiv.

[14]  Yun S. Song,et al.  The Simons Genome Diversity Project: 300 genomes from 142 diverse populations , 2016, Nature.

[15]  Jonathan Scott Friedlaender,et al.  Excavating Neandertal and Denisovan DNA from the genomes of Melanesian individuals , 2016, Science.

[16]  S. Ozarkar,et al.  Skin pigmentation variation among populations of West Maharashtra, India , 2016, American journal of human biology : the official journal of the Human Biology Council.

[17]  David J. Arenillas,et al.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles , 2015, Nucleic Acids Res..

[18]  J. Michael Cherry,et al.  ENCODE data at the ENCODE portal , 2015, Nucleic Acids Res..

[19]  Marie-Claude Babron,et al.  High level of inbreeding in final phase of 1000 Genomes Project , 2015, Scientific Reports.

[20]  D. Reich,et al.  Genome-wide patterns of selection in 230 ancient Eurasians , 2015, Nature.

[21]  O. Troyanskaya,et al.  Predicting effects of noncoding variants with deep learning–based sequence model , 2015, Nature Methods.

[22]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[23]  Eric Banks,et al.  Tools and best practices for data processing in allelic expression analysis , 2015, Genome Biology.

[24]  Blake Carrington,et al.  CRISPR-STAT: an easy and reliable PCR-based method to evaluate target-specific sgRNA activity , 2015, Nucleic acids research.

[25]  Guanjie Chen,et al.  Novel genomic signals of recent selection in an Ethiopian population , 2014, European Journal of Human Genetics.

[26]  Kevin Bishop,et al.  High-throughput gene targeting and phenotyping in zebrafish using CRISPR/Cas9 , 2015, Genome research.

[27]  R. Durbin,et al.  Tracing the Route of Modern Humans out of Africa by Using 225 Human Genome Sequences from Ethiopians and Egyptians , 2015, American journal of human genetics.

[28]  Benjamin J. Strober,et al.  A method to predict the impact of regulatory variants from DNA sequence , 2015, Nature Genetics.

[29]  Nicholas G. Martin,et al.  Genetics of skin color variation in Europeans: genome-wide association studies with functional follow-up , 2015, Human Genetics.

[30]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[31]  S. Aerts,et al.  Transcription factor MITF and remodeller BRG1 define chromatin organisation at regulatory elements in melanoma cells , 2015, eLife.

[32]  Harald F Hess,et al.  Fixation-resistant photoactivatable fluorescent proteins for CLEM , 2015, Nature Methods.

[33]  Rafati Alireza,et al.  ASSOCIATION OF RS12913832 IN THE HERC2 GENE AFFECTING HUMAN IRIS COLOR VARIATION , 2015 .

[34]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[35]  Jody Hey,et al.  A Hidden Markov Model for Investigating Recent Positive Selection through Haplotype Structure , 2014, bioRxiv.

[36]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[37]  Li Teng,et al.  4DGenome: a comprehensive database of chromatin interactions , 2015, Bioinform..

[38]  M. Marks,et al.  An intracellular anion channel critical for pigmentation , 2014, eLife.

[39]  E. Oancea,et al.  Genome-wide transcriptome analysis of human epidermal melanocytes. , 2014, Genomics.

[40]  Ross M. Fraser,et al.  Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.

[41]  D. Balding,et al.  Admixture in Latin America: Geographic Structure, Phenotypic Diversity and Self-Perception of Ancestry Based on 7,342 Individuals , 2014, PLoS genetics.

[42]  Eleazar Eskin,et al.  Identifying Causal Variants at Loci with Multiple Signals of Association , 2014, Genetics.

[43]  Heng Li,et al.  Toward better understanding of artifacts in variant calling from high-coverage samples , 2014, Bioinform..

[44]  Ross M. Fraser,et al.  A General Approach for Haplotype Phasing across the Full Spectrum of Relatedness , 2014, PLoS genetics.

[45]  Zachary A. Szpiech,et al.  selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection , 2014, Molecular biology and evolution.

[46]  Qing-Yuan Sun,et al.  CRL4 Complex Regulates Mammalian Oocyte Survival and Reprogramming by Activation of TET Proteins , 2013, Science.

[47]  Evan Z. Macosko,et al.  Our Fallen Genomes , 2013, Science.

[48]  R. Young,et al.  Super-Enhancers in the Control of Cell Identity and Disease , 2013, Cell.

[49]  C. Basu Mallick,et al.  The Light Skin Allele of SLC24A5 in South Asians and Europeans Shares Identity by Descent , 2013, PLoS genetics.

[50]  Anton J. Enright,et al.  The zebrafish reference genome sequence and its relationship to the human genome , 2013, Nature.

[51]  N. Yan,et al.  Evolutionary mix-and-match with MFS transporters , 2013, Proceedings of the National Academy of Sciences.

[52]  Nicholas A. Johnson,et al.  Genetic Architecture of Skin and Eye Color in an African-European Admixed Population , 2013, PLoS genetics.

[53]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[54]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[55]  Mark D Shriver,et al.  The timing of pigmentation lightening in Europeans. , 2013, Molecular biology and evolution.

[56]  A. Xu,et al.  Transcriptome Analysis Reveals Markers of Aberrantly Activated Innate Immunity in Vitiligo Lesional and Non-Lesional Skin , 2012, PloS one.

[57]  Mattias Jakobsson,et al.  Genomic Variation in Seven Khoe-San Groups Reveals Adaptation and Complex African History , 2012, Science.

[58]  Thomas Zichner,et al.  DELLY: structural variant discovery by integrated paired-end and split-read analysis , 2012, Bioinform..

[59]  S. Oppenheimer,et al.  Skin Color Variation in Orang Asli Tribes of Peninsular Malaysia , 2012, PloS one.

[60]  Joseph K. Pickrell,et al.  The genetic prehistory of southern Africa , 2012, Nature Communications.

[61]  C. Tyler-Smith,et al.  Ethiopian genetic diversity reveals linguistic stratification and complex influences on the Ethiopian gene pool. , 2012, American journal of human genetics.

[62]  Lisa Helbling Chadwick,et al.  The NIH Roadmap Epigenomics Program data resource. , 2012, Epigenomics.

[63]  M. Stephens,et al.  Genome-wide Efficient Mixed Model Analysis for Association Studies , 2012, Nature Genetics.

[64]  V. Le Corre,et al.  The genetic differentiation at quantitative trait loci under local adaptation , 2012, Molecular ecology.

[65]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[66]  Robert-Jan Palstra,et al.  HERC2 rs12913832 modulates human pigmentation by attenuating chromatin-loop formation between a long-range enhancer and the OCA2 promoter. , 2012, Genome research.

[67]  Colin A. Johnson,et al.  Evolutionarily Assembled cis-Regulatory Module at a Human Ciliopathy Locus , 2012, Science.

[68]  O. Delaneau,et al.  A linear complexity phasing method for thousands of genomes , 2011, Nature Methods.

[69]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[70]  Raymond K. Auerbach,et al.  Extensive Promoter-Centered Chromatin Interactions Provide a Topological Basis for Transcription Regulation , 2012, Cell.

[71]  Daniel L. Ayres,et al.  BEAGLE: An Application Programming Interface and High-Performance Computing Library for Statistical Phylogenetics , 2011, Systematic biology.

[72]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[73]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[74]  G. Babcock,et al.  Epidermal keratinocytes from light vs. dark skin exhibit differential degradation of melanosomes. , 2011, The Journal of investigative dermatology.

[75]  William Stafford Noble,et al.  FIMO: scanning for occurrences of a given motif , 2011, Bioinform..

[76]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[77]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer , 2011, Nature Biotechnology.

[78]  Philip L. F. Johnson,et al.  Genetic history of an archaic hominin group from Denisova Cave in Siberia , 2010, Nature.

[79]  A. Gylfason,et al.  Fine-scale recombination rate differences between sexes, populations and individuals , 2010, Nature.

[80]  G. Hooft Nothing to fear from mistakes , 2010, Nature.

[81]  K. Wakamatsu,et al.  Melanocortin 1 receptor genotype: an important determinant of the damage response of melanocytes to ultraviolet radiation , 2010, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[82]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[83]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[84]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[85]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[86]  Brian T. Naughton,et al.  Web-Based, Participant-Driven Studies Yield Novel Genetic Associations for Common Traits , 2010, PLoS genetics.

[87]  Cory Y. McLean,et al.  GREAT improves functional interpretation of cis-regulatory regions , 2010, Nature Biotechnology.

[88]  Leopold Parts,et al.  A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies , 2010, PLoS Comput. Biol..

[89]  H. Kang,et al.  Variance component model to account for sample structure in genome-wide association studies , 2010, Nature Genetics.

[90]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[91]  A. Verkleij,et al.  AP-1 and KIF13A coordinate endosomal sorting and positioning during melanosome biogenesis , 2009, The Journal of cell biology.

[92]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[93]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[94]  Scott M. Williams,et al.  The Genetic Structure and History of Africans and African Americans , 2009, Science.

[95]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[96]  G. Perry,et al.  Evolution of the human pygmy phenotype. , 2009, Trends in ecology & evolution.

[97]  Gary K. Chen,et al.  Fast and flexible simulation of DNA sequence data. , 2008, Genome research.

[98]  X. Deng,et al.  Arabidopsis DDB1-CUL4 ASSOCIATED FACTOR1 Forms a Nuclear E3 Ubiquitin Ligase with DDB1 and CUL4 That Is Involved in Multiple Plant Developmental Processes[W] , 2008, The Plant Cell Online.

[99]  F. Hu,et al.  A Genome-Wide Association Study Identifies Novel Alleles Associated with Hair Color and Skin Pigmentation , 2008, PLoS genetics.

[100]  L. Quintana-Murci,et al.  Natural selection has driven population differentiation in modern humans , 2008, Nature Genetics.

[101]  Johan T den Dunnen,et al.  Three genome-wide association studies and a linkage analysis identify HERC2 as a human iris color gene. , 2008, American journal of human genetics.

[102]  Hans Eiberg,et al.  Blue eye color in humans may be caused by a perfectly associated founder mutation in a regulatory element located within the HERC2 gene inhibiting OCA2 expression , 2008, Human Genetics.

[103]  B. Browning,et al.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. , 2007, American journal of human genetics.

[104]  R. Kittles,et al.  Genetic evidence for the convergent evolution of light skin in Europeans and East Asians. , 2006, Molecular biology and evolution.

[105]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[106]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[107]  AM. ZOOI. Ocist,et al.  Genetic Variations in the Fine Structure and Ontogeny of Mouse Melanin Granules , 2006 .

[108]  Keith C. Cheng,et al.  SLC24A5, a Putative Cation Exchanger, Affects Pigmentation in Zebrafish and Humans , 2005, Science.

[109]  Yun S. Song,et al.  Minimum Recombination Histories by Branch and Bound , 2005, WABI.

[110]  W. Oetting,et al.  P gene mutations associated with oculocutaneous albinism type II (OCA2) , 2005, Human mutation.

[111]  H. Yoshida,et al.  Drosophila Damaged DNA-Binding Protein 1 Is an Essential Factor for Development , 2004, Genetics.

[112]  I. Levin,et al.  The tomato homolog of the gene encoding UV-damaged DNA binding protein 1 (DDB1) underlined as the gene that causes the high pigment-1 mutant phenotype , 2004, Theoretical and Applied Genetics.

[113]  S. Wooding,et al.  Genetic Variation at the MC1R Locus and the Time since Loss of Human Body Hair1 , 2004 .

[114]  H. Walter,et al.  Some notes on the geographical distribution of the human red cell acid phosphatase phenotypes , 1972, Humangenetik.

[115]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[116]  Mark D Shriver,et al.  Comparing quantitative measures of erythema, pigmentation and skin response using reflectometry. , 2002, Pigment cell research.

[117]  L. Chin,et al.  p16(Ink4a) in melanocyte senescence and differentiation. , 2002, Journal of the National Cancer Institute.

[118]  Molly Przeworski,et al.  The signature of positive selection at randomly chosen loci. , 2002, Genetics.

[119]  S. Jesuthasan,et al.  Analysis of xanthophore and pterinosome biogenesis in zebrafish using methylene blue and pteridine autofluorescence. , 2002, Pigment cell research.

[120]  M. Marks,et al.  The melanosome: membrane dynamics in black and white , 2001, Nature Reviews Molecular Cell Biology.

[121]  M. Brilliant The mouse p (pink-eyed dilution) and human P genes, oculocutaneous albinism type 2 (OCA2), and melanosomal pH. , 2001, Pigment cell research.

[122]  G. Connah An African Classical Age: Eastern and Southern Africa in World History, 1000 B.C. to 400 A.D. (review) , 2001 .

[123]  G. Raposo,et al.  Distinct Protein Sorting and Localization to Premelanosomes, Melanosomes, and Lysosomes in Pigmented Melanocytic Cells✪ , 2001, The Journal of cell biology.

[124]  N. Jablonski,et al.  The evolution of human skin coloration. , 2000, Journal of human evolution.

[125]  I. Jackson,et al.  Evidence for variable selective pressures at MC1R. , 2000, American journal of human genetics.

[126]  M. Ramsay,et al.  Identification of P gene mutations in individuals with oculocutaneous albinism in sub‐Saharan Africa , 2000, Human mutation.

[127]  M. Marks,et al.  A Cytoplasmic Sequence in Human Tyrosinase Defines a Second Class of Di-leucine-based Sorting Signals for Late Endosomal and Lysosomal Delivery* , 1999, The Journal of Biological Chemistry.

[128]  A. Boyles,et al.  National Institute of Environmental Health Sciences , 2008 .

[129]  H. Bandelt,et al.  Median-joining networks for inferring intraspecific phylogenies. , 1999, Molecular biology and evolution.

[130]  C. Ehret,et al.  An African Classical Age: Eastern and Southern Africa in World History, 1000 BC to AD 400 , 1999 .

[131]  C. Nüsslein-Volhard,et al.  Zebrafish pigmentation mutations and the processes of neural crest development. , 1996, Development.

[132]  G. Imokawa,et al.  Modulation of melanogenic protein expression during the switch from eu- to pheomelanogenesis. , 1995, Journal of cell science.

[133]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[134]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[135]  G. Chu,et al.  Xeroderma pigmentosum group E cells lack a nuclear factor that binds to damaged DNA. , 1988, Science.

[136]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[137]  'grizzled', A mutant in linkage group x of the mouse. , 1966 .

[138]  Frank H. Mover Genetic Variations in the Fine Structure and Ontogeny of Mouse Melanin Granules. , 1966 .

[139]  Genetic Variation at the MCiR Locus and the Time since Loss of Human Body Hair 1 , 2022 .