A Panel of Ancestry Informative Markers for the Complex Five-Way Admixed South African Coloured Population

Admixture is a well known confounder in genetic association studies. If genome-wide data is not available, as would be the case for candidate gene studies, ancestry informative markers (AIMs) are required in order to adjust for admixture. The predominant population group in the Western Cape, South Africa, is the admixed group known as the South African Coloured (SAC). A small set of AIMs that is optimized to distinguish between the five source populations of this population (African San, African non-San, European, South Asian, and East Asian) will enable researchers to cost-effectively reduce false-positive findings resulting from ignoring admixture in genetic association studies of the population. Using genome-wide data to find SNPs with large allele frequency differences between the source populations of the SAC, as quantified by Rosenberg et. al's -statistic, we developed a panel of AIMs by experimenting with various selection strategies. Subsets of different sizes were evaluated by measuring the correlation between ancestry proportions estimated by each AIM subset with ancestry proportions estimated using genome-wide data. We show that a panel of 96 AIMs can be used to assess ancestry proportions and to adjust for the confounding effect of the complex five-way admixture that occurred in the South African Coloured population.

[1]  A. Price,et al.  Genome-wide association study of ancestry-specific TB risk in the South African Coloured population. , 2014, Human molecular genetics.

[2]  Nicola J. Mulder,et al.  Determining Ancestry Proportions in Complex Admixture Scenarios in South Africa Using a Novel Proxy Ancestry Selection Method , 2013, PloS one.

[3]  Mattias Jakobsson,et al.  Genomic Variation in Seven Khoe-San Groups Reveals Adaptation and Complex African History , 2012, Science.

[4]  Christopher R. Gignoux,et al.  Development of a Panel of Genome-Wide Ancestry Informative Markers to Study Admixture Throughout the Americas , 2012, PLoS genetics.

[5]  A. Nebel,et al.  Polymorphisms in MC3R promoter and CTSZ 3′UTR are associated with tuberculosis susceptibility , 2011, European Journal of Human Genetics.

[6]  P. McKeigue,et al.  Genome-wide association study of type 2 diabetes in a sample from Mexico City and a meta-analysis of a Mexican-American sample from Starr County, Texas , 2011, Diabetologia.

[7]  Chengqing Wu,et al.  A Comparison of Association Methods Correcting for Population Stratification in Case–Control Studies , 2011, Annals of human genetics.

[8]  Kenneth K. Kidd,et al.  Hunter-gatherer genomic diversity suggests a southern African origin for modern humans , 2011, Proceedings of the National Academy of Sciences.

[9]  Eileen G. Hoal,et al.  Gene-gene interaction between tuberculosis candidate genes in a South African population , 2011, Mammalian Genome.

[10]  B. Beutler,et al.  How host defense is encoded in the mammalian genome , 2011, Mammalian Genome.

[11]  P. V. van Helden,et al.  Analysis of eight genes modulating interferon gamma and human genetic susceptibility to tuberculosis: a case-control association study , 2010, BMC infectious diseases.

[12]  C. Seoighe,et al.  Genome-wide analysis of the structure of the South African Coloured Population in the Western Cape , 2010, Human Genetics.

[13]  Lluis Quintana-Murci,et al.  Strong maternal Khoisan contribution to the South African coloured population: a case of gender-biased admixture. , 2010, American journal of human genetics.

[14]  M. Möller,et al.  Current findings, challenges and novel approaches in human genetic susceptibility to tuberculosis. , 2010, Tuberculosis.

[15]  Tom H. Pringle,et al.  Complete Khoisan and Bantu genomes from southern Africa , 2010, Nature.

[16]  A. Franke,et al.  A functional haplotype in the 3'untranslated region of TNFRSF1B is associated with tuberculosis in two African populations. , 2010, American journal of respiratory and critical care medicine.

[17]  D. Reich,et al.  Genetic structure of a unique admixed population: implications for medical research. , 2010, Human molecular genetics.

[18]  Stephen L. Hauser,et al.  Genome-wide patterns of population structure and admixture in West Africans and African Americans , 2009, Proceedings of the National Academy of Sciences.

[19]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[20]  Scott M. Williams,et al.  The Genetic Structure and History of Africans and African Americans , 2009, Science.

[21]  P. V. van Helden,et al.  Investigation of chromosome 17 candidate genes in susceptibility to TB in a South African population. , 2009, Tuberculosis.

[22]  Gabriel Silva,et al.  Ancestry informative marker sets for determining continental origin and admixture proportions in common populations in America , 2009, Human mutation.

[23]  B. Taylor,et al.  Assessing statistical power of SNPs for population structure and conservation studies , 2009, Molecular ecology resources.

[24]  Gabriel Silva,et al.  An ancestry informative marker set for determining continental origin: validation and extension using human genome diversity panels , 2009, BMC Genetics.

[25]  L. van der Merwe,et al.  COX-2 promoter polymorphisms and the association with prostate cancer risk in South African men. , 2008, Carcinogenesis.

[26]  Mark Shriver,et al.  A panel of ancestry informative markers for estimating individual biogeographical ancestry and admixture from four continents: utility and applications , 2008, Human mutation.

[27]  Á. Carracedo,et al.  Inferring ancestral origin using a single multiplex assay of ancestry-informative marker SNPs. , 2007, Forensic science international. Genetics.

[28]  Michael W. Mahoney,et al.  PCA-Correlated SNPs for Structure Identification in Worldwide Human Populations , 2007, PLoS genetics.

[29]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[30]  P. V. van Helden,et al.  Length Variation of DC-SIGN and L-SIGN Neck-Region has no Impact on Tuberculosis Susceptibility , 2006, Human Immunology.

[31]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[32]  Manfred Kayser,et al.  Proportioning whole-genome single-nucleotide-polymorphism diversity for the identification of geographic population structure and genetic ancestry. , 2006, American journal of human genetics.

[33]  Noah A. Rosenberg Algorithms for Selecting Informative Marker Panels for Population Assignment , 2005, J. Comput. Biol..

[34]  M. Kotze,et al.  Analysis of the three common mutations in the CARD15 gene (R702W, G908R and 1007fs) in South African colored patients with inflammatory bowel disease. , 2005, Molecular and cellular probes.

[35]  C. Dandara,et al.  CYP3A5 genotypes and risk of oesophageal cancer in two South African populations. , 2005, Cancer letters.

[36]  Xiaofeng Zhu,et al.  Genetic Structure, Self-identified Race/ethnicity, and Confounding in Case-control Association Studies , 2022 .

[37]  P. Donnelly,et al.  The effects of human population structure on large genetic association studies , 2004, Nature Genetics.

[38]  Mark D Shriver,et al.  The genomic distribution of population substructure in four populations using 8,525 autosomal SNPs , 2004, Human Genomics.

[39]  J. Belmont,et al.  Mexican American ancestry-informative markers: examination of population structure and marker characteristics in European Americans, Mexican Americans, Amerindians and Asians , 2004, Human Genetics.

[40]  R. Ward,et al.  Informativeness of genetic markers for inference of ancestry. , 2003, American journal of human genetics.

[41]  L. Opie,et al.  A new mutation, R563Q, of the beta subunit of the epithelial sodium channel associated with low-renin, low-aldosterone hypertension , 2003, Journal of hypertension.

[42]  Michael J Bamshad,et al.  Human population genetic structure and inference of group membership. , 2003, American journal of human genetics.

[43]  Ting-kai Li,et al.  Alcohol dehydrogenase-2*2 allele is associated with decreased prevalence of fetal alcohol syndrome in the mixed-ancestry population of the Western Cape Province, South Africa. , 2001, Alcoholism, clinical and experimental research.

[44]  N. Rothman,et al.  Population stratification in epidemiologic studies of common genetic variants and cancer: quantification of bias. , 2000, Journal of the National Cancer Institute.

[45]  G Barbujani,et al.  An apportionment of human DNA diversity. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[46]  T. Stapleton,et al.  Colonial South Africa and the Origins of the Racial Order , 1997 .

[47]  J. Witte,et al.  Genetic dissection of complex traits , 1996, Nature Genetics.

[48]  W S Watkins,et al.  Origins and affinities of modern humans: a comparison of mitochondrial and nuclear genetic data. , 1995, American journal of human genetics.

[49]  H. Harpending African populations: the peoples of southern Africa and their affinities. , 1986, Science.

[50]  J. Weiner,et al.  The Peoples of Southern Africa and Their Affinities , 1986 .

[51]  R. Lewontin The Apportionment of Human Diversity , 1972 .

[52]  J. Dollard,et al.  Children of Bondage. , 1941 .

[53]  J. Dollard,et al.  Children of bondage , 1940 .