Empirical testing of a 23-AIMs panel of SNPs for ancestry evaluations in four major US populations

AbstractAncestry informative markers (AIMs) can be used to determine population affiliation of the donors of forensic samples. In order to examine ancestry evaluations of the four major populations in the USA, 23 highly informative AIMs were identified from the International HapMap project. However, the efficacy of these 23 AIMs could not be fully evaluated in silico. In this study, these 23 SNPs were multiplexed to test their actual performance in ancestry evaluations. Genotype data were obtained from 189 individuals collected from four American populations. One SNP (rs12149261) on chromosome 16 was removed from this panel because it was duplicated on chromosome 1. The resultant 22-AIMs panel was able to empirically resolve the four major populations as in the in silico study. Eight individuals were assigned to a different group than indicated on their samples. The assignments of the 22 AIMs for these samples were consistent with AIMs results from the ForenSeqTM panel. No departures from Hardy-Weinberg equilibrium (HWE) and linkage disequilibrium (LD) were detected for all 22 SNPs in four US populations (after removing the eight problematic samples). The principal component analysis (PCA) results indicated that 181 individuals from these populations were assigned to the expected groups. These 22 SNPs can contribute to the candidate AIMs pool for potential forensic identification purposes in major US populations.

[1]  P. Donnelly,et al.  The effects of human population structure on large genetic association studies , 2004, Nature Genetics.

[2]  Lili Ding,et al.  Comparison of measures of marker informativeness for ancestry and admixture mapping , 2011, BMC Genomics.

[3]  R. Chakraborty,et al.  Selection of highly informative SNP markers for population affiliation of major US populations , 2016, International Journal of Legal Medicine.

[4]  Hongzhe Li,et al.  Examination of ancestry and ethnic affiliation using highly informative diallelic DNA markers: application to diverse and admixed populations and implications for clinical epidemiology and forensic medicine , 2005, Human Genetics.

[5]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[6]  Mark D. Shriver,et al.  Genetic ancestry and the search for personalized genetic histories , 2004, Nature Reviews Genetics.

[7]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[8]  M. Feldman,et al.  Genetic Structure of Human Populations , 2002, Science.

[9]  P. Gill,et al.  Encoded evidence: DNA in forensic analysis , 2004, Nature Reviews Genetics.

[10]  Li Jin,et al.  Skin pigmentation, biogeographical ancestry and admixture mapping , 2003, Human Genetics.

[11]  Mark D Shriver,et al.  Control of confounding of genetic associations in stratified populations. , 2003, American journal of human genetics.

[12]  Bruce Budowle,et al.  Evaluation of the Illumina(®) Beta Version ForenSeq™ DNA Signature Prep Kit for use in genetic profiling. , 2016, Forensic science international. Genetics.

[13]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.