On the Inference of Ancestries in Admixed Populations

Inference of ancestral information in recently admixed populations, in which every individual is composed of a mixed ancestry (e.g., African Americans in the US), is a challenging problem. Several previous model-based approaches have used hidden Markov models (HMM) to model the problem, however, the Markov Chain Monte Carlo (MCMC) algorithms underlying these models converge slowly on realistic datasets. While retaining the HMM as a model, we show that a combination of an accurate fast initialization and a local hill-climb in likelihood results in significantly improved estimates of ancestry.We studied this approach in two scenarios--the inference of locus-specific ancestries in a population that is assumed to originate from two unknown ancestral populations, and the inference of allele frequencies in one ancestral population given those in another.

[1]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[2]  Itsik Pe'er,et al.  Evaluating potential for whole-genome studies in Kosrae, an isolated population in Micronesia , 2006, Nature Genetics.

[3]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[4]  C. Hoggart,et al.  Design and analysis of admixture mapping studies. , 2004, American journal of human genetics.

[5]  M. Nachman,et al.  Estimate of the mutation rate per nucleotide in humans. , 2000, Genetics.

[6]  M. Daly,et al.  Methods for high-density admixture mapping of disease genes. , 2004, American journal of human genetics.

[7]  D. Ballinger,et al.  A genomewide single-nucleotide-polymorphism panel with high ancestry information for African American admixture mapping. , 2006, American journal of human genetics.

[8]  Alex Acero,et al.  Spoken Language Processing , 2001 .

[9]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[10]  E. Halperin,et al.  Estimating Local Ancestry in Admixed Populations , 2022 .

[11]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[12]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. , 2003, Genetics.

[13]  N. Risch,et al.  Reconstructing genetic ancestry blocks in admixed individuals. , 2006, American journal of human genetics.