Inference of Multiple-wave Admixtures by Length Distribution of Ancestral Tracks

The ancestral tracks in admixed genomes are of valuable information for population history inference. A few methods have been developed to infer admixture history based on ancestral tracks. Nonetheless, these methods suffered the same flaw that only population admixture history under some specific models can be inferred. In addition, the inference of history might be biased or even unreliable if the specific model is deviated from the real situation. To address this problem, we firstly proposed a general discrete admixture model to describe the admixture history with multiple ancestral populations and multiple-wave admixtures. We next deduced the length distribution of ancestral tracks under the general discrete admixture model. We further developed a new method, MultiWaver, to explore the multiple-wave admixture histories. Our method could automatically determine an optimal admixture model based on the length distribution of ancestral tracks, and estimate the corresponding parameters under this optimal model. Specifically, we used a likelihood ratio test (LRT) to determine the number of admixture waves, and implemented an expectation??maximization (EM) algorithm to estimate parameters. We used simulation studies to validate the reliability and effectiveness of our method. Finally, good performance was observed when our method was applied to real datasets of African Americans, Mexicans, Uyghurs, and Hazaras.

[1]  S. S. Wilks The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses , 1938 .

[2]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[3]  D. Cox,et al.  A genomewide admixture map for Latino populations. , 2007, American journal of human genetics.

[4]  Annette Lee,et al.  A genomewide single-nucleotide-polymorphism panel for Mexican American admixture mapping. , 2007, American journal of human genetics.

[5]  N. Freimer,et al.  Geographic Patterns of Genome Admixture in Latin American Mestizos , 2008, PLoS genetics.

[6]  Li Jin,et al.  Analysis of genomic admixture in Uyghur and its implication in mapping strategy. , 2008, American journal of human genetics.

[7]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[8]  Shuhua Xu,et al.  A genome-wide analysis of admixture in Uyghurs and a high-density admixture map for disease-gene discovery. , 2008, American journal of human genetics.

[9]  Alkes L. Price,et al.  Reconstructing Indian Population History , 2009, Nature.

[10]  R. Nielsen,et al.  Inference of Historical Changes in Migration Rate From the Lengths of Migrant Tracts , 2009, Genetics.

[11]  D. Reich,et al.  Sensitive Detection of Chromosomal Segments of Distinct Ancestry in Admixed Populations , 2009, PLoS genetics.

[12]  Kirk E Lohmueller,et al.  Detecting ancient admixture and estimating demographic parameters in multiple human populations. , 2009, Molecular biology and evolution.

[13]  M. Stoneking,et al.  Dating the age of admixture via wavelet transform analysis of genome-wide data , 2011, Genome Biology.

[14]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[15]  H. Ostrer,et al.  The History of African Gene Flow into Southern Europeans, Levantines, and Jews , 2011, PLoS genetics.

[16]  Xiaofeng Zhu,et al.  The landscape of recombination in African Americans , 2011, Nature.

[17]  S. Gravel Population Genetics Models of Local Ancestry , 2012, Genetics.

[18]  Jake K. Byrnes,et al.  PCAdmix: Principal Components-Based Assignment of Ancestry Along Each Chromosome in Individuals with Admixed Ancestry from Two or More Populations , 2012, Human biology.

[19]  Li Jin,et al.  Exploring population admixture dynamics via empirical and simulated genome-wide distribution of ancestral chromosomal segments. , 2012, American journal of human genetics.

[20]  O. Delaneau,et al.  A linear complexity phasing method for thousands of genomes , 2011, Nature Methods.

[21]  Rong Chen,et al.  Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation. , 2012, American journal of human genetics.

[22]  Jake K. Byrnes,et al.  Reconstructing the Population Genetic History of the Caribbean , 2013, PLoS genetics.

[23]  Joseph K. Pickrell,et al.  Inferring Admixture Histories of Human Populations Using Linkage Disequilibrium , 2012, Genetics.

[24]  Wenfei Jin,et al.  Distribution of ancestral chromosomal segments in admixed genomes and its implications for inferring population history and admixture mapping , 2013, European Journal of Human Genetics.

[25]  B. Berger,et al.  Reconstructing Austronesian population history in Island Southeast Asia , 2014, Nature communications.

[26]  D. Falush,et al.  A Genetic Atlas of Human Admixture History , 2014, Science.

[27]  R. Nielsen,et al.  The Lengths of Admixture Tracts , 2014, Genetics.

[28]  Philip L. F. Johnson,et al.  The complete genome sequence of a Neandertal from the Altai Mountains , 2013, Nature.

[29]  Bonnie Berger,et al.  Ancient west Eurasian ancestry in southern and eastern Africa , 2013, Proceedings of the National Academy of Sciences.

[30]  D. Reich,et al.  The landscape of Neandertal ancestry in present-day humans , 2014, Nature.

[31]  David Reich,et al.  The Genetic Ancestry of African Americans, Latinos, and European Americans across the United States , 2015, American journal of human genetics.

[32]  Length Distribution of Ancestral Tracks under a General Admixture Model and Its Applications in Population History Inference , 2016, Scientific reports.

[33]  X. Ni,et al.  AdmixSim: A Forward-Time Simulator for Various and Complex Scenarios of Population Admixture , 2016, bioRxiv.

[34]  Scott M. Williams,et al.  The Great Migration and African-American Genomic Diversity , 2015, bioRxiv.

[35]  M. Stoneking,et al.  The Complex Admixture History and Recent Southern Origins of Siberian Populations , 2015, bioRxiv.