Ancestral Inference in Population Genetics Models with Selection (with Discussion)

A new algorithm is presented for exact simulation from the conditional distribution of the genealogical history of a sample, given the composition of the sample, for population genetics models with general diploid selection. The method applies to the usual diffusion approximation of evolution at a single locus, in a randomly mating population of constant size, for mutation models in which the distribution of the type of a mutant does not depend on the type of the progenitor allele; this includes any model with only two alleles. The new method is applied to ancestral inference for the two‐allele case, both with genic selection and heterozygote advantage and disadvantage, where one of the alleles is assumed to have resulted from a unique mutation event. The paper describes how the method could be used for inference when data are also available at neutral markers linked to the locus under selection. It also informally describes and constructs the non‐neutral Fleming–Viot measure‐valued diffusion.

[1]  T. Ohta,et al.  The age of a neutral mutant persisting in a finite population. , 1973, Genetics.

[2]  T. Maruyama,et al.  The age of an allele in a finite population. , 1974, Genetical research.

[3]  Anthony Unwin,et al.  Reversibility and Stochastic Networks , 1980 .

[4]  E. Mayr Adaptation and selection , 1981 .

[5]  D. Hartl,et al.  Inference of selection and recombination from nucleotide sequence data * , 1991 .

[6]  Stewart N. Ethier,et al.  Fleming-Viot processes in population genetics , 1993 .

[7]  Jun S. Liu,et al.  Sequential Imputations and Bayesian Missing Data Problems , 1994 .

[8]  Stephen M. Krone,et al.  The genealogy of samples in models with selection. , 1997, Genetics.

[9]  Stephen M. Krone,et al.  Ancestral Processes with Selection , 1997, Theoretical population biology.

[10]  S. Tavaré,et al.  The age of a mutation in a general coalescent tree , 1998 .

[11]  P. Donnelly,et al.  Conditional genealogies and the age of a neutral mutant. , 1999, Theoretical population biology.

[12]  Peter Beerli,et al.  Likelihoods on coalescents: a Monte Carlo sampling approach to inferring parameters from population samples of molecular data , 1999 .

[13]  Peter Donnelly,et al.  Genealogical processes for Fleming-Viot models with selection and recombination , 1999 .

[14]  M. Stephens Times on trees, and the age of an allele. , 2000, Theoretical population biology.

[15]  P. F. Slade Simulation of selected genealogies. , 2000, Theoretical population biology.

[16]  P. Donnelly,et al.  Inference in molecular population genetics , 2000 .

[17]  P. F. Slade Most recent common ancestor probability distributions in gene genealogies under selection. , 2000, Theoretical population biology.

[18]  A. Etheridge,et al.  An introduction to superprocesses , 2000 .

[19]  J. Kingman Origins of the coalescent. 1974-1982. , 2000, Genetics.

[20]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[21]  C. Wiuf Rare alleles and selection. , 2001, Theoretical population biology.

[22]  P Joyce,et al.  Likelihoods and simulation methods for a class of nonneutral population genetics models. , 2001, Genetics.

[23]  David B. Goldstein,et al.  Population genetic structure of variable drug response , 2001, Nature Genetics.

[24]  M Slatkin,et al.  Simulating genealogies of selected alleles in a population of variable size. , 2001, Genetical research.

[25]  J. Cavanaugh,et al.  International collaboration provides convincing linkage replication in complex disease through analysis of a large pooled data set: Crohn disease and chromosome 16. , 2001, American journal of human genetics.

[26]  P. Fearnhead Perfect simulation from population genetic models with selection. , 2001, Theoretical population biology.

[27]  R. Nielsen Statistical tests of selective neutrality in the age of genomics , 2001, Heredity.

[28]  E A Thompson,et al.  Linkage disequilibrium mapping: the role of population history, size, and structure. , 2001, Advances in genetics.

[29]  J. Wall,et al.  Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. , 2001, American journal of human genetics.

[30]  S. R. Wilson,et al.  Epistasis and its possible effects on transmission disequilibrium tests. , 2001, Annals of human genetics.

[31]  The Common Ancestor At A Non-Neutral Locus , 2001 .

[32]  P. Fearnhead The common ancestor at a nonneutral locus , 2002, Journal of Applied Probability.

[33]  D. Balding,et al.  Approximate Bayesian computation in population genetics. , 2002, Genetics.

[34]  D J Balding,et al.  Fine-scale mapping of disease loci via shattered coalescent modeling of genealogies. , 2002, American journal of human genetics.

[35]  Peter Donnelly,et al.  A comparison of bayesian methods for haplotype reconstruction from population genotype data. , 2003, American journal of human genetics.

[36]  R. Griffiths,et al.  The frequency spectrum of a mutation, and its age, in a general diffusion model. , 2003, Theoretical population biology.

[37]  F. Rousset Inferences from Spatial Population Genetics , 2004 .

[38]  M. Stephens,et al.  Inference Under the Coalescent , 2004 .

[39]  Laurent Excoffier,et al.  Analysis of Population Subdivision , 2004 .

[40]  Jan M. Rabaey,et al.  Comparison of Methods , 2004 .