Non-equilibrium theory of the allele frequency spectrum.

A forward diffusion equation describing the evolution of the allele frequency spectrum is presented. The influx of mutations is accounted for by imposing a suitable boundary condition. For a Wright-Fisher diffusion with or without selection and varying population size, the boundary condition is lim(x downward arrow0)xf(x,t)=thetarho(t), where f(.,t) is the frequency spectrum of derived alleles at independent loci at time t and rho(t) is the relative population size at time t. When population size and selection intensity are independent of time, the forward equation is equivalent to the backwards diffusion usually used to derive the frequency spectrum, but this approach allows computation of the time dependence of the spectrum both before an equilibrium is attained and when population size and selection intensity vary with time. From the diffusion equation, a set of ordinary differential equations for the moments of f(.,t) is derived and the expected spectrum of a finite sample is expressed in terms of those moments. The use of the forward equation is illustrated by considering neutral and selected alleles in a highly simplified model of human history. For example, it is shown that approximately 30% of the expected total heterozygosity of neutral loci is attributable to mutations that arose since the onset of population growth in roughly the last 150,000 years.

[1]  S. Wright,et al.  The Distribution of Gene Frequencies Under Irreversible Mutation. , 1938, Proceedings of the National Academy of Sciences of the United States of America.

[2]  W. Ewens Mathematical Population Genetics : I. Theoretical Introduction , 2004 .

[3]  F. Tajima The effect of change in population size on DNA polymorphism. , 1989, Genetics.

[4]  S. Wooding,et al.  The matrix coalescent and an application to human single-nucleotide polymorphisms. , 2002, Genetics.

[5]  D. Hartl,et al.  Directional selection and the site-frequency spectrum. , 2001, Genetics.

[6]  Rory A. Fisher,et al.  XVII—The distribution of gene ratios for rare mutations , 1931 .

[7]  R. Griffiths,et al.  The frequency spectrum of a mutation, and its age, in a general diffusion model. , 2003, Theoretical population biology.

[8]  M. Kimura The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. , 1969, Genetics.

[9]  S. Liu-Cordero,et al.  The discovery of single-nucleotide polymorphisms--and inferences about human demographic history. , 2001, American journal of human genetics.

[10]  D. Hartl,et al.  Population genetics of polymorphism and divergence. , 1992, Genetics.

[11]  F. Knight Essentials of Brownian Motion and Diffusion , 1981 .

[12]  Ryan D. Hernandez,et al.  Natural selection on protein-coding genes in the human genome , 2005, Nature.

[13]  C. Bustamante,et al.  Population Genetics of Polymorphism and Divergence for Diploid Selection Models With Arbitrary Dominance , 2004, Genetics.

[14]  I. M. Pyshik,et al.  Table of integrals, series, and products , 1965 .

[15]  R. Nielsen Estimation of population parameters and recombination rates from single nucleotide polymorphisms. , 2000, Genetics.

[16]  Ryan D. Hernandez,et al.  Simultaneous inference of selection and population growth from patterns of variation in the human genome , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[17]  E. Lander,et al.  On the allelic spectrum of human disease. , 2001, Trends in genetics : TIG.

[18]  W. Stephan,et al.  Detecting a local signature of genetic hitchhiking along a recombining chromosome. , 2002, Genetics.

[19]  W. Godwin Article in Press , 2000 .

[20]  W Stephan,et al.  The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. , 1995, Genetics.

[21]  Justin C. Fay,et al.  Testing the neutral theory of molecular evolution with genomic data from Drosophila , 2002, Nature.

[22]  David M. Williams Path Decomposition and Continuity of Local Time for One‐Dimensional Diffusions, I , 1974 .

[23]  M. Kimmel,et al.  New explicit expressions for relative frequencies of single-nucleotide polymorphisms with application to statistical inference on population growth. , 2003, Genetics.

[24]  L. Rogers,et al.  Diffusions, Markov processes, and martingales , 1979 .

[25]  S. Wright,et al.  Evolution in Mendelian Populations. , 1931, Genetics.

[26]  M. Nei,et al.  THE BOTTLENECK EFFECT AND GENETIC VARIABILITY IN POPULATIONS , 1975, Evolution; international journal of organic evolution.

[27]  S. Tavaré,et al.  The age of a mutation in a general coalescent tree , 1998 .

[28]  Y. Svirezhev,et al.  Diffusion Models of Population Genetics , 1990 .

[29]  I. S. Gradshteyn,et al.  Table of Integrals, Series, and Products , 1976 .

[30]  M Kimura,et al.  SOLUTION OF A PROCESS OF RANDOM GENETIC DRIFT WITH A CONTINUOUS MODEL. , 1955, Proceedings of the National Academy of Sciences of the United States of America.

[31]  W. G. Hill,et al.  Effective size of populations with overlapping generations. , 1972, Theoretical population biology.

[32]  J. Pitman,et al.  A decomposition of Bessel Bridges , 1982 .