The Joint Allele-Frequency Spectrum in Closely Related Species

We develop the theory for computing the joint frequency spectra of alleles in two closely related species. We allow for arbitrary population growth in both species after they had a common ancestor. We focus on the case in which a single chromosome is sequenced from one of the species. We use classical diffusion theory to show that, if the ancestral species was at equilibrium under mutation and drift and a chromosome from one of the descendant species carries the derived allele, the frequency spectrum in the other species is uniform, independently of the demographic history of both species. We also predict the expected densities of segregating and fixed sites when the chromosome from the other species carries the ancestral allele. We compare the predictions of our model with the site-frequency spectra of SNPs in the four HapMap populations of humans when the nucleotide present in the Neanderthal DNA sequence is ancestral or derived, using the chimp genome as the outgroup.

[1]  Rory A. Fisher,et al.  XVII—The distribution of gene ratios for rare mutations , 1931 .

[2]  S. Wright,et al.  The Distribution of Gene Frequencies Under Irreversible Mutation. , 1938, Proceedings of the National Academy of Sciences of the United States of America.

[3]  M Kimura,et al.  SOLUTION OF A PROCESS OF RANDOM GENETIC DRIFT WITH A CONTINUOUS MODEL. , 1955, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[5]  M. Kimura The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. , 1969, Genetics.

[6]  T. Ohta,et al.  The age of a neutral mutant persisting in a finite population. , 1973, Genetics.

[7]  Y. Svirezhev,et al.  Diffusion Models of Population Genetics , 1990 .

[8]  D. Hartl,et al.  Population genetics of polymorphism and divergence. , 1992, Genetics.

[9]  J. Wakeley,et al.  Estimating ancestral population parameters. , 1997, Genetics.

[10]  S. Tavaré,et al.  The age of a mutation in a general coalescent tree , 1998 .

[11]  R. Nielsen Estimation of population parameters and recombination rates from single nucleotide polymorphisms. , 2000, Genetics.

[12]  D. Hartl,et al.  Directional selection and the site-frequency spectrum. , 2001, Genetics.

[13]  S. Liu-Cordero,et al.  The discovery of single-nucleotide polymorphisms--and inferences about human demographic history. , 2001, American journal of human genetics.

[14]  Richard R. Hudson,et al.  Generating samples under a Wright-Fisher neutral model of genetic variation , 2002, Bioinform..

[15]  R. Griffiths,et al.  The frequency spectrum of a mutation, and its age, in a general diffusion model. , 2003, Theoretical population biology.

[16]  Carlos D Bustamante,et al.  Ascertainment bias in studies of human genome-wide polymorphism. , 2005, Genome research.

[17]  Ryan D. Hernandez,et al.  Simultaneous inference of selection and population growth from patterns of variation in the human genome , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Adrian W. Briggs,et al.  Analysis of one million base pairs of Neanderthal DNA , 2006, Nature.

[19]  Philip L. F. Johnson,et al.  Inference of population genetic parameters in metagenomics: a clean look at messy data. , 2006, Genome research.

[20]  Feng Chen,et al.  Sequencing and Analysis of Neanderthal Genomic DNA , 2006, Science.

[21]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[22]  Steven N Evans,et al.  Non-equilibrium theory of the allele frequency spectrum. , 2006, Theoretical population biology.