Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

The relationship between the two estimates of genetic variation at the DNA level, namely the number of segregating sites and the average number of nucleotide differences estimated from pairwise comparison, is investigated. It is found that the correlation between these two estimates is large when the sample size is small, and decreases slowly as the sample size increases. Using the relationship obtained, a statistical method for testing the neutral mutation hypothesis is developed. This method needs only the data of DNA polymorphism, namely the genetic variation within population at the DNA level. A simple method of computer simulation, that was used in order to obtain the distribution of a new statistic developed, is also presented. Applying this statistical method to the five regions of DNA sequences in Drosophila melanogaster, it is found that large insertion/deletion (greater than 100 bp) is deleterious. It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

[1]  J. Crow,et al.  THE NUMBER OF ALLELES THAT CAN BE MAINTAINED IN A FINITE POPULATION. , 1964, Genetics.

[2]  M. Kimura Evolutionary Rate at the Molecular Level , 1968, Nature.

[3]  M. Kimura The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. , 1969, Genetics.

[4]  W. Ewens The sampling theory of selectively neutral alleles. , 1972, Theoretical population biology.

[5]  T. Ohta Slightly Deleterious Mutant Substitutions in Evolution , 1973, Nature.

[6]  G. A. Watterson The sampling theory of selectively neutral alleles , 1974, Advances in Applied Probability.

[7]  T. Ohta Mutational pressure as the main cause of molecular evolution and polymorphism , 1974, Nature.

[8]  B. Lewin Units of transcription and translation: Sequence components of heterogeneous nuclear RNA and messenger RNA , 1975, Cell.

[9]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[10]  M. Kimura,et al.  The neutral theory of molecular evolution. , 1983, Scientific American.

[11]  M. Kimura Model of effectively neutral mutations in which selective constraint is incorporated. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Richard R. Hudson,et al.  TESTING THE CONSTANT‐RATE NEUTRAL ALLELE MODEL WITH PROTEIN SEQUENCE DATA , 1983, Evolution; international journal of organic evolution.

[13]  C. Aquadro,et al.  Human mitochondrial DNA variation and evolution: analysis of nucleotide sequences from seven individuals. , 1983, Genetics.

[14]  A. Brown Variation at the 87A heat shock locus in Drosophila melanogaster. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[15]  F. Tajima Evolutionary relationship of DNA sequences in finite populations. , 1983, Genetics.

[16]  C. Aquadro,et al.  Molecular population genetics of the alcohol dehydrogenase gene region of Drosophila melanogaster. , 1986, Genetics.

[17]  M. Aguadé,et al.  Excess polymorphism at the Adh locus in Drosophila melanogaster. , 1986, Genetics.

[18]  R. Hudson,et al.  A test of neutral molecular evolution based on nucleotide data. , 1987, Genetics.

[19]  N L Kaplan,et al.  The coalescent process in models with selection. , 1988, Genetics.

[20]  C. Langley,et al.  Molecular and phenotypic variation of the white locus region in Drosophila melanogaster. , 1988, Genetics.

[21]  N L Kaplan,et al.  The coalescent process in models with selection and recombination. , 1988, Genetics.