Application of Maximum Likelihood Methods to Population Genetic Data for the Estimation of Individual Fertilities

SUMMARY The fitness of plants or animals within a population is largely determined by the number of offspring they produce. In natural populations lacking familial structure either one or both parents are often unknown. To circumvent this problem, parents and offspring can be genotyped for a set of genetic markers. Likelihood models are proposed to estimate the fertility of either male or male and female parents in a population where fertility is defined broadly as a fraction of progeny in the population fathered or mothered by some individual. Models are developed for three cases: the mother is known and the male fertilities differ depending on the maternal parent; the mother is known and the male fertilities are the same for all maternal parents; neither parent is known and fertilities are estimated for parent pairs. It is established that a unique maximum likelihood solution exists under conditions that are commonly met. For situations in which these conditions are not met, a method is presented to determine estimates of a set of independent linear combinations of the parameters that can be uniquely estimated. Two algorithms are examined which can be used to find the parameter estimates. The variance of the estimator, likelihood ratio tests for constraints on the fertility parameters, and a goodness-of-fit test are developed. Finally, a worked example is presented.

[1]  B. Lindsay The Geometry of Mixture Likelihoods: A General Theory , 1983 .

[2]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[3]  P. Smouse,et al.  The use of restriction fragment length polymorphisms in paternity analysis. , 1986, American journal of human genetics.

[4]  D. Schoen,et al.  VARIATION IN MALE REPRODUCTIVE INVESTMENT AND MALE REPRODUCTIVE SUCCESS IN WHITE SPRUCE , 1986, Evolution; international journal of organic evolution.

[5]  George C. Williams,et al.  Sex and evolution. , 1975, Monographs in population biology.

[6]  N. Ellstrand,et al.  Interpopulation Gene Flow by Pollen in Wild Radish, Raphanus sativus , 1985, The American Naturalist.

[7]  Regina C. Elandt-Johnson,et al.  Probability models and statistical methods in genetics , 1972 .

[8]  S. Selvin Probability of nonpaternity determined by multiple allele codominant systems. , 1980, American journal of human genetics.

[9]  T. Meagher Analysis of Paternity within a Natural Population of Chamaelirium luteum. 1. Identification of Most-Likely Male Parents , 1986, The American Naturalist.

[10]  Elizabeth A. Thompson,et al.  The relationship between single parent and parent pair genetic likelihoods in genealogy reconstruction , 1986 .

[11]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[12]  P. Sherman,et al.  Multiple paternity in Belding's ground squirrel litters. , 1981, Science.

[13]  F. Marriott,et al.  Barnard's Monte Carlo Tests: How Many Simulations? , 1979 .

[14]  D. Levin,et al.  Gene Flow in Seed Plants , 1974 .

[15]  R. Chakraborty,et al.  Exclusion of paternity: the current state of the art. , 1974, American journal of human genetics.

[16]  E A Thompson Likelihood inference of paternity. , 1986, American journal of human genetics.

[17]  S. Haberman Product Models for Frequency Tables Involving Indirect Observation , 1977 .

[18]  D. Varberg Convex Functions , 1973 .

[19]  S. R. Searle Linear Models , 1971 .

[20]  Dankmar Böhning,et al.  Numerical estimation of a probability measure , 1985 .