Computation of the Likelihood in Biallelic Diffusion Models Using Orthogonal Polynomials

In population genetics, parameters describing forces such as mutation, migration and drift are generally inferred from molecular data. Lately, approximate methods based on simulations and summary statistics have been widely applied for such inference, even though these methods waste information. In contrast, probabilistic methods of inference can be shown to be optimal, if their assumptions are met. In genomic regions where recombination rates are high relative to mutation rates, polymorphic nucleotide sites can be assumed to evolve independently from each other. The distribution of allele frequencies at a large number of such sites has been called “allele-frequency spectrum” or “site-frequency spectrum” (SFS). Conditional on the allelic proportions, the likelihoods of such data can be modeled as binomial. A simple model representing the evolution of allelic proportions is the biallelic mutation-drift or mutation-directional selection-drift diffusion model. With series of orthogonal polynomials, specifically Jacobi and Gegenbauer polynomials, or the related spheroidal wave function, the diffusion equations can be solved efficiently. In the neutral case, the product of the binomial likelihoods with the sum of such polynomials leads to finite series of polynomials, i.e., relatively simple equations, from which the exact likelihoods can be calculated. In this article, the use of orthogonal polynomials for inferring population genetic parameters is investigated.

[1]  James Stuart Tanton,et al.  Encyclopedia of Mathematics , 2005 .

[2]  Claus Vogl,et al.  The allele-frequency spectrum in a decoupled Moran model with mutation, drift, and directional selection, assuming small mutation rates , 2012, Theoretical population biology.

[3]  Anton Zettl,et al.  Sturm-Liouville theory , 2005 .

[4]  M. Slatkin,et al.  Using maximum likelihood to estimate population size from temporal changes in allele frequencies. , 1999, Genetics.

[5]  P. E. Falloon,et al.  Theory and computation of spheroidal wavefunctions , 2003 .

[6]  R. Punnett,et al.  The Genetical Theory of Natural Selection , 1930, Nature.

[7]  Yun S. Song,et al.  A Simple Method for Finding Explicit Analytic Transition Densities of Diffusion Processes with General Diploid Selection , 2012, Genetics.

[8]  木村 資生,et al.  Population genetics, molecular evolution, and the neutral theory : selected papers , 1994 .

[9]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[10]  W J Ewens,et al.  A note on the sampling theory for infinite alleles and infinite sites models. , 1974, Theoretical population biology.

[11]  Thorsten Gerber,et al.  Handbook Of Mathematical Functions , 2016 .

[12]  D. Hartl,et al.  Population genetics of polymorphism and divergence. , 1992, Genetics.

[13]  Steven N Evans,et al.  Non-equilibrium theory of the allele frequency spectrum. , 2006, Theoretical population biology.

[14]  R. Griffiths,et al.  A coalescent dual process in a Moran model with genic selection, and the lambda coalescent limit. , 2010, Theoretical population biology.

[15]  Franziska Wulf,et al.  Mathematical Population Genetics , 2016 .

[16]  A. Roychoudhury,et al.  Sufficiency of the number of segregating sites in the limit under finite-sites mutation. , 2010, Theoretical population biology.

[17]  M. Kimura Population Genetics, Molecular Evolution, and the Neutral Theory: Selected Papers , 1995 .

[18]  S. Wright,et al.  Evolution in Mendelian Populations. , 1931, Genetics.

[19]  E. Baake,et al.  Ancestral processes with selection: Branching and Moran models , 2007, q-bio/0702002.

[20]  C. Vogl Estimating the scaled mutation rate and mutation bias with site frequency data. , 2014, Theoretical population biology.

[21]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[22]  W. Stephan,et al.  Analytical results on the neutral non-equilibrium allele frequency spectrum based on diffusion theory. , 2011, Theoretical population biology.

[23]  C. Vogl Biallelic Mutation-Drift Diffusion in the Limit of Small Scaled Mutation Rates , 2014, 1409.2299.

[24]  John E. Hershey,et al.  Computation , 1991, Digit. Signal Process..

[25]  Ryan D. Hernandez,et al.  Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data , 2009, PLoS genetics.

[26]  C. Vogl,et al.  Evidence for complex selection on four‐fold degenerate sites in Drosophila melanogaster , 2012, Journal of evolutionary biology.

[27]  M Kimura,et al.  SOLUTION OF A PROCESS OF RANDOM GENETIC DRIFT WITH A CONTINUOUS MODEL. , 1955, Proceedings of the National Academy of Sciences of the United States of America.

[28]  D. Balding,et al.  Approximate Bayesian computation in population genetics. , 2002, Genetics.

[29]  R. Griffiths,et al.  A coalescent dual process in a Moran model with genic selection. , 2009, Theoretical population biology.

[30]  A J McKane,et al.  Singular solutions of the diffusion equation of population genetics. , 2007, Journal of theoretical biology.

[31]  George B. Arfken,et al.  Sturm-Liouville Theory , 2012 .

[32]  P. A. P. Moran,et al.  Random processes in genetics , 1958, Mathematical Proceedings of the Cambridge Philosophical Society.

[33]  J. Parsch,et al.  On the utility of short intron sequences as a reference for the detection of positive and negative selection in Drosophila. , 2010, Molecular biology and evolution.

[34]  C. Vogl,et al.  Unconstrained evolution in short introns? – An analysis of genome‐wide polymorphism and divergence data from Drosophila , 2012, Journal of evolutionary biology.

[35]  E. Thompson,et al.  Monte Carlo evaluation of the likelihood for N(e) from temporally spaced samples. , 2000, Genetics.

[36]  Jürgen Jost,et al.  An introduction to the mathematical structure of the Wright–Fisher model of population genetics , 2012, Theory in Biosciences.

[37]  William J. Thompson,et al.  Spheroidal wave functions , 1999, Comput. Sci. Eng..

[38]  T. Jukes,et al.  The neutral theory of molecular evolution. , 2000, Genetics.

[39]  P. Green,et al.  Probability and Mathematical Genetics: Papers in Honour of Sir John Kingman , 2010 .

[40]  R. Griffiths,et al.  Diffusion processes and coalescent trees , 2010, 1003.4650.

[41]  Carsten Wiuf,et al.  Gene Genealogies, Variation and Evolution - A Primer in Coalescent Theory , 2004 .

[42]  Lewei Li,et al.  COMPUTATIONS OF SPHEROIDAL HARMONICS WITH COMPLEX ARGUMENTS : A REVIEW WITH AN ALGORITHM , 1998 .

[43]  J. Meixner,et al.  Mathieusche Funktionen und Sphäroidfunktionen , 1954 .

[44]  Robert Kofler,et al.  Massive Habitat-Specific Genomic Response in D. melanogaster Populations during Experimental Evolution in Hot and Cold Environments , 2013, Molecular biology and evolution.