Modeling genome evolution with a diffusion approximation of a birth-and-death process

MOTIVATION In our previous studies, we developed discrete-space birth, death and innovation models (BDIMs) of genome evolution. These models explain the origin of the characteristic Pareto distribution of paralogous gene family sizes in genomes, and model parameters that provide for the evolution of these distributions within a realistic time frame have been identified. However, extracting the temporal dynamics of genome evolution from discrete-space BDIM was not technically feasible. We were interested in obtaining dynamic portraits of the genome evolution process by developing a diffusion approximation of BDIM. RESULTS The diffusion version of BDIM belongs to a class of continuous-state models whose dynamics is described by the Fokker-Plank equation and the stationary solution could be any specified Pareto function. The diffusion models have time-dependent solutions of a special kind, namely, generalized self-similar solutions, which describe the transition from one stationary distribution of the system to another; this provides for the possibility of examining the temporal dynamics of genome evolution. Analysis of the generalized self-similar solutions of the diffusion BDIM reveals a biphasic curve of genome growth in which the initial, relatively short, self-accelerating phase is followed by a prolonged phase of slow deceleration. This evolutionary dynamics was observed both when genome growth started from zero and proceeded via innovation (a potential model of primordial evolution), and when evolution proceeded from one stationary state to another. In biological terms, this regime of evolution can be tentatively interpreted as a punctuated-equilibrium-like phenomenon whereby evolutionary transitions are accompanied by rapid gene amplification and innovation, followed by slow relaxation to a new stationary state.

[1]  M. Huynen,et al.  The frequency distribution of gene family sizes in complete genomes. , 1998, Molecular biology and evolution.

[2]  P. Bork,et al.  Measuring genome evolution. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[4]  E. Koonin,et al.  The structure of the protein universe and genome evolution , 2002, Nature.

[5]  E. Koonin,et al.  Birth and death of protein domains: A simple model of evolution explains power law behavior , 2002, BMC Evolutionary Biology.

[6]  N. Kampen,et al.  Stochastic processes in physics and chemistry , 1981 .

[7]  T. Gisiger Scale invariance in biology: coincidence or footprint of a universal mechanism? , 2001, Biological reviews of the Cambridge Philosophical Society.

[8]  Eugene V Koonin,et al.  Gene family evolution: an in-depth theoretical and simulation analysis of non-linear birth-death-innovation models , 2004, BMC Evolutionary Biology.

[9]  Niles Eldredge,et al.  Punctuated equilibria , 1997, Scholarpedia.

[10]  Eugene V. Koonin,et al.  Power Laws, Scale-Free Networks and Genome Biology , 2006 .

[11]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[12]  W. Ebeling Stochastic Processes in Physics and Chemistry , 1995 .

[13]  P. Underhill,et al.  The Evolutionary Fate and Consequences of Duplicate Genes , 2007 .

[14]  S. Gould The Structure of Evolutionary Theory , 2002 .

[15]  Vladimir A. Kuznetsov,et al.  Distribution Associated with Stochastic Processes of Gene Expression in a Single Eukaryotic Cell , 2001, EURASIP J. Adv. Signal Process..

[16]  Andrey Rzhetsky,et al.  Birth of scale-free molecular networks and the number of distinct DNA and protein domains per genome , 2001, Bioinform..

[17]  M. Gerstein,et al.  The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties , 2002, Genome Biology.

[18]  Barry D Hughes,et al.  A model explaining the size distribution of gene and protein families. , 2004, Mathematical biosciences.

[19]  G. Karev,et al.  Bifurcations of self-similar solutions of the Fokker-Plank equations , 2005, q-bio/0606004.

[20]  Eugene V. Koonin,et al.  Simple stochastic birth andz death models of genome evolution: was there enough time for us to evolve? , 2003, Bioinform..

[21]  Rabi Bhattacharya,et al.  Stochastic processes with applications , 1990 .

[22]  D. Sherrington Stochastic Processes in Physics and Chemistry , 1983 .