Level-dependent QBD models for the evolution of a family of gene duplicates

Abstract A gene family is a set of evolutionarily related genes formed by duplication. Genes within a gene family can perform a range of different but possibly overlapping functions. The process of duplication produces a gene that has identical functions to the gene it was duplicated from with subsequent divergence over time. In this paper, we explore different models for the ongoing evolution of a gene family. First, we consider a detailed model with multi-dimensional state-space which consists of binary matrices where rows of a matrix correspond to genes, columns correspond to functions, and the ijth entries record whether or not gene i performs function j. The large state space of this model makes it unsuitable for numerical analysis, but by considering the behavior of this detailed model we can test the suitability of two alternative models with more tractable state-spaces. Next, we consider a quasi-birth-and-death process (QBD) with two-dimensional states (n,m). The state (n, m) records the number of genes in the family, and the number of so called redundant genes (which are permitted to be lost). We contrast this to a level-dependent QBD with three-dimensional states (n, m, k) that record additional information which affects the transition rates. We show that the model with two-dimensional states (n, m) is insufficient for meaningful analysis, while the model with the three-dimensional states (n, m, k) is able to capture the qualitative behavior of the detailed model. We illustrate the fit between the level-dependent QBD and the original, detailed model, with numerical examples.

[1]  Tugrul Dayar,et al.  Kronecker Modeling and Analysis of Multidimensional Markovian Systems , 2018 .

[2]  Tl Stark,et al.  Markov models for the evolution of duplicate genes, and microsatellites , 2018 .

[3]  D. Liberles,et al.  Analysis of a mechanistic Markov model for gene duplicates evolving under subfunctionalization , 2017, BMC Evolutionary Biology.

[4]  Jing Zhao,et al.  On Mechanistic Modeling of Gene Content Evolution: Birth-Death Models and Mechanisms of Gene Birth and Gene Retention , 2014, Comput..

[5]  Qi-Ming He,et al.  Fundamentals of Matrix-Analytic Methods , 2013, Springer New York.

[6]  Yutaka Takahashi,et al.  A simple algorithm for the rate matrices of level-dependent QBD processes , 2010, QTNA.

[7]  Hendrik Baumann,et al.  Numerical solution of level dependent quasi-birth-and-death processes , 2010, ICCS.

[8]  Bengt Sennblad,et al.  The gene evolution model and computing its associated probabilities , 2009, JACM.

[9]  Nigel G. Bean,et al.  Algorithms for the Laplace–Stieltjes Transforms of First Return Times for Stochastic Fluid Flows , 2008 .

[10]  Jianzhi Zhang Evolution by gene duplication: an update , 2003 .

[11]  Tom Burr,et al.  Introduction to Matrix Analytic Methods in Stochastic Modeling , 2001, Technometrics.

[12]  A. Force,et al.  Preservation of duplicate genes by complementary, degenerative mutations. , 1999, Genetics.

[13]  Vaidyanathan Ramaswami,et al.  Matrix analytic methods for stochastic fluid flows , 1999 .

[14]  Ward Whitt,et al.  Numerical Inversion of Laplace Transforms of Probability Distributions , 1995, INFORMS J. Comput..

[15]  Peter G. Taylor,et al.  Calculating the equilibrium distribution in level dependent quasi-birth-and-death processes , 1995 .

[16]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.