Cyclic and multilevel causation in evolutionary processes

Many models of evolution are implicitly causal processes. Features such as causal feedback between evolutionary variables and evolutionary processes acting at multiple levels, though, mean that conventional causal models miss important phenomena. We develop here a general theoretical framework for analyzing evolutionary processes drawing on recent approaches to causal modeling developed in the machine-learning literature, which have extended Pearl’s ‘do’-calculus to incorporate cyclic causal interactions and multilevel causation. We also develop information-theoretic notions necessary to analyze causal information dynamics in our framework, introducing a causal generalization of the Partial Information Decomposition framework. We show how our causal framework helps to clarify conceptual issues in the contexts of complex trait analysis and cancer genetics, including assigning variation in an observed trait to genetic, epigenetic and environmental sources in the presence of epigenetic and environmental feedback processes, and variation in fitness to mutation processes in cancer using a multilevel causal model respectively, as well as relating causally-induced to observed variation in these variables via information theoretic bounds. In the process, we introduce a general class of multilevel causal evolutionary processes which connect evolutionary processes at multiple levels via coarse-graining relationships. Further, we show how a range of ‘fitness models’ can be formulated in our framework, as well as a causal analog of Price’s equation (generalizing the probabilistic ‘Rice equation’), clarifying the relationships between realized/probabilistic fitness and direct/indirect selection. Finally, we consider the potential relevance of our framework to foundational issues in biology and evolution, including supervenience, multilevel selection and individuality. Particularly, we argue that our class of multilevel causal evolutionary processes, in conjunction with a minimum description length principle, provides a conceptual framework in which identification of multiple levels of selection may be reduced to a model selection problem.

[1]  Mark Gerstein,et al.  Estimating growth patterns and driver effects in tumor evolution from individual samples , 2019, Nature Communications.

[2]  Benjamin J. Raphael,et al.  Characterizing genetic intra-tumor heterogeneity across 2,658 human cancer genomes , 2020, Cell.

[3]  M. Nowak,et al.  Evolution of cooperation by multilevel selection. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[4]  P. Visscher,et al.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets , 2016, Nature Genetics.

[5]  S. Frank Natural selection maximizes Fisher information , 2009, Journal of evolutionary biology.

[6]  S. Okasha The Relation between Kin and Multilevel Selection: An Approach Using Causal Graphs , 2016, The British Journal for the Philosophy of Science.

[7]  George Davey Smith,et al.  Mendelian randomization: Using genes as instruments for making causal inferences in epidemiology , 2008, Statistics in medicine.

[8]  Lucas Lochovsky,et al.  Passenger Mutations in More Than 2,500 Cancer Genomes: Overall Molecular Functional Impact and Consequences , 2020, Cell.

[9]  Johan Paulsson,et al.  Multileveled selection on plasmid replication. , 2002, Genetics.

[11]  M. Nowak,et al.  The evolution of eusociality , 2010, Nature.

[12]  Bernhard Schölkopf,et al.  Elements of Causal Inference: Foundations and Learning Algorithms , 2017 .

[13]  Joris M. Mooij,et al.  Foundations of structural causal models with cycles and latent variables , 2016, The Annals of Statistics.

[14]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.

[15]  Bernhard Schölkopf,et al.  Theoretical Aspects of Cyclic Structural Causal Models , 2016 .

[16]  B. Schuster-Böckler,et al.  The effects of mutational processes and selection on driver mutations across cancer types , 2017, Nature Communications.

[17]  Eckehard Olbrich,et al.  The information theory of individuality , 2014, Theory in Biosciences.

[18]  Benjamin A. Logsdon,et al.  Gene Expression Elucidates Functional Impact of Polygenic Risk for Schizophrenia , 2016, Nature Neuroscience.

[19]  Randall D. Beer,et al.  Nonnegative Decomposition of Multivariate Information , 2010, ArXiv.

[20]  Steve Awodey,et al.  Category Theory , 2006 .

[21]  S. Okasha Evolution and the Levels of Selection , 2007 .

[22]  Pietro Perona,et al.  Multi-Level Cause-Effect Systems , 2015, AISTATS.

[23]  Bernhard Schölkopf,et al.  From Ordinary Differential Equations to Structural Causal Models: the deterministic case , 2013, UAI.

[24]  A. Wagner Causal Drift, Robust Signaling, and Complex Disease , 2015, PloS one.

[25]  Olaf Sporns,et al.  Measuring information integration , 2003, BMC Neuroscience.

[26]  Erik P. Hoel When the Map Is Better Than the Territory , 2016, Entropy.

[27]  Kim Sterelny,et al.  The Major Transitions in Evolution Revisited , 2011 .

[28]  Eckehard Olbrich,et al.  Reconsidering unique information: Towards a multivariate information decomposition , 2014, 2014 IEEE International Symposium on Information Theory.

[29]  Bernhard Schölkopf,et al.  Causal Inference Using the Algorithmic Markov Condition , 2008, IEEE Transactions on Information Theory.

[30]  Bernhard Schölkopf,et al.  Estimating Causal Effects by Bounding Confounding , 2014, UAI.

[31]  D. Krakauer,et al.  Diversity, Dilemmas, and Monopolies of Niche Construction , 2008, The American Naturalist.

[32]  S. Omholt,et al.  Phenomics: the next challenge , 2010, Nature Reviews Genetics.

[33]  Eckehard Olbrich,et al.  Quantifying unique information , 2013, Entropy.

[34]  Munther A. Dahleh,et al.  Structure learning in causal cyclic networks , 2008 .

[35]  Moritz Grosse-Wentrup,et al.  Quantifying causal influences , 2012, 1203.6502.

[36]  Prashant S. Emani,et al.  Comprehensive functional genomic resource and integrative model for the human brain , 2018, Science.

[37]  Sean H. Rice,et al.  Evolutionary Theory: Mathematical and Conceptual Foundations , 2004 .

[38]  Daniel Polani,et al.  Information Flows in Causal Networks , 2008, Adv. Complex Syst..

[39]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[40]  Christof Koch,et al.  Quantifying synergistic mutual information , 2012, ArXiv.

[41]  Judea Pearl Causality by Judea Pearl , 2009 .

[42]  M. Gerstein,et al.  Estimating growth patterns and driver effects in tumor evolution from individual samples , 2019, bioRxiv.

[43]  Yuan Ji,et al.  Portraits of genetic intra-tumour heterogeneity and subclonal selection across cancer types , 2018, bioRxiv.

[44]  Bernhard Schölkopf,et al.  Causal Consistency of Structural Equation Models , 2017, UAI.

[45]  Karola Stotz,et al.  Measuring Causal Specificity , 2015, Philosophy of Science.

[46]  B. H. Miller,et al.  Evaluating genetic markers and neurobiochemical analytes for fluoxetine response using a panel of mouse inbred strains , 2011, Psychopharmacology.