Limit theorems for the site frequency spectrum of neutral mutations in an exponentially growing population.

The site frequency spectrum (SFS) is a widely used summary statistic of genomic data, offering a simple means of inferring the evolutionary history of a population. Motivated by recent evidence for the role of neutral evolution in cancer, we examine the SFS of neutral mutations in an exponentially growing population. Whereas recent work has focused on the mean behavior of the SFS in this scenario, here, we investigate the first-order asymptotics of the underlying stochastic process. Using branching process techniques, we show that the SFS of a Galton-Watson process evaluated at a fixed time converges almost surely to a random limit. We also show that the SFS evaluated at the stochastic time at which the population first reaches a certain size converges in probability to a constant. Finally, we illustrate how our results can be used to construct consistent estimators for the extinction probability and the effective mutation rate of a birth-death process.

[1]  Jason Schweinsberg,et al.  Asymptotics for the site frequency spectrum associated with the genealogy of a birth and death process , 2023, 2304.13851.

[2]  Jason Schweinsberg,et al.  Estimating single cell clonal dynamics in human blood using coalescent theory , 2023, bioRxiv.

[3]  H. Leman,et al.  Site frequency spectrum of a rescued population under rare resistant mutations , 2023, 2303.04069.

[4]  Kevin Leder,et al.  Exact site frequency spectra of neutrally evolving tumors: A transition between power laws reveals a signature of cell viability. , 2021, Theoretical population biology.

[5]  R. Durrett,et al.  Signatures of neutral evolution in exponentially growing tumors: A theoretical perspective , 2020, bioRxiv.

[6]  Khanh N. Dinh,et al.  Statistical inference for the evolutionary history of cancer genomes , 2019, bioRxiv.

[7]  T. Antal,et al.  Genetic composition of an exponentially growing cell population , 2019 .

[8]  A. Lambert The coalescent of a sample from a binary branching process. , 2018, Theoretical population biology.

[9]  T. Antal,et al.  Mutation frequencies in a birth–death branching process , 2017, The Annals of Applied Probability.

[10]  H. Ohtsuki,et al.  Forward and backward evolutionary processes and allele frequency spectrum in a cancer cell population. , 2017, Theoretical population biology.

[11]  S. Johnston The genealogy of Galton-Watson trees , 2017, Electronic Journal of Probability.

[12]  Nicholas Navin,et al.  Tumor evolution: Linear, branching, neutral or punctuated? , 2017, Biochimica et biophysica acta. Reviews on cancer.

[13]  Matthew I. Roberts,et al.  The coalescent structure of continuous-time Galton–Watson trees , 2017, The Annals of Applied Probability.

[14]  Martin A. Nowak,et al.  Quantifying Clonal and Subclonal Passenger Mutations in Cancer Evolution , 2016, PLoS Comput. Biol..

[15]  Marc J. Williams,et al.  Identification of neutral tumor evolution across cancer types , 2016, Nature Genetics.

[16]  Xuemei Lu,et al.  Extremely high genetic diversity in a single tumor points to prevalence of non-Darwinian cell evolution , 2015, Proceedings of the National Academy of Sciences.

[17]  R. Durrett Branching Process Models of Cancer , 2015 .

[18]  C. Curtis,et al.  A Big Bang model of human colorectal tumor growth , 2015, Nature Genetics.

[19]  K. Leder,et al.  Escape times for branching processes with random mutational fitness effects , 2014 .

[20]  Rick Durrett,et al.  POPULATION GENETICS OF NEUTRAL MUTATIONS IN EXPONENTIALLY GROWING CANCER CELL POPULATIONS. , 2013, The annals of applied probability : an official journal of the Institute of Mathematical Statistics.

[21]  G. Achaz Frequency Spectrum Neutrality Tests: One for All and All for One , 2009, Genetics.

[22]  A. Lambert The Allelic Partition for Coalescent Point Processes , 2008, 0804.2572.

[23]  Kai Zeng,et al.  Statistical Tests for Detecting Positive Selection by Utilizing High-Frequency Variants , 2006, Genetics.

[24]  Z. Yang,et al.  Probability models for DNA sequence evolution , 2004, Heredity.

[25]  Subramanian Venkatesan,et al.  Tumor Evolutionary Principles: How Intratumor Heterogeneity Influences Cancer Treatment and Outcome. , 2016, American Society of Clinical Oncology educational book. American Society of Clinical Oncology. Annual Meeting.

[26]  P. Balister,et al.  BRANCHING PROCESSES , 2006 .