Resampling: An improvement of importance sampling in varying population size models.

Sequential importance sampling algorithms have been defined to estimate likelihoods in models of ancestral population processes. However, these algorithms are based on features of the models with constant population size, and become inefficient when the population size varies in time, making likelihood-based inferences difficult in many demographic situations. In this work, we modify a previous sequential importance sampling algorithm to improve the efficiency of the likelihood estimation. Our procedure is still based on features of the model with constant size, but uses a resampling technique with a new resampling probability distribution depending on the pairwise composite likelihood. We tested our algorithm, called sequential importance sampling with resampling (SISR) on simulated data sets under different demographic cases. In most cases, we divided the computational cost by two for the same accuracy of inference, in some cases even by one hundred. This study provides the first assessment of the impact of such resampling techniques on parameter inference using sequential importance sampling, and extends the range of situations where likelihood inferences can be easily performed.

[1]  G. Bertorelle,et al.  Conservation of the endangered Mediterranean tortoise Testudo hermanni hermanni: The contribution of population genetics and historical demography , 2016 .

[2]  F. Rousset,et al.  Maximum-likelihood inference of population size contractions from microsatellite data. , 2014, Molecular biology and evolution.

[3]  Muhammad Faisal,et al.  Exact Likelihood Calculation under the Infinite Sites Model , 2015, Comput..

[4]  F. Rousset,et al.  IBDSim: a computer program to simulate genotypic data under isolation by distance , 2009, Molecular ecology resources.

[5]  François Rousset,et al.  Stepwise mutation likelihood computation by sequential importance sampling in subdivided population models. , 2005, Theoretical population biology.

[6]  Mark A. Beaumont,et al.  TESTING FOR GENETIC EVIDENCE OF POPULATION EXPANSION AND CONTRACTION: AN EMPIRICAL ANALYSIS OF MICROSATELLITE DNA VARIATION USING A HIERARCHICAL BAYESIAN MODEL , 2002, Evolution; international journal of organic evolution.

[7]  Rong Chen,et al.  A Theoretical Framework for Sequential Importance Sampling with Resampling , 2001, Sequential Monte Carlo Methods in Practice.

[8]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[9]  M. De Iorio,et al.  Importance sampling on coalescent histories. II: Subdivided population models , 2004, Advances in Applied Probability.

[10]  Robert C. Griffiths,et al.  Inference from Samples of DNA Sequences Using a Two-Locus Model , 2011, J. Comput. Biol..

[11]  A. C. Davison,et al.  Statistical models: Name Index , 2003 .

[12]  Tim Hesterberg,et al.  Monte Carlo Strategies in Scientific Computing , 2002, Technometrics.

[13]  T. Ohta,et al.  Stepwise mutation model and distribution of allelic frequencies in a finite population. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[14]  F. Rousset,et al.  Likelihood and approximate likelihood analyses of genetic structure in a linear habitat: performance and robustness to model mis-specification. , 2007, Molecular biology and evolution.

[15]  S. Sampling theory for neutral alleles in a varying environment , 2003 .

[16]  C. J-F,et al.  THE COALESCENT , 1980 .

[17]  François Rousset,et al.  Likelihood analysis of population genetic data under coalescent models: computational and inferential aspects , 2017 .

[18]  S. Tavaré,et al.  Ancestral Inference in Population Genetics , 1994 .

[19]  Champak R. Beeravolu,et al.  Genetic structure of populations of whale sharks among ocean basins and evidence for their historic rise and recent decline , 2014, Molecular ecology.

[20]  F. Rousset,et al.  Likelihood-based inferences under isolation by distance: two-dimensional habitats and confidence intervals. , 2012, Molecular biology and evolution.

[21]  C. Denys,et al.  Phylogeography and demographic history of Shaw's Jird (Meriones shawii complex) in North Africa , 2016 .

[22]  T. Severini Likelihood Methods in Statistics , 2001 .

[23]  S. Planes,et al.  Blacktip reef sharks, Carcharhinus melanopterus, have high genetic structure and varying demographic histories in their Indo‐Pacific range , 2014, Molecular ecology.

[24]  P. Donnelly,et al.  Inference in molecular population genetics , 2000 .

[25]  Carsten Wiuf,et al.  Importance Sampling for the Infinite Sites Model , 2008, Statistical applications in genetics and molecular biology.

[26]  M. De Iorio,et al.  Importance sampling on coalescent histories. I , 2004, Advances in Applied Probability.