Recombination hotspots as a point process

The variation of the recombination rate along chromosomal DNA is one of the important determinants of the patterns of linkage disequilibrium. A number of inferential methods have been developed which estimate the recombination rate and its variation from population genetic data. The majority of these methods are based on modelling the genealogical process underlying a sample of DNA sequences and thus explicitly include a model of the demographic process. Here we propose a different inferential procedure based on a previously introduced framework where recombination is modelled as a point process along a DNA sequence. The approach infers regions containing putative hotspots based on the inferred minimum number of recombination events; it thus depends only indirectly on the underlying population demography. A Poisson point process model with local rates is then used to infer patterns of recombination rate estimation in a fully Bayesian framework. We illustrate this new approach by applying it to several population genetic datasets, including a region with an experimentally confirmed recombination hotspot.

[1]  W. Ewens Mathematical Population Genetics , 1980 .

[2]  R. Hudson Two-locus sampling distributions and their application. , 2001, Genetics.

[3]  Sylvia Richardson,et al.  Markov Chain Monte Carlo in Practice , 1997 .

[4]  David B. Goldstein,et al.  Demography, Recombination Hotspot Intensity, and the Block Structure of Linkage Disequilibrium , 2003, Current Biology.

[5]  J. Hein,et al.  Recombination as a point process along sequences. , 1999, Theoretical population biology.

[6]  Refik Soyer,et al.  Bayesian Methods for Nonlinear Classification and Regression , 2004, Technometrics.

[7]  P. Donnelly,et al.  The Fine-Scale Structure of Recombination Rate Variation in the Human Genome , 2004, Science.

[8]  M. Stephens,et al.  Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data. , 2003, Genetics.

[9]  Carsten Wiuf,et al.  Gene Genealogies, Variation and Evolution - A Primer in Coalescent Theory , 2004 .

[10]  D G Denison,et al.  Bayesian Partitioning for Estimating Disease Risk , 2001, Biometrics.

[11]  R. Hudson,et al.  Statistical properties of the number of recombination events in the history of a sample of DNA sequences. , 1985, Genetics.

[12]  R. Griffiths,et al.  Bounds on the minimum number of recombination events in a sample history. , 2003, Genetics.

[13]  Simon Tavaré,et al.  Linkage disequilibrium: what history has to tell us. , 2002, Trends in genetics : TIG.

[14]  P. Green,et al.  Hidden Markov Models and Disease Mapping , 2002 .

[15]  Adrian F. M. Smith,et al.  A Bayesian CART algorithm , 1998 .

[16]  L. Kelley,et al.  The extent and importance of intragenic recombination , 2004, Human Genomics.

[17]  P. Donnelly,et al.  Approximate likelihood methods for estimating local recombination rates , 2002 .

[18]  J. Pritchard,et al.  Linkage disequilibrium in humans: models and data. , 2001, American journal of human genetics.

[19]  P. Marjoram,et al.  Ancestral Inference from Samples of DNA Sequences with Recombination , 1996, J. Comput. Biol..

[20]  Carsten Wiuf,et al.  Simulating haplotype blocks in the human genome , 2003, Bioinform..

[21]  Carsten Wiuf,et al.  A coalescent model of recombination hotspots. , 2003, Genetics.

[22]  M. Stumpf Haplotype diversity and SNP frequency dependence in the description of genetic variation , 2004, European Journal of Human Genetics.

[23]  Christian P. Robert,et al.  Monte Carlo Statistical Methods , 2005, Springer Texts in Statistics.

[24]  P. Donnelly,et al.  Estimating recombination rates from population genetic data. , 2001, Genetics.

[25]  A. Jeffreys,et al.  Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex , 2001, Nature Genetics.

[26]  P. Fearnhead,et al.  A coalescent-based method for detecting and estimating recombination from gene sequences. , 2002, Genetics.

[27]  Guido Consonni,et al.  A Bayesian Method for Combining Results from Several Binomial Experiments , 1995 .

[28]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .