Assessment of linkage disequilibrium by the decay of haplotype sharing, with application to fine-scale genetic mapping.

Linkage disequilibrium (LD) is of great interest for gene mapping and the study of population history. We propose a multilocus model for LD, based on the decay of haplotype sharing (DHS). The DHS model is most appropriate when the LD in which one is interested is due to the introduction of a variant on an ancestral haplotype, with recombinations in succeeding generations resulting in preservation of only a small region of the ancestral haplotype around the variant. This is generally the scenario of interest for gene mapping by LD. The DHS parameter is a measure of LD that can be interpreted as the expected genetic distance to which the ancestral haplotype is preserved, or, equivalently, 1/(time in generations to the ancestral haplotype). The method allows for multiple origins of alleles and for mutations, and it takes into account missing observations and ambiguities in haplotype determination, via a hidden Markov model. Whereas most commonly used measures of LD apply to pairs of loci, the DHS measure is designed for application to the densely mapped haplotype data that are increasingly available. The DHS method explicitly models the dependence among multiple tightly linked loci on a chromosome. When the assumptions about population structure are sufficiently tractable, the estimate of LD is obtained by maximum likelihood. For more-complicated models of population history, we find means and covariances based on the model and solve a quasi-score estimating equation. Simulations show that this approach works extremely well both for estimation of LD and for fine mapping. We apply the DHS method to published data sets for cystic fibrosis and progressive myoclonus epilepsy.

[1]  J. Bennett On the theory of random mating. , 1954, Annals of eugenics.

[2]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[3]  M. Slatkin,et al.  On treating the chromosome as the unit of selection. , 1972, Genetics.

[4]  H. Nevanlinna The Finnish population structure. A genetic and genealogical study. , 2009, Hereditas.

[5]  A. Lima-de-faria,et al.  Amplification of ribosomal DNA in Acheta. IV. The number of cistrons for 28S and 18S ribosomal RNA. , 2009, Hereditas.

[6]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  H. Rothschild Biocultural aspects of diseases. , 1981 .

[9]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[10]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[11]  L. Tsui,et al.  Erratum: Identification of the Cystic Fibrosis Gene: Genetic Analysis , 1989, Science.

[12]  M. Hodson,et al.  Identification of the cystic fibrosis gene. , 1990, BMJ.

[13]  A. Chapelle,et al.  Disease gene mapping in isolated human populations: the example of Finland. , 1993, Journal of medical genetics.

[14]  E. Foss,et al.  Chiasma interference as a function of genetic distance. , 1993, Genetics.

[15]  T. Speed,et al.  Statistical analysis of crossover interference using the chi-square model. , 1995, Genetics.

[16]  J. Terwilliger A powerful likelihood method for the analysis of linkage disequilibrium between trait loci and one or more polymorphic marker loci. , 1995, American journal of human genetics.

[17]  B S Weir,et al.  Likelihood methods for locating disease genes in nonequilibrium populations. , 1995, American journal of human genetics.

[18]  T P Speed,et al.  Modeling interference in genetic recombination. , 1995, Genetics.

[19]  Len A. Pennacchio,et al.  Mutations in the Gene Encoding Cystatin B in Progressive Myoclonus Epilepsy (EPM1) , 1996, Science.

[20]  R. Myers,et al.  Progressive myoclonus epilepsy EPM1 locus maps to a 175-kb interval in distal 21q. , 1996, American journal of human genetics.

[21]  A S Whittemore,et al.  Genome scanning for linkage: an overview. , 1996, American journal of human genetics.

[22]  K. Roeder,et al.  Disequilibrium mapping: composite likelihood for pairwise disequilibrium. , 1996, Genomics.

[23]  M. Xiong,et al.  Fine-scale genetic mapping based on linkage disequilibrium: theory and applications. , 1997, American journal of human genetics.

[24]  N J Cox,et al.  Allele-sharing models: LOD scores and accurate linkage tests. , 1997, American journal of human genetics.

[25]  L. Lazzeroni,et al.  Linkage disequilibrium and gene mapping: an empirical least-squares approach. , 1998, American journal of human genetics.

[26]  E A Thompson,et al.  Disequilibrium likelihoods for fine-scale mapping of a rare allele. , 1998, American journal of human genetics.

[27]  B Rannala,et al.  Likelihood analysis of disequilibrium mapping, and related problems. , 1998, American journal of human genetics.