Bayesian fine-scale mapping of disease loci, by hidden Markov models.

We present a new multilocus method for the fine-scale mapping of genes contributing to human diseases. The method is designed for use with multiple biallelic markers-in particular, single-nucleotide polymorphisms for which high-density genetic maps will soon be available. We model disease-marker association in a candidate region via a hidden Markov process and allow for correlation between linked marker loci. Using Markov-chain-Monte Carlo simulation methods, we obtain posterior distributions of model parameter estimates including disease-gene location and the age of the disease-predisposing mutation. In addition, we allow for heterogeneity in recombination rates, across the candidate region, to account for recombination hot and cold spots. We also obtain, for the ancestral marker haplotype, a posterior distribution that is unique to our method and that, unlike maximum-likelihood estimation, can properly account for uncertainty. We apply the method to data for cystic fibrosis and Huntington disease, for which mutations in disease genes have already been identified. The new method performs well compared with existing multi-locus mapping methods.

[1]  N. Morton,et al.  A metric map of humans: 23,500 loci in 850 bands. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[2]  X. Estivill,et al.  The origin of the major cystic fibrosis mutation (ΔF508) in European populations , 1994, Nature Genetics.

[3]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[4]  M. MacDonald,et al.  Complex patterns of linkage disequilibrium in the Huntington disease region. , 1991, American journal of human genetics.

[5]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[6]  M. McPeek,et al.  Assessment of linkage disequilibrium by the decay of haplotype sharing, with application to fine-scale genetic mapping. , 1999, American journal of human genetics.

[7]  J. Ott Analysis of Human Genetic Linkage , 1985 .

[8]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[9]  A Collins,et al.  Mapping a disease locus by allelic association. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[10]  A. Young,et al.  A polymorphic DNA marker genetically linked to Huntington's disease , 1983, Nature.

[11]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[12]  P M Conneally,et al.  DNA markers for nervous system diseases. , 1984, Science.

[13]  M. Xiong,et al.  Fine-scale genetic mapping based on linkage disequilibrium: theory and applications. , 1997, American journal of human genetics.

[14]  E. Pennisi A Closer Look at SNPs Suggests Difficulties , 1998, Science.

[15]  E A Thompson,et al.  Disequilibrium likelihoods for fine-scale mapping of a rare allele. , 1998, American journal of human genetics.

[16]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[17]  J. Terwilliger A powerful likelihood method for the analysis of linkage disequilibrium between trait loci and one or more polymorphic marker loci. , 1995, American journal of human genetics.

[18]  L. Kruglyak Prospects for whole-genome linkage disequilibrium mapping of common disease genes , 1999, Nature Genetics.

[19]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[20]  K Roeder,et al.  Haplotype fine mapping by evolutionary trees. , 2000, American journal of human genetics.

[21]  Manish S. Shah,et al.  A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington's disease chromosomes , 1993, Cell.

[22]  L. Tsui,et al.  Erratum: Identification of the Cystic Fibrosis Gene: Genetic Analysis , 1989, Science.

[23]  R. Nielsen Estimation of population parameters and recombination rates from single nucleotide polymorphisms. , 2000, Genetics.

[24]  B S Weir,et al.  Likelihood methods for locating disease genes in nonequilibrium populations. , 1995, American journal of human genetics.

[25]  L. Lazzeroni,et al.  Linkage disequilibrium and gene mapping: an empirical least-squares approach. , 1998, American journal of human genetics.

[26]  M. Hodson,et al.  Identification of the cystic fibrosis gene. , 1990, BMJ.

[27]  P. Moran,et al.  The statistical processes of evolutionary theory. , 1963 .