Sequential Imputation and Linkage Analysis

Multilocus calculations using all available information on all pedigree members are important for linkage analysis. Exact calculation methods in linkage analysis are limited in either the number of loci or the number of pedigree members they can handle. In this thesis, we propose a Monte Carlo method for linkage analysis based on sequential imputation. Unlike exact methods, sequential imputation can handle both a moderate number of loci and a large number of pedigree members. Sequential imputation does not have the problem of slow mixing encountered by Markov chain Monte Carlo methods because of high correlation between samples from pedigree data. This Monte Carlo method is an application of importance sampling in which we sequentially impute ordered genotypes locus by locus and then impute inheritance vectors conditioned on these genotypes. The resulting inheritance vectors together with the importance sampling weights are used to derive a consistent estimator of any linkage statistic of interest. The linkage statistic can be parametric or nonparametric; we focus on nonparametric linkage statistics. We showed that sequential imputation can produce accurate estimates within reasonable computing time. Then we performed a simulation study to illustrate the potential gain in power using our method for multilocus linkage analysis with large pedigrees. We also showed how sequential imputation can be used in haplotype reconstruction, an important step in genetic mapping. In all

[1]  S Lin,et al.  Relative efficiencies of the Chi‐square recombination models for gene mapping with human pedigree data , 1999, Annals of human genetics.

[2]  Probability functions on complex pedigrees , 1978 .

[3]  Pradip Tapadar,et al.  Haplotyping in Pedigrees via a Genetic Algorithm , 1999, Human Heredity.

[4]  Terence P. Speed,et al.  An algorithm for haplotype analysis , 1997, RECOMB '97.

[5]  F. Schnell Some general formulations of linkage effects in inbreeding. , 1961, Genetics.

[6]  William S. Klug,et al.  Concepts of Genetics , 1999 .

[7]  S Karlin,et al.  Classifications and comparisons of multilocus recombination distributions. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[8]  K. Lange,et al.  An algorithm for automatic genotype elimination. , 1987, American journal of human genetics.

[9]  Eric S. Lander,et al.  Faster Multipoint Linkage Analysis Using Fourier Transforms , 1998, J. Comput. Biol..

[10]  S Lin,et al.  Two‐Locus Modeling of Asthma in a Hutterite Pedigree via Markov Chain Monte Carlo , 2001, Genetic Epidemiology.

[11]  H. Cann,et al.  Centre d'etude du polymorphisme humain (CEPH): collaborative genetic mapping of the human genome. , 1990, Genomics.

[12]  D. Qian,et al.  Minimum-recombinant haplotyping in pedigrees. , 2002, American journal of human genetics.

[13]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[14]  T. Speed,et al.  Statistical analysis of crossover interference using the chi-square model. , 1995, Genetics.

[15]  J. Haines,et al.  An index marker map of chromosome 9 provides strong evidence for positive interference. , 1993, American journal of human genetics.

[16]  Elizabeth A. Thompson,et al.  Statistical inference from genetic data on pedigrees , 2003 .

[17]  J. Ott Computer-simulation methods in human linkage analysis. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[18]  D E Weeks,et al.  Multipoint mapping under genetic interference. , 1993, Human heredity.

[19]  R C Elston,et al.  A faster and more general hidden Markov model algorithm for multipoint likelihood calculations. , 1997, Human heredity.

[20]  W. Foulkes,et al.  Prevalence and penetrance of BRCA1 and BRCA2 gene mutations in unselected Ashkenazi Jewish women with breast cancer. , 1999, Journal of the National Cancer Institute.

[21]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[22]  J. Nutt,et al.  A gene for episodic ataxia/myokymia maps to chromosome 12p13. , 1994, American journal of human genetics.

[23]  M S McPeek,et al.  Optimal allele‐sharing statistics for genetic mapping using affected relatives , 1999, Genetic epidemiology.

[24]  J. O’Connell,et al.  The VITESSE algorithm for rapid exact multilocus linkage analysis via genotype set–recoding and fuzzy inheritance , 1995, Nature Genetics.

[25]  J. O’Connell,et al.  An optimal algorithm for automatic genotype elimination. , 1999, American journal of human genetics.

[26]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Niclas Bergman,et al.  Posterior Cramér-Rao Bounds for Sequential Estimation , 2001, Sequential Monte Carlo Methods in Practice.

[28]  R. Elston,et al.  A general model for the genetic analysis of pedigree data. , 1971, Human heredity.

[29]  M. Daly,et al.  Rapid multipoint linkage analysis of recessive traits in nuclear families, including homozygosity mapping. , 1995, American journal of human genetics.

[30]  R. Elston,et al.  The investigation of linkage between a quantitative trait and a marker locus , 1972, Behavior genetics.

[31]  T P Speed,et al.  Statistical analysis of chromatid interference. , 1995, Genetics.

[32]  S Lin,et al.  Incorporating crossover interference into pedigree analysis using the chi 2 model. , 1996, Human heredity.

[33]  Michael Isard,et al.  Statistical models of visual shape and motion , 1998, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[34]  E. Lander,et al.  Complete multipoint sib-pair analysis of qualitative and quantitative traits. , 1995, American journal of human genetics.

[35]  L Kruglyak,et al.  Efficient multipoint linkage analysis through reduction of inheritance space. , 2001, American journal of human genetics.

[36]  A. Whittemore,et al.  A class of tests for linkage using affected pedigree members. , 1994, Biometrics.

[37]  Daniel F. Gudbjartsson,et al.  Allegro, a new computer program for multipoint linkage analysis , 2000, Nature genetics.

[38]  A. Kong,et al.  Sequential imputation for multilocus linkage analysis. , 1994, Proceedings of the National Academy of Sciences of the United States of America.