A hidden Markov model approach to multilocus linkage analysis in a full-sib family

Statistical packages for constructing genetic linkage maps in inbred lines are well developed and applied extensively, while linkage analysis in outcrossing species faces some statistical challenges because of their complicated genetic structures. In this article, we present a multilocus linkage analysis via hidden Markov models for a linkage group of markers in a full-sib family. The advantage of this method is the simultaneous estimation of the recombination fractions between adjacent markers that possibly segregate in different ratios, and the calculation of likelihood for a certain order of the markers. When the number of markers decreases to two or three, the multilocus linkage analysis becomes traditional two-point or three-point linkage analysis, respectively. Monte Carlo simulations are performed to show that the recombination fraction estimates of multilocus linkage analysis are more accurate than those just using two-point linkage analysis and that the likelihood as an objective function for ordering maker loci is the most powerful method compared with other methods. By incorporating this multilocus linkage analysis, we have developed a Windows software, FsLinkageMap, for constructing genetic maps in a full-sib family. A real example is presented for illustrating linkage maps constructed by using mixed segregation markers. Our multilocus linkage analysis provides a powerful method for constructing high-density genetic linkage maps in some outcrossing plant species, especially in forest trees.

[1]  Falk Ct,et al.  A simple scheme for preliminary ordering of multiple loci: application to 45 CF families. , 1989 .

[2]  Rongling Wu,et al.  A multilocus likelihood approach to joint modeling of linkage, parental diplotype and gene order in a full-sib family , 2004, BMC Genetics.

[3]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[4]  J. B. S. Haldane,et al.  The probable errors of calculated linkage values, and the most accurate method of determining gametic from certain zygotic series , 1919, Journal of Genetics.

[5]  D. D. Kosambi The estimation of map distances from recombination values. , 1943 .

[6]  Rongling Wu,et al.  Simultaneous maximum likelihood estimation of linkage and linkage phases in outcrossing species. , 2002, Theoretical population biology.

[7]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8]  S. Luan,et al.  A rice quantitative trait locus for salt tolerance encodes a sodium transporter , 2005, Nature Genetics.

[9]  Susan R. Wilson,et al.  A major simplification in the preliminary ordering of linked loci , 1988, Genetic epidemiology.

[10]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[11]  T. C. Nesbitt,et al.  fw2.2: a quantitative trait locus key to the evolution of tomato fruit size. , 2000, Science.

[12]  T. Speed,et al.  Incorporating interference into linkage analysis for experimental crosses. , 2005, Biostatistics.

[13]  Richard G. F. Visser,et al.  RECORD: a novel method for ordering loci on a genetic linkage map , 2005, Theoretical and Applied Genetics.

[14]  T. Sang,et al.  Rice Domestication by Reducing Shattering , 2007 .

[15]  C T Falk,et al.  A simple scheme for preliminary ordering of multiple loci: application to 45 CF families. , 1989, Progress in clinical and biological research.

[16]  J M Lalouel,et al.  Linkage mapping from pair-wise recombination data , 1977, Heredity.

[17]  J. Jansen,et al.  Linkage analysis in a full-sib family of an outbreeding plant species: overview and consequences for applications , 1997 .

[18]  R. Sederoff,et al.  Genetic linkage maps of Eucalyptus grandis and Eucalyptus urophylla using a pseudo-testcross: mapping strategy and RAPD markers. , 1994, Genetics.

[19]  D E Weeks,et al.  Preliminary ranking procedures for multilocus ordering. , 1987, Genomics.

[20]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[21]  P. Stam,et al.  Construction of integrated genetic linkage maps by means of a new computer package: JOINMAP. , 1993 .

[22]  Monte Carlo simulations on marker grouping and ordering , 2003, Theoretical and Applied Genetics.

[23]  Roeland E. Voorrips,et al.  Software for the calculation of genetic linkage maps , 2001 .

[24]  R. Bressan,et al.  Unraveling salt tolerance in crops , 2005, Nature Genetics.

[25]  M. Daly,et al.  MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. , 1987, Genomics.

[26]  A. Brice,et al.  A QTL for flowering time in Arabidopsis reveals a novel allele of CRY2 , 2002, Nature Genetics.

[27]  J. H. Jørgensen,et al.  The barley chromosome 5 linkage map: II. Extension of the map with four loci , 2009 .

[28]  Richard G. F. Visser,et al.  RECORD: a novel method for ordering loci on a genetic linkage map , 2005, Theoretical and Applied Genetics.