Car Agene: Constructing and Joining Maximum Likelihood Genetic Maps*

Genetic mapping is an important step in the study of any organism. An accurate genetic map is extremely valuable for locating genes or more generally either qualitative or quantitative trait loci (QTL). This paper presents a new approach to two important problems in genetic mapping: automatically ordering markers to obtain a multipoint maximum likelihood map and building a multipoint maximum likelihood map using pooled data from several crosses. The approach is embodied in an hybrid algorithm that mixes the statistical optimization algorithm EM with local search techniques which have been developed in the artificial intelligence and operations research communities. An efficient implementation of the EM algorithm provides maximum likelihood recombination fractions, while the local search techniques look for orders that maximize this maximum likelihood. The specificity of the approach lies in the neighborhood structure used in the local search algorithms which has been inspired by an analogy between the marker ordering problem and the famous traveling salesman problem. The approach has been used to build joined maps for the wasp Trichogramma brassicae and on random pooled data sets. In both cases, it compares quite favorably with existing softwares as far as maximum likelihood is considered as a significant criteria.

[1]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[2]  Éric D. Taillard,et al.  Robust taboo search for the quadratic assignment problem , 1991, Parallel Comput..

[3]  Fred Glover,et al.  Tabu Search - Part II , 1989, INFORMS J. Comput..

[4]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Ott,et al.  Multilocus linkage analysis in humans: detection of linkage and estimation of recombination. , 1985, American journal of human genetics.

[6]  Lon R. Cardon,et al.  Quantitative Trait Loci , 1995 .

[7]  M. Daly,et al.  MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. , 1987, Genomics.

[8]  K H Buetow,et al.  Multipoint gene mapping using seriation. I. General methods. , 1987, American journal of human genetics.

[9]  Bruce S. Weir,et al.  Constructing Genetic Maps by Rapid Chain Delineation , 1994 .

[10]  David S. Johnson,et al.  Local Optimization and the Traveling Salesman Problem , 1990, ICALP.

[11]  Brian W. Kernighan,et al.  An Effective Heuristic Algorithm for the Traveling-Salesman Problem , 1973, Oper. Res..

[12]  P. Stam,et al.  Construction of integrated genetic linkage maps by means of a new computer package: JOINMAP. , 1993 .

[13]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[14]  David Applegate,et al.  Finding Cuts in the TSP (A preliminary report) , 1995 .

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  Fred W. Glover,et al.  Tabu Search - Part I , 1989, INFORMS J. Comput..

[17]  J. Ott Analysis of Human Genetic Linkage , 1985 .

[18]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .