Automatic Selection ofLoop Breakers for GeneticLinkage Analysis

Pedigree loops pose a difficult computational challenge in genetic linkage analysis. The most popular linkage analysis package, LINKAGE, uses an algorithm that converts a looped pedigree into a loopless pedigree, which is traversed many times. The conversion is controlled by user selection of individuals to act as loop breakers. The selection of loop breakers has significant impact on the running time of the subsequent linkage analysis. We have automated the process of selecting loop breakers, implemented a hybrid algorithm for it in the FASTLINK version of LINKAGE, and tested it on many real pedigrees with excellent performance. We point out that there is no need to break each loop by a distinct individual because, with minor modification to the algorithms in LINKAGE/FASTLINK, a single individual that participates in multiple marriages can serve as a loop breaker for several loops. Our algorithm for finding loop breakers, called LOOPBREAKER, is a combination of: (1) a new algorithm that is guaranteed to be optimal in the special case of pedigrees with no multiple marriages and (2) an adaptation of a known algorithm for breaking loops in general graphs. The contribution of this work is the adaptation of abstract methods from computer science to a challenging problem in genetics.

[1]  A A Schäffer,et al.  Avoiding recomputation in linkage analysis. , 1994, Human heredity.

[2]  Judea Pearl,et al.  Fusion, Propagation, and Structuring in Belief Networks , 1986, Artif. Intell..

[3]  A A Schäffer,et al.  Localization of a gene for an autosomal recessive form of juvenile Parkinsonism to chromosome 6q25.2-27. , 1997, American journal of human genetics.

[4]  G. Lathrop,et al.  Easy calculations of lod scores and genetic risks on small computers. , 1984, American journal of human genetics.

[5]  Dan Geiger,et al.  A fast algorithm for finding junction trees 81 A sufficientlyfast algorithm for finding close to optimal junction trees , 1996 .

[6]  R. Elston,et al.  A general model for the genetic analysis of pedigree data. , 1971, Human heredity.

[7]  K. Lange,et al.  Extensions to pedigree analysis. V. Optimal calculation of Mendelian likelihoods. , 1983, Human heredity.

[8]  A A Schäffer,et al.  Faster linkage analysis computations for pedigrees with loops or unused alleles. , 1996, Human heredity.

[9]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[10]  Dan Geiger,et al.  Approximation Algorithms for the Loop Cutset Problem , 1994, UAI.

[11]  Probability functions on complex pedigrees , 1978 .

[12]  Judea Pearl,et al.  A Constraint-Propagation Approach to Probabilistic Reasoning , 1985, UAI.

[13]  S. Daiger,et al.  Linkage mapping of autosomal dominant retinitis pigmentosa (RP1) to the pericentric region of human chromosome 8. , 1991, Genomics.

[14]  Dan Geiger,et al.  Identifying independence in bayesian networks , 1990, Networks.

[15]  J. O’Connell,et al.  The VITESSE algorithm for rapid exact multilocus linkage analysis via genotype set–recoding and fuzzy inheritance , 1995, Nature Genetics.

[16]  Gregory F. Cooper,et al.  Probabilistic inference in multiply connected belief networks using loop cutsets , 1990, Int. J. Approx. Reason..

[17]  Robert C. Elston,et al.  Extensions to Pedigree Analysis , 1975 .

[18]  J. Ott,et al.  Multilocus linkage analysis in humans: detection of linkage and estimation of recombination. , 1985, American journal of human genetics.

[19]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[20]  J. Ott Estimation of the recombination fraction in human pedigrees: efficient computation of the likelihood for human linkage studies. , 1974, American journal of human genetics.

[21]  Dan Geiger,et al.  A Practical Algorithm for Finding Optimal Triangulations , 1997, AAAI/IAAI.

[22]  Jurg Ott,et al.  Handbook of Human Genetic Linkage , 1994 .

[23]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[24]  Judea Pearl,et al.  A Computational Model for Causal and Diagnostic Reasoning in Inference Systems , 1983, IJCAI.

[25]  Bruce A. Reed,et al.  Finding approximate separators and computing tree width quickly , 1992, STOC '92.

[26]  A A Schäffer,et al.  Faster sequential genetic linkage computations. , 1993, American journal of human genetics.

[27]  Dan Geiger,et al.  Optimization of Pearl's Method of Conditioning and Greedy-Like Approximation Algorithms for the Vertex Feedback Set Problem , 1996, Artif. Intell..

[28]  G. P. Vogler,et al.  Efficient methods for computing linkage likelihoods of recessive diseases in inbred pedigrees , 1991, Genetic epidemiology.

[29]  C. Greenberg,et al.  Limb-girdle muscular dystrophy and Miyoshi myopathy in an aboriginal Canadian kindred map to LGMD2B and segregate with the same haplotype. , 1996, American journal of human genetics.

[30]  Reuven Bar-Yehuda,et al.  Approximation algorithms for the vertex feedback set problem with applications to constraint satisfaction and Bayesian inference , 1994, SODA '94.

[31]  Piotr Berman,et al.  Constant Ratio Approximations of the Weighted Feedback Vertex Set Problem for Undirected Graphs , 1995, ISAAC.

[32]  J. Ott,et al.  Strategies for multilocus linkage analysis in humans. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Shimon Even,et al.  Graph Algorithms , 1979 .