Ignoring linkage disequilibrium among tightly linked markers induces false-positive evidence of linkage for affected sib pair analysis.

Most multipoint linkage programs assume linkage equilibrium among the markers being studied. The assumption is appropriate for the study of sparsely spaced markers with intermarker distances exceeding a few centimorgans, because linkage equilibrium is expected over these intervals for almost all populations. However, with recent advancements in high-throughput genotyping technology, much denser markers are available, and linkage disequilibrium (LD) may exist among the markers. Applying linkage analyses that assume linkage equilibrium to dense markers may lead to bias. Here, we demonstrated that, when some or all of the parental genotypes are missing, assuming linkage equilibrium among tightly linked markers where strong LD exists can cause apparent oversharing of multipoint identity by descent (IBD) between sib pairs and false-positive evidence for multipoint model-free linkage analysis of affected sib pair data. LD can also mimic linkage between a disease locus and multiple tightly linked markers, thus causing false-positive evidence of linkage using parametric models, particularly when heterogeneity LOD score approaches are applied. Bias can be eliminated by inclusion of parental genotype data and can be reduced when additional unaffected siblings are included in the analysis.

[1]  Pardis C Sabeti,et al.  Linkage disequilibrium in the human genome , 2001, Nature.

[2]  J. Ott,et al.  Linkage analysis and family classification under heterogeneity , 1983, Annals of human genetics.

[3]  C I Amos,et al.  Guess LOD approach: Sufficient conditions for robustness , 1995, Genetic epidemiology.

[4]  J. Ott,et al.  A computer program for linkage analysis of general human pedigrees. , 1976, American journal of human genetics.

[5]  L R Cardon,et al.  Extent and distribution of linkage disequilibrium in three genomic regions. , 2001, American journal of human genetics.

[6]  N. Freimer,et al.  Incorrect specification of marker allele frequencies: effects on linkage analysis. , 1993, American journal of human genetics.

[7]  J. O’Connell,et al.  The VITESSE algorithm for rapid exact multilocus linkage analysis via genotype set–recoding and fuzzy inheritance , 1995, Nature Genetics.

[8]  J. Ott Computer-simulation methods in human linkage analysis. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[9]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[10]  Daniel F. Gudbjartsson,et al.  Allegro, a new computer program for multipoint linkage analysis , 2000, Nature genetics.

[11]  Eleftheria Zeggini,et al.  Whole-genome scan, in a complex disease, using 11,245 single-nucleotide polymorphisms: comparison with microsatellites. , 2004, American journal of human genetics.

[12]  Susan E Hodge,et al.  HLODs remain powerful tools for detection of linkage in the presence of genetic heterogeneity. , 2002, American journal of human genetics.

[13]  S A Seuchter,et al.  The effect of misspecifying allele frequencies in incompletely typed families , 1993, Genetic epidemiology.

[14]  A. Whittemore,et al.  A class of tests for linkage using affected pedigree members. , 1994, Biometrics.

[15]  J. Ott Analysis of Human Genetic Linkage , 1985 .

[16]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[17]  Daniel J Schaid,et al.  Caution on pedigree haplotype inference with software that assumes linkage equilibrium. , 2002, American journal of human genetics.

[18]  N J Cox,et al.  Allele-sharing models: LOD scores and accurate linkage tests. , 1997, American journal of human genetics.