Haplotype and linkage disequilibrium architecture for human cancer-associated genes.

To facilitate association-based linkage studies we have studied the linkage disequilibrium (LD) and haplotype architecture around five genes of interest for cancer risk: ATM, BRCA1, BRCA2, RAD51, and TP53. Single nucleotide polymorphisms (SNPs) were identified and used to construct haplotypes that span 93-200 kb per locus with an average SNP density of 12 kb. These markers were genotyped in four ethnically defined populations that contained 48 each of African Americans, Asian Americans, Hispanic Americans, and European Americans. Haplotypes were inferred using an expectation maximization (EM) algorithm, and the data were analyzed using D', R(2), Fisher's exact P-values, and the four-gamete test for recombination. LD levels varied widely between loci from continuously high LD across 200 kb to a virtual absence of LD across a similar length of genome. LD structure also varied at each gene and between populations studied. This variation indicates that the success of linkage-based studies will require a precise description of LD at each locus and in each population to be studied. One striking consistency between genes was that at each locus a modest number of haplotypes present in each population accounted for a high fraction of the total number of chromosomes. We conclude that each locus has its own genomic profile with regard to LD, and despite this there is the widespread trend of relatively low haplotype diversity. As a result, a low marker density should be adequate to identify haplotypes that represent the common variation at a locus, thereby decreasing costs and increasing efficacy of association studies.

[1]  A. Beaudet,et al.  A robotics‐assisted procedure for large scale cystic fibrosis mutation analysis , 1994, Human mutation.

[2]  L R Cardon,et al.  Extent and distribution of linkage disequilibrium in three genomic regions. , 2001, American journal of human genetics.

[3]  Ranajit Chakraborty,et al.  Gene admixture in human populations: Models and predictions , 1986 .

[4]  Jonathan Scott Friedlaender,et al.  Haplotypes and linkage disequilibrium at the phenylalanine hydroxylase locus, PAH, in a global representation of populations. , 2000, American journal of human genetics.

[5]  E. Boerwinkle,et al.  Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. , 1998, American journal of human genetics.

[6]  J. Witte,et al.  Linkage disequilibrium and allele-frequency distributions for 114 single-nucleotide polymorphisms in five populations. , 2000, American journal of human genetics.

[7]  A. Jeffreys,et al.  Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex , 2001, Nature Genetics.

[8]  R. Hudson,et al.  Statistical properties of the number of recombination events in the history of a sample of DNA sequences. , 1985, Genetics.

[9]  J. Todd,et al.  Identification of susceptibility loci for insulin-dependent diabetes mellitus by trans-racial gene mapping , 1989, Nature.

[10]  K. Buetow,et al.  Nonuniform recombination within the human beta-globin gene cluster. , 1984, American journal of human genetics.

[11]  K K Kidd,et al.  The accuracy of statistical methods for estimation of haplotype frequencies: an example from the CD4 locus. , 2000, American journal of human genetics.

[12]  Frank Dudbridge,et al.  Haplotype tagging for the identification of common disease genes , 2001, Nature Genetics.

[13]  R. W. Davis,et al.  Global analysis of ATM polymorphism reveals significant functional constraint. , 2001, American journal of human genetics.

[14]  N. Freimer,et al.  The distribution of linkage disequilibrium over anonymous genome regions. , 1995, Human molecular genetics.

[15]  D. Nelson,et al.  Haplotypes at ATM identify coding-sequence variation and indicate a region of extensive linkage disequilibrium. , 2000, American journal of human genetics.

[16]  L. Kruglyak Prospects for whole-genome linkage disequilibrium mapping of common disease genes , 1999, Nature Genetics.

[17]  G. Abecasis,et al.  Single nucleotide polymorphism and linkage disequilibrium within the TCR α/δ locus , 2000 .

[18]  Gonçalo R. Abecasis,et al.  GOLD-Graphical Overview of Linkage Disequilibrium , 2000, Bioinform..

[19]  N E Morton,et al.  Genetic epidemiology of single-nucleotide polymorphisms. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[20]  R. Lewontin The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models. , 1964, Genetics.

[21]  L. Excoffier,et al.  Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. , 1995, Molecular biology and evolution.

[22]  Pardis C Sabeti,et al.  Linkage disequilibrium in the human genome , 2001, Nature.

[23]  D. Barker,et al.  Evidence for effective suppression of recombination in the chromosome 17q21 segment spanning RNU2-BRCA1. , 1999, American journal of human genetics.

[24]  R. Lewontin,et al.  The detection of linkage disequilibrium in molecular sequence data. , 1995, Genetics.

[25]  Peter Beighton,et al.  de la Chapelle, A. , 1997 .

[26]  M. Daly,et al.  High-resolution haplotype structure in the human genome , 2001, Nature Genetics.

[27]  Zhaohui S. Qin,et al.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. , 2002, American journal of human genetics.

[28]  Pui-Yan Kwok,et al.  Juxtaposed regions of extensive and minimal linkage disequilibrium in human Xq25 and Xq28 , 2000, Nature Genetics.