Haplotype diversity across 100 candidate genes for inflammation, lipid metabolism, and blood pressure regulation in two populations.

Recent studies have suggested that a significant fraction of the human genome is contained in blocks of strong linkage disequilibrium, ranging from ~5 to >100 kb in length, and that within these blocks a few common haplotypes may account for >90% of the observed haplotypes. Furthermore, previous studies have suggested that common haplotypes in candidate genes are generally shared across populations and represent the majority of chromosomes in each population. The conclusions drawn from these preliminary studies, however, are based on an incomplete knowledge of the variation in the regions examined. To bridge this gap in knowledge, we have completely resequenced 100 candidate genes in a population of African descent and one of European descent. Although these genes have been well studied because of their medical importance, we demonstrate that a large amount of sequence variation has not yet been described. We also report that the average number of inferred haplotypes per gene, when complete data is used, is higher than in previous reports and that the number and proportion of all haplotypes represented by common haplotypes per gene is variable. Furthermore, we demonstrate that haplotypes shared between the two populations constitute only a fraction of the total number of haplotypes observed and that these shared haplotypes represent fewer of the African-descent chromosomes than was expected from previous studies. Finally, we show that restricting variation discovery to coding regions does not adequately describe all common haplotypes or the true haplotype block structure observed when all common variation is used to infer haplotypes. These data, derived from complete knowledge of genetic variation in these genes, suggest that the haplotype architecture of candidate genes across the human genome is more complex than previously suggested, with important implications for candidate gene and genomewide association studies.

[1]  E. Boerwinkle,et al.  Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. , 1998, American journal of human genetics.

[2]  C. Carlson,et al.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. , 2004, American journal of human genetics.

[3]  R. Noelle,et al.  Combined effects of calcineurin inhibitors or sirolimus with anti-CD40L mAb on alloengraftment under nonmyeloablative conditions. , 2002, Blood.

[4]  L. Kruglyak Prospects for whole-genome linkage disequilibrium mapping of common disease genes , 1999, Nature Genetics.

[5]  Peter Donnelly,et al.  A comparison of bayesian methods for haplotype reconstruction from population genotype data. , 2003, American journal of human genetics.

[6]  M. Rieder,et al.  Sequence variation in the human angiotensin converting enzyme , 1999, Nature Genetics.

[7]  K. Tokunaga,et al.  CD36 polymorphism is associated with protection from cerebral malaria. , 2003, American journal of human genetics.

[8]  N. Risch Searching for genetic determinants in the new millennium , 2000, Nature.

[9]  W S Watkins,et al.  The distribution of human genetic diversity: a comparison of mitochondrial, autosomal, and Y-chromosome data. , 2000, American journal of human genetics.

[10]  L. Cardon,et al.  Association study designs for complex diseases , 2001, Nature Reviews Genetics.

[11]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[12]  E. Lander,et al.  Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease , 2003, Nature Genetics.

[13]  D. Hartl,et al.  Principles of population genetics , 1981 .

[14]  J. Kere,et al.  Microsatellite diversity and the demographic history of modern humans. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Yusuke Nakamura,et al.  Gene-based SNP discovery as part of the Japanese Millennium Genome Project: identification of 190 562 genetic variations in the human genome , 2002, Journal of Human Genetics.

[16]  N. Shen,et al.  Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis , 1999, Nature Genetics.

[17]  A. Jeffreys,et al.  Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex , 2001, Nature Genetics.

[18]  J. Stephens,et al.  Haplotype Variation and Linkage Disequilibrium in 313 Human Genes , 2001, Science.

[19]  L. Jorde,et al.  Nucleotide diversity and haplotype structure of the human angiotensinogen gene in two populations. , 2002, American journal of human genetics.

[20]  P. Donnelly,et al.  A new statistical method for haplotype reconstruction from population data. , 2001, American journal of human genetics.

[21]  W. Ewens THE MAINTENANCE OF ALLELES BY MUTATION. , 1964, Genetics.

[22]  Francis S. Collins,et al.  Variations on a Theme: Cataloging Human DNA Sequence Variation , 1997, Science.

[23]  Kun Zhang,et al.  HaploBlockFinder: Haplotype Block Analyses , 2003, Bioinform..

[24]  L Tiret,et al.  Sequence diversity in 36 candidate genes for cardiovascular disorders. , 1999, American journal of human genetics.

[25]  S. Chanock,et al.  Variants in the VCAM1 gene and risk for symptomatic stroke in sickle cell disease. , 2002, Blood.

[26]  J. Akey,et al.  Distribution of recombination crossovers and the origin of haplotype blocks: the interplay of population history, recombination, and mutation. , 2002, American journal of human genetics.

[27]  Momiao Xiong,et al.  Randomly distributed crossovers may generate block-like patterns of linkage disequilibrium: an act of genetic drift , 2003, Human Genetics.

[28]  Julie A Schneider,et al.  DNA variability of human genes , 2003, Mechanisms of Ageing and Development.

[29]  F. Collins,et al.  A vision for the future of genomics research , 2003, Nature.

[30]  D. Botstein,et al.  Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease , 2003, Nature Genetics.

[31]  B. J. Carey,et al.  Chromosome-wide distribution of haplotype blocks and the role of recombination hot spots , 2003, Nature Genetics.

[32]  Lon R. Cardon,et al.  Efficient selective screening of haplotype tag SNPs , 2003, Bioinform..

[33]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[34]  D. Trégouët,et al.  Heterogeneity of linkage disequilibrium in human genes has implications for association studies of common diseases. , 2002, Human molecular genetics.

[35]  A. Jeffreys,et al.  Recombination hotspots rather than population history dominate linkage disequilibrium in the MHC class II region. , 2003, Human molecular genetics.

[36]  J. Schneider,et al.  Direct measurement of the male recombination fraction in the human beta-globin hot spot. , 2002, Human molecular genetics.

[37]  E. Boerwinkle,et al.  Cladistic structure within the human Lipoprotein lipase gene and its implications for phenotypic association studies. , 2000, Genetics.

[38]  M. Daly,et al.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms , 2001, Nature.

[39]  Deborah A. Nickerson,et al.  Additional SNPs and linkage-disequilibrium analyses are necessary for whole-genome association studies in humans , 2003, Nature Genetics.

[40]  M. Daly,et al.  High-resolution haplotype structure in the human genome , 2001, Nature Genetics.

[41]  R S Judson,et al.  Complex promoter and coding region beta 2-adrenergic receptor haplotypes alter receptor expression and predict in vivo responsiveness. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Dmitri V. Zaykin,et al.  Effectiveness of computational methods in haplotype prediction , 2002, Human Genetics.

[43]  D. Nickerson,et al.  Variation is the spice of life , 2001, Nature Genetics.

[44]  W S Watkins,et al.  Population genomics: a bridge from evolutionary history to genetic medicine. , 2001, Human molecular genetics.

[45]  A. Clark,et al.  Sequence variation and linkage disequilibrium in the human T-cell receptor beta (TCRB) locus. , 2001, American journal of human genetics.

[46]  S. P. Fodor,et al.  Blocks of Limited Haplotype Diversity Revealed by High-Resolution Scanning of Human Chromosome 21 , 2001, Science.

[47]  J. Ioannidis,et al.  Replication validity of genetic association studies , 2001, Nature Genetics.

[48]  F. Collins,et al.  New goals for the U.S. Human Genome Project: 1998-2003. , 1998, Science.

[49]  S. Gabriel,et al.  Quality and completeness of SNP databases , 2003, Nature Genetics.

[50]  Frank Dudbridge,et al.  Haplotype tagging for the identification of common disease genes , 2001, Nature Genetics.

[51]  R. Griffiths,et al.  Archaic African and Asian lineages in the genetic ancestry of modern humans. , 1997, American journal of human genetics.

[52]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[53]  J. Wall,et al.  Haplotype blocks and linkage disequilibrium in the human genome , 2003, Nature Reviews Genetics.

[54]  Lon R. Cardon,et al.  A first-generation linkage disequilibrium map of human chromosome 22 , 2002, Nature.

[55]  Pierre Puget,et al.  Decoding the literature on genetic variation , 2003, Nature Biotechnology.

[56]  Marek Kimmel,et al.  Haplotype and linkage disequilibrium architecture for human cancer-associated genes. , 2002, Genome research.

[57]  A. Chakravarti,et al.  Haplotype inference in random population samples. , 2002, American journal of human genetics.

[58]  M. Stephens,et al.  Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data. , 2003, Genetics.