Recombination and linkage disequilibrium in Arabidopsis thaliana

Linkage disequilibrium (LD) is a major aspect of the organization of genetic variation in natural populations. Here we describe the genome-wide pattern of LD in a sample of 19 Arabidopsis thaliana accessions using 341,602 non-singleton SNPs. LD decays within 10 kb on average, considerably faster than previously estimated. Tag SNP selection algorithms and 'hide-the-SNP' simulations suggest that genome-wide association mapping will require only 40%–50% of the observed SNPs, a reduction similar to estimates in a sample of African Americans. An Affymetrix genotyping array containing 250,000 SNPs has been designed based on these results; we demonstrate that it should have more than adequate coverage for genome-wide association mapping. The extent of LD is highly variable, and we find clear evidence of recombination hotspots, which seem to occur preferentially in intergenic regions. LD also reflects the action of selection, and it is more extensive between nonsynonymous polymorphisms than between synonymous polymorphisms.

[1]  Paul Marjoram,et al.  Estimating Recombination Rates From Single-Nucleotide Polymorphisms Using Summary Statistics , 2006, Genetics.

[2]  S. Gabriel,et al.  Efficiency and power in genetic association studies , 2005, Nature Genetics.

[3]  Hadi Quesneville,et al.  Variation in crossing-over rates across chromosome 4 of Arabidopsis thaliana reveals the presence of meiotic recombination "hot spots". , 2005, Genome research.

[4]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[5]  M. Nordborg Linkage disequilibrium, gene trees and selfing: an ancestral recombination graph with partial self-fertilization. , 2000, Genetics.

[6]  P. Andolfatto Adaptive evolution of non-coding DNA in Drosophila , 2005, Nature.

[7]  Mark Daly,et al.  Haploview: analysis and visualization of LD and haplotype maps , 2005, Bioinform..

[8]  Frank Dudbridge,et al.  Haplotype tagging for the identification of common disease genes , 2001, Nature Genetics.

[9]  Kevin R. Thornton,et al.  Multilocus patterns of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations. , 2005, Genome research.

[10]  Kui Zhang,et al.  Hapblock: Haplotype Block Partitioning and Tag Snp Selection Software Using a Set of Dynamic Programming Algorithms , 2022 .

[11]  J. Wall,et al.  Why is there so little intragenic linkage disequilibrium in humans? , 2001, Genetical research.

[12]  C. Carlson,et al.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. , 2004, American journal of human genetics.

[13]  R. Hudson Two-locus sampling distributions and their application. , 2001, Genetics.

[14]  R. Horres,et al.  Genome size variation among accessions of Arabidopsis thaliana. , 2004, Annals of botany.

[15]  Mattias Jakobsson,et al.  The Pattern of Polymorphism in Arabidopsis thaliana , 2005, PLoS biology.

[16]  Paul Marjoram,et al.  Relative Influences of Crossing Over and Gene Conversion on the Pattern of Linkage Disequilibrium in Arabidopsis thaliana , 2006, Genetics.

[17]  Keyan Zhao,et al.  An Arabidopsis Example of Association Mapping in Structured Samples , 2006, PLoS genetics.

[18]  Hao Wu,et al.  R/qtl: QTL Mapping in Experimental Crosses , 2003, Bioinform..

[19]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[20]  Norman Arnheim,et al.  Hot and cold spots of recombination in the human genome: the reason we should find them and how this can be achieved. , 2003, American journal of human genetics.

[21]  R. Hudson,et al.  Statistical properties of the number of recombination events in the history of a sample of DNA sequences. , 1985, Genetics.

[22]  M. Nordborg,et al.  The effect of gene conversion on intralocus associations. , 1998, Genetics.

[23]  Richard R. Hudson,et al.  Generating samples under a Wright-Fisher neutral model of genetic variation , 2002, Bioinform..

[24]  J. Wall,et al.  Haplotype blocks and linkage disequilibrium in the human genome , 2003, Nature Reviews Genetics.

[25]  Richard M. Clark,et al.  Common Sequence Polymorphisms Shaping Genetic Diversity in Arabidopsis thaliana , 2007, Science.

[26]  L. Cardon,et al.  The complex interplay among factors that influence allelic association , 2004, Nature Reviews Genetics.

[27]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[28]  J. Wall,et al.  Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. , 2001, American journal of human genetics.

[29]  Thomas Wiehe,et al.  Recombination and gene conversion in a 170-kb genomic region of Arabidopsis thaliana. , 2002, Genetics.

[30]  Simon Tavaré,et al.  Linkage disequilibrium: what history has to tell us. , 2002, Trends in genetics : TIG.

[31]  M. Nordborg,et al.  Sequence variation and haplotype structure surrounding the flowering time locus FRI in Arabidopsis thaliana. , 2002, Genetics.

[32]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[33]  M. Stephens,et al.  Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data. , 2003, Genetics.