Common Sequence Polymorphisms Shaping Genetic Diversity in Arabidopsis thaliana

The genomes of individuals from the same species vary in sequence as a result of different evolutionary processes. To examine the patterns of, and the forces shaping, sequence variation in Arabidopsis thaliana, we performed high-density array resequencing of 20 diverse strains (accessions). More than 1 million nonredundant single-nucleotide polymorphisms (SNPs) were identified at moderate false discovery rates (FDRs), and ∼4% of the genome was identified as being highly dissimilar or deleted relative to the reference genome sequence. Patterns of polymorphism are highly nonrandom among gene families, with genes mediating interaction with the biotic environment having exceptional polymorphism levels. At the chromosomal scale, regional variation in polymorphism was readily apparent. A scan for recent selective sweeps revealed several candidate regions, including a notable example in which almost all variation was removed in a 500-kilobase window. Analyzing the polymorphisms we describe in larger sets of accessions will enable a detailed understanding of forces shaping population-wide sequence variation in A. thaliana.

[1]  T. Mitchell-Olds,et al.  A Multilocus Sequence Survey in Arabidopsis thaliana Reveals a Genome-Wide Departure From a Neutral Model of DNA Sequence Polymorphism , 2005, Genetics.

[2]  Thomas Mitchell-Olds,et al.  Epistasis and balanced polymorphism influencing complex trait variation , 2005, Nature.

[3]  E. Stahl,et al.  Dynamics of disease resistance polymorphism at the Rpm1 locus of Arabidopsis , 1999, Nature.

[4]  Jonathan D. G. Jones,et al.  The plant immune system , 2006, Nature.

[5]  M. Morgante Plant genome organisation and diversity: the year of the junk! , 2006, Current opinion in biotechnology.

[6]  Jian-Qun Chen,et al.  Unique Evolutionary Mechanism in R-Genes Under the Presence/Absence Polymorphism in Arabidopsis thaliana , 2006, Genetics.

[7]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[8]  Patrick Achard,et al.  F-box proteins everywhere. , 2006, Current opinion in plant biology.

[9]  Blake C. Meyers,et al.  Genome-Wide Analysis of NBS-LRR–Encoding Genes in Arabidopsis Online version contains Web-only data. Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.009308. , 2003, The Plant Cell Online.

[10]  A. Syvänen Toward genome-wide SNP genotyping , 2005, Nature Genetics.

[11]  M. Grant,et al.  Independent deletions of a pathogen-resistance gene in Brassica and Arabidopsis. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[12]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[13]  M. Nordborg,et al.  The genealogy of sequences containing multiple sites subject to strong selection in a subdivided population. , 2003, Genetics.

[14]  S. P. Fodor,et al.  Blocks of Limited Haplotype Diversity Revealed by High-Resolution Scanning of Human Chromosome 21 , 2001, Science.

[15]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[16]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[17]  Mattias Jakobsson,et al.  The Pattern of Polymorphism in Arabidopsis thaliana , 2005, PLoS biology.

[18]  Keyan Zhao,et al.  A Nonparametric Test Reveals Selection for Rapid Flowering in the Arabidopsis Genome , 2006, PLoS biology.

[19]  M. Kreitman,et al.  A Genome-Wide Survey of R Gene Polymorphisms in Arabidopsis[W] , 2006, The Plant Cell Online.

[20]  N L Kaplan,et al.  The coalescent process in models with selection and recombination. , 1988, Genetics.

[21]  The search for a sequencing thoroughbred , 2005, Nature Biotechnology.

[22]  V. Vapnik Estimation of Dependences Based on Empirical Data , 2006 .

[23]  K. Frazer,et al.  Common deletions and SNPs are in linkage disequilibrium in the human genome , 2006, Nature Genetics.

[24]  John Maynard Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[25]  James H. Thomas Adaptive evolution in two large families of ubiquitin-ligase adapters in nematodes and plants. , 2006, Genome research.

[26]  M. Nei,et al.  Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection , 1988, Nature.

[27]  M. Nordborg Structured coalescent processes on different time scales. , 1997, Genetics.

[28]  N L Kaplan,et al.  The "hitchhiking effect" revisited. , 1989, Genetics.

[29]  Eli Stahl,et al.  Signature of balancing selection in Arabidopsis , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[31]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[32]  Li-li Chen,et al.  A Receptor Kinase-Like Protein Encoded by the Rice Disease Resistance Gene, Xa21 , 1995, Science.

[33]  K. Hokamp,et al.  A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. , 2003, Genome research.