Tomato SNP Discovery by EST Mining and Resequencing

Many economically important crop species are relatively depauparate in genetic diversity (e.g., soybean, peanut, tomato). DNA polymorphism within cultivated tomato has been estimated to be low based on molecular markers. Through mining of more than 148,000 public tomato expressed sequence tags (ESTs) and full-length cDNAs, we identified 764 EST clusters with potential single nucleotide polymorphisms (SNPs) among more than 15 tomato lines. By sequencing regions from 53 of these clusters in two to three lines, we discovered a wealth of nucleotide polymorphism (62 SNPs and 12 indels in 21 Unigenes), resulting in a verification rate of 27.2% (28 of 103 SNPs predicted in EST clusters were verified). We hypothesize that five regions with 1.6–13-fold more diversity relative to other tested regions are associated with introgressions from wild relatives. Identifying polymorphic, expressed genes in the tomato genome will be useful for both tomato improvement and germplasm conservation.

[1]  S. Tanksley,et al.  RFLP analysis of phylogenetic relationships and genetic variation in the genus Lycopersicon , 1990, Theoretical and Applied Genetics.

[2]  V. Poysa,et al.  Development and characterization of simple sequence repeat (SSR) markers and their use in determining relationships among Lycopersicon esculentum cultivars , 2002, Theoretical and Applied Genetics.

[3]  S. Rustgi,et al.  Molecular markers from the transcribed/expressed region of the genome in higher plants , 2004, Functional & Integrative Genomics.

[4]  S. Tanksley,et al.  Yield and quality evaluations on a pair of processing tomato lines nearly isogenic for the Tm2a gene for resistance to the tobacco mosaic virus , 2004, Euphytica.

[5]  M. A. Stevens,et al.  Genetics and breeding , 1986 .

[6]  Mark H. Wright,et al.  The SOL Genomics Network. A Comparative Resource for Solanaceae Biology and Beyond1 , 2005, Plant Physiology.

[7]  Marek J. Sergot,et al.  SEAN: SNP prediction and display program utilizing EST sequence clusters , 2006, Bioinform..

[8]  M. Nei,et al.  Estimation of average heterozygosity and genetic distance from a small number of individuals. , 1978, Genetics.

[9]  Valentín,et al.  Chapter 2. , 1998, Annals of the ICRP.

[10]  J. Hachey,et al.  Development of sequence characterized DNA markers linked to a dominant verticillium wilt resistance gene in tomato. , 1998, Genome.

[11]  S. Tanksley,et al.  The I2C family from the wilt disease resistance locus I2 belongs to the nucleotide binding, leucine-rich repeat superfamily of plant resistance genes. , 1997, The Plant cell.

[12]  S. Tanksley,et al.  Comparative fine mapping of fruit quality QTLs on chromosome 4 introgressions derived from two wild tomato species , 2004, Euphytica.

[13]  M. West,et al.  Evaluation of AFLPs for germplasm fingerprinting and assessment of genetic diversity in cultivars of tomato (Lycopersicon esculentum L.). , 2004, Genome.

[14]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[15]  Jonathan Pevsner,et al.  Basic Local Alignment Search Tool (BLAST) , 2005 .

[16]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[17]  G. Martin,et al.  High density molecular linkage maps of the tomato and potato genomes. , 1992, Genetics.

[18]  A. Kawabe,et al.  DNA polymorphism at the cytosolic phosphoglucose isomerase (PgiC) locus of the wild plant Arabidopsis thaliana. , 2000, Genetics.

[19]  S. Tanksley,et al.  Use of isogenic lines and simultaneous probing to identify DNA markers tightly linked to the tm-2a gene in tomato. , 1988, Genetics.

[20]  M. Ganal,et al.  Construction and testing of a microsatellite database containing more than 500 tomato varieties , 2002, Theoretical and Applied Genetics.

[21]  T. Ideker,et al.  Mining SNPs from EST databases. , 1999, Genome research.

[22]  M. Smulders,et al.  Use of microsatellites to evaluate genetic diversity and species relationships in the genus Lycopersicon , 2001, Theoretical and Applied Genetics.

[23]  T. C. Nesbitt,et al.  Comparative sequencing in the genus Lycopersicon. Implications for the evolution of fruit size in the domestication of cultivated tomatoes. , 2002, Genetics.

[24]  V. Le Corre,et al.  Nucleotide variability at the acetyl coenzyme A carboxylase gene and the signature of herbicide selection in the grass weed Alopecurus myosuroides (Huds.). , 2004, Molecular biology and evolution.

[25]  G. Huttley,et al.  Nucleotide polymorphism in the chalcone synthase‐A locus and evolution of the chalcone synthase multigene family of common morning glory Ipomoea purpurea , 1997 .

[26]  Xavier Messeguer,et al.  DnaSP, DNA polymorphism analyses by the coalescent and other methods , 2003, Bioinform..

[27]  D. Hartl,et al.  Principles of population genetics , 1981 .

[28]  E. Kabelka,et al.  Discovery of single nucleotide polymorphisms in Lycopersicon esculentum by computer aided analysis of expressed sequence tags , 2004 .

[29]  Jody Hey,et al.  Principles of population genetics (2nd edn) , 1989 .

[30]  G. Bonnema,et al.  The Cf-ECP2 gene is linked to, but not part of, the Cf-4/Cf-9 cluster on the short arm of chromosome 1 in tomato , 1999, Molecular and General Genetics MGG.

[31]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[32]  Jonathan D. G. Jones,et al.  Dispersion of the Cf-4 disease resistance gene in Lycopersicon germplasm , 2000, Heredity.

[33]  B. Picó,et al.  Genetic variability and relationship of closely related Spanish traditional cultivars of tomato as detected by SRAP and SSR markers , 2005 .

[34]  K. Suzuki,et al.  Construction and testing of JT-60 , 1987 .

[35]  A. Rafalski,et al.  Discovery and application of single nucleotide polymorphism markers in plants. , 2001 .

[36]  R. Verkerk,et al.  Mapping strategy for resistance genes in tomato based on RFLPs between cultivars: Cf9 (resistance to Cladosporium fulvum) on chromosome 1 , 1992, Theoretical and Applied Genetics.