Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing.

We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars. Using gene models from the annotated Heinz 1706 reference genome, we observed differences in the ratio between non-synonymous and synonymous SNPs (dN/dS) in fruit diversification and plant growth genes compared to a random set of genes, indicating positive selection and differences in selection pressure between crop accessions and wild species. In wild species, the number of single-nucleotide polymorphisms (SNPs) exceeds 10 million, i.e. 20-fold higher than found in most of the crop accessions, indicating dramatic genetic erosion of crop and heirloom tomatoes. In addition, the highest levels of heterozygosity were found for allogamous self-incompatible wild species, while facultative and autogamous self-compatible species display a lower heterozygosity level. Using whole-genome SNP information for maximum-likelihood analysis, we achieved complete tree resolution, whereas maximum-likelihood trees based on SNPs from ten fruit and growth genes show incomplete resolution for the crop accessions, partly due to the effect of heterozygous SNPs. Finally, results suggest that phylogenetic relationships are correlated with habitat, indicating the occurrence of geographical races within these groups, which is of practical importance for Solanum genome evolution studies.

[1]  Frederick Mosteller,et al.  Data Analysis and Regression , 1978 .

[2]  C. Rieder,et al.  Greatwall kinase , 2004, The Journal of cell biology.

[3]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[4]  S. Tanksley,et al.  The genetic basis of pear-shaped tomato fruit , 1999, Theoretical and Applied Genetics.

[5]  D. Zamir,et al.  An alternative pathway to beta -carotene formation in plant chromoplasts discovered by map-based cloning of beta and old-gold color mutations in tomato. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[6]  C. Elsik,et al.  Comparative Genomics of Plant Chromosomes , 2000, Plant Cell.

[7]  G. Martin,et al.  Deductions about the Number, Organization, and Evolution of Genes in the Tomato Genome Based on Analysis of a Large Expressed Sequence Tag Collection and Selective Genomic Sequencing Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.010478. , 2002, The Plant Cell Online.

[8]  S. Knapp Tobacco to tomatoes: a phylogenetic perspective on fruit diversity in the Solanaceae. , 2002, Journal of experimental botany.

[9]  S. Tanksley,et al.  A new class of regulatory genes underlying the cause of pear-shaped tomato fruit , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[10]  T. C. Nesbitt,et al.  Comparative sequencing in the genus Lycopersicon. Implications for the evolution of fruit size in the domestication of cultivated tomatoes. , 2002, Genetics.

[11]  S. Tanksley,et al.  RFLP analysis of phylogenetic relationships and genetic variation in the genus Lycopersicon , 1990, Theoretical and Applied Genetics.

[12]  D. Grierson,et al.  Identification and genetic analysis of normal and mutant phytoene synthase genes of tomato by sequencing, complementation and co-suppression , 1993, Plant Molecular Biology.

[13]  S. Tanksley,et al.  Genetics of actin-related sequences in tomato , 1986, Theoretical and Applied Genetics.

[14]  R. Verkerk,et al.  Mapping strategy for resistance genes in tomato based on RFLPs between cultivars: Cf9 (resistance to Cladosporium fulvum) on chromosome 1 , 1992, Theoretical and Applied Genetics.

[15]  D. Bittel,et al.  Dosage response of rye genes in a wheat background , 1992, Theoretical and Applied Genetics.

[16]  GENEALOGICAL FOOTPRINTS OF SPECIATION PROCESSES IN WILD TOMATOES: DEMOGRAPHY AND EVIDENCE FOR HISTORICAL GENE FLOW , 2005, Evolution; international journal of organic evolution.

[17]  K. Hammer Das Domestikationssyndrom , 1984, Die Kulturpflanze.

[18]  D. Spooner,et al.  Comparison of AFLPs with other markers for phylogenetic inference in wild tomatoes [Solanum L. section Lycopersicon (Mill.) Wettst.] , 2005 .

[19]  D. Spooner,et al.  New Species of Wild Tomatoes (Solanum Section Lycopersicon: Solanaceae) from Northern Peru , 2005 .

[20]  Michael J. Adams,et al.  DPVweb: a comprehensive database of plant and fungal virus genes and genomes , 2005, Nucleic Acids Res..

[21]  Bruce D. Smith,et al.  The Molecular Genetics of Crop Domestication , 2006, Cell.

[22]  Genetic Resources, Chromosome Engineering, and Crop Improvement : Vegetable Crops, Volume 3 , 2006 .

[23]  E. Earle,et al.  Estimation of nuclear DNA content of plants by flow cytometry , 2007, Plant Molecular Biology Reporter.

[24]  E. D. Earle,et al.  Nuclear DNA content of some important plant species , 1991, Plant Molecular Biology Reporter.

[25]  Yuling Bai,et al.  Domestication and Breeding of Tomatoes: What have We Gained and What Can We Gain in the Future? , 2007, Annals of botany.

[26]  J. Panero Systematic botany monographs , 2008, Brittonia.

[27]  D. Spooner,et al.  Taxonomy of wild tomatoes and their relatives (Solanum sect. Lycopersicoides, sect. Juglandifolia, sect. Lycopersicon; Solanaceae). , 2008 .

[28]  L. Moyle Ecological and Evolutionary Genomics in the Wild Tomatoes (Solanum Sect. Lycopersicon) , 2008, Evolution; international journal of organic evolution.

[29]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[30]  S. Tanksley,et al.  Regulatory change in YABBY-like transcription factor led to evolution of extreme fruit size during tomato domestication , 2008, Nature Genetics.

[31]  Jinghua Xiao,et al.  Single feature polymorphisms between two rice cultivars detected using a median polish method , 2009, Theoretical and Applied Genetics.

[32]  L. Stein,et al.  JBrowse: a next-generation genome browser. , 2009, Genome research.

[33]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[34]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[35]  R. Mott,et al.  The 1001 Genomes Project for Arabidopsis thaliana , 2009, Genome Biology.

[36]  A. Gnirke,et al.  High-quality draft assemblies of mammalian genomes from massively parallel sequence data , 2010, Proceedings of the National Academy of Sciences.

[37]  L. Anderson,et al.  Structural Differences in Chromosomes Distinguish Species in the Tomato Clade , 2010, Cytogenetic and Genome Research.

[38]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[39]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[40]  Paramvir S. Dehal,et al.  FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments , 2010, PloS one.

[41]  A. Michel,et al.  Population structure and genetic differentiation associated with breeding history and selection in tomato (Solanum lycopersicum L.) , 2011, Heredity.

[42]  A. Michel,et al.  Distribution of SUN, OVATE, LC, and FAS in the Tomato Germplasm and the Relationship to Fruit Shape Diversity1[C][W][OA] , 2011, Plant Physiology.

[43]  Qiushui He,et al.  SNP-Based Typing: A Useful Tool to Study Bordetella pertussis Populations , 2011, PloS one.

[44]  N. Ranc,et al.  Increase in Tomato Locule Number Is Controlled by Two Single-Nucleotide Polymorphisms Located Near WUSCHEL1[C][W] , 2011, Plant Physiology.

[45]  M. Ercolano,et al.  Solanum sect. Lycopersicon , 2011 .

[46]  M. Nei,et al.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. , 2011, Molecular biology and evolution.

[47]  M. Causse,et al.  Genetic Diversity in Tomato (Solanum lycopersicum) and Its Wild Relatives , 2012 .

[48]  John W. Scott,et al.  High-Density SNP Genotyping of Tomato (Solanum lycopersicum L.) Reveals Patterns of Genetic Variation Due to Breeding , 2012, PloS one.

[49]  Hans de Jong,et al.  Chromosome evolution in Solanum traced by cross-species BAC-FISH. , 2012, The New phytologist.

[50]  R. Visser,et al.  Structural homology in the Solanaceae: analysis of genomic regions in support of synteny studies in tomato, potato and pepper. , 2012, The Plant journal : for cell and molecular biology.

[51]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[52]  Kevin R. Thornton,et al.  The Drosophila melanogaster Genetic Reference Panel , 2012, Nature.

[53]  Daniel W. A. Buchan,et al.  The tomato genome sequence provides insights into fleshy fruit evolution , 2012, Nature.

[54]  Anthony M. Bolger,et al.  Comparative transcriptomics reveals patterns of selection in domesticated and wild tomato , 2013, Proceedings of the National Academy of Sciences.

[55]  Oscar Westesson,et al.  Visualizing next-generation sequencing data with JBrowse , 2013, Briefings Bioinform..

[56]  Nilgun Donmez,et al.  SCARPA: scaffolding reads with practical algorithms , 2013, Bioinform..