The USDA cucumber (Cucumis sativus L.) collection: genetic diversity, population structure, genome-wide association studies, and core collection development

Germplasm collections are a crucial resource to conserve natural genetic diversity and provide a source of novel traits essential for sustained crop improvement. Optimal collection, preservation and utilization of these materials depends upon knowledge of the genetic variation present within the collection. Here we use the high-throughput genotyping-by-sequencing (GBS) technology to characterize the United States National Plant Germplasm System (NPGS) collection of cucumber (Cucumis sativus L.). The GBS data, derived from 1234 cucumber accessions, provided more than 23 K high-quality single-nucleotide polymorphisms (SNPs) that are well distributed at high density in the genome (~1 SNP/10.6 kb). The SNP markers were used to characterize genetic diversity, population structure, phylogenetic relationships, linkage disequilibrium, and population differentiation of the NPGS cucumber collection. These results, providing detailed genetic analysis of the U.S. cucumber collection, complement NPGS descriptive information regarding geographic origin and phenotypic characterization. We also identified genome regions significantly associated with 13 horticulturally important traits through genome-wide association studies (GWAS). Finally, we developed a molecularly informed, publicly accessible core collection of 395 accessions that represents at least 96% of the genetic variation present in the NPGS. Collectively, the information obtained from the GBS data enabled deep insight into the diversity present and genetic relationships among accessions within the collection, and will provide a valuable resource for genetic analyses, gene discovery, crop improvement, and germplasm preservation.Cucumber: Core collection for future crop improvementGenetic analysis of the US cucumber collection provides valuable insights into the plant’s diversity and generates a core resource for future research. Breeding crop plants requires in-depth understanding of plant genetics and the identification of key genes that enhance traits such as disease and stress resistance. Zhangjun Fei at the Boyce Thompson Institute in Ithaca, Rebecca Grumet at Michigan State University, and co-workers across the US used high-throughput genetic sequencing to characterize the 1,234 cucumber accessions held by the US National Plant Germplasm System. The team uncovered 23,000 specific genome variations that revealed the genetic diversity, population structure and genetic differentiation of the cucumber. They identified genome regions associated with 13 traits important to cultivation and selected 395 accessions covering 96 percent of the collection’s genetic variation to make a publicly accessible core of data.

[1]  Todd C. Wehner,et al.  QTL mapping of downy and powdery mildew resistances in PI 197088 cucumber with genotyping-by-sequencing in RIL population , 2018, Theoretical and Applied Genetics.

[2]  Xueyong Zhang,et al.  Genomics-based plant germplasm research (GPGR) , 2017 .

[3]  Ying Li,et al.  A genomic variation map provides insights into the genetic basis of cucumber domestication and diversity , 2013, Nature Genetics.

[4]  P. Simon,et al.  Early orange mass 400, early orange mass 402, and late orange mass 404 : High-carotene cucumber germplasm , 1997 .

[5]  J. Staub,et al.  Genetic variation in cucumber (Cucumis sativus L.) as assessed by random amplified polymorphic DNA1 , 1999, Genetic Resources and Crop Evolution.

[6]  J. Staub,et al.  Genetic mapping and QTL analysis of horticultural traits in cucumber (Cucumis sativus L.) using recombinant inbred lines , 2003, Theoretical and Applied Genetics.

[7]  J. Staub,et al.  Genetic diversity in cucumber (Cucumis sativus L.): I. A reevaluation of the U.S. germplasm collection , 1996, Genetic Resources and Crop Evolution.

[8]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. , 2003, Genetics.

[9]  J. Garcia-Mas,et al.  Cucurbit Genetics and Genomics: A Look to the Future , 2017 .

[10]  T. Wehner,et al.  STAYGREEN, STAY HEALTHY: a loss-of-susceptibility mutation in the STAYGREEN gene provides durable, broad-spectrum disease resistances for over 50 years of US cucumber production. , 2018, The New phytologist.

[11]  X. Gu,et al.  Genetic Analysis and QTL Mapping of Resistance to Gummy Stem Blight in Cucumis sativus Seedling Stage. , 2017, Plant disease.

[12]  R. Tyagi,et al.  Revised genebank standards for management of plant genetic resources , 2015 .

[13]  Zhonghua Zhang,et al.  Genetic Diversity and Population Structure of Cucumber (Cucumis sativus L.) , 2012, PloS one.

[14]  Yong-Bi Fu,et al.  The Vulnerability of Plant Genetic Resources Conserved Ex Situ , 2017 .

[15]  T. Wehner,et al.  Anthracnose resistance of the cucumber germplasm collection in North Carolina field tests. , 1995 .

[16]  Dawei Li,et al.  Next-generation sequencing, FISH mapping and synteny-based modeling reveal mechanisms of decreasing dysploidy in Cucumis. , 2014, The Plant journal : for cell and molecular biology.

[17]  QTL mapping for downy mildew resistance in cucumber inbred line WI7120 (PI 330628) , 2016, Theoretical and Applied Genetics.

[18]  T. Wehner NC-42 and NC-43: Root-Knot Nematode-Resistant Cucumber , 1996 .

[19]  M. Colle,et al.  Screening the Cucumber Plant Introduction Collection for Young Fruit Resistance to Phytophthora capsici , 2014 .

[20]  T. Wehner,et al.  Root-knot Nematode Resistance in Cucumber and Horned Cucumber , 1993 .

[21]  S. Renner,et al.  Phylogenetics of Cucumis (Cucurbitaceae): Cucumber (C. sativus) belongs in an Asian/Australian clade far from melon (C. melo) , 2007, BMC Evolutionary Biology.

[22]  T. Wehner,et al.  Evaluation of the U.S. cucumber germplasm collection for root size using a subjective rating technique , 2004, Euphytica.

[23]  M. Nee,et al.  Biosystematic Monograph of the Genus Cucumis (Cucurbitaceae)-Botanical Identification of Cucumbers and Melons. , 1994 .

[24]  T. Whitaker,et al.  Cucurbits, Botany, Cultivation, and Utilization , 1962 .

[25]  S. Renner,et al.  Cucumber (Cucumis sativus) and melon (C. melo) have numerous wild relatives in Asia and Australia, and the sister species of melon is from Australia , 2010, Proceedings of the National Academy of Sciences.

[26]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[27]  Zhongyuan Hu,et al.  A re-sequencing-based ultra-dense genetic map reveals a gummy stem blight resistance-associated gene in Cucumis melo , 2017, DNA research : an international journal for rapid publication of reports on genes and genomes.

[28]  A. Brown,et al.  Core collections: a practical approach to genetic resources management , 1989 .

[29]  T. Wehner,et al.  Screening the Cucumber Germplasm Collection for Fruit Yield and Quality , 2002 .

[30]  Y. Weng,et al.  Molecular mapping reveals structural rearrangements and quantitative trait loci underlying traits with local adaptation in semi-wild Xishuangbanna cucumber (Cucumis sativus L. var. xishuangbannanesis Qi et Yuan) , 2014, Theoretical and Applied Genetics.

[31]  H. Paris,et al.  Occidental diffusion of cucumber (Cucumis sativus) 500-1300 CE: two routes to Europe. , 2012, Annals of botany.

[32]  T. Wehner,et al.  Downy Mildew Resistance of the Cucumber Germplasm Collection in North Carolina Field Tests , 1997 .

[33]  Jiming Jiang,et al.  Chromosome rearrangements during domestication of cucumber as revealed by high-density genetic mapping and draft genome assembly. , 2012, The Plant journal : for cell and molecular biology.

[34]  Tae-Ho Lee,et al.  SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data , 2014, BMC Genomics.

[35]  Robert J. Elshire,et al.  TASSEL-GBS: A High Capacity Genotyping by Sequencing Analysis Pipeline , 2014, PloS one.

[36]  K. Barker,et al.  Resistance to Root-knot Nematodes in Cucumber and Horned Cucumber. , 1991, Journal of nematology.

[37]  J. Staub,et al.  Gone Global: Familiar and Exotic Cucurbits Have Asian Origins , 2013 .

[38]  J. Staub,et al.  The Formation of Test Arrays and a Core Collection in Cucumber Using Phenotypic and Molecular Marker Data , 2002 .

[39]  Stacey S. Cherny,et al.  Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets , 2011, Human Genetics.

[40]  Ying Liu,et al.  FaST linear mixed models for genome-wide association studies , 2011, Nature Methods.

[41]  T. Wehner,et al.  Resistance of Cucumber Cultivars to a New Strain of Cucurbit Downy Mildew , 2012 .

[42]  T. Walters Historical overview on domesticated plants in China with special emphasis on the Cucurbitaceae , 1989, Economic Botany.

[43]  David K. Smith,et al.  ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data , 2017 .

[44]  L. Davies,et al.  Who is sowing our seeds? A systematic review of the use of plant genetic resources in research , 2017, Genetic Resources and Crop Evolution.

[45]  Y. Weng,et al.  QTL mapping of domestication and diversifying selection related traits in round-fruited semi-wild Xishuangbanna cucumber (Cucumis sativus L. var. xishuangbannanesis) , 2017, Theoretical and Applied Genetics.

[46]  J. Staub,et al.  Genetic diversity in cucumber (cucumis sativus L.): Iii. An evaluation of Indian germplasm , 1997, Genetic Resources and Crop Evolution.

[47]  Namshin Kim,et al.  GenoCore: A simple and fast algorithm for core subset selection from large genotype datasets , 2017, PloS one.

[48]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[49]  Ashutosh Kumar Singh,et al.  Genetic diversity in Indian cucumber based on microsatellite and morphological markers , 2013 .

[50]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[51]  T. Wehner,et al.  Genetic Resources of Cucumber , 2016 .

[52]  T. Wehner,et al.  Screening the cucumber germplasm collection for resistance to gummy stem blight in North Carolina field tests. , 2000 .

[53]  X. Gu,et al.  Inheritance and QTL mapping of resistance to gummy stem blight in cucumber stem , 2017, Molecular Breeding.

[54]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[55]  T. Wehner,et al.  Field and Detached-fruit Screening Tests for Resistance to Belly Rot in Cucumber , 2004 .

[56]  K. Lamour,et al.  A Detached Cucumber Fruit Method to Screen for Resistance to Phytophthora capsici and Effect of Fruit Age on Susceptibility to Infection. , 2006, Plant disease.

[57]  Y. Weng,et al.  STAYGREEN (CsSGR) is a candidate for the anthracnose (Colletotrichum orbiculare) resistance locus cla in Gy14 cucumber , 2018, Theoretical and Applied Genetics.

[58]  J. Staub,et al.  Genetic diversity in cucumber (Cucumis sativus L.): IV. An evaluation of Chinese germplasm1 , 1999, Genetic Resources and Crop Evolution.

[59]  K. Ando,et al.  Evaluation of Altered Cucumber Plant Architecture as a Means to Reduce Phytophthora capsici Disease Incidence on Cucumber Fruit , 2006 .

[60]  Matthew R. Pocock,et al.  The Bioperl toolkit: Perl modules for the life sciences. , 2002, Genome research.

[61]  R. Grumet,et al.  Aboveground Screening for Genotypic Differences in Cucumber Root Growth in the Greenhouse and Field , 1992 .

[62]  Qifa Zhang,et al.  Genome-wide association studies of 14 agronomic traits in rice landraces , 2010, Nature Genetics.

[63]  Kathleen R. Reitsma,et al.  Powdery Mildew Resistance in the U.S. National Plant Germplasm System Cucumber Collection , 2005 .