Development and application of Single Primer Enrichment Technology (SPET) SNP assay for population genomics analysis and candidate gene discovery in lettuce.

Single primer enrichment technology (SPET) is a novel high-throughput genotyping method based on short-read sequencing of specific genomic regions harboring polymorphisms. SPET provides an efficient and reproducible method for genotyping target loci, overcoming the limits associated with other reduced representation library sequencing methods that are based on a random sampling of genomic loci. The possibility to sequence regions surrounding a target SNP allows the discovery of thousands of closely linked, novel SNPs. In this work, we report the design and application of the first SPET panel in lettuce, consisting of 41,547 probes spanning the whole genome and designed to target both coding (~96%) and intergenic (~4%) regions. A total of 81,531 SNPs were surveyed in 160 lettuce accessions originating from a total of 10 countries in Europe, America, and Asia and representing 10 horticultural types. Model ancestry population structure clearly separated the cultivated accessions (Lactuca sativa) from accessions of its presumed wild progenitor (L. serriola), revealing a total of six genetic subgroups that reflected a differentiation based on cultivar typology. Phylogenetic relationships and principal component analysis revealed a clustering of butterhead types and a general differentiation between germplasm originating from Western and Eastern Europe. To determine the potentiality of SPET for gene discovery, we performed genome-wide association analysis for main agricultural traits in L. sativa using six models (GLM naive, MLM, MLMM, CMLM, FarmCPU, and BLINK) to compare their strength and power for association detection. Robust associations were detected for seed color on chromosome 7 at 50 Mbp. Colocalization of association signals was found for outer leaf color and leaf anthocyanin content on chromosome 9 at 152 Mbp and on chromosome 5 at 86 Mbp. The association for bolting time was detected with the GLM, BLINK, and FarmCPU models on chromosome 7 at 164 Mbp. Associations were detected in chromosomal regions previously reported to harbor candidate genes for these traits, thus confirming the effectiveness of SPET for GWAS. Our findings illustrated the strength of SPET for discovering thousands of variable sites toward the dissection of the genomic diversity of germplasm collections, thus allowing a better characterization of lettuce collections.

[1]  P. Tripodi Next generation sequencing technologies to explore the diversity of germplasm resources: Achievements and trends in tomato , 2022, Computational and structural biotechnology journal.

[2]  D. Bassi,et al.  Genetic dissection of fruit maturity date in apricot (P. armeniaca L.) through a Single Primer Enrichment Technology (SPET) approach , 2022, BMC Genomics.

[3]  R. Kalendar,et al.  MLO Proteins from Tomato (Solanum lycopersicum L.) and Related Species in the Broad Phylogenetic Context , 2022, Plants.

[4]  J. Park,et al.  Genome-wide core sets of SNP markers and Fluidigm assays for rapid and effective genotypic identification of Korean cultivars of lettuce (Lactuca sativa L.) , 2022, Horticulture research.

[5]  Qinsong Yang,et al.  Molecular and metabolic insights into anthocyanin biosynthesis during leaf coloration in autumn , 2021 .

[6]  C. Fankhauser,et al.  Phototropin-mediated perception of light direction in leaves regulates blade flattening , 2021, Plant physiology.

[7]  P. C. Sharma,et al.  Genome-wide association mapping reveals key genomic regions for physiological and yield-related traits under salinity stress in wheat (Triticum aestivum L.). , 2021, Genomics.

[8]  H. Park,et al.  QTL Analysis of Stem Elongation and Flowering Time in Lettuce Using Genotyping-by-Sequencing , 2021, Genes.

[9]  Yong He,et al.  Advanced high-throughput plant phenotyping techniques for genome-wide association studies: A review , 2021, Journal of advanced research.

[10]  Ivan Simko,et al.  Mapping and identification of genetic loci affecting earliness of bolting and flowering in lettuce , 2021, Theoretical and Applied Genetics.

[11]  Huanming Yang,et al.  Whole-genome resequencing of 445 Lactuca accessions reveals the domestication history of cultivated lettuce , 2021, Nature Genetics.

[12]  R. Michelmore,et al.  A Composite Analysis of Flowering Time Regulation in Lettuce , 2021, Frontiers in Plant Science.

[13]  Sunchung Park,et al.  Population genetics and genome‐wide association studies provide insights into the influence of selective breeding on genetic variation in lettuce , 2021, The plant genome.

[14]  Zhiwu Zhang,et al.  GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction , 2020, bioRxiv.

[15]  H. Matsumura,et al.  Identification of two QTLs for resistance to Fusarium wilt race 1 in lettuce (Lactuca sativa L.) , 2020, Euphytica.

[16]  J. Thelen,et al.  Label-Free Quantitative Phosphoproteomics Reveals Signaling Dynamics Involved in Embryogenic Competence Acquisition in Sugarcane. , 2020, Journal of proteome research.

[17]  Javier Herrero,et al.  Construction of a high density linkage map in Oil Palm using SPET markers , 2020, Scientific Reports.

[18]  Xiaojing Ma,et al.  Transcription Factors FHY3 and FAR1 Regulate Light-Induced CIRCADIAN CLOCK ASSOCIATED1 Gene Expression in Arabidopsis , 2020, Plant Cell.

[19]  Ivan Simko,et al.  Genome-wide association mapping reveals loci for shelf life and developmental rate of lettuce , 2020, Theoretical and Applied Genetics.

[20]  Fei Gao,et al.  CNSA: a data repository for archiving omics data , 2020, bioRxiv.

[21]  Qinghua Zhang,et al.  Bin-based genome-wide association analyses improve power and resolution in QTL mapping and identify favorable alleles from multiple parents in a four-way MAGIC rice population , 2019, Theoretical and Applied Genetics.

[22]  R. Irizarry ggplot2 , 2019, Introduction to Data Science.

[23]  F. Wendt,et al.  Identity informative SNP associations in the UK Biobank. , 2019, Forensic science international. Genetics.

[24]  R. Michelmore,et al.  Characterization of four polymorphic genes controlling red leaf colour in lettuce that have undergone disruptive selection since domestication , 2019, Plant biotechnology journal.

[25]  G. Aprea,et al.  Single Primer Enrichment Technology (SPET) for High-Throughput Genotyping in Tomato and Eggplant Germplasm , 2019, Front. Plant Sci..

[26]  M. Morgante,et al.  Single primer enrichment technology as a tool for massive genotyping: a benchmark on black poplar and maize. , 2019, Annals of botany.

[27]  Zhiwu Zhang,et al.  BLINK: a package for the next level of genome-wide association studies with both individuals and markers in the millions , 2018, GigaScience.

[28]  Gang Li,et al.  FAR1-RELATED SEQUENCE (FRS) and FRS-RELATED FACTOR (FRF) Family Proteins in Arabidopsis Growth and Development , 2018, Front. Plant Sci..

[29]  Sudhir Kumar,et al.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. , 2018, Molecular biology and evolution.

[30]  Jianping Wang,et al.  Development and Applications of a High Throughput Genotyping Tool for Polyploid Crops: Single Nucleotide Polymorphism (SNP) Array , 2018, Front. Plant Sci..

[31]  R. Michelmore,et al.  RNA sequencing provides insights into the evolution of lettuce and the regulation of flavonoid biosynthesis , 2017, Nature Communications.

[32]  Xun Xu,et al.  Genome assembly with in vitro proximity ligation data and whole-genome triplication in lettuce , 2017, Nature Communications.

[33]  Keiichi Mochida,et al.  Exploring Genetic Diversity in Plants Using High-Throughput Sequencing Techniques , 2016, Current genomics.

[34]  Moo Jung Kim,et al.  Nutritional value, bioactive compounds and health benefits of lettuce (Lactuca sativa L.) , 2016 .

[35]  Safina Khan,et al.  FHY3 and FAR1 Act Downstream of Light Stable Phytochromes , 2016, Front. Plant Sci..

[36]  Zhiwu Zhang,et al.  Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies , 2016, PLoS genetics.

[37]  A. Allan,et al.  Transcriptome analysis and transient transformation suggest an ancient duplicated MYB transcription factor as a candidate gene for leaf red coloration in peach , 2014, BMC Plant Biology.

[38]  T. V. van Hintum,et al.  Next-generation genebanking: plant genetic resources management and utilization in the sequencing era , 2014 .

[39]  E. Pante,et al.  Use of RAD sequencing for delimiting species , 2014, Heredity.

[40]  Yibo Dong,et al.  Genotyping-By-Sequencing for Plant Genetic Diversity Analysis: A Lab Guide for SNP Genotyping , 2014 .

[41]  Edward S. Buckler,et al.  A SUPER Powerful Method for Genome Wide Association Study , 2014, PloS one.

[42]  Xuehui Huang,et al.  Natural variations and genome-wide association studies in crop plants. , 2014, Annual review of plant biology.

[43]  Soon-Jae Kwon,et al.  Association analysis of bacterial leaf spot resistance and SNP markers derived from expressed sequence tags (ESTs) in lettuce (Lactuca sativa L.) , 2014, Molecular Breeding.

[44]  M. Morgante,et al.  An Extensive Evaluation of Read Trimming Effects on Illumina NGS Data Analysis , 2013, PloS one.

[45]  Soon-Jae Kwon,et al.  Genome-wide association of 10 horticultural traits with expressed sequence tag-derived SNP markers in a collection of lettuce lines , 2013, The Crop Journal.

[46]  K. Miyawaki,et al.  Phototropin 2 is involved in blue light-induced anthocyanin accumulation in Fragaria x ananassa fruits , 2013, Journal of Plant Research.

[47]  C. Loley,et al.  A unifying framework for robust association testing, estimation, and genetic model selection using the generalized linear model , 2013, European Journal of Human Genetics.

[48]  Ivan Simko,et al.  Development of genomic SSR markers for fingerprinting lettuce (Lactuca sativa L.) cultivars and mapping genes , 2013, BMC Plant Biology.

[49]  Trevor W. Rife,et al.  Genotyping‐by‐Sequencing for Plant Breeding and Genetics , 2012 .

[50]  S. Deschamps,et al.  Genotyping-by-Sequencing in Plants , 2012, Biology.

[51]  U. Lohwasser,et al.  Genetic resources collections of leafy vegetables (lettuce, spinach, chicory, artichoke, asparagus, lamb’s lettuce, rhubarb and rocket salad): composition and gaps , 2012, Genetic Resources and Crop Evolution.

[52]  Bjarni J. Vilhjálmsson,et al.  An efficient multi-locus mixed model approach for genome-wide association studies in structured populations , 2012, Nature Genetics.

[53]  R. Michelmore,et al.  Development and application of a 6.5 million feature Affymetrix Genechip® for massively parallel discovery of single position polymorphisms in lettuce (Lactuca spp.) , 2012, BMC Genomics.

[54]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[55]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[56]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[57]  Zhiwu Zhang,et al.  Mixed linear model approach adapted for genome-wide association studies , 2010, Nature Genetics.

[58]  T. V. van Hintum,et al.  Comparison of anonymous and targeted molecular markers for the estimation of genetic diversity in ex situ conserved Lactuca , 2009, TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik.

[59]  Ivan Simko Development of EST-SSR markers for the study of population structure in lettuce (Lactuca sativa L.). , 2009, The Journal of heredity.

[60]  Robert D Schnabel,et al.  SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries , 2008, Nature Methods.

[61]  Edward S. Buckler,et al.  TASSEL: software for association mapping of complex traits in diverse samples , 2007, Bioinform..

[62]  C. Honda,et al.  Isolation and functional analysis of a MYB transcription factor gene that is a key regulator for the development of red coloration in apple skin. , 2007, Plant & cell physiology.

[63]  Michael Riefler,et al.  Structure and function of cytokinin oxidase/dehydrogenase genes of maize, rice, Arabidopsis and other species , 2003, Journal of Plant Research.

[64]  J. Memelink,et al.  Transcription factors controlling plant secondary metabolism: what regulates the regulators? , 2002, Phytochemistry.

[65]  S. Shete,et al.  On estimating the heterozygosity and polymorphism information content value. , 2000, Theoretical population biology.

[66]  Pablo Cingolani Variant Annotation and Functional Prediction: SnpEff. , 2022, Methods in molecular biology.

[67]  Andrew H. Paterson,et al.  Application of genotyping by sequencing technology to a variety of crop breeding programs. , 2016, Plant science : an international journal of experimental plant biology.

[68]  Claude-Alain H. Roten,et al.  Fast and accurate short read alignment with Burrows–Wheeler transform , 2009, Bioinform..