Development and Evaluation of SoySNP50K, a High-Density Genotyping Array for Soybean

The objective of this research was to identify single nucleotide polymorphisms (SNPs) and to develop an Illumina Infinium BeadChip that contained over 50,000 SNPs from soybean (Glycine max L. Merr.). A total of 498,921,777 reads 35–45bp in length were obtained from DNA sequence analysis of reduced representation libraries from several soybean accessions which included six cultivated and two wild soybean (G. soja Sieb. et Zucc.) genotypes. These reads were mapped to the soybean whole genome sequence and 209,903 SNPs were identified. After applying several filters, a total of 146,161 of the 209,903 SNPs were determined to be ideal candidates for Illumina Infinium II BeadChip design. To equalize the distance between selected SNPs, increase assay success rate, and minimize the number of SNPs with low minor allele frequency, an iteration algorithm based on a selection index was developed and used to select 60,800 SNPs for Infinium BeadChip design. Of the 60,800 SNPs, 50,701 were targeted to euchromatic regions and 10,000 to heterochromatic regions of the 20 soybean chromosomes. In addition, 99 SNPs were targeted to unanchored sequence scaffolds. Of the 60,800 SNPs, a total of 52,041 passed Illumina’s manufacturing phase to produce the SoySNP50K iSelect BeadChip. Validation of the SoySNP50K chip with 96 landrace genotypes, 96 elite cultivars and 96 wild soybean accessions showed that 47,337 SNPs were polymorphic and generated successful SNP allele calls. In addition, 40,841 of the 47,337 SNPs (86%) had minor allele frequencies ≥10% among the landraces, elite cultivars and the wild soybean accessions. A total of 620 and 42 candidate regions which may be associated with domestication and recent selection were identified, respectively. The SoySNP50K iSelect SNP beadchip will be a powerful tool for characterizing soybean genetic diversity and linkage disequilibrium, and for constructing high resolution linkage maps to improve the soybean whole genome sequence assembly.

[1]  M. Ganal,et al.  Development of a Large SNP Genotyping Array and Generation of High-Density Genetic Maps in Tomato , 2012, PloS one.

[2]  Riccardo Velasco,et al.  Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple , 2012, PloS one.

[3]  John M. Burke,et al.  SNP Discovery and Development of a High-Density Genotyping Array for Sunflower , 2012, PloS one.

[4]  O. Martin,et al.  A Large Maize (Zea mays L.) SNP Genotyping Array: Development and Germplasm Genotyping, and Genetic Mapping to Compare with the B73 Reference Genome , 2011, PloS one.

[5]  Hongwei Jiang,et al.  An Integrated Quantitative Trait Locus Map of Oil Content in Soybean, Glycine max (L.) Merr., Generated Using a Meta-Analysis Method for Mining Genes , 2011 .

[6]  Xiaoling Liang,et al.  Identification of functional genetic variations underlying drought tolerance in maize using SNP markers. , 2011, Journal of integrative plant biology.

[7]  T. Vuong,et al.  Confirmation of quantitative trait loci for resistance to multiple-HG types of soybean cyst nematode (Heterodera glycines Ichinohe) , 2011, Euphytica.

[8]  Rong Zhou,et al.  Quantitative trait loci analysis of stem strength and related traits in soybean , 2011, Euphytica.

[9]  A. Legarra,et al.  Can we predict the quality of an equine breeding for the CSO from genomics , 2011 .

[10]  J. Schmutz,et al.  Whole-genome sequencing and intensive analysis of the undomesticated soybean (Glycine soja Sieb. and Zucc.) genome , 2010, Proceedings of the National Academy of Sciences.

[11]  Bo Wang,et al.  Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection , 2010, Nature Genetics.

[12]  Rex T. Nelson,et al.  Abundance of SSR Motifs and Development of Candidate Polymorphic SSR Markers (BARCSOYSSR_1.0) in Soybean , 2010 .

[13]  P. Cregan,et al.  High-throughput SNP discovery and assay development in common bean , 2010, BMC Genomics.

[14]  M. Rothschild,et al.  Development and Application of High-density SNP Arrays in Genomic Studies of Domestic Animals , 2010 .

[15]  Thomas E. Carter,et al.  A high density integrated genetic linkage map of soybean and the development of a 1536 universal soy linkage panel for quantitative trait locus mapping. , 2010 .

[16]  T. Sakurai,et al.  Genome sequence of the palaeopolyploid soybean , 2010, Nature.

[17]  Steven B Cannon,et al.  High-throughput SNP discovery through deep resequencing of a reduced representation library to anchor and orient scaffolds in the soybean whole genome sequence , 2010, BMC Genomics.

[18]  Stefano Lonardi,et al.  Development and implementation of high-throughput SNP genotyping in barley , 2009, BMC Genomics.

[19]  Denis Milan,et al.  Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology , 2009, PloS one.

[20]  J. Dvorak,et al.  Single nucleotide polymorphism genotyping in polyploid wheat with the Illumina GoldenGate assay , 2009, Theoretical and Applied Genetics.

[21]  Timothy P. L. Smith,et al.  Development and Characterization of a High Density SNP Genotyping Assay for Cattle , 2009, PloS one.

[22]  M. Mhlanga,et al.  High-Throughput SNP Genotyping: Combining Tag SNPs and Molecular Beacons , 2009, Methods in molecular biology.

[23]  Richard Shen,et al.  Medium- to high-throughput SNP genotyping using VeraCode microbeads. , 2009, Methods in molecular biology.

[24]  Shengnan Jin,et al.  High-throughput methods for SNP genotyping. , 2009, Methods in molecular biology.

[25]  J. Dvorak,et al.  A high-throughput strategy for screening of bacterial artificial chromosome libraries and anchoring of clones on a genetic map constructed with single nucleotide polymorphisms , 2009, BMC Genomics.

[26]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[27]  P. Cornelius,et al.  SOYBEAN QTL FOR YIELD AND YIELD COMPONENTS ASSOCIATED WITH GLYCINE SOJA ALLELES , 2008 .

[28]  R. Shoemaker,et al.  High-throughput genotyping with the GoldenGate assay in the complex genome of soybean , 2008, Theoretical and Applied Genetics.

[29]  K. Chase,et al.  A Soybean Transcript Map: Gene Distribution, Haplotype and Single-Nucleotide Polymorphism Analysis , 2007, Genetics.

[30]  Laurent Excoffier,et al.  Arlequin (version 3.0): An integrated software package for population genetics data analysis , 2005, Evolutionary bioinformatics online.

[31]  R. Klein,et al.  Power analysis for genome-wide association studies , 2007, BMC Genetics.

[32]  P. Cregan,et al.  BARCSoySNP23: a panel of 23 selected SNPs for soybean cultivar identification , 2007, Theoretical and Applied Genetics.

[33]  Rajeev K. Varshney,et al.  Recent history of artificial outcrossing facilitates whole-genome association mapping in elite inbred crop varieties , 2006, Proceedings of the National Academy of Sciences.

[34]  Randall L. Nelson,et al.  Impacts of genetic bottlenecks on soybean genome diversity , 2006, Proceedings of the National Academy of Sciences.

[35]  John J. Grefenstette,et al.  Application of machine learning in SNP discovery , 2006, BMC Bioinformatics.

[36]  J. E. Specht,et al.  A new integrated genetic linkage map of the soybean , 2004, Theoretical and Applied Genetics.

[37]  J. Gai,et al.  QTL mapping of ten agronomic traits on the soybean (Glycine max L. Merr.) genetic map and their association with EST markers , 2004, Theoretical and Applied Genetics.

[38]  A. Walsh,et al.  Mining single-nucleotide polymorphisms from hexaploid wheat ESTs. , 2003, Genome.

[39]  P. Cregan,et al.  Single-nucleotide polymorphisms in soybean. , 2003, Genetics.

[40]  R. Shoemaker,et al.  Molecular Marker Analysis of Seed Size in Soybean , 2003 .

[41]  J. Meyer,et al.  Introgression of a quantitative trait locus for yield from Glycine soja into commercial soybean cultivars , 2003, Theoretical and Applied Genetics.

[42]  Yujun Zhang,et al.  A fine physical map of the rice chromosome 4. , 2002, Genome research.

[43]  Y. Minobe,et al.  Search for and analysis of single nucleotide polymorphisms (SNPs) in rice (Oryza sativa, Oryza rufipogon) and establishment of SNP markers. , 2002, DNA research : an international journal for rapid publication of reports on genes and genomes.

[44]  D. Sleper,et al.  Molecular characterization of resistance to Heterodera glycines in soybean PI 438489B , 2001, Theoretical and Applied Genetics.

[45]  James E. Specht,et al.  Soybean response to water : A QTL analysis of drought tolerance , 2001 .

[46]  Gabor T. Marth,et al.  A general approach to single-nucleotide polymorphism discovery , 1999, Nature Genetics.

[47]  K. Chase,et al.  Genetics of soybean agronomic traits: I. Comparison of three related recombinant inbred populations , 1999 .

[48]  K. Lark,et al.  An Integrated Genetic Linkage Map of the Soybean Genome , 1999 .

[49]  A. Brookes The essence of SNPs. , 1999, Gene.

[50]  Gregory D Schuler,et al.  Sequence mapping by electronic PCR , 1997, Genome research.

[51]  T. Carter,et al.  Genetic Base for North American Public Soybean Cultivars Released between 1947 and 1988 , 1994 .