Genotyping of Soybean Cultivars With Medium-Density Array Reveals the Population Structure and QTNs Underlying Maturity and Seed Traits

Soybean was domesticated about 5,000 to 6,000 years ago in China. Although genotyping technologies such as genotyping by sequencing (GBS) and high-density array are available, it is convenient and economical to genotype cultivars or populations using medium-density SNP array in genetic study as well as in molecular breeding. In this study, 235 cultivars, collected from China, Japan, USA, Canada and some other countries, were genotyped using SoySNP8k iSelect BeadChip with 7,189 single nucleotide polymorphisms (SNPs). In total, 4,471 polymorphic SNP markers were used to analyze population structure and perform genome-wide association study (GWAS). The most likely K value was 7, indicating this population can be divided into 7 subpopulations, which is well in accordance with the geographic origins of cultivars or accession studied. The LD decay rate was estimated at 184 kb, where r2 dropped to half of its maximum value (0.205). GWAS using FarmCPU detected a stable quantitative trait nucleotide (QTN) for hilum color and seed color, which is consistent with the known loci or genes. Although no universal QTNs for flowering time and maturity were identified across all environments, a total of 30 consistent QTNs were detected for flowering time (R1) or maturity (R7 and R8) on 16 chromosomes, most of them were corresponding to known E1 to E4 genes or QTL region reported in SoyBase (soybase.org). Of 16 consistent QTNs for protein and oil contents, 11 QTNs were detected having antagonistic effects on protein and oil content, while 4 QTNs soly for oil content, and one QTN soly for protein content. The information gained in this study demonstrated that the usefulness of the medium-density SNP array in genotyping for genetic study and molecular breeding.

[1]  H. Zhai,et al.  QTL effects and epistatic interaction for flowering time and branch number in a soybean mapping population of Japanese×Chinese cultivars , 2017 .

[2]  Zheng Wang,et al.  Genome-wide association studies dissect the genetic networks underlying agronomical traits in soybean , 2017, Genome Biology.

[3]  H. Nguyen,et al.  Molecular mapping and genomics of soybean seed protein: a review and perspective for the future , 2017, Theoretical and Applied Genetics.

[4]  J. Gai,et al.  Identification of Major Quantitative Trait Loci for Seed Oil Content in Soybeans by Combining Linkage and Genome-Wide Association Mapping , 2017, Front. Plant Sci..

[5]  P. Cregan,et al.  Identification of QTL with large effect on seed weight in a selective population of soybean with genome-wide association and fixation index analyses , 2017, BMC Genomics.

[6]  Jianxin Ma,et al.  Plasticity and innovation of regulatory mechanisms underlying seed oil content mediated by duplicated genes in the palaeopolyploid soybean , 2017, The Plant journal : for cell and molecular biology.

[7]  J. Poland,et al.  Genome-Wide Association Study of Grain Architecture in Wild Wheat Aegilops tauschii , 2017, Front. Plant Sci..

[8]  G. Jiang,et al.  The development and use of a molecular model for soybean maturity groups , 2017, BMC Plant Biology.

[9]  L. Vodkin,et al.  Mutations in Argonaute5 Illuminate Epistatic Interactions of the K1 and I Loci Leading to Saddle Seed Color Patterns in Glycine max , 2017, Plant Cell.

[10]  Dong Cao,et al.  Natural variation at the soybean J locus improves adaptation to the tropics and enhances yield , 2017, Nature Genetics.

[11]  C. A. Scapim,et al.  A Genome-Wide Association Study for Agronomic Traits in Soybean Using SNP Markers and SNP-Based Haplotype Analysis , 2017, PloS one.

[12]  Ashkan Golshani,et al.  Mapping and identification of a potential candidate gene for a novel maturity locus, E10, in soybean , 2017, Theoretical and Applied Genetics.

[13]  L. Leamy,et al.  A genome-wide association study of seed composition traits in wild soybean (Glycine soja) , 2017, BMC Genomics.

[14]  Xiaohua Liang,et al.  Genome-Wide Association Study for Plant Height and Grain Yield in Rice under Contrasting Moisture Regimes , 2016, Front. Plant Sci..

[15]  Xinyi Shi,et al.  A soybean quantitative trait locus that promotes flowering under long days is identified as FT5a, a FLOWERING LOCUS T ortholog , 2016, Journal of experimental botany.

[16]  H. Zhai,et al.  Functional conservation and diversification of the soybean maturity gene E1 and its homologs in legumes , 2016, Scientific Reports.

[17]  Yong Guo,et al.  Identification and Validation of Loci Governing Seed Coat Color by Combining Association Mapping and Bulk Segregation Analysis in Soybean , 2016, PloS one.

[18]  Zhiwu Zhang,et al.  GAPIT Version 2: An Enhanced Integrated Tool for Genomic Association and Prediction , 2016, The plant genome.

[19]  Yong Guo,et al.  Phenotypic Characterization and Genetic Dissection of Growth Period Traits in Soybean (Glycine max) Using Association Mapping , 2016, PloS one.

[20]  J. Abe,et al.  Quantitative trait locus mapping of soybean maturity gene E5 , 2016, Breeding science.

[21]  S. Chen,et al.  Molecular and geographic evolutionary support for the essential role of GIGANTEAa in soybean domestication of flowering time , 2016, BMC Evolutionary Biology.

[22]  W. Fehr Principles of cultivar development , 2016 .

[23]  Zhiwu Zhang,et al.  Iterative Usage of Fixed and Random Effect Models for Powerful and Efficient Genome-Wide Association Studies , 2016, PLoS genetics.

[24]  Baohui Liu,et al.  A recessive allele for delayed flowering at the soybean maturity locus E9 is a leaky allele of FT2a, a FLOWERING LOCUS T ortholog , 2016, BMC Plant Biology.

[25]  H. Nguyen,et al.  Expanding Omics Resources for Improvement of Soybean Seed Composition Traits , 2015, Front. Plant Sci..

[26]  P. Cregan,et al.  A Population Structure and Genome‐Wide Association Analysis on the USDA Soybean Germplasm Collection , 2015, The plant genome.

[27]  P. Cregan,et al.  Genomic consequences of selection and genome-wide association mapping in soybean , 2015, BMC Genomics.

[28]  H. Zhai,et al.  Diurnal Expression Pattern, Allelic Variation, and Association Analysis Reveal Functional Features of the E1 Gene in Control of Photoperiodic Flowering in Soybean , 2015, PloS one.

[29]  J. Zhang,et al.  Analysis of quantitative trait loci for main plant traits in soybean. , 2015, Genetics and molecular research : GMR.

[30]  Dongmei Li,et al.  Unconditional and conditional QTL underlying the genetic interrelationships between soybean seed isoflavone, and protein or oil contents , 2015 .

[31]  N. Young,et al.  Naturally occurring diversity helps to reveal genes of adaptive importance in legumes , 2015, Front. Plant Sci..

[32]  Jiaoping Zhang,et al.  Genome-wide association study for flowering time, maturity dates and plant height in early maturing soybean (Glycine max) germplasm , 2015, BMC Genomics.

[33]  Hui Xiang,et al.  Resequencing 302 wild and cultivated accessions identifies genes related to domestication and improvement in soybean , 2015, Nature Biotechnology.

[34]  Istvan Rajcan,et al.  Identification of loci governing eight agronomic traits using a GBS-GWAS approach and validation by QTL mapping in soya bean. , 2015, Plant biotechnology journal.

[35]  Hong Wang,et al.  Gene regulation mediated by microRNAs in response to green tea polyphenol EGCG in mouse lung cancer , 2014, BMC Genomics.

[36]  Baohui Liu,et al.  A New Dominant Gene E9 Conditions Early Flowering and Maturity in Soybean , 2014 .

[37]  P. Cregan,et al.  Identification and validation of quantitative trait loci for seed yield, oil and protein contents in two recombinant inbred line populations of soybean , 2014, Molecular Genetics and Genomics.

[38]  H. Zhai,et al.  Allelic Variations at Four Major Maturity E Genes and Transcriptional Abundance of the E1 Gene Are Associated with Flowering Time and Maturity of Soybean Cultivars , 2014, PloS one.

[39]  Baohui Liu,et al.  GmFT4, a Homolog of FLOWERING LOCUS T, Is Positively Regulated by E1 and Functions as a Flowering Repressor in Soybean , 2014, PloS one.

[40]  P. Cregan,et al.  A genome-wide association study of seed protein and oil content in soybean , 2014, BMC Genomics.

[41]  Wenbin Li,et al.  Identification of quantitative trait loci underlying seed protein and oil contents of soybean across multi‐genetic backgrounds and environments , 2013 .

[42]  H. Zhai,et al.  Recent Achievement in Gene Cloning and Functional Genomics in Soybean , 2013, TheScientificWorldJournal.

[43]  Q. Song,et al.  SNP-Based Genetic Linkage Map of Soybean Using the SoySNP6K Illumina Infinium BeadChip Genotyping Array , 2013 .

[44]  N. Tomooka,et al.  QTL affecting fitness of hybrids between wild and cultivated soybeans in experimental fields , 2013, Ecology and evolution.

[45]  Safiullah M. Pathan,et al.  Genetic Mapping and Confirmation of Quantitative Trait Loci for Seed Protein and Oil Contents and Seed Weight in Soybean , 2013 .

[46]  E. Cober,et al.  Genetic control of soybean seed oil: I. QTL and genes associated with seed oil concentration in RIL populations derived from crossing moderately high-oil parents , 2013, Theoretical and Applied Genetics.

[47]  Randall L. Nelson,et al.  Development and Evaluation of SoySNP50K, a High-Density Genotyping Array for Soybean , 2013, PloS one.

[48]  Brian Boyle,et al.  An Improved Genotyping by Sequencing (GBS) Approach Offering Increased Versatility and Efficiency of SNP Discovery and Genotyping , 2013, PloS one.

[49]  Meng Li,et al.  Genetics and population analysis Advance Access publication July 13, 2012 , 2012 .

[50]  Baohui Liu,et al.  Molecular identification of genes controlling flowering time, maturity, and photoperiod response in soybean , 2012, Plant Systematics and Evolution.

[51]  T. Yamazaki,et al.  Positional cloning and characterization reveal the molecular basis for soybean maturity locus E1 that regulates photoperiodic flowering , 2012, Proceedings of the National Academy of Sciences.

[52]  Hongwei Jiang,et al.  An Integrated Quantitative Trait Locus Map of Oil Content in Soybean, Glycine max (L.) Merr., Generated Using a Meta-Analysis Method for Mining Genes , 2011 .

[53]  S. Tabata,et al.  A Map-Based Cloning Strategy Employing a Residual Heterozygous Line Reveals that the GIGANTEA Gene Is Involved in Soybean Maturity and Flowering , 2011, Genetics.

[54]  A. Horigane,et al.  QTL Analysis of Soybean Seed Coat Discoloration Associated with II TT Genotype , 2011 .

[55]  Peter J. Bradbury,et al.  Genome-wide association study of leaf architecture in the maize nested association mapping population , 2011, Nature Genetics.

[56]  Meng Li,et al.  Genome-wide association studies of 14 agronomic traits in rice landraces , 2010, Nature Genetics.

[57]  Yun Lian,et al.  QTL mapping of isoflavone, oil and protein contents in soybean (Glycine max L. Merr.). , 2010 .

[58]  E. Cober,et al.  A New Locus for Early Maturity in Soybean , 2010 .

[59]  S. Tabata,et al.  Map-Based Cloning of the Gene Associated With the Soybean Maturity Locus E3 , 2009, Genetics.

[60]  Baohui Liu,et al.  Genetic Redundancy in Soybean Photoresponses Associated With Duplication of the Phytochrome A Gene , 2008, Genetics.

[61]  M. Kim,et al.  Association analysis using SSR markers to find QTL for seed protein content in soybean , 2008, Euphytica.

[62]  Takeshi Hayashi,et al.  QTL analysis of cleistogamy in soybean , 2008, Theoretical and Applied Genetics.

[63]  M. Yano,et al.  An Integrated High-density Linkage Map of Soybean with RFLP, SSR, STS, and AFLP Markers Using A Single F2 Population , 2008, DNA research : an international journal for rapid publication of reports on genes and genomes.

[64]  Edward S. Buckler,et al.  TASSEL: software for association mapping of complex traits in diverse samples , 2007, Bioinform..

[65]  T. Komatsuda,et al.  QTL analysis of low temperature induced browning in soybean seed coats. , 2007, The Journal of heredity.

[66]  Arnold M. Saxton,et al.  Quantitative trait loci for agronomic traits in soybean , 2007 .

[67]  J. Gai,et al.  A comparative study on segregation analysis and QTL mapping of quantitative traits in plants—with a case in soybean , 2007 .

[68]  V. Poysa,et al.  Seed and agronomic QTL in low linolenic acid, lipoxygenase-free soybean (Glycine max (L.) Merrill) germplasm. , 2006, Genome.

[69]  A. Saxton,et al.  Quantitative Trait Loci for Seed Protein and Oil Concentration, and Seed Size in Soybean , 2005 .

[70]  G. Evanno,et al.  Detecting the number of clusters of individuals using the software structure: a simulation study , 2005, Molecular ecology.

[71]  D. Hyten,et al.  Seed quality QTL in a prominent soybean population , 2004, Theoretical and Applied Genetics.

[72]  E. Kabelka,et al.  Putative Alleles for Increased Yield from Soybean Plant Introductions , 2004 .

[73]  M. Alpaslan,et al.  Seed composition of soybeans grown in the Harran region of Turkey as affected by row spacing and irrigation. , 2002, Journal of agricultural and food chemistry.

[74]  J. Vollmann,et al.  Seed quality QTLs identified in a molecular map of early maturing soybean , 2001, Theoretical and Applied Genetics.

[75]  R. Shoemaker,et al.  Mapping genetic loci for flowering time, maturity, and photoperiod insensitivity in soybean , 2001, Molecular Breeding.

[76]  E. Cober,et al.  A New Soybean Maturity and Photoperiod-Sensitivity Locus Linked to E1 and T , 2001 .

[77]  James E. Specht,et al.  Soybean response to water : A QTL analysis of drought tolerance , 2001 .

[78]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[79]  K. Chase,et al.  Genetics of soybean agronomic traits: I. Comparison of three related recombinant inbred populations , 1999 .

[80]  K. Chase,et al.  Genetics of Soybean Agronomic Traits: II. Interactions between Yield Quantitative Trait Loci in Soybean , 1999 .

[81]  N. Vello,et al.  E6, a dominant gene conditioning early flowering and maturity in soybeans , 1999 .

[82]  R. Shoemaker,et al.  Mapping QTL for Seed Protein and Oil Content in Eight Soybean Populations , 1997 .

[83]  D. Ashley,et al.  RFLP loci associated with soybean seed protein and oil content across populations and locations , 1996, Theoretical and Applied Genetics.

[84]  Perry B. Cregan,et al.  Genetic Mapping of Agronomic Traits Using Recombinant Inbred Lines of Soybean , 1996 .

[85]  J. Ray,et al.  Genetic control of a long-juvenile trait in soybean , 1995 .

[86]  K. Lark,et al.  Interval mapping of quantitative trait loci for reproductive, morphological, and seed traits of soybean (Glycine max L.) , 1993, Theoretical and Applied Genetics.

[87]  R. Shoemaker,et al.  RFLP analysis of soybean seed protein and oil content , 1992, Theoretical and Applied Genetics.

[88]  R. Shoemaker,et al.  RFLP mapping in soybean: association between marker loci and variation in quantitative traits. , 1990, Genetics.

[89]  R. L. Bernard,et al.  A new gene affecting the time of flowering and maturity in soybeans , 1987 .

[90]  W. F. Thompson,et al.  Rapid isolation of high molecular weight plant DNA. , 1980, Nucleic acids research.

[91]  A. G. Norman Soybean physiology, agronomy, and utilization. , 1979 .

[92]  R. Buzzell INHERITANCE OF A SOYBEAN FLOWERING RESPONSE TO FLUORESCENT-DAYLENGTH CONDITIONS , 1971 .

[93]  W. Fehr,et al.  Stage of Development Descriptions for Soybeans, Glycine Max (L.) Merrill , 1971 .

[94]  R. L. Bernard Two Major Genes for Time of Flowering and Maturity in Soybeans 1 , 1971 .

[95]  R. Nelson,et al.  Elevation of soybean seed oil content through selection for seed coat shininess , 2017, Nature Plants.

[96]  R. Nelson,et al.  QTL associated with yield in three backcross-derived populations of soybean , 2007 .

[97]  Y. Nakazawa,et al.  Quantitative trait loci mapping of pubescence density and flowering time of insect-resistant soybean (Glycine max L. Merr.) , 2007 .

[98]  M. Yano,et al.  An informative linkage map of soybean reveals QTLs for flowering time, leaflet morphology and regions of segregation distortion. , 2001, DNA research : an international journal for rapid publication of reports on genes and genomes.

[99]  R. Buzzell,et al.  Research Notes : Inheritance of insensitivity to long daylength , 1980 .