Bridging the genotyping gap: using genotyping by sequencing (GBS) to add high-density SNP markers and new value to traditional bi-parental mapping and breeding populations

Genotyping by sequencing (GBS) is the latest application of next-generation sequencing protocols for the purposes of discovering and genotyping SNPs in a variety of crop species and populations. Unlike other high-density genotyping technologies which have mainly been applied to general interest “reference” genomes, the low cost of GBS makes it an attractive means of saturating mapping and breeding populations with a high density of SNP markers. One barrier to the widespread use of GBS has been the difficulty of the bioinformatics analysis as the approach is accompanied by a high number of erroneous SNP calls which are not easily diagnosed or corrected. In this study, we use a 384-plex GBS protocol to add 30,984 markers to an indica (IR64) × japonica (Azucena) mapping population consisting of 176 recombinant inbred lines of rice (Oryza sativa) and we release our imputation and error correction pipeline to address initial GBS data sparsity and error, and streamline the process of adding SNPs to RIL populations. Using the final imputed and corrected dataset of 30,984 markers, we were able to map recombination hot and cold spots and regions of segregation distortion across the genome with a high degree of accuracy, thus identifying regions of the genome containing putative sterility loci. We mapped QTL for leaf width and aluminum tolerance, and were able to identify additional QTL for both phenotypes when using the full set of 30,984 SNPs that were not identified using a subset of only 1,464 SNPs, including a previously unreported QTL for aluminum tolerance located directly within a recombination hotspot on chromosome 1. These results suggest that adding a high density of SNP markers to a mapping or breeding population through GBS has a great value for numerous applications in rice breeding and genetics research.

[1]  M. Yano,et al.  Relationship between transmission ratio distortion and genetic divergence in intraspecific rice crosses , 2011, Molecular Genetics and Genomics.

[2]  M. Yano,et al.  Diverse variation of reproductive barriers in three intraspecific rice crosses. , 2002, Genetics.

[3]  B. Williams,et al.  An Integrated Physical and Genetic Map of the Rice Genome , 2002, The Plant Cell Online.

[4]  Xuehui Huang,et al.  High-throughput genotyping by whole-genome resequencing. , 2009, Genome research.

[5]  M. Soller,et al.  Detecting marker-QTL linkage and estimating QTL gene effect and map location using a saturated genetic map. , 1993, Genetics.

[6]  A. Fujiyama,et al.  A map of rice genome variation reveals the origin of cultivated rice , 2012, Nature.

[7]  Keyan Zhao,et al.  Genetic Architecture of Aluminum Tolerance in Rice (Oryza sativa) Determined through Genome-Wide Association Analysis and QTL Mapping , 2011, PLoS genetics.

[8]  M. Lorieux,et al.  A Genetic Model for the Female Sterility Barrier Between Asian and African Cultivated Rice Species , 2010, Genetics.

[9]  M. Lorieux,et al.  Identification of five new blast resistance genes in the highly blast-resistant rice variety IR64 using a QTL mapping strategy , 2003, Theoretical and Applied Genetics.

[10]  M. Lynch,et al.  Genetics and Analysis of Quantitative Traits , 1996 .

[11]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[12]  Songnian Hu,et al.  SNP deserts of Asian cultivated rice: genomic regions under domestication , 2009, Journal of evolutionary biology.

[13]  Lin Fang,et al.  Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes , 2011, Nature Biotechnology.

[14]  A. Parco,et al.  Polymorphism, distribution, and segregation of AFLP markers in a doubled haploid rice population , 2009, Theoretical and Applied Genetics.

[15]  P. Vos,et al.  AFLP: a new technique for DNA fingerprinting. , 1995, Nucleic acids research.

[16]  D. Soltis,et al.  Microsatellite marker development for Galax urceolata (Diapensiaceae). , 2011, American journal of botany.

[17]  F Alex Feltus,et al.  An SNP resource for rice genetics and breeding based on subspecies indica and japonica genome alignments. , 2004, Genome research.

[18]  S. Hittalmani,et al.  Molecular marker assisted tagging of morphological and physiological traits under two contrasting moisture regimes at peak vegetative stage in rice (Oryza sativa L.) , 2000, Euphytica.

[19]  P. Etter,et al.  Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers , 2008, PloS one.

[20]  Y. Hsing,et al.  Comparative analyses of linkage maps and segregation distortion of two F₂ populations derived from japonica crossed with indica rice. , 2010, Hereditas.

[21]  P. Bagali,et al.  Molecular mapping of quantitative trait loci associated with seedling tolerance to salt stress in rice (Oryza sativa L.). , 2000 .

[22]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[23]  B. Courtois,et al.  Genetic Analysis of Water Use Efficiency in Rice (Oryza sativa L.) at the Leaf Level , 2010, Rice.

[24]  M. Blaxter,et al.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing , 2011, Nature Reviews Genetics.

[25]  M. Yano,et al.  A genome-wide survey of reproductive barriers in an intraspecific hybrid. , 2001, Genetics.

[26]  R. MacCurdy,et al.  Three-Dimensional Root Phenotyping with a Novel Imaging and Software Platform1[C][W][OA] , 2011, Plant Physiology.

[27]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[28]  S. Lin,et al.  A high-density rice genetic linkage map with 2275 markers using a single F2 population. , 1998, Genetics.

[29]  B. Ford-Lloyd,et al.  Mapping AFLP markers associated with subspecific differentiation of Oryza sativa (rice) and an investigation of segregation distortion , 1998, Heredity.

[30]  Yujun Zhang,et al.  A fine physical map of the rice chromosome 4. , 2002, Genome research.

[31]  C. Magorokosho,et al.  QTL mapping in three tropical maize populations reveals a set of constitutive and adaptive genomic regions for drought tolerance , 2012, Theoretical and Applied Genetics.

[32]  Z. Li,et al.  Gene actions of QTLs affecting several agronomic traits resolved in a recombinant inbred rice population and two backcross populations , 2003, Theoretical and Applied Genetics.

[33]  Y. Minobe,et al.  Search for and analysis of single nucleotide polymorphisms (SNPs) in rice (Oryza sativa, Oryza rufipogon) and establishment of SNP markers. , 2002, DNA research : an international journal for rapid publication of reports on genes and genomes.

[34]  E S Lander,et al.  Systematic detection of errors in genetic linkage data. , 1992, Genomics.

[35]  G. S. Khush,et al.  QTL × environment interactions in rice. I. Heading date and plant height , 2003, Theoretical and Applied Genetics.

[36]  Kenneth L. McNally,et al.  High-throughput single nucleotide polymorphism genotyping for breeding applications in rice using the BeadXpress platform , 2011, Molecular Breeding.

[37]  E. Choi,et al.  Quantitative trait loci for phytate in rice grain and their relationship with grain micronutrient content , 2007, Euphytica.

[38]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[39]  MICROSATELLITE MARKER DEVELOPMENT, MAPPING AND APPLICATIONS IN RICE GENETICS AND BREEDING , 1997 .

[40]  S. Leal Genetics and Analysis of Quantitative Traits , 2001 .

[41]  N. Huang,et al.  Chromosomal regions associated with segregation distortion of molecular markers in F2 , backcross, doubled haploid, and recombinant inbred populations in rice (Oryza sativa L.) , 1997, Molecular and General Genetics MGG.

[42]  V. Singh,et al.  Identification of QTL for growth- and grain yield-related traits in rice across nine locations of Asia , 2003, Theoretical and Applied Genetics.

[43]  D. Kudrna,et al.  Diversity Arrays Technology (DArT) for whole-genome profiling of barley. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Edward S. Buckler,et al.  TASSEL: software for association mapping of complex traits in diverse samples , 2007, Bioinform..

[45]  B. Mangin,et al.  Constructing confidence intervals for QTL location. , 1994, Genetics.

[46]  A. Paterson,et al.  Gene actions of QTLs affecting several agronomic traits resolved in a recombinant inbred rice population and two testcross populations , 2003, Theoretical and Applied Genetics.

[47]  M. Yano,et al.  Physical maps and recombination frequency of six rice chromosomes. , 2003, The Plant journal : for cell and molecular biology.

[48]  U. Rosyara,et al.  Family-based mapping of quantitative trait loci in plant breeding populations with resistance to Fusarium head blight in wheat as an illustration , 2009, Theoretical and Applied Genetics.

[49]  M. Lorieux,et al.  Identification of five new blast resistance genes in the highly blast-resistant rice variety IR64 using a QTL mapping strategy , 2003, Theoretical and Applied Genetics.

[50]  D Siegmund,et al.  Statistical methods for mapping quantitative trait loci from a dense set of markers. , 1999, Genetics.

[51]  Gregory D May,et al.  A comparative transcriptomic study of an allotetraploid and its diploid progenitors illustrates the unique advantages and challenges of RNA-seq in plant species. , 2012, American journal of botany.