High-throughput SNP Profiling of Genetic Resources in Crop Plants Using Genotyping Arrays

Using high-throughput DNA sequencing technologies, it is now possible to quickly and reliably identify many thousands to millions of SNPs in a species. They can subsequently serve as markers for the development of large genotyping arrays. Large numbers of individuals derived from gene banks, landraces, breeding material and varieties can be genotyped with such arrays at an extremely high marker density in a fast, efficient and highly reproducible way. Based on our experience, we provide in this chapter an overview on various aspects that have to be considered within the process of developing such genotyping arrays, including the SNP discovery and/or collection, possible selection criteria for SNPs to be put on the array, SNP scoring and allele calling as well as data assembly for the analysis of millions of genotypes. To make the best use of these genotyping data, it will be very important to establish databases containing marker data from many genotyping experiments in order to simplify downstream data processing for scientific as for breeding purposes.

[1]  J. Chen,et al.  Genome-wide genetic changes during modern breeding of maize , 2012, Nature Genetics.

[2]  Jan van Oeveren,et al.  Complexity Reduction of Polymorphic Sequences (CRoPS™): A Novel Approach for Large-Scale Polymorphism Discovery in Complex Genomes , 2007, PloS one.

[3]  B. S. Dhillon,et al.  Extent and genome-wide distribution of linkage disequilibrium in commercial maize germplasm , 2011, Theoretical and Applied Genetics.

[4]  K. Gunderson,et al.  Design of tag SNP whole genome genotyping arrays. , 2009, Methods in molecular biology.

[5]  Randall L. Nelson,et al.  Development and Evaluation of SoySNP50K, a High-Density Genotyping Array for Soybean , 2013, PloS one.

[6]  Marta Matvienko,et al.  De novo assembly and characterization of the carrot transcriptome reveals novel genes, new markers, and genetic diversity , 2011, BMC Genomics.

[7]  C. Town,et al.  Genome-wide SNP discovery in tetraploid alfalfa using 454 sequencing and high resolution melting analysis , 2011, BMC Genomics.

[8]  John M. Burke,et al.  SNP Discovery and Development of a High-Density Genotyping Array for Sunflower , 2012, PloS one.

[9]  J. Dvorak,et al.  Population- and genome-specific patterns of linkage disequilibrium and SNP variation in spring and winter wheat (Triticum aestivum L.) , 2010, BMC Genomics.

[10]  Mark H. Wright,et al.  Large‐Scale Discovery of Gene‐Enriched SNPs , 2009 .

[11]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[12]  Riccardo Velasco,et al.  Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple , 2012, PloS one.

[13]  M. Ganal,et al.  SNP discovery by amplicon sequencing and multiplex SNP genotyping in the allopolyploid species Brassica napus. , 2010, Genome.

[14]  Steven B Cannon,et al.  High-throughput SNP discovery through deep resequencing of a reduced representation library to anchor and orient scaffolds in the soybean whole genome sequence , 2010, BMC Genomics.

[15]  S. Deschamps,et al.  Rapid Genome‐wide Single Nucleotide Polymorphism Discovery in Soybean and Rice via Deep Resequencing of Reduced Representation Libraries with the Illumina Genome Analyzer , 2010 .

[16]  Detlef Weigel,et al.  Next-generation genetics in plants , 2008, Nature.

[17]  P. Heslop-Harrison,et al.  Organisation of the plant genome in chromosomes. , 2011, The Plant journal : for cell and molecular biology.

[18]  R. Veilleux,et al.  Integration of Two Diploid Potato Linkage Maps with the Potato Genome Sequence , 2012, PloS one.

[19]  Peter Tiffin,et al.  Pervasive gene content variation and copy number variation in maize and its undomesticated progenitor. , 2010, Genome research.

[20]  J. Rafalski,et al.  Association genetics in crop improvement. , 2010, Current opinion in plant biology.

[21]  Mark H. Wright,et al.  Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa , 2011, Nature communications.

[22]  Weihua Chang,et al.  Whole-genome genotyping. , 2006, Methods in enzymology.

[23]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[24]  Thomas Altmann,et al.  SNP identification in crop plants. , 2009, Current opinion in plant biology.

[25]  Jianbing Yan,et al.  Genetic Characterization and Linkage Disequilibrium Estimation of a Global Maize Collection Using SNP Markers , 2009, PloS one.

[26]  C. Saintenac,et al.  Targeted analysis of nucleotide and copy number variation by exon capture in allotetraploid wheat genome , 2011, Genome Biology.

[27]  Bo Wang,et al.  Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection , 2010, Nature Genetics.

[28]  J. Poland,et al.  Development of High-Density Genetic Maps for Barley and Wheat Using a Novel Two-Enzyme Genotyping-by-Sequencing Approach , 2012, PloS one.

[29]  T. Joshi,et al.  SNP discovery by high-throughput sequencing in soybean , 2010, BMC Genomics.

[30]  Dorrie Main,et al.  Development and Evaluation of a 9K SNP Array for Peach by Internationally Coordinated SNP Detection and Validation in Breeding Germplasm , 2012, PloS one.

[31]  Paul D. Shaw,et al.  Natural variation in a homolog of Antirrhinum CENTRORADIALIS contributed to spring growth habit and environmental adaptation in cultivated barley , 2012, Nature Genetics.

[32]  M. Blaxter,et al.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing , 2011, Nature Reviews Genetics.

[33]  Dawn H. Nagel,et al.  The B73 Maize Genome: Complexity, Diversity, and Dynamics , 2009, Science.

[34]  Xuehui Huang,et al.  High-throughput genotyping by whole-genome resequencing. , 2009, Genome research.

[35]  David Edwards,et al.  Discovering genetic polymorphisms in next-generation sequencing data. , 2009, Plant biotechnology journal.

[36]  T. Shah,et al.  Comparative SNP and Haplotype Analysis Reveals a Higher Genetic Diversity and Rapider LD Decay in Tropical than Temperate Germplasm in Maize , 2011, PloS one.

[37]  Weihua Chang,et al.  Whole-genome genotyping with the single-base extension assay , 2005, Nature Methods.

[38]  C. Kole,et al.  Arabidopsis Genome Initiative , 2016 .

[39]  S. Jackson,et al.  Next-generation sequencing technologies and their implications for crop genetics and breeding. , 2009, Trends in biotechnology.

[40]  Jean-Luc Jannink,et al.  Population genetics of genomics-based crop improvement methods. , 2011, Trends in genetics : TIG.

[41]  Uwe Scholz,et al.  From RNA-seq to large-scale genotyping - genomics resources for rye (Secale cereale L.) , 2011, BMC Plant Biology.

[42]  Antoine Janssen,et al.  Sequence-Based Genotyping for Marker Discovery and Co-Dominant Scoring in Germplasm and Populations , 2012, PloS one.

[43]  O. Martin,et al.  A Large Maize (Zea mays L.) SNP Genotyping Array: Development and Germplasm Genotyping, and Genetic Mapping to Compare with the B73 Reference Genome , 2011, PloS one.

[44]  J. Rogers,et al.  Crop genome sequencing: lessons and rationales. , 2011, Trends in plant science.

[45]  Robert J. Elshire,et al.  A First-Generation Haplotype Map of Maize , 2009, Science.

[46]  M. Yano,et al.  Discovery of Genome-Wide DNA Polymorphisms in a Landrace Cultivar of Japonica Rice by Whole-Genome Sequencing , 2011, Plant & cell physiology.

[47]  A. Janssen,et al.  Sequence-based SNP genotyping in durum wheat. , 2013, Plant biotechnology journal.

[48]  J. Cañizares,et al.  Transcriptome sequencing for SNP discovery across Cucumis melo , 2012, BMC Genomics.

[49]  Jian Wang,et al.  Genome-wide patterns of genetic variation among elite maize inbred lines , 2010, Nature Genetics.

[50]  F. Christians,et al.  High-density genechip oligonucleotide probe arrays. , 2002, Advances in biochemical engineering/biotechnology.

[51]  Evandro Novaes,et al.  High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome , 2008, BMC Genomics.

[52]  M. Ganal,et al.  Development of a Large SNP Genotyping Array and Generation of High-Density Genetic Maps in Tomato , 2012, PloS one.

[53]  Bin Han,et al.  Resequencing rice genomes: an emerging new era of rice genomics. , 2013, Trends in genetics : TIG.

[54]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[55]  Yan Long,et al.  Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing. , 2009, Plant biotechnology journal.

[56]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[57]  S. Myles,et al.  Rapid Genomic Characterization of the Genus Vitis , 2010, PloS one.

[58]  J. Dvorak,et al.  Single nucleotide polymorphism genotyping in polyploid wheat with the Illumina GoldenGate assay , 2009, Theoretical and Applied Genetics.

[59]  M. Handzic ) 5 , 1990 .

[60]  J. Dvorak,et al.  Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence , 2011, BMC Genomics.

[61]  T. Richmond,et al.  Targeted re-sequencing of the allohexaploid wheat exome. , 2012, Plant biotechnology journal.

[62]  Richard M. Clark,et al.  Sequencing of natural strains of Arabidopsis thaliana with short reads. , 2008, Genome research.

[63]  Edward S. Buckler,et al.  Genetic structure and domestication history of the grape , 2011, Proceedings of the National Academy of Sciences.

[64]  G. Valè,et al.  Identification of SNP and SSR markers in eggplant using RAD tag sequencing , 2011, BMC Genomics.

[65]  Karsten M. Borgwardt,et al.  Whole-genome sequencing of multiple Arabidopsis thaliana populations , 2011, Nature Genetics.

[66]  Uwe Scholz,et al.  Unlocking the Barley Genome by Chromosomal and Comparative Genomics[W][OA] , 2011, Plant Cell.

[67]  Xun Xu,et al.  Comparative population genomics of maize domestication and improvement , 2012, Nature Genetics.

[68]  S. Salvi,et al.  High-throughput SNP discovery and genotyping in durum wheat (Triticum durum Desf.) , 2011, Theoretical and Applied Genetics.

[69]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[70]  Detlef Weigel,et al.  Fast-forward genetics enabled by new sequencing technologies. , 2011, Trends in plant science.