Reliable In Silico Identification of Sequence Polymorphisms and Their Application for Extending the Genetic Map of Sugar Beet (Beta vulgaris)

Molecular markers are a highly valuable tool for creating genetic maps. Like in many other crops, sugar beet (Beta vulgaris L.) breeding is increasingly supported by the application of such genetic markers. Single nucleotide polymorphism (SNP) based markers have a high potential for automated analysis and high-throughput genotyping. We developed a bioinformatics workflow that uses Sanger and 2nd-generation sequence data for detection, evaluation and verification of new transcript-associated SNPs from sugar beet. RNAseq data from one parent of an established mapping population were produced by 454-FLX sequencing and compared to Sanger ESTs derived from the other parent. The workflow established for SNP detection considers the quality values of both types of reads, provides polymorphic alignments as well as selection criteria for reliable SNP detection and allows painless generation of new genetic markers within genes. We obtained a total of 14,323 genic SNPs and InDels. According to empirically optimised settings for the quality parameters, we classified these SNPs into four usability categories. Validation of a subset of the in silico detected SNPs by genotyping the mapping population indicated a high success rate of the SNP detection. Finally, a total of 307 new markers were integrated with existing data into a new genetic map of sugar beet which offers improved resolution and the integration of terminal markers.

[1]  David Edwards,et al.  Redundancy based detection of sequence polymorphisms in expressed sequence tag data using autoSNP , 2003, Bioinform..

[2]  Philippe Chaumeil,et al.  Automated SNP Detection in Expressed Sequence Tags: Statistical Considerations and Application to Maritime Pine Sequences , 2004, Plant Molecular Biology.

[3]  Eric S. Lander,et al.  An SNP map of the human genome generated by reduced representation shotgun sequencing , 2000, Nature.

[4]  S. Wanamaker,et al.  Genome-wide SNP discovery and linkage analysis in barley based on genes responsive to abiotic stress , 2005, Molecular Genetics and Genomics.

[5]  Juliane C. Dohm,et al.  High-throughput identification of genetic markers using representational oligonucleotide microarray analysis , 2010, Theoretical and Applied Genetics.

[6]  J. Jansen,et al.  Constructing dense genetic linkage maps , 2001, Theoretical and Applied Genetics.

[7]  J. Dvorak,et al.  Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence , 2011, BMC Genomics.

[8]  J. O'Brien,et al.  Construction of a 'unigene' cDNA clone set by oligonucleotide fingerprinting allows access to 25 000 potential sugar beet genes. , 2002, The Plant journal : for cell and molecular biology.

[9]  D. Pérez-Marín,et al.  Direct prediction of bioethanol yield in sugar beet pulp using near infrared spectroscopy. , 2011, Bioresource technology.

[10]  A. Rafalski Applications of single nucleotide polymorphisms in crop genetics. , 2002, Current opinion in plant biology.

[11]  I. Eujayl,et al.  Empirical evaluation of DArT, SNP, and SSR marker-systems for genotyping, clustering, and assigning sugar beet hybrid varieties into populations. , 2012, Plant science : an international journal of experimental plant biology.

[12]  Jack A. M. Leunissen,et al.  QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species , 2006, BMC Bioinformatics.

[13]  T. Schmidt,et al.  A sugar beet (Beta vulgaris L.) reference FISH karyotype for chromosome and chromosome-arm identification, integration of genetic linkage groups and analysis of major repeat family distribution. , 2012, The Plant journal : for cell and molecular biology.

[14]  Richard Reinhardt,et al.  Haplotype divergence in Beta vulgaris and microsynteny with sequenced plant genomes. , 2009, The Plant journal : for cell and molecular biology.

[15]  Roeland E. Voorrips,et al.  Software for the calculation of genetic linkage maps , 2001 .

[16]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[17]  Kazutaka Katoh,et al.  Multiple alignment of DNA sequences with MAFFT. , 2009, Methods in molecular biology.

[18]  Tianzhu Zhang,et al.  Unintended consequences of bioethanol feedstock choice in China. , 2012, Bioresource technology.

[19]  Richard G. F. Visser,et al.  RECORD: a novel method for ordering loci on a genetic linkage map , 2005, Theoretical and Applied Genetics.

[20]  Zhong Wang,et al.  Next-generation transcriptome assembly , 2011, Nature Reviews Genetics.

[21]  C. Jung,et al.  Chromosomal assignment of the nine linkage groups of sugar beet (Beta vulgaris L.) using primary trisomics , 1997, Theoretical and Applied Genetics.

[22]  R. Voorrips MapChart: software for the graphical presentation of linkage maps and QTLs. , 2002, The Journal of heredity.

[23]  J. Reif,et al.  Genome-wide association mapping of agronomic traits in sugar beet , 2011, Theoretical and Applied Genetics.

[24]  G. Kirov,et al.  Universal, robust, highly quantitative SNP allele frequency measurement in DNA pools , 2002, Human Genetics.

[25]  P. Cregan,et al.  Discovery of SNPs in soybean genotypes frequently used as the parents of mapping populations in the United States and Korea. , 2005, The Journal of heredity.

[26]  Christian Jung,et al.  Analysis of DNA polymorphisms in sugar beet (Beta vulgaris L.) and development of an SNP-based map of expressed genes , 2007, Theoretical and Applied Genetics.

[27]  Alexander Goesmann,et al.  The genome of the recently domesticated crop plant sugar beet (Beta vulgaris) , 2013, Nature.

[28]  G. P. Telles,et al.  Trimming and clustering sugarcane ESTs , 2001 .

[29]  A. Syvänen Accessing genetic variation: genotyping single nucleotide polymorphisms , 2001, Nature Reviews Genetics.

[30]  Yong Li,et al.  An Arabidopsis thaliana T-DNA mutagenized population (GABI-Kat) for flanking sequence tag-based reverse genetics , 2003, Plant Molecular Biology.

[31]  Hans Lehrach,et al.  Palaeohexaploid ancestry for Caryophyllales inferred from extensive gene-based physical and genetic mapping of the sugar beet genome (Beta vulgaris). , 2012, The Plant journal : for cell and molecular biology.

[32]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[33]  Yu Zhang,et al.  Calling SNPs without a reference sequence , 2010, BMC Bioinformatics.

[34]  Gabor T. Marth,et al.  A general approach to single-nucleotide polymorphism discovery , 1999, Nature Genetics.

[35]  B. Göttgens,et al.  A new RNASeq-based reference transcriptome for sugar beet and its application in transcriptome-scale analysis of vernalization and gibberellin responses , 2012, BMC Genomics.

[36]  E. D. Earle,et al.  Nuclear DNA content of some important plant species , 1991, Plant Molecular Biology Reporter.