In silico analysis on frequency and distribution of microsatellites in ESTs of some cereal species.

During the last decade microsatellites or SSRs (simple sequence repeats) have been proven to be the markers of choice in plant genetics research and for breeding purposes because of their hypervariability and ease of detection. However, development of these markers is expensive, labour intensive and time consuming, in particular, if they are being developed from genomic libraries. In the context of large-scale sequencing and genomics programmes in various cereal species at different laboratories, a large set of expressed sequence tags (ESTs) is being generated, which can be used to search for microsatellites. Keeping in view the importance of such type of SSRs, available ESTs of some cereal species like barley, maize, oats, rice, rye and wheat were investigated for a study of abundance, frequency and distribution of various types of microsatellites. SSRs were present in about 7% to 10% of the total ESTs in the investigated cereal genomes. On the basis of surveying EST sequences amounting to 75.2 Mb in barley, 54.7 Mb in maize, 43.9 Mb in rice, 3.7 Mb in rye, 41.6 Mb in sorghum and 37.5 Mb in wheat, the frequency of SSRs was 1/7.5 kb in barley, 1/7.5 kb in maize, 1/6.2 kb in wheat, 1/5.5 kb in rye and sorghum and 1/3.9 kb in rice. The overall average SSR frequency for these species is 1/6.0 kb. Trimeric repeats are the most abundant (54% to 78%) class of microsatellites followed by dimeric repeats (17% to 40%). Among the trimeric repeats the motifs CCG are the most common in all the cases ranging from 32% in wheat to 49% in sorghum. When all these SSRs were analysed for assessing their potential to develop new markers, unique primer pairs could be designed for 30% to 70% of the total non-redundant microsatellites which are up to 3% of total ESTs in the studied species.

[1]  B. Saal,et al.  Development of simple sequence repeat markers in rye (Secale cereale L.). , 1999, Genome.

[2]  D. Tautz Hypervariability of simple sequences as a general source for polymorphic DNA markers. , 1989, Nucleic acids research.

[3]  M. Morgante,et al.  Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes , 2002, Nature Genetics.

[4]  Assessment of genotypic variation among cultivated durum wheat based on EST-SSRS and genomic SSRS , 2001 .

[5]  W. Powell,et al.  Isolation of EST-derived microsatellite markers for genotyping the A and B genomes of wheat , 2002, Theoretical and Applied Genetics.

[6]  M. Sorrells,et al.  Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat , 2002, Plant Molecular Biology.

[7]  R. Varshney,et al.  The development and use of microsatellite markers for genetic analysis and plant breeding with emphasis on bread wheat , 2000, Euphytica.

[8]  D. Metzgar,et al.  Selection against frameshift mutations limits microsatellite expansion in coding DNA. , 2000, Genome research.

[9]  S. Larson Plant Genotyping: The DNA Fingerprinting of Plants , 2002, Heredity.

[10]  M. Morgante,et al.  A simple sequence repeat-based linkage map of barley. , 2000, Genetics.

[11]  R. Henry,et al.  Microsatellites derived from ESTs, and their comparison with those derived by other methods. , 2001 .

[12]  G. Scoles,et al.  The development of oat microsatellite markers and their use in identifying relationships among Avena species and oat cultivars , 2000, Theoretical and Applied Genetics.

[13]  M. Senior,et al.  Maize simple repetitive DNA sequences: abundance and allele variation. , 1996, Genome.

[14]  R K Varshney,et al.  Generation and comparison of EST-derived SSRs and SNPs in barley (Hordeum vulgare L.). , 2004, Hereditas.

[15]  W. Powell,et al.  Polymorphism revealed by simple sequence repeats , 1996 .

[16]  M. Morgante,et al.  PCR-amplified microsatellites as markers in plant genetics. , 1993, The Plant journal : for cell and molecular biology.

[17]  L. Lipovich,et al.  Mapping and genome organization of microsatellite sequences in rice (Oryza sativa L.) , 2000, Theoretical and Applied Genetics.

[18]  Winston A Hide,et al.  A comprehensive approach to clustering of expressed human gene sequence: the sequence tag alignment and consensus knowledge base. , 1999, Genome research.

[19]  D. Bhattramakki,et al.  An integrated SSR and RFLP linkage map of Sorghum bicolor (L.) Moench. , 2000, Genome.

[20]  R. Bernardo,et al.  Inconsistency between SSR groupings and genetic backgrounds of white corn inbreds , 2001 .

[21]  M. Ganal,et al.  A microsatellite map of wheat. , 1998, Genetics.

[22]  A. Inagaki,et al.  Microsatellite DNA markers for rice chromosomes , 1996, Theoretical and Applied Genetics.

[23]  D. Marshall,et al.  Computational and experimental characterization of physically clustered simple sequence repeats in plants. , 2000, Genetics.

[24]  R. Henry,et al.  Microsatellite markers from sugarcane (Saccharum spp.) ESTs cross transferable to erianthus and sorghum. , 2001, Plant science : an international journal of experimental plant biology.

[25]  J. Beckmann,et al.  Toward a Unified Approach to Genetic Mapping of Eukaryotes Based on Sequence Tagged Microsatellite Sites , 1990, Bio/Technology.

[26]  R. J. Henry,et al.  Analysis of SSRs derived from grape ESTs , 2000, Theoretical and Applied Genetics.

[27]  L. Lipovich,et al.  Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. , 2001, Genome research.