Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential.

A total of 57.8 Mb of publicly available rice (Oryza sativa L.) DNA sequence was searched to determine the frequency and distribution of different simple sequence repeats (SSRs) in the genome. SSR loci were categorized into two groups based on the length of the repeat motif. Class I, or hypervariable markers, consisted of SSRs > or =20 bp, and Class II, or potentially variable markers, consisted of SSRs > or =12 bp <20 bp. The occurrence of Class I SSRs in end-sequences of EcoRI- and HindIII-digested BAC clones was one SSR per 40 Kb, whereas in continuous genomic sequence (represented by 27 fully sequenced BAC and PAC clones), the frequency was one SSR every 16 kb. Class II SSRs were estimated to occur every 3.7 kb in BAC ends and every 1.9 kb in fully sequenced BAC and PAC clones. GC-rich trinucleotide repeats (TNRs) were most abundant in protein-coding portions of ESTs and in fully sequenced BACs and PACs, whereas AT-rich TNRs showed no such preference, and di- and tetranucleotide repeats were most frequently found in noncoding, intergenic regions of the rice genome. Microsatellites with poly(AT)n repeats represented the most abundant and polymorphic class of SSRs but were frequently associated with the Micropon family of miniature inverted-repeat transposable elements (MITEs) and were difficult to amplify. A set of 200 Class I SSR markers was developed and integrated into the existing microsatellite map of rice, providing immediate links between the genetic, physical, and sequence-based maps. This contribution brings the number of microsatellite markers that have been rigorously evaluated for amplification, map position, and allelic diversity in Oryza spp. to a total of 500.

[1]  M. Senior,et al.  Maize simple repetitive DNA sequences: abundance and allele variation. , 1996, Genome.

[2]  R G Steen,et al.  A high-density integrated genetic linkage and radiation hybrid map of the laboratory rat. , 1999, Genome research.

[3]  P. Moncada,et al.  Quantitative trait loci for yield and yield components in an Oryza sativa×Oryza rufipogon BC2F2 population evaluated in an upland environment , 2001, Theoretical and Applied Genetics.

[4]  J. Weber,et al.  Alu repeats: a source for the genesis of primate microsatellites. , 1995, Genomics.

[5]  J. S. Heslop-Harrison,et al.  The physical and genomic organization of microsatellites in sugar beet. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[6]  M. Umeda,et al.  Characterization of a plant SINE, p-SINE1, in rice genomes. , 1992, Idengaku zasshi.

[7]  L. Lipovich,et al.  Diversity of microsatellites derived from genomic libraries and GenBank sequences in rice (Oryza sativa L.) , 2000, Theoretical and Applied Genetics.

[8]  L. Lipovich,et al.  Mapping and genome organization of microsatellite sequences in rice (Oryza sativa L.) , 2000, Theoretical and Applied Genetics.

[9]  W. Powell,et al.  Polymorphism revealed by simple sequence repeats , 1996 .

[10]  M. King,et al.  SeqHelp: a program to analyze molecular sequences utilizing common computational resources. , 1998, Genome research.

[11]  H. Ellegren,et al.  Low frequency of microsatellites in the avian genome. , 1997, Genome research.

[12]  S. Rasmussen,et al.  Genome and chromosome identification in cultivated barley and related species of the Triticeae (Poaceae) by in situ hybridization with the GAA-satellite sequence. , 1996, Genome.

[13]  S. Mccouch,et al.  Comparative evaluation of within-cultivar variation of rice (Oryza sativa L.) using microsatellite and RFLP markers. , 1997, Genome.

[14]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[15]  S. Lin,et al.  A high-density rice genetic linkage map with 2275 markers using a single F2 population. , 1998, Genetics.

[16]  D. Ward,et al.  A conserved repetitive DNA element located in the centromeres of cereal chromosomes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[17]  S. Tanksley,et al.  Saturated molecular map of the rice genome based on an interspecific backcross population. , 1994, Genetics.

[18]  D. Marshall,et al.  Computational and experimental characterization of physically clustered simple sequence repeats in plants. , 2000, Genetics.

[19]  S. Tanksley,et al.  Identification of trait-improving quantitative trait loci alleles from a wild rice relative, Oryza rufipogon. , 1998, Genetics.

[20]  M A Budiman,et al.  Rice transposable elements: a survey of 73,000 sequence-tagged-connectors. , 2000, Genome research.

[21]  H R Garner,et al.  Computerized polymorphic marker identification: experimental validation and a predicted human polymorphism catalog. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[22]  S. Wessler,et al.  Stowaway: a new family of inverted repeat elements associated with the genes of both monocotyledonous and dicotyledonous plants. , 1994, The Plant cell.

[23]  M. Lorieux,et al.  Heredity and genetic mapping of domestication-related traits in a temperate japonica weedy rice , 2001, Theoretical and Applied Genetics.

[24]  M. Morgante,et al.  Intimate association of microsatellite repeats with retrotransposons and other dispersed repetitive elements in barley. , 1999, The Plant journal : for cell and molecular biology.

[25]  W. Zhai,et al.  Mapping quantitative trait loci controlling sheath blight resistance in two rice cultivars (Oryza sativa L.) , 2000, Theoretical and Applied Genetics.

[26]  E. Young,et al.  Trinucleotide repeats are clustered in regulatory genes in Saccharomyces cerevisiae. , 2000, Genetics.

[27]  J. Weber Informativeness of human (dC-dA)n.(dG-dT)n polymorphisms. , 1990, Genomics.

[28]  H. Zoghbi,et al.  Fourteen and counting: unraveling trinucleotide repeat diseases. , 2000, Human molecular genetics.

[29]  S. Mccouch,et al.  Development of an RFLP map from a doubled haploid population in rice , 1994 .

[30]  S. Mccouch,et al.  Development of a microsatellite framework map providing genome-wide coverage in rice (Oryza sativa L.) , 1997, Theoretical and Applied Genetics.

[31]  M. Daly,et al.  MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. , 1987, Genomics.

[32]  Tal Pupko,et al.  Evolution of Microsatellites in the Yeast Saccharomyces cerevisiae: Role of Length and Number of Repeated Units , 1999, Journal of Molecular Evolution.

[33]  P. He,et al.  QTL mapping for the paste viscosity characteristics in rice (Oryza sativa L.) , 2000, Theoretical and Applied Genetics.

[34]  G. Bernardi,et al.  Compositional Properties of Homologous Coding Sequences from Plants , 1998, Journal of Molecular Evolution.

[35]  Qifa Zhang,et al.  The distribution and copy number of copia-like retrotransposons in rice (Oryza sativa L.) and their implications in the organization and evolution of the rice genome. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[36]  S. Wessler,et al.  A computer-based systematic survey reveals the predominance of small inverted-repeat elements in wild-type rice genes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.