Sorghum Genome Sequencing by Methylation Filtration

Sorghum bicolor is a close relative of maize and is a staple crop in Africa and much of the developing world because of its superior tolerance of arid growth conditions. We have generated sequence from the hypomethylated portion of the sorghum genome by applying methylation filtration (MF) technology. The evidence suggests that 96% of the genes have been sequence tagged, with an average coverage of 65% across their length. Remarkably, this level of gene discovery was accomplished after generating a raw coverage of less than 300 megabases of the 735-megabase genome. MF preferentially captures exons and introns, promoters, microRNAs, and simple sequence repeats, and minimizes interspersed repeats, thus providing a robust view of the functional parts of the genome. The sorghum MF sequence set is beneficial to research on sorghum and is also a powerful resource for comparative genomics among the grasses and across the entire plant kingdom. Thousands of hypothetical gene predictions in rice and Arabidopsis are supported by the sorghum dataset, and genomic similarities highlight evolutionarily conserved regions that will lead to a better understanding of rice and Arabidopsis.

[1]  A. Paterson,et al.  Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[2]  D. Bartel,et al.  Computational identification of plant microRNAs and their targets, including a stress-induced miRNA. , 2004, Molecular cell.

[3]  Ian Korf,et al.  Gene finding in novel genomes , 2004, BMC Bioinformatics.

[4]  John L. Bowman,et al.  Gene regulation: Ancient microRNA target sequences in plants , 2004, Nature.

[5]  W. McCombie,et al.  Sequencing the maize genome. , 2004, Current opinion in plant biology.

[6]  D. T. Rosenow,et al.  Mapping QTLs associated with drought resistance in sorghum (Sorghum bicolor L. Moench) , 2002, Plant Molecular Biology.

[7]  S. Dike,et al.  Maize Genome Sequencing by Methylation Filtration , 2003, Science.

[8]  J Quackenbush,et al.  Enrichment of Gene-Coding Sequences in Maize by Genome Filtration , 2003, Science.

[9]  P. Arruda,et al.  Collection for Tropical Crop Sugarcane Analysis and Functional Annotation of an Expressed Sequence Tag , 2006 .

[10]  R. Martienssen,et al.  Maintenance of heterochromatin by RNA interference of tandem repeats , 2003, Nature Genetics.

[11]  Edward H. Coe,et al.  iMap: a database-driven utility to integrate and access the genetic and physical maps of maize , 2003, Bioinform..

[12]  K. Shinozaki,et al.  Regulatory network of gene expression in the drought and cold stress responses. , 2003, Current opinion in plant biology.

[13]  Ian Korf,et al.  Serial BLAST searching , 2003, Bioinform..

[14]  J. Vrebalov,et al.  Sequence-based alignment of sorghum chromosome 3 and rice chromosome 1 reveals extensive conservation of gene order and one major chromosomal rearrangement. , 2003, The Plant journal : for cell and molecular biology.

[15]  J. Bennetzen,et al.  The genetic colinearity of rice and other cereals on the basis of genomic sequence analysis. , 2003, Current opinion in plant biology.

[16]  K. Shinozaki,et al.  OsDREB genes in rice, Oryza sativa L., encode transcription activators that function in drought-, high-salt- and cold-responsive gene expression. , 2003, The Plant journal : for cell and molecular biology.

[17]  T. Wada,et al.  Role of a positive regulator of root hair development, CAPRICE, in Arabidopsis root epidermal cell differentiation , 2002, Development.

[18]  Steven G. Schroeder,et al.  Genetic, Physical, and Informatics Resources for Maize. On the Road to an Integrated Map1 , 2002, Plant Physiology.

[19]  A Schnittger,et al.  TRIPTYCHON and CAPRICE mediate lateral inhibition during trichome and root hair patterning in Arabidopsis , 2002, The EMBO journal.

[20]  B. Reinhart,et al.  Prediction of Plant MicroRNA Targets , 2002, Cell.

[21]  B. Reinhart,et al.  MicroRNAs in plants. , 2002, Genes & development.

[22]  H. Fu,et al.  Intraspecific violation of genetic colinearity and its implications in maize , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Daniel G Peterson,et al.  Integration of Cot analysis, DNA cloning, and high-throughput sequencing facilitates genome characterization and gene discovery. , 2002, Genome research.

[24]  Huanming Yang,et al.  A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. indica) , 2002, Science.

[25]  M. Morgante,et al.  Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes , 2002, Nature Genetics.

[26]  H. Fu,et al.  Recombination rates between adjacent genic and retrotransposon regions in maize vary by 2 orders of magnitude , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[27]  A. Oliphant,et al.  A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). , 2002, Science.

[28]  L. Stein,et al.  Development and mapping of 2240 new SSR markers for rice (Oryza sativa L.). , 2002, DNA research : an international journal for rapid publication of reports on genes and genomes.

[29]  M. Morgante,et al.  Abundance, distribution, and transcriptional activity of repetitive elements in the maize genome. , 2001, Genome research.

[30]  P. Pevzner,et al.  An Eulerian path approach to DNA fragment assembly , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[31]  R. Henry,et al.  Microsatellite markers from sugarcane (Saccharum spp.) ESTs cross transferable to erianthus and sorghum. , 2001, Plant science : an international journal of experimental plant biology.

[32]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[33]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[34]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[35]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[36]  Ian Korf,et al.  MaskerAid : a performance enhancement to RepeatMasker , 2000, Bioinform..

[37]  V. Brendel,et al.  Molecular characterization of a mutable pigmentation phenotype and isolation of the first active transposable element from Sorghum bicolor. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Robert A. Martienssen,et al.  Differential methylation of genes and retrotransposons facilitates shotgun sequencing of the maize genome , 1999, Nature Genetics.

[39]  V. Colot,et al.  Eukaryotic DNA methylation as an evolutionary device , 1999, BioEssays : news and reviews in molecular, cellular and developmental biology.

[40]  Phillip SanMiguel,et al.  The paleontology of intergene retrotransposons of maize , 1998, Nature Genetics.

[41]  R. Martienssen Transposons, DNA methylation and gene control. , 1998, Trends in genetics : TIG.

[42]  J. Bennetzen The evolution of grass genome organisation and function. , 1998, Symposia of the Society for Experimental Biology.

[43]  R. Matyášek,et al.  Variability in CpNpG methylation in higher plant genomes. , 1997, Gene.

[44]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[45]  J. Bennetzen,et al.  Nested Retrotransposons in the Intergenic Regions of the Maize Genome , 1996, Science.

[46]  J. Jeddeloh,et al.  mCCG methylation in angiosperms. , 1996, The Plant journal : for cell and molecular biology.

[47]  S. Wessler,et al.  Retrotransposons in the flanking regions of normal plant genes: a role for copia-like elements in the evolution of gene structure and expression. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[48]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[49]  J. Bennetzen,et al.  Active maize genes are unmodified and flanked by diverse classes of modified, highly repetitive DNA. , 1994, Genome.

[50]  A. Leitch,et al.  Key features of cereal genome organization as revealed by the use of cytosine methylation-sensitive restriction endonucleases. , 1993, Genomics.

[51]  David J. States,et al.  Identification of protein coding regions by database similarity search , 1993, Nature Genetics.

[52]  R Holliday,et al.  DNA methylation and mutation. , 1993, Mutation research.

[53]  J. Salinas,et al.  The distribution of 5-methylcytosine in the nuclear genome of plants. , 1992, Nucleic acids research.

[54]  E. Lander,et al.  Genomic mapping by fingerprinting random clones: a mathematical analysis. , 1988, Genomics.

[55]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[56]  A. Razin,et al.  Sequence specificity of methylation in higher plant DNA , 1981, Nature.

[57]  Britten Rj,et al.  DNA sequence arrangement and preliminary evidence on its evolution. , 1976 .

[58]  R. Britten,et al.  DNA sequence arrangement and preliminary evidence on its evolution. , 1976, Federation proceedings.

[59]  R. Britten,et al.  Repeated Sequences in DNA , 1968 .

[60]  R. Britten,et al.  Repeated sequences in DNA. Hundreds of thousands of copies of DNA sequences have been incorporated into the genomes of higher organisms. , 1968, Science.