Prediction of co-regulated genes in Bacillus subtilis on the basis of upstream elements conserved across three closely related species

BackgroundIdentification of co-regulated genes is essential for elucidating transcriptional regulatory networks and the function of uncharacterized genes. Although co-regulated genes should have at least one common sequence element, it is generally difficult to identify these genes from the presence of this element because it is very easily obscured by noise. To overcome this problem, we used conserved information from three closely related species: Bacillus subtilis,B. halodurans and B. stearothermophilus.ResultsEven though such species have a limited number of clearly orthologous genes, we obtained 1,884 phylogenetically conserved elements from the upstream intergenic regions of 1,568 B. subtilis genes. Similarity between these elements was used to cluster these genes. No other a priori knowledge on genes and elements was used. We could identify some genes known or suggested to be regulated by a common transcription factor as well as genes regulated by a common attenuation effector.ConclusionsWe confirmed that our method generates relatively few false positives in clusters with higher scores and that general elements such as -35/-10 boxes and Shine-Dalgarno sequence are not major obstacles. Moreover, we identified some plausible additional members of groups of known co-regulated genes. Thus, our approach is promising for exploring potentially co-regulated genes.

[1]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[2]  Kenta Nakai,et al.  DBTBS: a database of Bacillus subtilis promoters and transcription factors , 2001, Nucleic Acids Res..

[3]  G. Church,et al.  A comprehensive library of DNA-binding site matrices for 55 proteins applied to the complete Escherichia coli K-12 genome. , 1998, Journal of molecular biology.

[4]  Y. Lu,et al.  Function of RNA secondary structures in transcriptional attenuation of the Bacillus subtilis pyr operon. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[5]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[6]  G. Homuth,et al.  The dnaK operon of Bacillus subtilis is heptacistronic , 1997, Journal of bacteriology.

[7]  G. Rapoport,et al.  CtsR, a novel regulator of stress and heat shock response, controls clp and molecular chaperone gene expression in Gram‐positive bacteria , 1999, Molecular microbiology.

[8]  Y. Nakamura,et al.  Complete genome sequence of the alkaliphilic bacterium Bacillus halodurans and genomic sequence comparison with Bacillus subtilis. , 2000, Nucleic acids research.

[9]  K. Makarova,et al.  Conservation of the binding site for the arginine repressor in all bacterial lineages , 2001, Genome Biology.

[10]  Y. Lu,et al.  Roles of the three transcriptional attenuators of the Bacillus subtilis pyrimidine biosynthetic operon in the regulation of its expression , 1995, Journal of bacteriology.

[11]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[12]  M. Rose,et al.  ClpE, a novel type of HSP100 ATPase, is part of the CtsR heat shock regulon of Bacillus subtilis , 1999, Molecular microbiology.

[13]  M. Grunberg‐Manago,et al.  Aminoacyl-tRNA synthetase gene regulation in Bacillus subtilis. , 1996, Biochimie.

[14]  G D Stormo,et al.  A comparative genomics approach to prediction of new members of regulons. , 2001, Genome research.

[15]  Denis Thieffry,et al.  Prediction of transcriptional regulatory sites in the complete genome sequence of Escherichia coli K-12 , 1998, Bioinform..

[16]  T Yada,et al.  Modeling and predicting transcriptional units of Escherichia coli genes using hidden Markov models. , 1999, Bioinformatics.

[17]  E. Koonin,et al.  Prediction of transcription regulatory sites in Archaea by a comparative genomic approach. , 2000, Nucleic acids research.

[18]  A. Ogiwara,et al.  Evaluation and characterization of catabolite-responsive elements (cre) of Bacillus subtilis. , 2000, Nucleic acids research.

[19]  N. Ogasawara,et al.  Systematic function analysis of Bacillus subtilis genes. , 2000, Research in microbiology.

[20]  W. Miller,et al.  Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome. , 1997, Genome research.

[21]  W R Pearson,et al.  Flexible sequence similarity searching with the FASTA3 program package. , 2000, Methods in molecular biology.

[22]  I. Moszer The complete genome of Bacillus subtilis: from sequence annotation to data management and analysis , 1998, FEBS letters.

[23]  L. Christiansen,et al.  Xanthine metabolism in Bacillus subtilis: characterization of the xpt-pbuX operon and evidence for purine- and nitrogen-controlled expression of genes involved in xanthine salvage and catabolism , 1997, Journal of bacteriology.

[24]  K. Kobayashi,et al.  Combined transcriptome and proteome analysis as a powerful approach to study genes under glucose repression in Bacillus subtilis. , 2001, Nucleic acids research.

[25]  S. Salzberg,et al.  Prediction of operons in microbial genomes. , 2001, Nucleic acids research.

[26]  E. Koonin,et al.  Computer analysis of transcription regulatory patterns in completely sequenced bacterial genomes. , 1999, Nucleic acids research.

[27]  G. Church,et al.  Predicting regulons and their cis-regulatory motifs by comparative genomics. , 2000, Nucleic acids research.

[28]  Robert R. Sokal,et al.  A statistical method for evaluating systematic relationships , 1958 .

[29]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[30]  I-Min A. Dubchak,et al.  Active conservation of noncoding sequences revealed by three-way species comparisons. , 2000, Genome research.

[31]  Temple F. Smith,et al.  Operons in Escherichia coli: genomic analyses and predictions. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[32]  G. Church,et al.  Conservation of DNA regulatory motifs and discovery of new motifs in microbial genomes. , 2000, Genome research.

[33]  A. Goffeau,et al.  The complete genome sequence of the Gram-positive bacterium Bacillus subtilis , 1997, Nature.

[34]  T. Henkin,et al.  The S box regulon: a new global transcription termination control system for methionine and cysteine biosynthesis genes in Gram‐positive bacteria , 1998, Molecular microbiology.

[35]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[36]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[37]  J. Liu,et al.  Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomes. , 2001, Nucleic acids research.