Validating regulatory predictions from diverse bacteria with mutant fitness data

Although transcriptional regulation is fundamental to understanding bacterial physiology, the targets of most bacterial transcription factors are not known. Comparative genomics has been used to identify likely targets of some of these transcription factors, but these predictions typically lack experimental support. Here, we used mutant fitness data, which measures the importance of each gene for a bacterium’s growth across many conditions, to test regulatory predictions from RegPrecise, a curated collection of comparative genomics predictions. Because characterized transcription factors often have correlated fitness with one of their targets (either positively or negatively), correlated fitness patterns provide support for the comparative genomics predictions. At a false discovery rate of 3%, we identified significant cofitness for at least one target of 158 TFs in 107 ortholog groups and from 24 bacteria. Thus, high-throughput genetics can be used to identify a high-confidence subset of the sequence-based regulatory predictions.

[1]  Inna Dubchak,et al.  RegPredict: an integrated system for regulon inference in prokaryotes by comparative genomics approach , 2010, Nucleic Acids Res..

[2]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[3]  Kathleen Marchal,et al.  COLOMBOS v2.0: an ever expanding collection of bacterial expression compendia , 2013, Nucleic Acids Res..

[4]  Ming Zhang,et al.  Comparing sequences without using alignments: application to HIV/SIV subtyping , 2007, BMC Bioinformatics.

[5]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[6]  W. Boos,et al.  The role of the trehalose system in regulating the maltose regulon of Escherichia coli , 1999, Molecular microbiology.

[7]  V. Wendisch Genome-wide expression analysis in Corynebacterium glutamicum using DNA microarrays. , 2003, Journal of biotechnology.

[8]  Dmitry A Rodionov,et al.  Comparative genomic reconstruction of transcriptional regulatory networks in bacteria. , 2007, Chemical reviews.

[9]  Erin Beck,et al.  TIGRFAMs and Genome Properties in 2013 , 2012, Nucleic Acids Res..

[10]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[11]  Paramvir S. Dehal,et al.  Systematic mapping of two component response regulators to gene targets in a model sulfate reducing bacterium , 2011, Genome Biology.

[12]  Bor-Sen Chen,et al.  Identifying regulatory targets of cell cycle transcription factors using gene expression and ChIP-chip data , 2007, BMC Bioinformatics.

[13]  Kelly M. Wetmore,et al.  Deep Annotation of Protein Function across Diverse Bacteria from Mutant Phenotypes , 2016 .

[14]  Jeremiah J. Faith,et al.  Many Microbe Microarrays Database: uniformly normalized Affymetrix compendia with structured experimental metadata , 2007, Nucleic Acids Res..

[15]  Anna Lyubetskaya,et al.  ChIP-Seq and the complexity of bacterial transcriptional regulation. , 2013, Current topics in microbiology and immunology.

[16]  Kelly M. Wetmore,et al.  Rapid Quantification of Mutant Fitness in Diverse Bacteria by Sequencing Randomly Bar-Coded Transposons , 2015, mBio.

[17]  Kei-Hoi Cheung,et al.  Advancing translational research with the Semantic Web , 2007, BMC Bioinformatics.

[18]  Adam P. Arkin,et al.  Conservation of Transcription Start Sites within Genes across a Bacterial Genus , 2014, mBio.

[19]  Julio Collado-Vides,et al.  RegulonDB v8.0: omics data sets, evolutionary conservation, regulatory phrases, cross-validated gold standards and more , 2012, Nucleic Acids Res..

[20]  William J. Riehl,et al.  RegPrecise 3.0 – A resource for genome-scale exploration of transcriptional regulation in bacteria , 2013, BMC Genomics.

[21]  E. Pérez-Rueda,et al.  Identification and analysis of DNA-binding transcription factors in Bacillus subtilis and other Firmicutes- a genomic approach , 2006, BMC Genomics.

[22]  G. Stormo,et al.  Identifying protein-binding sites from unaligned DNA fragments. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[23]  A. Ishihama,et al.  Transcription profile of Escherichia coli: genomic SELEX search for regulatory targets of transcription factors , 2016, Nucleic acids research.

[24]  Adam P. Arkin,et al.  Evidence-Based Annotation of Gene Function in Shewanella oneidensis MR-1 Using Genome-Wide Fitness Profiling across 121 Conditions , 2011, PLoS genetics.

[25]  Sarah A. Teichmann,et al.  Genomic repertoires of DNA-binding transcription factors across the tree of life , 2010, Nucleic acids research.

[26]  J. Collado-Vides,et al.  Identifying global regulators in transcriptional regulatory networks in bacteria. , 2003, Current opinion in microbiology.

[27]  Lee Ann McCue,et al.  Making connections between novel transcription factors and their DNA motifs. , 2005, Genome research.