REMap: Operon map of M. tuberculosis based on RNA sequence data.

A map of the transcriptional organization of genes of an organism is a basic tool that is necessary to understand and facilitate a more accurate genetic manipulation of the organism. Operon maps are largely generated by computational prediction programs that rely on gene conservation and genome architecture and may not be physiologically relevant. With the widespread use of RNA sequencing (RNAseq), the prediction of operons based on actual transcriptome sequencing rather than computational genomics alone is much needed. Here, we report a validated operon map of Mycobacterium tuberculosis, developed using RNAseq data from both the exponential and stationary phases of growth. At least 58.4% of M. tuberculosis genes are organized into 749 operons. Our prediction algorithm, REMap (RNA Expression Mapping of operons), considers the many cases of transcription coverage of intergenic regions, and avoids dependencies on functional annotation and arbitrary assumptions about gene structure. As a result, we demonstrate that REMap is able to more accurately predict operons, especially those that contain long intergenic regions or functionally unrelated genes, than previous operon prediction programs. The REMap algorithm is publicly available as a user-friendly tool that can be readily modified to predict operons in other bacteria.

[1]  Adamandia Kapopoulou,et al.  TubercuList--10 years after. , 2011, Tuberculosis.

[2]  Olga T. Schubert,et al.  Erratum to Genome-wide mapping of transcriptional start sites defines an extensive leaderless transcriptome in Mycobacterium tuberculosis [Cell Reports 5, (2013) 1121-1131] , 2014 .

[3]  Patrick Deschavanne,et al.  Horizontal transfer of a virulence operon to the ancestor of Mycobacterium tuberculosis. , 2006, Molecular biology and evolution.

[4]  A M Dannenberg,et al.  Immunopathogenesis of pulmonary tuberculosis. , 1993, Hospital practice.

[5]  A. Profumo,et al.  The Mycobacterium tuberculosis Rv2358-furB operon is induced by zinc. , 2004, Research in microbiology.

[6]  B. Gicquel,et al.  Analysis of the phthiocerol dimycocerosate locus of Mycobacterium tuberculosis. Evidence that this lipid is involved in the cell wall permeability barrier. , 2001, The Journal of biological chemistry.

[7]  W. Jacobs,et al.  Site-specific integration of mycobacteriophage L5: integration-proficient vectors for Mycobacterium smegmatis, Mycobacterium tuberculosis, and bacille Calmette-Guérin. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[8]  S. Salzberg,et al.  Prediction of operons in microbial genomes. , 2001, Nucleic acids research.

[9]  Dirk Schnappinger,et al.  Inhibition of Respiration by Nitric Oxide Induces a Mycobacterium tuberculosis Dormancy Program , 2003, The Journal of experimental medicine.

[10]  S. Cole,et al.  Genome‐wide regulon and crystal structure of BlaI (Rv1846c) from Mycobacterium tuberculosis , 2009, Molecular microbiology.

[11]  C. Walsh,et al.  Identification of a Mycobacterium tuberculosis gene cluster encoding the biosynthetic enzymes for assembly of the virulence-conferring siderophore mycobactin. , 1998, Chemistry & biology.

[12]  Alimuddin Zumla,et al.  WHO's 2013 global report on tuberculosis: successes, threats, and opportunities , 2013, The Lancet.

[13]  B. Gicquel,et al.  A Mycobacterium tuberculosis operon encoding ESAT-6 and a novel low-molecular-mass culture filtrate protein (CFP-10). , 1998, Microbiology.

[14]  Kristin Reiche,et al.  The primary transcriptome of the major human pathogen Helicobacter pylori , 2010, Nature.

[15]  E. Brown,et al.  A Mycobacterial Operon Essential for Virulence In Vivo and Invasion and Intracellular Persistence in Macrophages , 2006, Infection and Immunity.

[16]  A. Cataldi,et al.  The gene encoding P27 lipoprotein and a putative antibiotic-resistance gene form an operon in Mycobacterium tuberculosis and Mycobacterium bovis. , 2000, Microbiology.

[17]  P. Roback,et al.  A predicted operon map for Mycobacterium tuberculosis , 2007, Nucleic acids research.

[18]  Ioannis Xenarios,et al.  Database resources for the tuberculosis community. , 2013, Tuberculosis.

[19]  F. Bange,et al.  Polymorphic Nucleotide within the Promoter of Nitrate Reductase (NarGHJI) Is Specific for Mycobacterium tuberculosis , 2003, Journal of Clinical Microbiology.

[20]  R. Rutherford,et al.  The transcriptome of Mycobacterium tuberculosis , 2010, Applied Microbiology and Biotechnology.

[21]  S. Salzberg,et al.  Whole-Genome Comparison of Mycobacterium tuberculosis Clinical and Laboratory Strains , 2002, Journal of bacteriology.

[22]  Sarita Ranjan,et al.  MycoperonDB: a database of computationally identified operons and transcriptional units in Mycobacteria , 2006, BMC Bioinformatics.

[23]  C. Locht,et al.  Identification of novel intergenic repetitive units in a mycobacterial two‐component system operon , 1997, Molecular microbiology.

[24]  Weidong Tian,et al.  Revealing of Mycobacterium marinum Transcriptome by RNA-seq , 2013, PloS one.

[25]  M. Bashyam,et al.  A study of mycobacterial transcriptional apparatus: identification of novel features in promoter elements , 1996, Journal of bacteriology.

[26]  N. C. Gey van Pittius,et al.  The complex architecture of mycobacterial promoters. , 2013, Tuberculosis.

[27]  Jaya Sivaswami Tyagi,et al.  Transcription and autoregulation of the Rv3134c-devR-devS operon of Mycobacterium tuberculosis. , 2005, Microbiology.

[28]  T. Parish,et al.  The Critical Role of embC in Mycobacterium tuberculosis , 2008, Journal of bacteriology.

[29]  W. Jacobs,et al.  inhA, a gene encoding a target for isoniazid and ethionamide in Mycobacterium tuberculosis. , 1994, Science.

[30]  Molecular analysis of Mycobacterium tuberculosis phosphate specific transport system in Mycobacterium smegmatis. Characterization of recombinant 38 kDa (PstS-1). , 2001, Microbial pathogenesis.

[31]  S. Cole,et al.  Comparative and functional genomics of the Mycobacterium tuberculosis complex. , 2002, Microbiology.

[32]  Y. Akhter,et al.  Clusters of PE and PPE genes of Mycobacterium tuberculosis are organized in operons: Evidence that PE Rv2431c is co‐transcribed with PPE Rv2430c and their gene products interact with each other , 2006, FEBS letters.

[33]  G. Riccardi,et al.  Rv2686c-Rv2687c-Rv2688c, an ABC Fluoroquinolone Efflux Pump in Mycobacterium tuberculosis , 2004, Antimicrobial Agents and Chemotherapy.

[34]  Peter D. Karp,et al.  Using functional and organizational information to improve genome-wide computational prediction of transcription units on pathway-genome databases , 2004, Bioinform..

[35]  J. O'Brien,et al.  Genetic Organization of the Region Encoding Regulation, Biosynthesis, and Transport of Rhizobactin 1021, a Siderophore Produced by Sinorhizobium meliloti , 2001, Journal of bacteriology.

[36]  G. Schoolnik,et al.  Mycobacterium tuberculosis gene expression during adaptation to stationary phase and low-oxygen dormancy. , 2004, Tuberculosis.

[37]  W. Bishai,et al.  The aerosol rabbit model of TB latency, reactivation and immune reconstitution inflammatory syndrome. , 2008, Tuberculosis.

[38]  Nalin Rastogi,et al.  Molecular Evolutionary History of Tubercle Bacilli Assessed by Study of the Polymorphic Nucleotide within the Nitrate Reductase (narGHJI) Operon Promoter , 2005, Journal of Clinical Microbiology.

[39]  N. Casali,et al.  Regulation of the Mycobacterium tuberculosis mce1 Operon , 2006, Journal of bacteriology.

[40]  T. Elliott,et al.  Transcriptional control of the nuo operon which encodes the energy-conserving NADH dehydrogenase of Salmonella typhimurium , 1995, Journal of bacteriology.

[41]  J. Betts,et al.  Evaluation of a nutrient starvation model of Mycobacterium tuberculosis persistence by gene and protein expression profiling , 2002, Molecular microbiology.

[42]  Shruti Jain,et al.  mymA operon of Mycobacterium tuberculosis: its regulation and importance in the cell envelope. , 2003, FEMS microbiology letters.

[43]  Temple F. Smith,et al.  Operons in Escherichia coli: genomic analyses and predictions. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Olga T. Schubert,et al.  Genome-wide Mapping of Transcriptional Start Sites Defines an Extensive Leaderless Transcriptome in Mycobacterium tuberculosis , 2014, Cell Reports.

[45]  Ying Xu,et al.  DOOR: a database for prokaryotic operons , 2008, Nucleic Acids Res..

[46]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[47]  I. Onorato,et al.  An outbreak involving extensive transmission of a virulent strain of Mycobacterium tuberculosis. , 1998, The New England journal of medicine.

[48]  B. Tjaden,et al.  Computational analysis of bacterial RNA-Seq data , 2013, Nucleic acids research.

[49]  James A. Raleigh,et al.  Tuberculous Granulomas Are Hypoxic in Guinea Pigs, Rabbits, and Nonhuman Primates , 2008, Infection and Immunity.