Recurring cluster and operon assembly for Phenylacetate degradation genes

BackgroundA large number of theories have been advanced to explain why genes involved in the same biochemical processes are often co-located in genomes. Most of these theories have been dismissed because empirical data do not match the expectations of the models. In this work we test the hypothesis that cluster formation is most likely due to a selective pressure to gradually co-localise protein products and that operon formation is not an inevitable conclusion of the process.ResultsWe have selected an exemplar well-characterised biochemical pathway, the phenylacetate degradation pathway, and we show that its complex history is only compatible with a model where a selective advantage accrues from moving genes closer together. This selective pressure is likely to be reasonably weak and only twice in our dataset of 102 genomes do we see independent formation of a complete cluster containing all the catabolic genes in the pathway. Additionally, de novo clustering of genes clearly occurs repeatedly, even though recombination should result in the random dispersal of such genes in their respective genomes. Interspecies gene transfer has frequently replaced in situ copies of genes resulting in clusters that have similar content but very different evolutionary histories.ConclusionOur model for cluster formation in prokaryotes, therefore, consists of a two-stage selection process. The first stage is selection to move genes closer together, either because of macromolecular crowding, chromatin relaxation or transcriptional regulation pressure. This proximity opportunity sets up a separate selection for co-transcription.

[1]  J. Lawrence Selfish operons and speciation by gene transfer. , 1997, Trends in microbiology.

[2]  C. MacCluer,et al.  A metabolic force for gene clustering , 2004, Bulletin of mathematical biology.

[3]  M. Demerec,et al.  Complex Loci in Microorganisms , 1959 .

[4]  J. Spieth,et al.  Operons in C. elegans: Polycistronic mRNA precursors are processed by trans-splicing of SL2 to downstream coding regions , 1993, Cell.

[5]  Pietro Liò,et al.  The Origin and Evolution of Operons: The Piecewise Building of the Proteobacterial Histidine Operon , 2005, Journal of Molecular Evolution.

[6]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[7]  Thomas J Naughton,et al.  Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified , 2006, BMC Evolutionary Biology.

[8]  Csaba Pál,et al.  Evidence against the selfish operon theory. , 2004, Trends in genetics : TIG.

[9]  George E. Fox,et al.  Conserved Gene Clusters in Bacterial Genomes Provide Further Support for the Primacy of RNA , 1997, Journal of Molecular Evolution.

[10]  E. R. Olivera,et al.  The phenylacetyl‐CoA catabolon: a complex catabolic unit with broad biotechnological applications , 2001, Molecular microbiology.

[11]  E. Díaz,et al.  Molecular characterization of the phenylacetic acid catabolic pathway in Pseudomonas putida U: the phenylacetyl-CoA catabolon. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[12]  E. Díaz,et al.  Catabolism of Phenylacetic Acid in Escherichia coli , 1998, The Journal of Biological Chemistry.

[13]  Eric J Alm,et al.  Correction: The Life-Cycle of Operons , 2006, PLoS Genetics.

[14]  P. R. ten Wolde,et al.  Statistical analysis of the spatial distribution of operons in the transcriptional regulation network of Escherichia coli. , 2003, Journal of molecular biology.

[15]  Simon Wong,et al.  Birth of a metabolic gene cluster in yeast by adaptive gene relocation , 2005, Nature Genetics.

[16]  Katherine H. Huang,et al.  Operon formation is driven by co-regulation and not by horizontal gene transfer. , 2005, Genome research.

[17]  T. Blumenthal Trans-splicing and polycistronic transcription in Caenorhabditis elegans. , 1995, Trends in genetics : TIG.

[18]  Mark Wilkinson,et al.  Of clades and clans: terms for phylogenetic relationships in unrooted trees. , 2007, Trends in ecology & evolution.

[19]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[20]  A. Arkin,et al.  The Life-Cycle of Operons , 2006, PLoS genetics.

[21]  J. Felsenstein Phylogenies from molecular sequences: inference and reliability. , 1988, Annual review of genetics.

[22]  Hanah Margalit,et al.  Chromosomal organization is shaped by the transcription regulatory network. , 2005, Trends in genetics : TIG.

[23]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[24]  J. Monod,et al.  [Operon: a group of genes with the expression coordinated by an operator]. , 1960, Comptes rendus hebdomadaires des seances de l'Academie des sciences.

[25]  B. Snel,et al.  Conservation of gene order: a fingerprint of proteins that physically interact. , 1998, Trends in biochemical sciences.

[26]  R. Ellis,et al.  Macromolecular crowding: an important but neglected aspect of the intracellular environment. , 2001, Current opinion in structural biology.

[27]  E. Koonin,et al.  Evolution of mosaic operons by horizontal gene transfer and gene displacement in situ , 2003, Genome Biology.

[28]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[29]  J. McInerney,et al.  Fatty acid biosynthesis in Mycobacterium tuberculosis: Lateral gene transfer, adaptive evolution, and gene duplication , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[30]  M. Elowitz,et al.  Protein Mobility in the Cytoplasm ofEscherichia coli , 1999, Journal of bacteriology.

[31]  Davide Pisani,et al.  Supertrees disentangle the chimerical origin of eukaryotic genomes. , 2007, Molecular biology and evolution.

[32]  Wolfgang Eisenreich,et al.  Functional genomics by NMR spectroscopy. Phenylacetate catabolism in Escherichia coli. , 2003, European journal of biochemistry.