How many genes can make a cell: the minimal-gene-set concept.

Several theoretical and experimental studies have endeavored to derive the minimal set of genes that are necessary and sufficient to sustain a functioning cell under ideal conditions, that is, in the presence of unlimited amounts of all essential nutrients and in the absence of any adverse factors, including competition. A comparison of the first two completed bacterial genomes, those of the parasites Haemophilus influenzae and Mycoplasma genitalium, produced a version of the minimal gene set consisting of approximately 250 genes. Very similar estimates were obtained by analyzing viable gene knockouts in Bacillus subtilis, M. genitalium, and Mycoplasma pneumoniae. With the accumulation and comparison of multiple complete genome sequences, it became clear that only approximately 80 genes of the 250 in the original minimal gene set are represented by orthologs in all life forms. For approximately 15% of the genes from the minimal gene set, viable knockouts were obtained in M. genitalium; unexpectedly, these included even some of the universal genes. Thus, some of the genes that were included in the first version of the minimal gene set, based on a limited genome comparison, could be, in fact, dispensable. The majority of these genes, however, are likely to encode essential functions but, in the course of evolution, are subject to nonorthologous gene displacement, that is, recruitment of unrelated or distantly related proteins for the same function. Further theoretical and experimental studies within the framework of the minimal-gene-set concept and the ultimate construction of a minimal genome are expected to advance our understanding of the basic principles of cell functioning by systematically detecting nonorthologous gene displacement and deciphering the roles of essential but functionally uncharacterized genes.

[1]  Manuel Peitsch,et al.  A genome-based approach for the identification of essential bacterial genes , 1998, Nature Biotechnology.

[2]  T. Traut,et al.  A minimal gene set for cellular life derived by comparison of complete bacterial genomes , 1998 .

[3]  R. Lo,et al.  O-sialoglycoprotease from Pasteurella haemolytica. , 1995, Methods in enzymology.

[4]  J. Logsdon,et al.  Thermotoga heats up lateral gene transfer. , 1999, Current biology : CB.

[5]  A. Mushegian,et al.  The minimal genome concept. , 1999, Current opinion in genetics & development.

[6]  Michael Y. Galperin,et al.  Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea , 1997, Molecular microbiology.

[7]  S. Salzberg,et al.  Evidence for lateral gene transfer between Archaea and Bacteria from genome sequence of Thermotoga maritima , 1999, Nature.

[8]  Detlef D. Leipe,et al.  Did DNA replication evolve twice independently? , 1999, Nucleic acids research.

[9]  P. Bork,et al.  Non-orthologous gene displacement. , 1996, Trends in genetics : TIG.

[10]  Michael Y. Galperin,et al.  Beyond complete genomes: from sequence to structure and function. , 1998, Current opinion in structural biology.

[11]  M. Cho,et al.  Policy forum: genetics. Ethical considerations in synthesizing a minimal genome. , 1999, Science.

[12]  E. Koonin,et al.  A Minimal Gene Complement for Cellular Life and Reconstruction of Primitive Life Forms by Analysis of Complete Bacterial Genomes , 1998 .

[13]  T. Gojobori,et al.  Identification of a ribonuclease H gene in both Mycoplasma genitalium and Mycoplasma pneumoniae by a new method for exhaustive identification of ORFs in the complete genome sequences , 1999, FEBS letters.

[14]  L. Orgel,et al.  Phylogenetic Classification and the Universal Tree , 1999 .

[15]  R. Lo,et al.  [47] O-sialoglycoprotease from Pasteurella haemolytica , 1995 .

[16]  W. Doolittle,et al.  Lateral genomics. , 1999, Trends in cell biology.

[17]  U. Hübscher,et al.  Clamping down on clamps and clamp loaders--the eukaryotic replication factor C. , 1998, European journal of biochemistry.

[18]  D. Söll,et al.  Aminoacyl-tRNA synthesis: divergent routes to a common goal. , 1997, Trends in biochemical sciences.

[19]  Mitsuhiro Itaya,et al.  An estimation of minimal genome size required for life , 1995, FEBS letters.

[20]  Nikos Kyrpides,et al.  Universal Protein Families and the Functional Content of the Last Universal Common Ancestor , 1999, Journal of Molecular Evolution.

[21]  E V Koonin,et al.  Complete genome sequences of cellular life forms: glimpses of theoretical evolutionary genomics. , 1996, Current opinion in genetics & development.

[22]  Arthur L. Caplan,et al.  Policy forum: genetics. Ethical considerations in synthesizing a minimal genome. , 1999, Science.

[23]  D. Mills,et al.  An extracellular Darwinian experiment with a self-duplicating nucleic acid molecule. , 1967, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Michael Y. Galperin,et al.  The COG database: a tool for genome-scale analysis of protein functions and evolution , 2000, Nucleic Acids Res..

[25]  T. Gaasterland,et al.  Microbial genescapes: phyletic and functional patterns of ORF distribution among prokaryotes. , 1998, Microbial & comparative genomics.

[26]  E. Koonin,et al.  Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches. , 1999, Journal of molecular biology.

[27]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[28]  O. White,et al.  Global transposon mutagenesis and a minimal Mycoplasma genome. , 1999, Science.

[29]  Y. Naot,et al.  Molecular Biology and Pathogenicity of Mycoplasmas , 1998, Microbiology and Molecular Biology Reviews.

[30]  E. Koonin,et al.  Genomics: Re-evaluation of translation machinery evolution , 1998, Current Biology.

[31]  D. Söll,et al.  One polypeptide with two aminoacyl-tRNA synthetase activities. , 2000, Science.

[32]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[33]  Doolittle Wf Phylogenetic Classification and the Universal Tree , 1999 .

[34]  E V Koonin,et al.  Evidence for massive gene exchange between archaeal and bacterial hyperthermophiles. , 1998, Trends in genetics : TIG.

[35]  M. Kreitman,et al.  Methods to detect selection in populations with applications to the human. , 2000, Annual review of genomics and human genetics.

[36]  W. Fitch Distinguishing homologous from analogous proteins. , 1970, Systematic zoology.

[37]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[38]  T Gaasterland,et al.  Constructing multigenome views of whole microbial genomes. , 1998, Microbial & comparative genomics.

[39]  R. Fleischmann,et al.  The Minimal Gene Complement of Mycoplasma genitalium , 1995, Science.