Identifying functional links between genes using conserved chromosomal proximity.

Conservation of proximity of a pair of genes across multiple genomes generally indicates that their functions could be linked. Here, we present a systematic evaluation using 42 complete microbial genomes from 25 phylogenetic groups to test the reliability of this observation in predicting function for genes. We find a relationship between the number of phylogenetic groups in which a gene pair is proximate and the probability that the pair belongs to a common pathway. Our method produces 1586 links between ortholog families substantiated by observed proximity in genomes representing at least three phylogenetic groups. Of the pairs annotated in the KEGG database, 80% are in the same biological pathway in KEGG.

[1]  M. Kanehisa,et al.  Automatic detection of conserved gene clusters in multiple genomes by graph comparison and P-quasi grouping. , 2000, Nucleic acids research.

[2]  E. Shoubridge,et al.  Random genetic drift in the female germline explains the rapid segregation of mammalian mitochondrial DNA , 1996, Nature Genetics.

[3]  Z. Chrzanowska-Lightowlers,et al.  Intracellular mitochondrial triplasmy in a patient with two heteroplasmic base changes. , 1997, American journal of human genetics.

[4]  M. Suyama,et al.  Evolution of prokaryotic gene order: genome rearrangements in closely related species. , 2001, Trends in genetics : TIG.

[5]  E. Shoubridge,et al.  Oxidative phosphorylation defect in the brains of carriers of the tRNAleu(UUR) A3243G mutation in a MELAS pedigree , 2000, Annals of neurology.

[6]  D. Turnbull,et al.  Random intracellular drift explains the clonal expansion of mitochondrial DNA mutations with age. , 2001, American journal of human genetics.

[7]  P Guerdoux-Jamet,et al.  Implication of gene distribution in the bacterial chromosome for the bacterial cell factory. , 2000, Journal of biotechnology.

[8]  P Guerdoux-Jamet,et al.  Mapping the bacterial cell architecture into the chromosome. , 2000, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[9]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[10]  R. Overbeek,et al.  The use of gene clusters to infer functional coupling. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[11]  T Gaasterland,et al.  Constructing multigenome views of whole microbial genomes. , 1998, Microbial & comparative genomics.

[12]  L. Smith,et al.  Mitochondrial genotype segregation in a mouse heteroplasmic lineage produced by embryonic karyoplast transplantation. , 1997, Genetics.

[13]  D. Eisenberg,et al.  Detecting protein function and protein-protein interactions from genome sequences. , 1999, Science.

[14]  E. Koonin,et al.  Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context. , 2001, Genome research.

[15]  G. Church,et al.  A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression , 2000, Nature Genetics.

[16]  H. Jacobs,et al.  Coupled Leading- and Lagging-Strand Synthesis of Mammalian Mitochondrial DNA , 2000, Cell.

[17]  R. Burton,et al.  Natural selection and the evolution of mtDNA-encoded peptides: evidence for intergenomic co-adaptation. , 2001, Trends in genetics : TIG.

[18]  E. Shoubridge,et al.  Variable distribution of mutant mitochondria1 DNAs (tRNALeu[3243]) in tissues of symptomatic relatives with MELAS , 1993, Neurology.

[19]  C. Moorehead All rights reserved , 1997 .

[20]  F. Jacob L'opéron 25 ans après , 1997 .

[21]  H. Jacobs,et al.  Genotypic stability, segregation and selection in heteroplasmic human cell lines containing np 3243 mutant mtDNA. , 2000, Genetics.

[22]  [The operon after 25 years]. , 1997, Comptes rendus de l'Academie des sciences. Serie III, Sciences de la vie.

[23]  C. DeLisi,et al.  Genes linked by fusion events are generally of the same functional category: A systematic analysis of 30 microbial genomes , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[25]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[26]  Michael Y. Galperin,et al.  The COG database: new developments in phylogenetic classification of proteins from complete genomes , 2001, Nucleic Acids Res..

[27]  P Bork,et al.  Gene context conservation of a higher order than operons. , 2000, Trends in biochemical sciences.

[28]  B. Snel,et al.  Conservation of gene order: a fingerprint of proteins that physically interact. , 1998, Trends in biochemical sciences.

[29]  J. Lawrence Selfish operons and speciation by gene transfer. , 1997, Trends in microbiology.

[30]  I. Kobayashi Behavior of restriction-modification systems as selfish mobile elements and their impact on genome evolution. , 2001, Nucleic acids research.

[31]  E V Koonin,et al.  Gene order is not conserved in bacterial evolution. , 1996, Trends in genetics : TIG.

[32]  B. Snel,et al.  Gene and context: integrative approaches to genome analysis. , 2000, Advances in protein chemistry.

[33]  Seiya Takahashi,et al.  Replicative advantage and tissue-specific segregation of RR mitochondrial DNA between C57BL/6 and RR heteroplasmic mice. , 2000, Genetics.

[34]  Anton J. Enright,et al.  Protein interaction maps for complete genomes based on gene fusion events , 1999, Nature.