On the Classification and Evolution of Protein Modules

Our efforts to classify the functional units of many proteins, the modules, are reviewed. The data from the sequencing projects for various model organisms are extremely helpful in deducing the evolution of proteins and modules. For example, a dramatic increase of modular proteins can be observed from yeast to C. elegans in accordance with new protein functions that had to be introduced in multicellular organisms. Our sequence characterization of modules relies on sensitive similarity search algorithms and the collection of multiple sequence alignments for each module. To trace the evolution of modules and to further automate the classification, we have developed a sequence and a module alerting system that checks newly arriving sequence data for the presence of already classified modules. Using these systems, we were able to identify an unexpected similarity between extracellular C1Q modules with bacterial proteins.

[1]  B. Dujon The yeast genome project: what did we learn? , 1996, Trends in genetics : TIG.

[2]  P Bork,et al.  The immunoglobulin fold. Structural classification, sequence patterns and common core. , 1994, Journal of molecular biology.

[3]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[4]  T J Gibson,et al.  PairWise and SearchWise: finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames. , 1996, Nucleic acids research.

[5]  P. Haris,et al.  Beta-sheet secondary structure of the trimeric globular domain of C1q of complement and collagen types VIII and X by Fourier-transform infrared spectroscopy and averaged structure predictions. , 1994, The Biochemical journal.

[6]  R. Fleischmann,et al.  The Minimal Gene Complement of Mycoplasma genitalium , 1995, Science.

[7]  M H Saier,et al.  Structure and evolution of a multidomain multiphosphoryl transfer protein. Nucleotide sequence of the fruB(HI) gene in Rhodobacter capsulatus and comparisons with homologous genes from other organisms. , 1990, Journal of molecular biology.

[8]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[9]  László Patthy,et al.  Exons – original building blocks of proteins? , 1991, BioEssays : news and reviews in molecular, cellular and developmental biology.

[10]  R. Doolittle The multiplicity of domains in proteins. , 1995, Annual review of biochemistry.

[11]  A Bairoch,et al.  Go hunting in sequence databases but watch out for the traps. , 1996, Trends in genetics : TIG.

[12]  L. Patthy,et al.  Exon shuffling and other ways of module exchange. , 1996, Matrix biology : journal of the International Society for Matrix Biology.

[13]  Peer Bork,et al.  A phosphotyrosine interaction domain , 1995, Cell.

[14]  T. Sato,et al.  Complete nucleotide sequence of a skin element excised by DNA rearrangement during sporulation in Bacillus subtilis. , 1995, Microbiology.

[15]  W. Gilbert Why genes in pieces? , 1978, Nature.

[16]  J. Kelton,et al.  The cDNA Sequence of Human Endothelial Cell Multimerin , 1995, The Journal of Biological Chemistry.

[17]  P Bork,et al.  Structure and distribution of modules in extracellular proteins , 1996, Quarterly Reviews of Biophysics.

[18]  L. Silver,et al.  Sperm-Egg Binding Protein or Proto-Oncogene? , 1996, Science.