3D Complex: A Structural Classification of Protein Complexes

Most of the proteins in a cell assemble into complexes to carry out their function. It is therefore crucial to understand the physicochemical properties as well as the evolution of interactions between proteins. The Protein Data Bank represents an important source of information for such studies, because more than half of the structures are homo- or heteromeric protein complexes. Here we propose the first hierarchical classification of whole protein complexes of known 3-D structure, based on representing their fundamental structural features as a graph. This classification provides the first overview of all the complexes in the Protein Data Bank and allows nonredundant sets to be derived at different levels of detail. This reveals that between one-half and two-thirds of known structures are multimeric, depending on the level of redundancy accepted. We also analyse the structures in terms of the topological arrangement of their subunits and find that they form a small number of arrangements compared with all theoretically possible ones. This is because most complexes contain four subunits or less, and the large majority are homomeric. In addition, there is a strong tendency for symmetry in complexes, even for heteromeric complexes. Finally, through comparison of Biological Units in the Protein Data Bank with the Protein Quaternary Structure database, we identified many possible errors in quaternary structure assignments. Our classification, available as a database and Web server at http://www.3Dcomplex.org, will be a starting point for future work aimed at understanding the structure and evolution of protein complexes.

[1]  Sarah A Teichmann,et al.  Novel specificities emerge by stepwise duplication of functional modules. , 2005, Genome research.

[2]  W. Pearson Rapid and sensitive sequence comparison with FASTP and FASTA. , 1990, Methods in enzymology.

[3]  J. Janin,et al.  A dissection of specific and non-specific protein-protein interfaces. , 2004, Journal of molecular biology.

[4]  Sarah A Teichmann,et al.  The origins and evolution of functional modules: lessons from protein complexes , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[5]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[6]  Robert D. Finn,et al.  iPfam: visualization of protein?Cprotein interactions in PDB at domain and amino acid resolutions , 2005, Bioinform..

[7]  Robert B Sim,et al.  Disease-associated Mutations in Human Mannose-binding Lectin Compromise Oligomerization and Activity of the Final Protein* , 2004, Journal of Biological Chemistry.

[8]  Lindsay Sawyer,et al.  The two types of 3-dehydroquinase have distinct structures but catalyze the same overall reaction , 1999, Nature Structural Biology.

[9]  A. Murzin,et al.  Evolution of protein fold in the presence of functional constraints. , 2006, Current Opinion in Structural Biology.

[10]  Ryan Day,et al.  A consensus view of fold space: Combining SCOP, CATH, and the Dali Domain Dictionary , 2003, Protein science : a publication of the Protein Society.

[11]  Fred P. Davis,et al.  PIBASE: a comprehensive database of structurally defined protein interfaces , 2005, Bioinform..

[12]  Jacques Monod,et al.  Allosteric Proteins and Cellular Control Systems , 1989 .

[13]  J M Thornton,et al.  Conservation helps to identify biologically relevant crystal contacts. , 2001, Journal of molecular biology.

[14]  Philip E. Bourne,et al.  Functional Coverage of the Human Genome by Existing Structures, Structural Genomics Targets, and Homology Models , 2005, PLoS Comput. Biol..

[15]  B. Alberts The Cell as a Collection of Protein Machines: Preparing the Next Generation of Molecular Biologists , 1998, Cell.

[16]  Robert B. Russell,et al.  3did: interacting protein domains of known three-dimensional structure , 2004, Nucleic Acids Res..

[17]  I. Ispolatov,et al.  Binding properties and evolution of homodimers in protein–protein interaction networks , 2005, Nucleic acids research.

[18]  Cyrus Chothia,et al.  Divergence of interdomain geometry in two-domain proteins. , 2006, Structure.

[19]  Janet M. Thornton,et al.  Automatic inference of protein quaternary structure from crystals , 2003 .

[20]  Patrice Koehl,et al.  The ASTRAL Compendium in 2004 , 2003, Nucleic Acids Res..

[21]  Peter G Wolynes,et al.  A survey of flexible protein binding mechanisms and their transition states using native topology based energy landscapes. , 2005, Journal of molecular biology.

[22]  R. Russell,et al.  The relationship between sequence and interaction divergence in proteins. , 2003, Journal of molecular biology.

[23]  D. Goeddel,et al.  TNF-R1 Signaling: A Beautiful Pathway , 2002, Science.

[24]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[25]  Saraswathi Vishveshwara,et al.  Oligomeric protein structure networks: insights into protein-protein interactions , 2005, BMC Bioinformatics.

[26]  A J Olson,et al.  Structural symmetry and protein function. , 2000, Annual review of biophysics and biomolecular structure.

[27]  Peter J Tonge,et al.  Crystal Structure of Mycobacterium tuberculosis MenB, a Key Enzyme in Vitamin K2 Biosynthesis* , 2003, Journal of Biological Chemistry.

[28]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[29]  R. Liddington,et al.  Dimer formation drives the activation of the cell death protease caspase 9 , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Ron D. Appel,et al.  ExPASy: the proteomics server for in-depth protein knowledge and analysis , 2003, Nucleic Acids Res..

[31]  S. Beychok,et al.  Probes of subunit assembly and reconstitution pathways in multisubunit proteins. , 1979, Annual review of biochemistry.

[32]  A. Holmgren,et al.  Folding of human superoxide dismutase: disulfide reduction prevents dimerization and produces marginally stable monomers. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[33]  M Vijayan,et al.  Variability in quaternary association of proteins with the same tertiary fold: A case study and rationalization involving legume lectins , 1999, Proteins.

[34]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[35]  H. Wolfson,et al.  Studies of protein‐protein interfaces: A statistical analysis of the hydrophobic effect , 1997, Protein science : a publication of the Protein Society.

[36]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[37]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[38]  R. Huber,et al.  Crystal structure of the 20S proteasome from the archaeon T. acidophilum at 3.4 A resolution. , 1995, Science.

[39]  C. Chothia,et al.  The Packing Density in Proteins: Standard Radii and Volumes , 1999 .

[40]  J. Berger,et al.  Structure of the Rho Transcription Terminator Mechanism of mRNA Recognition and Helicase Loading , 2003, Cell.

[41]  Michael Schroeder,et al.  SCOPPI: a structural classification of protein–protein interfaces , 2005, Nucleic Acids Res..

[42]  T. Earnest,et al.  From words to literature in structural proteomics , 2003, Nature.

[43]  J. Thornton,et al.  PQS: a protein quaternary structure file server. , 1998, Trends in biochemical sciences.

[44]  Denise Gorse,et al.  Morphological aspects of oligomeric protein structures. , 2005, Progress in biophysics and molecular biology.

[45]  Burkhard Rost,et al.  Target space for structural genomics revisited , 2002, Bioinform..

[46]  Kyungsook Han,et al.  PSIbase: a database of Protein Structural Interactome map (PSIMAP) , 2005, Bioinform..

[47]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Emden R. Gansner,et al.  Graphviz - Open Source Graph Drawing Tools , 2001, GD.

[49]  C. Chothia,et al.  Principles of protein–protein recognition , 1975, Nature.