An examination of the conservation of surface patch polarity for proteins

MOTIVATION The solubility of a protein is crucial for its function and is therefore an evolutionary constraint. As the solubility of a protein is related to the distribution of polar and hydrophobic residues on its solvent accessible surface, such a constraint should provide a valuable insight into the evolution of protein surfaces. We examine how the surfaces of proteins have evolved by considering how the average hydrophobicities of patches of surface residues vary across homologous proteins. We derive distributions for the average hydrophobicity/philicity of surface patches at a residue-based level-which we refer to as the residue hydrophobic density. This is computed for a set of 28 monomeric proteins and their homologues. The resulting distributions are compared with a set of randomized sequences, with the same residue content. RESULTS We find that the patches, involving typically more than 10 residues, maintain a more hydrophilic surface than one would expect from a random substitution model, indicating a cooperative behaviour for these surfaces residues in terms of this single variable. SUPPLEMENTARY INFORMATION Additional plots for all of the proteins examined in this paper can be found at: http://www.ebi.ac.uk/~shanahan/PCon/index.html

[1]  Yinan Wei,et al.  Solution structure of a de novo protein from a designed combinatorial library , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[3]  G Vriend,et al.  Correlated Mutation Analyses on Very Large Sequence Families , 2002, Chembiochem : a European journal of chemical biology.

[4]  Christopher M. Dobson,et al.  Protein-misfolding diseases: Getting out of shape , 2002, Nature.

[5]  Vincent Lombard,et al.  The EMBL Nucleotide Sequence Database , 2002, Nucleic Acids Res..

[6]  M. Nakasako Large-scale networks of hydration water molecules around proteins investigated by cryogenic X-ray crystallography. , 2001, Cellular and molecular biology.

[7]  Yael Mandel-Gutfreund,et al.  Contributions of residue pairing to β-sheet formation:conservation and covariation of amino acid residue pairs on antiparallel β-strands 1 1 Edited by J. Thornton , 2001 .

[8]  J. Thornton,et al.  Discriminating between homodimeric and monomeric proteins in the crystalline state , 2000, Proteins.

[9]  B. Rost,et al.  Effective use of sequence correlation and conservation in fold recognition. , 1999, Journal of molecular biology.

[10]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[11]  M. S. Chapman,et al.  Critical initial real-space refinement in the structure determination of arginine kinase. , 1999, Acta crystallographica. Section D, Biological crystallography.

[12]  W R Taylor,et al.  Coevolving protein residues: maximum likelihood identification and relationship to structure. , 1999, Journal of molecular biology.

[13]  Janet Sim,et al.  Amino acid substitutions affecting protein solubility: high level expression of Streptomyces clavuligerus isopenicillin N synthase in Escherichia coli , 1999 .

[14]  G S Weston,et al.  Three-dimensional structure of AmpC beta-lactamase from Escherichia coli bound to a transition-state analogue: possible implications for the oxyanion hypothesis and for inhibitor design. , 1998, Biochemistry.

[15]  M. S. Chapman,et al.  Transition state structure of arginine kinase: implications for catalysis of bimolecular reactions. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[16]  W. Jenkins Three solutions of the protein solubility problem. , 1998, Protein science : a publication of the Protein Society.

[17]  W. C. Barker,et al.  The PIR-International Protein Sequence Database. , 1998, Nucleic acids research.

[18]  S. Jones,et al.  Analysis of protein-protein interaction sites using surface patches. , 1997, Journal of molecular biology.

[19]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[20]  A. Valencia,et al.  Correlated mutations contain information about protein-protein interaction. , 1997, Journal of molecular biology.

[21]  C. Sander,et al.  Correlated mutations and residue contacts in proteins , 1994, Proteins.

[22]  C. Sander,et al.  Correlated Mutations and Residue Contacts , 1994 .

[23]  Gert Vriend,et al.  A common motif in G-protein-coupled seven transmembrane helix receptors , 1993, J. Comput. Aided Mol. Des..

[24]  Simon J. Hubbard,et al.  Department of Biochemistry and Molecular Biology , 2006 .

[25]  Catherine H. Schein,et al.  Solubility as a Function of Protein Structure and Solvent Components , 1990, Bio/Technology.

[26]  C. DeLisi,et al.  Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. , 1987, Journal of molecular biology.

[27]  T. Arakawa,et al.  Theory of protein solubility. , 1985, Methods in enzymology.

[28]  R. Cramer,et al.  Measurement of correlation of partition coefficients of polar amino acids. , 1981, Molecular pharmacology.