Natural-like function in artificial WW domains

Protein sequences evolve through random mutagenesis with selection for optimal fitness. Cooperative folding into a stable tertiary structure is one aspect of fitness, but evolutionary selection ultimately operates on function, not on structure. In the accompanying paper, we proposed a model for the evolutionary constraint on a small protein interaction module (the WW domain) through application of the SCA, a statistical analysis of multiple sequence alignments. Construction of artificial protein sequences directed only by the SCA showed that the information extracted by this analysis is sufficient to engineer the WW fold at atomic resolution. Here, we demonstrate that these artificial WW sequences function like their natural counterparts, showing class-specific recognition of proline-containing target peptides. Consistent with SCA predictions, a distributed network of residues mediates functional specificity in WW domains. The ability to recapitulate natural-like function in designed sequences shows that a relatively small quantity of sequence information is sufficient to specify the global energetics of amino acid interactions.

[1]  K. Abromeit Music Received , 2023, Notes.

[2]  Alan R. Fersht,et al.  The use of double mutants to detect structural changes in the active site of the tyrosyl-tRNA synthetase (Bacillus stearothermophilus) , 1984, Cell.

[3]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[4]  R. MacKinnon,et al.  Revealing the architecture of a K+ channel pore through mutant cycles with a peptide inhibitor. , 1995, Science.

[5]  M. Sudol,et al.  The WW domain of Yes-associated protein binds a proline-rich ligand that differs from the consensus established for Src homology 3-binding modules. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[6]  M. Sudol,et al.  The WW Domain of Neural Protein FE65 Interacts with Proline-rich Motifs in Mena, the Mammalian Homolog of DrosophilaEnabled* , 1997, The Journal of Biological Chemistry.

[7]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[8]  Scott A. Peterson,et al.  Characterization of the WW Domain of Human Yes-associated Protein and Its Polyproline-containing Ligands* , 1997, The Journal of Biological Chemistry.

[9]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[10]  Xiao Zhen Zhou,et al.  Function of WW domains as phosphoserine- or phosphothreonine-binding modules. , 1999, Science.

[11]  M. Sudol,et al.  A Single Point Mutation in a Group I WW Domain Shifts Its Specificity to That of Group II WW Domains* , 1999, The Journal of Biological Chemistry.

[12]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[13]  P. Leder,et al.  A Novel Pro-Arg Motif Recognized by WW Domains* , 2000, The Journal of Biological Chemistry.

[14]  Xin Huang,et al.  Structure of a WW domain containing fragment of dystrophin in complex with β-dystroglycan , 2000, Nature Structural Biology.

[15]  Tony Hunter,et al.  Structural basis for phosphoserine-proline recognition by group IV WW domains , 2000, Nature Structural Biology.

[16]  W. Lim,et al.  Converging on proline: the mechanism of WW domain peptide recognition , 2000, Nature Structural Biology.

[17]  C. Voigt,et al.  Rational evolutionary design: the theory of in vitro protein evolution. , 2000, Advances in protein chemistry.

[18]  J. Schneider-Mergener,et al.  Synthesis of an Array Comprising 837 Variants of the hYAP WW Protein Domain. , 2001, Angewandte Chemie.

[19]  J. Forman-Kay,et al.  Solution structure of a Nedd4 WW domain–ENaC peptide complex , 2001, Nature Structural Biology.

[20]  B. Kay,et al.  Characterizing Class I WW domains defines key specificity determinants and generates mutant domains with novel specificities. , 2001, Chemistry & biology.

[21]  A. Fersht,et al.  Ultrafast folding of WW domains without structured aromatic clusters in the denatured state , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[22]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[23]  John R Desjarlais,et al.  A de novo redesign of the WW domain , 2003, Protein science : a publication of the Protein Society.

[24]  Gürol M. Süel,et al.  Evolutionarily conserved networks of residues mediate allosteric communication in proteins , 2003, Nature Structural Biology.

[25]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[26]  P. Schmieder,et al.  WW domain sequence activity relationships identified using ligand recognition propensities of 42 WW domains , 2003, Protein science : a publication of the Protein Society.

[27]  Loren L Looger,et al.  Computational Design of a Biologically Active Enzyme , 2004, Science.

[28]  Yi Zhang,et al.  A map of WW domain family interactions , 2004, Proteomics.

[29]  Yusuke Kato,et al.  Common Mechanism of Ligand Recognition by Group II/III WW Domains , 2004, Journal of Biological Chemistry.

[30]  D. Baker,et al.  Computational redesign of protein-protein interaction specificity , 2004, Nature Structural &Molecular Biology.

[31]  W. P. Russ,et al.  Evolutionary information for specifying a protein fold , 2005, Nature.