Comprehensive in silico mutagenesis highlights functionally important residues in proteins

MOTIVATION Mutating residues into alanine (alanine scanning) is one of the fastest experimental means of probing hypotheses about protein function. Alanine scans can reveal functional hot spots, i.e. residues that alter function upon mutation. In vitro mutagenesis is cumbersome and costly: probing all residues in a protein is typically as impossible as substituting by all non-native amino acids. In contrast, such exhaustive mutagenesis is feasible in silico. RESULTS Previously, we developed SNAP to predict functional changes due to non-synonymous single nucleotide polymorphisms. Here, we applied SNAP to all experimental mutations in the ASEdb database of alanine scans; we identi.ed 70% of the hot spots (>or=1 kCal/mol change in binding energy); more severe changes were predicted more accurately. Encouraged, we carried out a complete all-against-all in silico mutagenesis for human glucokinase. Many of the residues predicted as functionally important have indeed been con.rmed in the literature, others await experimental veri.cation, and our method is ready to aid in the design of in vitro mutagenesis. AVAILABILITY ASEdb and glucokinase scores are available at http://www.rostlab.org/services/SNAP. For submissions of large/whole proteins for processing please contact the author.

[1]  D. Eisenberg,et al.  Protein function in the post-genomic era , 2000, Nature.

[2]  Zhilei Chen,et al.  A highly sensitive selection method for directed evolution of homing endonucleases , 2005, Nucleic acids research.

[3]  A. Ménez,et al.  Characterization of the Functional Epitope on the Urokinase Receptor , 2006, Journal of Biological Chemistry.

[4]  B. Rost,et al.  Conservation and prediction of solvent accessibility in protein families , 1994, Proteins.

[5]  M. Stoffel,et al.  Nonsense mutation in the glucokinase gene causes early-onset non-insulin-dependent diabetes mellitus , 1992, Nature.

[6]  B. Rost,et al.  SNAP: predict effect of non-synonymous polymorphisms on function , 2007, Nucleic acids research.

[7]  B. Rost How to Use Protein 1- D Structure Predicted by PROFphd , 2005 .

[8]  Motonori Ota,et al.  The Protein Mutant Database , 1999, Nucleic Acids Res..

[9]  K. Nishikawa,et al.  Constructing a protein mutant database. , 1994, Protein engineering.

[10]  C. Epstein,et al.  Role of the Amino-Acid ‘Code’ and of Selection for Conformation in the Evolution of Proteins , 1966, Nature.

[11]  A. Bogan,et al.  Anatomy of hot spots in protein interfaces. , 1998, Journal of molecular biology.

[12]  M. Inouye,et al.  Cysteine-Scanning Analysis of the Dimerization Domain of EnvZ, an Osmosensing Histidine Kinase , 2003, Journal of bacteriology.

[13]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[14]  L. Pauling,et al.  Evolutionary Divergence and Convergence in Proteins , 1965 .

[15]  H. Wolfson,et al.  Spatial chemical conservation of hot spot interactions in protein-protein complexes , 2007, BMC Biology.

[16]  Burkhard Rost,et al.  ISIS: interaction sites identified from sequence , 2007, Bioinform..

[17]  V. Bryson,et al.  Evolving Genes and Proteins. , 1965, Science.

[18]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[19]  F M Matschinsky,et al.  Familial hyperinsulinism caused by an activating glucokinase mutation. , 1998, The New England journal of medicine.

[20]  G. Weiss,et al.  Rapid mapping of protein functional epitopes by combinatorial alanine scanning. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[21]  James R Horn,et al.  Shotgun Alanine Scanning Shows That Growth Hormone Can Bind Productively to Its Receptor through a Drastically Minimized Interface* , 2005, Journal of Biological Chemistry.

[22]  John M. Walker,et al.  The Proteomics Protocols Handbook , 2005, Humana Press.

[23]  A Yamaguchi,et al.  Cysteine‐scanning mutagenesis around transmembrane segment VI of Tn10‐encoded metal‐tetracycline/H+ antiporter , 1999, FEBS letters.

[24]  K. Georgeson,et al.  Molecular determination of agouti-related protein binding to human melanocortin-4 receptor. , 2003, Molecular pharmacology.

[25]  B. Rost PHD: predicting one-dimensional protein structure by profile-based neural networks. , 1996, Methods in enzymology.

[26]  W. Delano Unraveling hot spots in binding interfaces: progress and challenges. , 2002, Current opinion in structural biology.

[27]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt) , 2006, Nucleic Acids Research.

[28]  Zhimin Xiang,et al.  Pharmacological characterization of 40 human melanocortin-4 receptor polymorphisms with the endogenous proopiomelanocortin-derived agonists and the agouti-related protein (AGRP) antagonist. , 2006, Biochemistry.

[29]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[30]  S. Davis,et al.  The role of charged residues mediating low affinity protein-protein recognition at the cell surface by CD2. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Holger Gohlke,et al.  Targeting protein-protein interactions with small molecules: challenges and perspectives for computational binding epitope detection and ligand finding. , 2006, Current medicinal chemistry.

[32]  D. Baker,et al.  A simple physical model for binding energy hot spots in protein–protein complexes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Peng Yue,et al.  SNPs3D: Candidate gene and SNP selection for association studies , 2006, BMC Bioinformatics.

[34]  Sidney W. Fox,et al.  CHAPTER 5 – Protein Molecules: Intraspecific and Interspecific Variations , 1962 .

[35]  R Shapiro,et al.  Site-directed mutagenesis of histidine-13 and histidine-114 of human angiogenin. Alanine derivatives inhibit angiogenin-induced angiogenesis. , 1989, Biochemistry.

[36]  Klaus Brusgaard,et al.  The second activating glucokinase mutation (A456V): implications for glucose homeostasis and diabetes therapy. , 2002, Diabetes.

[37]  Burkhard Rost,et al.  Protein–Protein Interaction Hotspots Carved into Sequences , 2007, PLoS Comput. Biol..

[38]  P. Bork,et al.  Human non-synonymous SNPs: server and survey. , 2002, Nucleic acids research.

[39]  T. Clackson,et al.  A hot spot of binding energy in a hormone-receptor interface , 1995, Science.

[40]  Kurt S. Thorn,et al.  ASEdb: a database of alanine mutations and their effects on the free energy of binding in protein interactions , 2001, Bioinform..

[41]  K. Sharp,et al.  Protein folding and association: Insights from the interfacial and thermodynamic properties of hydrocarbons , 1991, Proteins.

[42]  M. Pantoliano,et al.  Identification and concerted function of two receptor binding surfaces on basic fibroblast growth factor required for mitogenesis. , 1994, The Journal of biological chemistry.

[43]  Teruyuki Nishimura,et al.  Structural basis for allosteric regulation of the monomeric allosteric enzyme human glucokinase. , 2004, Structure.