Amino acids determining enzyme-substrate specificity in prokaryotic and eukaryotic protein kinases

The binding between a PK and its target is highly specific, despite the fact that many different PKs exhibit significant sequence and structure homology. There must be, then, specificity-determining residues (SDRs) that enable different PKs to recognize their unique substrate. Here we use and further develop a computational procedure to discover putative SDRs (PSDRs) in protein families, whereby a family of homologous proteins is split into orthologous proteins, which are assumed to have the same specificity, and paralogous proteins, which have different specificities. We reason that PSDRs must be similar among orthologs, whereas they must necessarily be different among paralogs. Our statistical procedure and evolutionary model identifies such residues by discriminating a functional signal from a phylogenetic one. As case studies we investigate the prokaryotic two-component system and the eukaryotic AGC (i.e., cAMP-dependent PK, cGMP-dependent PK, and PKC) PKs. Without using experimental data, we predict PSDRs in prokaryotic and eukaryotic PKs, and suggest precise mutations that may convert the specificity of one PK to another. We compare our predictions with current experimental results and obtain considerable agreement with them. Our analysis unifies much of existing data on PK specificity. Finally, we find PSDRs that are outside the active site. Based on our results, as well as structural and biochemical characterizations of eukaryotic PKs, we propose the testable hypothesis of “specificity via differential activation” as a way for the cell to control kinase specificity.

[1]  P. Parker,et al.  The extended protein kinase C superfamily. , 1998, The Biochemical journal.

[2]  T. Hunter A thousand and one protein kinases , 1987, Cell.

[3]  S. Taylor,et al.  Phosphorylation and activation of cAMP-dependent protein kinase by phosphoinositide-dependent protein kinase. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[4]  C. Walsh,et al.  Altered recognition mutants of the response regulator PhoB: a new genetic strategy for studying protein-protein interactions. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Eugene I. Shakhnovich,et al.  Kinetics, thermodynamics and evolution of non-native interactions in a protein folding nucleus , 2000, Nature Structural Biology.

[6]  T. Soderling,et al.  A structural basis for substrate specificities of protein Ser/Thr kinases: primary sequence preference of casein kinases I and II, NIMA, phosphorylase kinase, calmodulin-dependent kinase II, CDK5, and Erk1 , 1996, Molecular and cellular biology.

[7]  P. Cohen,et al.  Characterization of a 3-phosphoinositide-dependent protein kinase which phosphorylates and activates protein kinase Bα , 1997, Current Biology.

[8]  Mark D'Souza,et al.  SENTRA, a database of signal transduction proteins , 2000, Nucleic Acids Res..

[9]  H. Bondi Radiation as a source of gravitation , 1975, Nature.

[10]  Susan S. Taylor,et al.  The catalytic subunit of cAMP-dependent protein kinase: prototype for an extended network of communication. , 1999, Progress in biophysics and molecular biology.

[11]  L. Mirny,et al.  Universally conserved positions in protein folds: reading evolutionary signals about stability, folding kinetics and function. , 1999, Journal of molecular biology.

[12]  M. Inouye,et al.  Histidine kinases: diversity of domain organization , 1999, Molecular microbiology.

[13]  S. Eykyn Microbiology , 1950, The Lancet.

[14]  B. Kobe,et al.  Structural basis and prediction of substrate specificity in protein serine/threonine kinases , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Structural studies on phospho-CDK2/cyclin A bound to nitrate, a transition state analogue: implications for the protein kinase mechanism. , 2002, Biochemistry.

[16]  Ann M Stock,et al.  Two-component signal transduction. , 2000, Annual review of biochemistry.

[17]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[18]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[19]  M. R. Adams,et al.  Comparative genomics of the eukaryotes. , 2000, Science.

[20]  L. Pinna,et al.  How do protein kinases recognize their substrates? , 1996, Biochimica et biophysica acta.

[21]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[22]  S. Hanks,et al.  Protein kinase catalytic domain sequence database: identification of conserved features of primary structure and classification of family members. , 1991, Methods in enzymology.

[23]  K. Varughese,et al.  Molecular recognition of bacterial phosphorelay proteins. , 2002, Current opinion in microbiology.

[24]  M. Andjelkovic,et al.  Phosphorylation and activation of p70s6k by PDK1. , 1998, Science.

[25]  M. Yaffe,et al.  A motif-based profile scanning approach for genome-wide prediction of signaling pathways , 2001, Nature Biotechnology.

[26]  James R. Brown,et al.  Evolution of two-component signal transduction. , 2000, Molecular biology and evolution.

[27]  T. Hunter,et al.  The eukaryotic protein kinase superfamily: kinase (catalytic) domain structure and classification 1 , 1995, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[28]  L. Mirny,et al.  Using orthologous and paralogous proteins to identify specificity determining residues. , 2002, Genome biology.

[29]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[30]  M. Ewen,et al.  A‐ and B‐type cyclins differentially modulate substrate specificity of cyclin‐cdk complexes. , 1993, The EMBO journal.

[31]  W. Atchley,et al.  Separation of phylogenetic and functional associations in biological sequences by using the parametric bootstrap. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[32]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[33]  J. Kuriyan,et al.  The Conformational Plasticity of Protein Kinases , 2002, Cell.

[34]  D. Kim,et al.  Genomic analysis of the histidine kinase family in bacteria and archaea. , 2001, Microbiology.

[35]  Peter J Bickel,et al.  Finding important sites in protein sequences , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[36]  J. Stock,et al.  The histidine protein kinase superfamily. , 1999, Advances in microbial physiology.

[37]  J. Hoch,et al.  A transient interaction between two phosphorelay proteins trapped in a crystal lattice reveals the mechanism of molecular recognition and phosphotransfer in signal transduction. , 2000, Structure.

[38]  Sean R. Eddy,et al.  Maximum Discrimination Hidden Markov Models of Sequence Consensus , 1995, J. Comput. Biol..

[39]  R. Lefkowitz,et al.  G protein-coupled receptor kinases. , 1998, Annual review of biochemistry.