Experimental identification of specificity determinants in the domain linker of a LacI/GalR protein: Bioinformatics‐based predictions generate true positives and false negatives

In protein families, conserved residues often contribute to a common general function, such as DNA‐binding. However, unique attributes for each homolog (e.g. recognition of alternative DNA sequences) must arise from variation in other functionally‐important positions. The locations of these “specificity determinant” positions are obscured amongst the background of varied residues that do not make significant contributions to either structure or function. To isolate specificity determinants, a number of bioinformatics algorithms have been developed. When applied to the LacI/GalR family of transcription regulators, several specificity determinants are predicted in the 18 amino acids that link the DNA‐binding and regulatory domains. However, results from alternative algorithms are only in partial agreement with each other. Here, we experimentally evaluate these predictions using an engineered repressor comprising the LacI DNA‐binding domain, the LacI linker, and the GalR regulatory domain (LLhG). “Wild‐type” LLhG has altered DNA specificity and weaker lacO1 repression compared to LacI or a similar LacI:PurR chimera. Next, predictions of linker specificity determinants were tested, using amino acid substitution and in vivo repression assays to assess functional change. In LLhG, all predicted sites are specificity determinants, as well as three sites not predicted by any algorithm. Strategies are suggested for diminishing the number of false negative predictions. Finally, individual substitutions at LLhG specificity determinants exhibited a broad range of functional changes that are not predicted by bioinformatics algorithms. Results suggest that some variants have altered affinity for DNA, some have altered allosteric response, and some appear to have changed specificity for alternative DNA ligands. Proteins 2008. © 2008 Wiley‐Liss, Inc.

[1]  S. Luria,et al.  Transduction of lactose-utilizing ability among strains of E. coli and S. dysenteriae and the properties of the transducing phage particles. , 1960, Virology.

[2]  G. Buttin,et al.  [REGULATORY MECHANISMS IN THE BIOSYNTHESIS OF THE ENZYMES OF GALACTOSE METABOLISM IN ESCHERICHIA COLI K 12. I. THE INDUCED BIOSYNTHESIS OF GALACTOKINASE AND THE SIMULTANEOUS INDUCTION OF THE ENZYMATIC SEQUENCE]. , 1963, Journal of Molecular Biology.

[3]  G. Buttin Mécanismes régulateurs dans la biosynthèse des enzymes du métabolisme du galactose chez Escherichia coli K12: I. La biosynthèse induite de la galactokinase et l'induction simultanée de la séquence enzymatique , 1963 .

[4]  A. Riggs,et al.  lac repressor--operator interaction. II. Effect of galactosides and other ligands. , 1970, Journal of molecular biology.

[5]  Syr-yaung Lin,et al.  Lac Repressor Binding to DNA not containing the Lac Operator and to Synthetic Poly dAT , 1970, Nature.

[6]  W. Gilbert,et al.  The nucleotide sequence of the lac operator. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[7]  F. Neidhardt,et al.  Culture Medium for Enterobacteria , 1974, Journal of bacteriology.

[8]  A. Riggs,et al.  A comparison of lac repressor binding to operator and to nonoperator DNA. , 1975, Biochemical and biophysical research communications.

[9]  J. Sadler,et al.  A perfectly symmetric lac operator binds the lac repressor very tightly. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[10]  B. Müller-Hill,et al.  Possible ideal lac operator: Escherichia coli lac operator-like sequences from eukaryotic genomes lack the central G X C pair. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[11]  G. Stewart,et al.  pHG165: a pBR322 copy number derivative of pUC8 for cloning and expression. , 1986, Plasmid.

[12]  S. Rudikoff,et al.  Purification and properties of Gal repressor:pL-galR fusion in pKC31 plasmid vector. , 1987, The Journal of biological chemistry.

[13]  J. Ha,et al.  Role of the hydrophobic effect in stability of site-specific protein-DNA complexes. , 1989, Journal of molecular biology.

[14]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[15]  Jeffrey H. Miller,et al.  A short course in bacterial genetics , 1992 .

[16]  Jeffrey H. Miller A Short Course in Bacterial Genetics: A Laboratory Manual and Handbook for Escherichia coli and Rela , 1992 .

[17]  S. Adhya,et al.  A family of bacterial regulators homologous to Gal and Lac repressors. , 1992, The Journal of biological chemistry.

[18]  K. Y. Choi,et al.  Role of the purine repressor hinge sequence in repressor function , 1994, Journal of bacteriology.

[19]  M. Schumacher,et al.  Crystal structure of LacI member, PurR, bound to DNA: minor groove binding by alpha helices. , 1994, Science.

[20]  R. S. Spolar,et al.  Coupling of local folding to site-specific binding of proteins to DNA. , 1994, Science.

[21]  S. Roy,et al.  The non-inducible nature of super-repressors of the gal operon in Escherichia coli. , 1995, Journal of molecular biology.

[22]  C. Sander,et al.  A method to predict functional residues in proteins , 1995, Nature Structural Biology.

[23]  M. Schumacher,et al.  Mechanism of corepressor-mediated specific DNA binding by the purine repressor , 1995, Cell.

[24]  G. Chang,et al.  Crystal Structure of the Lactose Operon Repressor and Its Complexes with DNA and Inducer , 1996, Science.

[25]  F. Cohen,et al.  An evolutionary trace method defines binding surfaces common to protein families. , 1996, Journal of molecular biology.

[26]  M. Slijper,et al.  Formation of the hinge helix in the lac represser is induced upon binding to the lac operator , 1996, Nature Structural Biology.

[27]  Jeffrey Miller,et al.  Genetic Studies of Lac Repressor: 4000 Single Amino Acid Substitutions and Analysis of the Resulting Phenotypes on the Basis of the Protein Structure , 1996, German Conference on Bioinformatics.

[28]  M. Schumacher,et al.  The X-ray Structure of the PurR-Guanine-purF Operator Complex Reveals the Contributions of Complementary Electrostatic Surfaces and a Water-mediated Hydrogen Bond to Corepressor Specificity and Binding Affinity* , 1997, The Journal of Biological Chemistry.

[29]  H. Pedersen,et al.  Protein‐induced fit: the CRP activator protein changes sequence‐specific DNA recognition by the CytR repressor, a highly flexible LacI member , 1997, The EMBO journal.

[30]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[31]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[32]  C. Jørgensen,et al.  DNA‐binding characteristics of the Escherichia coli CytR regulator: a relaxed spacing requirement between operator half‐sites is provided by a flexible, unstructured interdomain linker , 1998, Molecular microbiology.

[33]  S. Egan,et al.  Amino Acid-DNA Contacts by RhaS: an AraC Family Transcription Activator , 1999, Journal of bacteriology.

[34]  Byungkook Lee,et al.  GalR mutants defective in repressosome formation. , 1999, Genes & development.

[35]  A. Koehler,et al.  The role of lysine 55 in determining the specificity of the purine repressor for its operators through minor groove interactions. , 1999, Journal of molecular biology.

[36]  K. Matthews,et al.  Operator DNA sequence variation enhances high affinity binding by hinge helix mutants of lactose repressor protein. , 2000, Biochemistry.

[37]  M. Lewis,et al.  A closer view of the conformation of the Lac repressor bound to operator , 2000, Nature Structural Biology.

[38]  S. Bell,et al.  Charting a course through RNA polymerase , 2000, Nature Structural Biology.

[39]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[40]  R. Russell,et al.  Analysis and prediction of functional sub-types from protein sequence alignments. , 2000, Journal of molecular biology.

[41]  M. Lewis,et al.  Crystallographic analysis of Lac repressor bound to natural operator O1. , 2001, Journal of molecular biology.

[42]  R. Kaptein,et al.  Strong DNA binding by covalently linked dimeric Lac headpiece: Evidence for the crucial role of the hinge helices , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[43]  K. Matthews,et al.  Engineered disulfide linking the hinge regions within lactose repressor dimer increases operator affinity, decreases sequence selectivity, and alters allostery. , 2001, Biochemistry.

[44]  L. Swint-Kruse,et al.  Plasticity of quaternary structure: Twenty‐two ways to form a LacI dimer , 2001, Protein science : a publication of the Protein Society.

[45]  O. Lichtarge,et al.  Structural clusters of evolutionary trace residues are statistically significant and common in proteins. , 2002, Journal of molecular biology.

[46]  L. Mirny,et al.  Using orthologous and paralogous proteins to identify specificity determining residues , 2002, Genome Biology.

[47]  S. Adhya,et al.  Genetic Analysis of GalR Tetramerization in DNA Looping during Repressosome Assembly* , 2002, The Journal of Biological Chemistry.

[48]  Kevin L. Griffith,et al.  Measuring beta-galactosidase activity in bacteria: cell growth, permeabilization, and enzyme assays in 96-well arrays. , 2002, Biochemical and biophysical research communications.

[49]  B. Pettitt,et al.  Fine‐tuning function: Correlation of hinge domain interactions with functional distinctions between LacI and PurR , 2002, Protein science : a publication of the Protein Society.

[50]  Jianpeng Ma,et al.  Allosteric transition pathways in the lactose repressor protein core domains: Asymmetric motions in a homodimer , 2003, Protein science : a publication of the Protein Society.

[51]  Liskin Swint-Kruse,et al.  Perturbation from a distance: mutations that alter LacI function through long-range effects. , 2003, Biochemistry.

[52]  K. Nishikawa,et al.  Parallel evolution of ligand specificity between LacI/GalR family repressors and periplasmic sugar-binding proteins. , 2003, Molecular biology and evolution.

[53]  B. Kallipolitis,et al.  A role for the interdomain linker region of the Escherichia coli CytR regulator in repression complex formation. , 2004, Journal of molecular biology.

[54]  O. Lichtarge,et al.  A family of evolution-entropy hybrid methods for ranking protein residues by importance. , 2004, Journal of molecular biology.

[55]  M. Gelfand,et al.  Automated selection of positions determining functional specificity of proteins by comparative analysis of orthologous groups in protein families , 2004, Protein science : a publication of the Protein Society.

[56]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[57]  M. Schumacher,et al.  Structural Basis for Allosteric Control of the Transcription Regulator CcpA by the Phosphoprotein HPr-Ser46-P , 2004, Cell.

[58]  O. Lichtarge,et al.  Evolutionary Trace of G Protein-coupled Receptors Reveals Clusters of Residues That Determine Global and Class-specific Functions* , 2004, Journal of Biological Chemistry.

[59]  Mikhail S. Gelfand,et al.  SDPpred: a tool for prediction of amino acid residues that determine differences in functional specificity of homologous proteins , 2004, Nucleic Acids Res..

[60]  Eugene I. Shakhnovich,et al.  Determining functional specificity from protein sequences , 2005, Bioinform..

[61]  Eugene I. Shakhnovich,et al.  Predicting specificity-determining residues in two large eukaryotic transcription factor families , 2005, Nucleic acids research.

[62]  M. Schumacher,et al.  Phosphoprotein Crh-Ser46-P Displays Altered Binding to CcpA to Effect Carbon Catabolite Regulation* , 2006, Journal of Biological Chemistry.

[63]  L. Swint-Kruse,et al.  Extrinsic interactions dominate helical propensity in coupled binding and folding of the lactose repressor protein hinge helix. , 2006, Biochemistry.

[64]  Alfonso Valencia,et al.  Phylogeny-independent detection of functional residues , 2006, Bioinform..

[65]  Olivier Lichtarge,et al.  Evolutionary trace report_maker: a new type of service for comparative analysis of proteins , 2006, Bioinform..

[66]  Wei Cai,et al.  Prediction of functional specificity determinants from protein sequences using log-likelihood ratios , 2006, Bioinform..

[67]  V. Tretyachenko-Ladokhina,et al.  Flexibility and adaptability in binding of E. coli cytidine repressor to different operators suggests a role in differential gene regulation. , 2006, Journal of molecular biology.

[68]  Jack F Kirsch,et al.  Identification of functional paralog shift mutations: Conversion of Escherichia coli malate dehydrogenase to a lactate dehydrogenase , 2007, Proceedings of the National Academy of Sciences.

[69]  Anna R Panchenko,et al.  Functional specificity lies within the properties and evolutionary changes of amino acids. , 2007, Journal of molecular biology.

[70]  L. Swint-Kruse,et al.  Functional consequences of exchanging domains between LacI and PurR are mediated by the intervening linker sequence , 2007, Proteins.

[71]  Kai Ye,et al.  Tracing evolutionary pressure , 2008, Bioinform..

[72]  J. Trewhella,et al.  Ligand-induced conformational changes and conformational dynamics in the solution structure of the lactose repressor protein. , 2008, Journal of molecular biology.

[73]  Kai Ye,et al.  Multi-RELIEF: a method to recognize specificity determining residues from multiple sequence alignments using a Machine-Learning approach for feature weighting , 2008, Bioinform..

[74]  H. Scheraga,et al.  Identification of GATC‐ and CCGG‐recognizing Type II REases and their putative specificity‐determining positions using Scan2S—A novel motif scan algorithm with optional secondary structure constraints , 2008, Proteins.