Transitive homology-guided structural studies lead to discovery of Cro proteins with 40% sequence identity but different folds

Proteins that share common ancestry may differ in structure and function because of divergent evolution of their amino acid sequences. For a typical diverse protein superfamily, the properties of a few scattered members are known from experiment. A satisfying picture of functional and structural evolution in relation to sequence changes, however, may require characterization of a larger, well chosen subset. Here, we employ a “stepping-stone” method, based on transitive homology, to target sequences intermediate between two related proteins with known divergent properties. We apply the approach to the question of how new protein folds can evolve from preexisting folds and, in particular, to an evolutionary change in secondary structure and oligomeric state in the Cro family of bacteriophage transcription factors, initially identified by sequence-structure comparison of distant homologs from phages P22 and λ. We report crystal structures of two Cro proteins, Xfaso 1 and Pfl 6, with sequences intermediate between those of P22 and λ. The domains show 40% sequence identity but differ by switching of α-helix to β-sheet in a C-terminal region spanning ≈25 residues. Sedimentation analysis also suggests a correlation between helix-to-sheet conversion and strengthened dimerization.

[1]  J. Pflugrath,et al.  The finer things in X-ray diffraction data collection. , 1999, Acta crystallographica. Section D, Biological crystallography.

[2]  B. Matthews,et al.  Crystal structure of lambda-Cro bound to a consensus operator at 3.0 A resolution. , 1998, Journal of molecular biology.

[3]  H. Edelhoch,et al.  Spectroscopic determination of tryptophan and tyrosine in proteins. , 1967, Biochemistry.

[4]  G. Bidwell,et al.  Slow assembly and disassembly of lambda Cro repressor dimers. , 2005, Journal of molecular biology.

[5]  Adam Godzik,et al.  Saturated BLAST: an automated multiple intermediate sequence search used to detect distant homology , 2000, Bioinform..

[6]  C A Orengo,et al.  Combining sensitive database searches with multiple intermediates to detect distant homologues. , 1999, Protein engineering.

[7]  Nick V Grishin,et al.  Access the most recent version at doi: 10.1110/ps.03197403 References , 2003 .

[8]  R. Sauer,et al.  Bacteriophage lambda repressor and cro protein: interactions with operator DNA. , 1980, Methods in enzymology.

[9]  C. Chothia,et al.  Intermediate sequences increase the detection of homology between sequences. , 1997, Journal of molecular biology.

[10]  R. Hendrix,et al.  Evolutionary relationships among diverse bacteriophages and prophages: all the world's a phage. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Anastassis Perrakis,et al.  Automated protein model building combined with iterative structure refinement , 1999, Nature Structural Biology.

[12]  Collaborative Computational,et al.  The CCP4 suite: programs for protein crystallography. , 1994, Acta crystallographica. Section D, Biological crystallography.

[13]  A. Kropinski,et al.  Sequence of the Genome of SalmonellaBacteriophage P22 , 2000, Journal of bacteriology.

[14]  A. Vagin,et al.  MOLREP: an Automated Program for Molecular Replacement , 1997 .

[15]  L Serrano,et al.  Exploring the conformational properties of the sequence space between two proteins with different folds: an experimental study. , 1999, Journal of molecular biology.

[16]  Thomas C. Terwilliger,et al.  Automated MAD and MIR structure solution , 1999, Acta crystallographica. Section D, Biological crystallography.

[17]  C. Pace,et al.  How to measure and predict the molar absorption coefficient of a protein , 1995, Protein science : a publication of the Protein Society.

[18]  F. Sanger,et al.  Nucleotide sequence of bacteriophage lambda DNA. , 1982, Journal of molecular biology.

[19]  M. Woolfson,et al.  A flexible and efficient procedure for the solution and phase refinement of protein structures. , 2000, Acta crystallographica. Section D, Biological crystallography.

[20]  Jay Painter,et al.  Electronic Reprint Biological Crystallography Optimal Description of a Protein Structure in Terms of Multiple Groups Undergoing Tls Motion Biological Crystallography Optimal Description of a Protein Structure in Terms of Multiple Groups Undergoing Tls Motion , 2005 .

[21]  R. Sauer,et al.  Bacteriophage P22 Cro protein: sequence, purification, and properties. , 1986, Biochemistry.

[22]  D. Hillis,et al.  Taxonomic sampling, phylogenetic accuracy, and investigator bias. , 1998, Systematic biology.

[23]  G. Murshudov,et al.  Refinement of macromolecular structures by the maximum-likelihood method. , 1997, Acta crystallographica. Section D, Biological crystallography.

[24]  A. Johnson,et al.  Interactions between DNA-bound repressors govern regulation by the lambda phage repressor. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[25]  A. Murzin,et al.  Evolution of protein fold in the presence of functional constraints. , 2006, Current opinion in structural biology.

[26]  Kevin Cowtan,et al.  research papers Acta Crystallographica Section D Biological , 2005 .

[27]  Jay H. Konieczka,et al.  Secondary structure switching in Cro protein evolution. , 2004, Structure.

[28]  M. Cordes,et al.  Retroevolution of lambda Cro toward a stable monomer. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Nick V Grishin,et al.  Structural basis for converting a general transcription factor into an operon-specific virulence regulator. , 2007, Molecular cell.

[30]  Jay Painter,et al.  TLSMD web server for the generation of multi-group TLS models , 2006 .

[31]  M. Cordes,et al.  Retroevolution of λ Cro toward a stable monomer , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Andrei N. Lupas,et al.  Common Evolutionary Origin of Swapped-Hairpin and Double-Psi β Barrels , 2006 .

[33]  G. Bidwell,et al.  Slow Assembly and Disassembly of λ Cro Repressor Dimers , 2005 .

[34]  P. Kraulis A program to produce both detailed and schematic plots of protein structures , 1991 .

[35]  R. Sauer,et al.  [76] Bacteriophage λ repressor and cro protein: Interactions with operator DNA , 1980 .

[36]  Graham F. Hatfull,et al.  Corrected Sequence of the Bacteriophage P22 Genome , 2003, Journal of bacteriology.

[37]  Johannes Söding,et al.  The HHpred interactive server for protein homology detection and structure prediction , 2005, Nucleic Acids Res..

[38]  G. K. Ackers,et al.  Coupled energetics of lambda cro repressor self-assembly and site-specific DNA operator binding I: analysis of cro dimerization from nanomolar to micromolar concentrations. , 2000, Biochemistry.

[39]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[40]  B. Matthews,et al.  Refined structure of Cro repressor protein from bacteriophage lambda suggests both flexibility and plasticity. , 1998, Journal of molecular biology.

[41]  Alexander Schliep,et al.  Clustering Protein Sequences ? Structure Prediction by Transitive Homology , 2001, German Conference on Bioinformatics.

[42]  R F Standaert,et al.  Atomic structures of the human immunophilin FKBP-12 complexes with FK506 and rapamycin. , 1993, Journal of molecular biology.

[43]  Thomas C Terwilliger,et al.  SOLVE and RESOLVE: automated structure solution and density modification. , 2003, Methods in enzymology.

[44]  Mark Gerstein,et al.  Measurement of the effectiveness of transitive sequence comparison, through a third 'intermediate' sequence , 1998, Bioinform..

[45]  Mark Ptashne,et al.  Interactions between DNA-bound repressors govern regulation by the λ phage repressor , 1979 .

[46]  Sergej Djuranovic,et al.  Common evolutionary origin of swapped-hairpin and double-psi beta barrels. , 2006, Structure.

[47]  Thomas C. Terwilliger,et al.  Electronic Reprint Biological Crystallography Maximum-likelihood Density Modification , 2022 .

[48]  Matthew H J Cordes,et al.  Relationship between sequence determinants of stability for two natural homologous proteins with different folds. , 2006, Biochemistry.

[49]  U. Marx,et al.  Homologous proteins with different folds: the three-dimensional structures of domains 1 and 6 of the multiple Kazal-type inhibitor LEKTI. , 2003, Journal of molecular biology.

[50]  N. Grishin Fold change in evolution of protein structures. , 2001, Journal of structural biology.

[51]  Thomas Terwilliger,et al.  SOLVE and RESOLVE: automated structure solution, density modification and model building. , 2004, Journal of synchrotron radiation.