Crystal structure of YbaK protein from Haemophilus influenzae (HI1434) at 1.8 Å resolution: Functional implications

Structural genomics of proteins of unknown function most straightforwardly assists with assignment of biochemical activity when the new structure resembles that of proteins whose functions are known. When a new fold is revealed, the universe of known folds is enriched, and once the function is determined by other means, novel structure‐function relationships are established. The previously unannotated protein HI1434 from H. influenzae provides a hybrid example of these two paradigms. It is a member of a microbial protein family, labeled in SwissProt as YbaK and ebsC. The crystal structure at 1.8 Å resolution reported here reveals a fold that is only remotely related to the C‐lectin fold, in particular to endostatin, and thus is not sufficiently similar to imply that YbaK proteins are saccharide binding proteins. However, a crevice that may accommodate a small ligand is evident. The putative binding site contains only one invariant residue, Lys46, which carries a functional group that could play a role in catalysis, indicating that YbaK is probably not an enzyme. Detailed sequence analysis, including a number of newly sequenced microbial organisms, highlights sequence homology to an insertion domain in prolyl‐tRNA synthetases (proRS) from prokaryote, a domain whose function is unknown. A HI1434‐based model of the insertion domain shows that it should also contain the putative binding site. Being part of a tRNA synthetases, the insertion domain is likely to be involved in oligonucleotide binding, with possible roles in recognition/discrimination or editing of prolyl‐tRNA. By analogy, YbaK may also play a role in nucleotide or oligonucleotide binding, the nature of which is yet to be determined. Proteins 2000;40:86–97. © 2000 Wiley‐Liss, Inc.

[1]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[2]  I. Beacham,et al.  Silent genes in bacteria: the previously designated 'cryptic' ilvHI locus of 'Salmonella typhimurium LT2' is active in natural isolates. , 1995, FEMS microbiology letters.

[3]  V S Lamzin,et al.  wARP: improvement and extension of crystallographic phases by weighted averaging of multiple-refined dummy atomic models. , 1997, Acta crystallographica. Section D, Biological crystallography.

[4]  W. McClain,et al.  Distinctive acceptor-end structure and other determinants of Escherichia coli tRNAPro identity. , 1994, Nucleic acids research.

[5]  Olivier Poch,et al.  Partition of tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs , 1990, Nature.

[6]  C. Sander,et al.  Challenging times for bioinformatics , 1995, Nature.

[7]  E V Koonin,et al.  Evolution of aminoacyl-tRNA synthetases--analysis of unique domain architectures and phylogenetic trees reveals a complex history of horizontal gene transfer events. , 1999, Genome research.

[8]  J. Pflugrath,et al.  Crystal orientation and X-ray pattern prediction routines for area-detector diffractometer systems in macromolecular crystallography , 1987 .

[9]  J M Thornton,et al.  Three-dimensional structure analysis of PROSITE patterns. , 1999, Journal of molecular biology.

[10]  A. Fersht Review Lecture Enzymic editing mechanisms and the genetic code , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[11]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[12]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[13]  W. Hendrickson Determination of macromolecular structures from anomalous diffraction of synchrotron radiation. , 1991, Science.

[14]  William Arbuthnot Sir Lane,et al.  Endostatin: An Endogenous Inhibitor of Angiogenesis and Tumor Growth , 1997, Cell.

[15]  S. Kim,et al.  Structure-based assignment of the biochemical function of a hypothetical protein: a test case of structural genomics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[16]  V. Biou,et al.  [31] Treatment of multiwavelength anomalous diffraction data as a special case of multiple isomorphous replacement. , 1997, Methods in enzymology.

[17]  G. Dunny,et al.  Cloning and molecular analysis of genes affecting expression of binding substance, the recipient-encoded receptor(s) mediating mating aggregate formation in Enterococcus faecalis , 1993, Journal of bacteriology.

[18]  C Colovos,et al.  The 1.8 A crystal structure of the ycaC gene product from Escherichia coli reveals an octameric hydrolase of unknown specificity. , 1998, Structure.

[19]  P. Kraulis A program to produce both detailed and schematic plots of protein structures , 1991 .

[20]  T. Blundell,et al.  Structure of pentameric human serum amyloid P component , 1994, Nature.

[21]  Z. Otwinowski,et al.  Processing of X-ray diffraction data collected in oscillation mode. , 1997, Methods in enzymology.

[22]  Thomas C. Terwilliger,et al.  Automated MAD and MIR structure solution , 1999, Acta crystallographica. Section D, Biological crystallography.

[23]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[24]  Sung-Hou Kim,et al.  Sparse matrix sampling: a screening method for crystallization of proteins , 1991 .

[25]  G. Murshudov,et al.  Refinement of macromolecular structures by the maximum-likelihood method. , 1997, Acta crystallographica. Section D, Biological crystallography.

[26]  D. Moras,et al.  Class II aminoacyl transfer RNA synthetases: crystal structure of yeast aspartyl-tRNA synthetase complexed with tRNA(Asp) , 1991, Science.

[27]  A. Goffeau,et al.  The complete genome sequence of the Gram-positive bacterium Bacillus subtilis , 1997, Nature.

[28]  P. Bork,et al.  Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli , 1996, Current Biology.

[29]  Yunje Cho,et al.  Structure-based identification of a novel NTPase from Methanococcus jannaschii , 1999, Nature Structural Biology.

[30]  C. Ehresmann,et al.  The Structure of Threonyl-tRNA Synthetase-tRNAThr Complex Enlightens Its Repressor Activity and Reveals an Essential Zinc Ion in the Active Site , 1999, Cell.

[31]  V S Lamzin,et al.  Automated refinement of protein models. , 1993, Acta crystallographica. Section D, Biological crystallography.

[32]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[33]  C. Sander,et al.  Dali: a network tool for protein structure comparison. , 1995, Trends in biochemical sciences.

[34]  S. Barondes,et al.  X-ray crystal structure of the human dimeric S-Lac lectin, L-14-II, in complex with lactose at 2.9-A resolution. , 1994, The Journal of biological chemistry.

[35]  J. Navaza,et al.  AMoRe: an automated package for molecular replacement , 1994 .

[36]  O. Herzberg,et al.  Structure of S-lectin, a developmentally regulated vertebrate beta-galactoside-binding protein. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[37]  D. Carrington,et al.  Polypeptide ligation occurs during post-translational modification of concanavalin A , 1985, Nature.

[38]  Anastassis Perrakis,et al.  Automated protein model building combined with iterative structure refinement , 1999, Nature Structural Biology.

[39]  R. Timpl,et al.  Crystal structure of the angiogenesis inhibitor endostatin at 1.5 Å resolution , 1998, The EMBO journal.

[40]  Collaborative Computational,et al.  The CCP4 suite: programs for protein crystallography. , 1994, Acta crystallographica. Section D, Biological crystallography.

[41]  A. Murzin Structure classification‐based assessment of CASP3 predictions for the fold recognition targets , 1999, Proteins.

[42]  N. Vyas Atomic features of protein-carbohydrate interactions , 1991 .

[43]  J. Zou,et al.  Improved methods for building protein models in electron density maps and the location of errors in these models. , 1991, Acta crystallographica. Section A, Foundations of crystallography.

[44]  M. Koizumi,et al.  Structure of coagulation factors IX/X-binding protein, a heterodimer of C-type lectin domains , 1997, Nature Structural Biology.

[45]  K. Musier-Forsyth,et al.  Species-specific differences in the operational RNA code for aminoacylation of tRNAPro. , 1998, Biochemistry.

[46]  Thomas Boehm,et al.  Antiangiogenic therapy of experimental cancer does not induce acquired drug resistance , 1997, Nature.

[47]  K. J. Brown,et al.  Histidine-rich glycoprotein and platelet factor 4 mask heparan sulfate proteoglycans recognized by acidic and basic fibroblast growth factor. , 1994, Biochemistry.

[48]  R. Timpl,et al.  Zinc-dependent dimers observed in crystals of human endostatin. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Burkhard Rost,et al.  PHD - an automatic mail server for protein secondary structure prediction , 1994, Comput. Appl. Biosci..

[50]  A. Brünger Free R value: a novel statistical quantity for assessing the accuracy of crystal structures , 1992, Nature.

[51]  I. Beacham,et al.  Identification and sequence analysis of a silent gene (ushA0) in Salmonella typhimurium. , 1986, Journal of molecular biology.

[52]  Janet L. Smith Determination of three-dimensional structure by multiwavelength anomalous diffraction: Current Opinion in Structural Biology 1991, 1:1002–1011 , 1991 .

[53]  A. Murzin OB(oligonucleotide/oligosaccharide binding)‐fold: common structural and functional solution for non‐homologous sequences. , 1993, The EMBO journal.