An integrated approach to structural genomics.

Structural genomics aims at determining a set of protein structures that will represent all domain folds present in the biosphere. These structures can be used as the basis for the homology modelling of the majority of all remaining protein domains or, indeed, proteins. Structural genomics therefore promises to provide a comprehensive structural description of the protein universe. To achieve this, a broad scientific effort is required. The Berlin-based "Protein Structure Factory" (PSF) plans to contribute to this effort by setting up a local infrastructure for the low-cost, high-throughput analysis of soluble human proteins. In close collaboration with the German Human Genome Project (DHGP) protein-coding genes will be expressed in Escherichia coli or yeast. Affinity-tagged proteins will be purified semi-automatically for biophysical characterization and structure analysis by X-ray diffraction methods and NMR spectroscopy. In all steps of the structure analysis process, possibilities for automation, parallelization and standardization will be explored. Major new facilities that are created for the PSF include a robotic station for large-scale protein crystallization, an NMR center and an experimental station for protein crystallography at the synchrotron storage ring BESSY II in Berlin.

[1]  Michael Y. Galperin,et al.  Beyond complete genomes: from sequence to structure and function. , 1998, Current opinion in structural biology.

[2]  Hans Lehrach,et al.  Automated array technologies for gene expression profiling , 1997 .

[3]  D. Fischer,et al.  Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Anastassis Perrakis,et al.  Automated protein model building combined with iterative structure refinement , 1999, Nature Structural Biology.

[5]  P. Hajduk,et al.  Discovering High-Affinity Ligands for Proteins: SAR by NMR , 1996, Science.

[6]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[7]  V S Lamzin,et al.  Automated refinement for protein crystallography. , 1997, Methods in enzymology.

[8]  T. Earnest,et al.  X-ray crystal structures of 70S ribosome functional complexes. , 1999, Science.

[9]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[10]  U. Heinemann,et al.  New aspects of electron transfer revealed by the crystal structure of a truncated bovine adrenodoxin, Adx(4-108). , 1998, Structure.

[11]  H Oschkinat,et al.  Automated assignment of multidimensional nuclear magnetic resonance spectra. , 1994, Methods in enzymology.

[12]  D Eisenberg,et al.  A 3D-1D substitution matrix for protein fold recognition that includes predicted secondary structure of the sequence. , 1997, Journal of molecular biology.

[13]  Yasuhiko Yoshida,et al.  Cell‐free production and stable‐isotope labeling of milligram quantities of proteins , 1999, FEBS letters.

[14]  Peer Bork,et al.  SMART, a simple modular architecture research tool , 1998 .

[15]  Patricia C Weber,et al.  [2] Overview of protein crystallization methods. , 1997, Methods in enzymology.

[16]  J Schultz,et al.  SMART, a simple modular architecture research tool: identification of signaling domains. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Mckusick Va Genomics: structural and functional studies of genomes. , 1997 .

[18]  G. Montelione,et al.  A banner year for membranes , 1999, Nature Structural Biology.

[19]  Peer Bork,et al.  Characterization of targeting domains by sequence analysis: glycogen-binding domains in protein phosphatases , 1998, Journal of Molecular Medicine.

[20]  R. Miller,et al.  The Shake-and-Bake structure determination of triclinic lysozyme. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[21]  David T. Jones,et al.  Protein superfamilles and domain superfolds , 1994, Nature.

[22]  T. Terwilliger,et al.  Rapid protein-folding assay using green fluorescent protein , 1999, Nature Biotechnology.

[23]  Asako Saegusa,et al.  Japan's genome programme goes ahead, with protein analysis , 1998, Nature.

[24]  P. Bork,et al.  Secreted Fringe-like Signaling Molecules May Be Glycosyltransferases , 1997, Cell.

[25]  Sung-Hou Kim,et al.  Crystal structure of a small heat-shock protein , 1998, Nature.

[26]  J. Newman,et al.  Class‐directed structure determination: Foundation for a protein structure initiative , 1998, Protein science : a publication of the Protein Society.

[27]  S H Kim,et al.  Crystal structures of eukaryotic translation initiation factor 5A from Methanococcus jannaschii at 1.8 A resolution. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[28]  A. Sali 100,000 protein structures for the biologist , 1998, Nature Structural Biology.

[29]  Wayne A Hendrickson,et al.  [28] Phase determination from multiwavelength anomalous diffraction measurements. , 1997, Methods in enzymology.

[30]  Peer Bork,et al.  Sequences and topology Deriving biological knowledge from genomic sequences , 1998 .

[31]  J L Sussman,et al.  Protein Data Bank archives of three-dimensional macromolecular structures. , 1997, Methods in enzymology.

[32]  J. Zou,et al.  Towards the automatic interpretation of macromolecular electron-density maps: qualitative and quantitative matching of protein sequence to map. , 1996, Acta crystallographica. Section D, Biological crystallography.

[33]  Preparation and crystallization of a cross‐linked complex of bovine adrenodoxin and adrenodoxin reductase , 1997, Proteins.

[34]  U Mueller,et al.  Thermal stability and atomic-resolution crystal structure of the Bacillus caldolyticus cold shock protein. , 2000, Journal of molecular biology.

[35]  D. Lockhart,et al.  Functional Genomics , 1999, Springer Netherlands.

[36]  P Bork,et al.  Homology-based fold predictions for Mycoplasma genitalium proteins. , 1998, Journal of molecular biology.

[37]  G T Montelione,et al.  HYPER: A hierarchical algorithm for automatic determination of protein dihedral-angle constraints and stereospecific CβH2 resonance assignments from NMR data , 1999, Journal of biomolecular NMR.

[38]  L Shapiro,et al.  The Argonne Structural Genomics Workshop: Lamaze class for the birth of a new science. , 1998, Structure.

[39]  S. Meier-Ewert,et al.  Application of robotic technology to automated sequence fingerprint analysis by oligonucleotide hybridisation. , 1994, Journal of biotechnology.

[40]  Sung-Hou Kim Shining a light on structural genomics , 1998, Nature Structural Biology.

[41]  C. Chothia One thousand families for the molecular biologist , 1992, Nature.

[42]  V. Ramakrishnan,et al.  Structure of a bacterial 30S ribosomal subunit at 5.5 Å resolution , 1999, Nature.

[43]  Suganthi Balasubramanian,et al.  Protein alchemy: Changing β-sheet into α-helix , 1997, Nature Structural Biology.

[44]  R M Sweet,et al.  Integrated software for a macromolecular crystallography synchrotron beamline. , 1998, Acta crystallographica. Section D, Biological crystallography.

[45]  G N Murshudov,et al.  Validation tools: can they indicate the information content of macromolecular crystal structures? , 1998, Structure.

[46]  Craig M. Ogata,et al.  MAD phasing grows up , 1998, Nature Structural Biology.

[47]  S Fortier,et al.  Critical-point analysis in protein electron-density map interpretation. , 1997, Methods in enzymology.

[48]  Howard A. Padmore,et al.  The Macromolecular Crystallography Facility at the Advanced Light Source , 1995 .

[49]  T L Blundell,et al.  A database of globular protein structural domains: clustering of representative family members into similar folds. , 1996, Folding & design.

[50]  A. Skerra,et al.  One-step affinity purification of bacterially produced proteins by means of the "Strep tag" and immobilized recombinant core streptavidin. , 1994, Journal of chromatography. A.

[51]  O. Ptitsyn,et al.  Why do globular proteins fit the limited set of folding patterns? , 1987, Progress in biophysics and molecular biology.

[52]  S H Kim,et al.  The crystal structure of an Fe-superoxide dismutase from the hyperthermophile Aquifex pyrophilus at 1.9 A resolution: structural basis for thermostability. , 1997, Journal of molecular biology.

[53]  Poul Nissen,et al.  Placement of protein and RNA structures into a 5 Å-resolution map of the 50S ribosomal subunit , 1999, Nature.

[54]  T Gaasterland,et al.  Structural genomics taking shape. , 1998, Trends in genetics : TIG.

[55]  R. Gentz,et al.  Genetic Approach to Facilitate Purification of Recombinant Proteins with a Novel Metal Chelate Adsorbent , 1988, Bio/Technology.

[56]  P. Hajduk,et al.  Discovering High-Affinity Ligands for Proteins , 1997, Science.

[57]  P. Bork,et al.  Predicting functions from protein sequences—where are the bottlenecks? , 1998, Nature Genetics.

[58]  H Oschkinat,et al.  Automated NOESY interpretation with ambiguous distance restraints: the refined NMR solution structure of the pleckstrin homology domain from beta-spectrin. , 1997, Journal of molecular biology.

[59]  Peer Bork,et al.  A superfamily of conserved domains in DNA damage‐ responsive cell cycle checkpoint proteins , 1997, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.