Addition of missing loops and domains to protein models by x-ray solution scattering.

Inherent flexibility and conformational heterogeneity in proteins can often result in the absence of loops and even entire domains in structures determined by x-ray crystallographic or NMR methods. X-ray solution scattering offers the possibility of obtaining complementary information regarding the structures of these disordered protein regions. Methods are presented for adding missing loops or domains by fixing a known structure and building the unknown regions to fit the experimental scattering data obtained from the entire particle. Simulated annealing was used to minimize a scoring function containing the discrepancy between the experimental and calculated patterns and the relevant penalty terms. In low-resolution models where interface location between known and unknown parts is not available, a gas of dummy residues represents the missing domain. In high-resolution models where the interface is known, loops or domains are represented as interconnected chains (or ensembles of residues with spring forces between the C(alpha) atoms), attached to known position(s) in the available structure. Native-like folds of missing fragments can be obtained by imposing residue-specific constraints. After validation in simulated examples, the methods have been applied to add missing loops or domains to several proteins where partial structures were available.

[1]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[2]  E P Morris,et al.  The structure of the acto-myosin subfragment 1 complex: results of searches using data from electron microscopy and x-ray crystallography. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[3]  D I Svergun,et al.  A Map of Protein-rRNA Distribution in the 70 SEscherichia coli Ribosome* , 2000, The Journal of Biological Chemistry.

[4]  M. Levitt A simplified representation of protein conformations for rapid simulation of protein folding. , 1976, Journal of molecular biology.

[5]  Sebastian Doniach,et al.  Protein structure prediction constrained by solution X-ray scattering data and structural homology identification. , 2002, Journal of molecular biology.

[6]  P. Debye,et al.  Zerstreuung von Röntgenstrahlen , 1915 .

[7]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[8]  G J Barton,et al.  Application of multiple sequence alignment profiles to improve protein secondary structure prediction , 2000, Proteins.

[9]  D. I. Svergun,et al.  Structure Analysis by Small-Angle X-Ray and Neutron Scattering , 1987 .

[10]  Lester Ingber,et al.  Simulated annealing: Practice versus theory , 1993 .

[11]  W. Taylor,et al.  Global fold determination from a small number of distance restraints. , 1995, Journal of molecular biology.

[12]  D. I. Svergun,et al.  Solution scattering from biopolymers: advanced contrast‐variation data analysis , 1994 .

[13]  A W Ashton,et al.  Pentameric and decameric structures in solution of serum amyloid P component by X-ray and neutron scattering and molecular modelling analyses. , 1997, Journal of molecular biology.

[14]  F. Dauvergne,et al.  The localisation method used at EMBL , 1982 .

[15]  D I Svergun,et al.  Large differences are observed between the crystal and solution quaternary structures of allosteric aspartate transcarbamylase in the R state , 1997, Proteins.

[16]  G J Kleywegt,et al.  Validation of protein models from Calpha coordinates alone. , 1997, Journal of molecular biology.

[17]  D. Svergun,et al.  CRYSOL : a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates , 1995 .

[18]  G L Gilliland,et al.  The three-dimensional structure of a glutathione S-transferase from the mu gene class. Structural analysis of the binary complex of isoenzyme 3-3 and glutathione at 2.2-A resolution. , 1992, Biochemistry.

[19]  Peter V. Konarev,et al.  MASSHA – a graphics system for rigid-body modelling of macromolecular complexes against solution scattering data , 2001 .

[20]  William H. Press,et al.  Numerical recipes , 1990 .

[21]  J. Tainer,et al.  Crystal structures of a schistosomal drug and vaccine target: glutathione S-transferase from Schistosoma japonica and its complex with the leading antischistosomal drug praziquantel. , 1995, Journal of molecular biology.

[22]  Dmitri I. Svergun,et al.  A software system for rigid body modelling of solution scattering data , 2000 .

[23]  Frank Potthast,et al.  Local interactions and protein folding: A three-dimensional off-lattice approach , 1997 .

[24]  D I Svergun,et al.  Determination of domain structure of proteins from X-ray solution scattering. , 2001, Biophysical journal.

[25]  Cheryl H. Arrowsmith,et al.  Protein production: feeding the crystallographers and NMR spectroscopists , 2000, Nature Structural Biology.

[26]  G. Gilliland,et al.  Three‐Dimensional structure of schistosoma japonicum glutathione s‐transferase fused with a six‐amino acid conserved neutralizing epitope of gp41 from hiv , 1994, Protein science : a publication of the Protein Society.

[27]  K. Dill,et al.  An iterative method for extracting energy-like quantities from protein structures. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Dmitri I. Svergun,et al.  Automated matching of high- and low-resolution structural models , 2001 .

[29]  Biochemical and structural characterization of recombinant copper-metallothionein from Saccharomyces cerevisiae. , 1999, European journal of biochemistry.

[30]  A. Dorfman,et al.  Structure of the Ankyrin-binding Domain of α-Na,K-ATPase* , 1998, The Journal of Biological Chemistry.

[31]  K. Nagai,et al.  Generation of β-globin by sequence-specific proteolysis of a hybrid protein produced in Escherichia coli , 1984, Nature.

[32]  D I Svergun,et al.  Crystal Versus Solution Structures of Thiamine Diphosphate-dependent Enzymes* , 2000, The Journal of Biological Chemistry.

[33]  D. Svergun,et al.  Conformation of the Drosophila Motor Protein Non-claret Disjunctional in Solution from X-ray and Neutron Scattering* , 2001, The Journal of Biological Chemistry.

[34]  T. Darden,et al.  Modeling zymogen protein C. , 2000, Biophysical journal.

[35]  D. Smith,et al.  Single-step purification of polypeptides expressed in Escherichia coli as fusions with glutathione S-transferase. , 1988, Gene.

[36]  J. Stull,et al.  Structures of calmodulin and a functional myosin light chain kinase in the activated complex: a neutron scattering study. , 1997, Biochemistry.

[37]  D I Svergun,et al.  Protein hydration in solution: experimental observation by x-ray and neutron scattering. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[38]  J. Hajdu,et al.  Crystal structure of reduced protein R2 of ribonucleotide reductase: the structural basis for oxygen activation at a dinuclear iron site. , 1996, Structure.

[39]  C. Boulin,et al.  Data acquisition systems for linear and area X-ray detectors using delay line readout , 1988 .

[40]  M. Parker,et al.  Crystallization of glutathione S-transferase from human placenta. , 1990, Journal of molecular biology.

[41]  J. Bordas,et al.  X-ray diffraction and scattering on disordered systems using synchrotron radiation , 1983 .

[42]  F. Kozielski,et al.  The crystal structure of the minus-end-directed microtubule motor protein ncd reveals variable dimer conformations. , 1999, Structure.

[43]  J. Kraut,et al.  Loop and subdomain movements in the mechanism of Escherichia coli dihydrofolate reductase: crystallographic evidence. , 1997, Biochemistry.

[44]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[45]  D. Langs,et al.  On 'globbicity' of low-resolution protein structures. , 1999, Acta crystallographica. Section D, Biological crystallography.

[46]  D I Svergun,et al.  Restoring low resolution structure of biological macromolecules from solution scattering using simulated annealing. , 1999, Biophysical journal.

[47]  G J Barton,et al.  Evaluation and improvement of multiple sequence methods for protein secondary structure prediction , 1999, Proteins.

[48]  R. Jernigan,et al.  Self‐consistent estimation of inter‐residue protein contact energies based on an equilibrium mixture approximation of residues , 1999, Proteins.

[49]  R. Huber,et al.  The Geometry of the Reactive Site and of the Peptide Groups in Trypsin, Trypsinogen and its Complexes with Inhibitors , 1983 .

[50]  D. Svergun,et al.  A direct indirect method of small-angle scattering data treatment , 1993 .

[51]  W. Kabsch A discussion of the solution for the best rotation to relate two sets of vectors , 1978 .

[52]  D I Svergun,et al.  A model of the quaternary structure of the Escherichia coli F1 ATPase from X-ray solution scattering and evidence for structural changes in the delta subunit during ATP hydrolysis. , 1998, Biophysical journal.

[53]  Geoffrey J. Barton,et al.  JPred : a consensus secondary structure prediction server , 1999 .

[54]  M Levitt,et al.  Recognizing native folds by the arrangement of hydrophobic and polar residues. , 1995, Journal of molecular biology.

[55]  C. Boulin,et al.  Data appraisal, evaluation and display for synchrotron radiation experiments: Hardware and software , 1986 .

[56]  Structure of the fibrinogen γ‐chain integrin binding and factor XIIIa cross‐linking sites obtained through carrier protein driven crystallization , 1999, Protein science : a publication of the Protein Society.

[57]  R Diamond,et al.  Real-space refinement of the structure of hen egg-white lysozyme. , 1977, Journal of molecular biology.