Characterizing the Space of interatomic Distance Distribution Functions Consistent with Solution Scattering Data

Scattering of neutrons and X-rays from molecules in solution offers alternative approaches to the study of a wide range of macromolecular structures in their solution state without crystallization. We study one part of the problem of elucidating three-dimensional structure from solution scattering data, determining the distribution of interatomic distances, P(r), where r is the distance between two atoms in the protein molecule. This problem is known to be ill-conditioned: for a single observed diffraction pattern, there may be many consistent distance distribution functions, and there is a risk of overfitting the observed scattering data. We propose a new approach to avoiding this problem: accepting the validity of multiple alternative P(r) curves rather than seeking a single "best." We place linear constraints to ensure that a computed P(r) is consistent with the experimental data. The constraints enforce smoothness in the P(r) curve, ensure that the P(r) curve is a probability distribution, and allow for experimental error. We use these constraints to precisely describe the space of all consistent P(r) curves as a polytope of histogram values or Fourier coefficients. We develop a linear programming approach to sampling the space of consistent, realistic P(r) curves. On both experimental and simulated scattering data, our approach efficiently generates ensembles of such curves that display substantial diversity.

[1]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[2]  P. B. Moore,et al.  Small-angle scattering. Information content and error analysis , 1980 .

[3]  D. Svergun,et al.  Small-angle scattering studies of biological macromolecules in solution , 2003 .

[4]  David Avis,et al.  A pivoting algorithm for convex hulls and vertex enumeration of arrangements and polyhedra , 1992, Discret. Comput. Geom..

[5]  Sebastian Doniach,et al.  Fold recognition aided by constraints from small angle X-ray scattering data. , 2005, Protein engineering, design & selection : PEDS.

[6]  M. Wall,et al.  Large-scale shape changes in proteins and macromolecular complexes. , 2000, Annual review of physical chemistry.

[7]  J. Trewhella,et al.  Solution structure of the cAMP-dependent protein kinase catalytic subunit and its contraction upon binding the protein kinase inhibitor peptide. , 1993, Biochemistry.

[8]  J. Trewhella,et al.  Denatured states of ribonuclease A have compact dimensions and residual secondary structure. , 1992, Biochemistry.

[9]  Maxim V. Petoukhov,et al.  New methods for domain structure determination of proteins from solution scattering data , 2003 .

[10]  S. Hansen,et al.  The Maximum-Entropy Method without the Positivity Constraint - Applications to the Determination of the Distance-Distribution Function in Small-Angle Scattering , 1994 .

[11]  A. Tikhonov On the stability of inverse problems , 1943 .

[12]  Dmitri I. Svergun,et al.  New developments in direct shape determination from small angle scattering , 1991 .

[13]  Sebastian Doniach,et al.  Reconstruction of low-resolution three-dimensional density maps from one-dimensional small-angle X-ray solution scattering data for biomolecules , 2000 .

[14]  S Doniach,et al.  Characterization of transient intermediates in lysozyme folding with time-resolved small-angle X-ray scattering. , 1999, Journal of molecular biology.

[15]  James E. Bray,et al.  The CATH database: an extended protein family resource for structural and functional genomics , 2003, Nucleic Acids Res..

[16]  A. Guinier,et al.  La diffraction des rayons X aux très petits angles : application à l'étude de phénomènes ultramicroscopiques , 1939 .

[17]  O. Glatter,et al.  A new method for the evaluation of small‐angle scattering data , 1977 .

[18]  E Pantos,et al.  Low-resolution structures of proteins in solution retrieved from X-ray scattering with a genetic algorithm. , 1998, Biophysical journal.

[19]  G. Fournet,et al.  Small‐Angle Scattering of X‐Rays , 1956 .

[20]  Sebastian Doniach,et al.  Protein structure prediction constrained by solution X-ray scattering data and structural homology identification. , 2002, Journal of molecular biology.

[21]  D. Svergun,et al.  Small-angle scattering: a view on the properties, structures and structural changes of biological macromolecules in solution , 2003, Quarterly Reviews of Biophysics.

[22]  Dmitri I Svergun,et al.  Global rigid body modeling of macromolecular complexes against small-angle scattering data. , 2005, Biophysical journal.

[23]  A. V. Semenyuk,et al.  Small-angle-scattering-data treatment by the regularization method , 1988 .

[24]  Mario Bertero,et al.  The Stability of Inverse Problems , 1980 .

[25]  C. Bailey-Kellogg,et al.  Analysis of self-associating proteins by singular value decomposition of solution scattering data. , 2008, Biophysical journal.

[26]  K. Siegbahn,et al.  SMALL-ANGLE SCATTERING OF X-RAYS , 1958 .

[27]  Dmitri I. Svergun,et al.  New developments in direct shape determination from small-angle scattering. 2. Uniqueness , 1996 .

[28]  D. Svergun,et al.  Prototype of a database for rapid protein classification based on solution scattering data , 2003 .

[29]  D. Svergun,et al.  CRYSOL : a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates , 1995 .

[30]  M. Brillouin,et al.  La diffraction des rayons X aux très petits angles: application a l'étude de phénomènes ultramicroscopiques , 1939 .

[31]  Dmitri I. Svergun,et al.  Determination of the regularization parameter in indirect-transform methods using perceptual criteria , 1992 .

[32]  M. Blackledge,et al.  Structural characterization of flexible proteins using small-angle X-ray scattering. , 2007, Journal of the American Chemical Society.

[33]  J. Trewhella,et al.  Solution structure of the cAMP-dependent protein kinase catalytic subunit and its contraction upon binding the protein kinase inhibitor peptide. , 1993, Biochemistry.

[34]  D I Svergun,et al.  Determination of domain structure of proteins from X-ray solution scattering. , 2001, Biophysical journal.

[35]  S. Steenstrup Deconvolution in the Presence of Noise Using the Maximum Entropy Principle , 1985 .