The use of model selection in the model-free analysis of protein dynamics

Model-free analysis of NMR relaxation data, which is widely used for the study of protein dynamics, consists of the separation of the global rotational diffusion from internal motions relative to the diffusion frame and the description of these internal motions by amplitude and timescale. Five model-free models exist, each of which describes a different type of motion. Model-free analysis requires the selection of the model which best describes the dynamics of the NH bond. It will be demonstrated that the model selection technique currently used has two significant flaws, under-fitting, and not selecting a model when one ought to be selected. Under-fitting breaks the principle of parsimony causing bias in the final model-free results, visible as an overestimation of S2 and an underestimation of τe and Rex. As a consequence the protein falsely appears to be more rigid than it actually is. Model selection has been extensively developed in other fields. The techniques known as Akaike's Information Criteria (AIC), small sample size corrected AIC (AICc), Bayesian Information Criteria (BIC), bootstrap methods, and cross-validation will be compared to the currently used technique. To analyse the variety of techniques, synthetic noisy data covering all model-free motions was created. The data consists of two types of three-dimensional grid, the Rex grids covering single motions with chemical exchange {S2,τe,Rex}, and the Double Motion grids covering two internal motions {Sf2,Ss2,τs}. The conclusion of the comparison is that for accurate model-free results, AIC model selection is essential. As the method neither under, nor over-fits, AIC is the best tool for applying Occam's razor and has the additional benefits of simplifying and speeding up model-free analysis.

[1]  Zucchini,et al.  An Introduction to Model Selection. , 2000, Journal of mathematical psychology.

[2]  Ronald M. Levy,et al.  Propagation of experimental uncertainties using the Lipari-Szabo model-free analysis of protein dynamics , 1998, Journal of biomolecular NMR.

[3]  E. Meirovitch,et al.  A structural mode-coupling approach to 15N NMR relaxation in proteins. , 2001, Journal of the American Chemical Society.

[4]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[5]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[6]  Paul C. Driscoll,et al.  Deviations from the simple two-parameter model-free approach to the interpretation of nitrogen-15 nuclear magnetic relaxation of proteins , 1990 .

[7]  D. Weber,et al.  A Bayesian statistical method for the detection and quantification of rotational diffusion anisotropy from NMR relaxation data. , 2000, Journal of magnetic resonance.

[8]  V. Orekhov,et al.  Model-free approach beyond the borders of its applicability. , 1997, Journal of magnetic resonance.

[9]  T. Pawson,et al.  Backbone dynamics of a free and phosphopeptide-complexed Src homology 2 domain studied by 15N NMR relaxation. , 1994, Biochemistry.

[10]  G. Lipari Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules , 1982 .

[11]  G T Montelione,et al.  Estimation of dynamic parameters from NMR relaxation data using the Lipari-Szabo model-free approach and Bayesian statistical methods. , 1999, Journal of magnetic resonance.

[12]  Christopher D. Kroenke,et al.  The Static Magnetic Field Dependence of Chemical Exchange Linebroadening Defines the NMR Chemical Shift Time Scale , 2000 .

[13]  P. Wright,et al.  Intramolecular motions of a zinc finger DNA-binding domain from Xfin characterized by proton-detected natural abundance carbon-13 heteronuclear NMR spectroscopy , 1991 .

[14]  L. Nicholson,et al.  An improved method for distinguishing between anisotropic tumbling and chemical exchange in analysis of 15N relaxation parameters , 2001, Journal of biomolecular NMR.

[15]  P. Wright,et al.  Anisotropic rotational diffusion in model-free analysis for a ternary DHFR complex , 2001, Journal of biomolecular NMR.

[16]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[17]  A. Palmer,et al.  Backbone dynamics of Escherichia coli ribonuclease HI: correlations with structure and function in an active enzyme. , 1995, Journal of molecular biology.

[18]  A. Szabó,et al.  Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules. 1. Theory and range of validity , 1982 .

[19]  H. Carr,et al.  The Principles of Nuclear Magnetism , 1961 .

[20]  David R. Anderson,et al.  Model selection and inference : a practical information-theoretic approach , 2000 .

[21]  A. Szabó,et al.  Model-free approach to the interpretation of nuclear magnetic resonance relaxation in macromolecules. 2. Analysis of experimental results , 1982 .

[22]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .