Inference for dynamic systems and conformational sampling for protein folding are two problems motivated by applied data, which pose computational challenges from a statistical perspective. Dynamic systems are often described by a set of coupled differential equations, and methods of parametric estimation for these models from noisy data can require repeatedly solving the equations numerically. Many of these models also lead to rough likelihood surfaces, which makes sampling difficult. We introduce a method for Bayesian inference on these models, using a multiple chain framework that exploits the underlying mathematical structure and interpolates the posterior to improve efficiency. In protein folding, a large conformational space must be searched for low energy states, where any energy function constructed on the states is at best approximate. We propose a method for sampling fragment conformations that accounts for geometric and energetic constraints, and explore ideas for folding entire proteins that account for uncertain energy landscapes and learning from data more effectively. These ingredients are combined into a framework for tackling the problem of generating improvements to protein structure predictions.
[1]
Richard A Friesner,et al.
Progress in super long loop prediction
,
2011,
Proteins.
[2]
J. Skolnick,et al.
Local energy landscape flattening: Parallel hyperbolic Monte Carlo sampling of protein folding
,
2002,
Proteins.
[3]
Hongyi Zhou,et al.
Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction
,
2002,
Protein science : a publication of the Protein Society.
[4]
Yang Zhang,et al.
A Novel Side-Chain Orientation Dependent Potential Derived from Random-Walk Reference State for Protein Fold Selection and Structure Prediction
,
2010,
PloS one.
[5]
R. Friesner,et al.
Long loop prediction using the protein local optimization program
,
2006,
Proteins.