Assessment of uncertainty in protein backbone NMR assignments using Bayesian model averaging

Nuclear magnetic resonance (NMR) spectroscopy is a key experimental technique for determination of protein structure, dynamics, and interactions. However, the data it provides are noisy and incomplete, and there is no generally accepted measure of uncertainty in the resulting interpretations. This paper proposes a statistical modeling framework for an essential step in the NMR process, backbone resonance assignment. Our approach views NMR data as random observations, and employs Bayesian model averaging to quantify uncertainty in terms of standard errors. Results with experimental data demonstrate that the approach is able to quantify the ambiguity in spectral information as well as the ambiguity in assignment. The proposed framework is complementary to existing automated methods. It can benefit NMR spectroscopy by identifying regions of assigned sequence that are certain enough for use in further analysis (e.g. structure determination and dynamics), and by annotating deposited assignments with the amount of information provided by the particular protocols utilized.

[1]  David S Wishart,et al.  RefDB: A database of uniformly referenced protein chemical shifts , 2003, Journal of biomolecular NMR.

[2]  Ronald M. Levy,et al.  Protein sequential resonance assignments by combinatorial enumeration using 13Cα chemical shifts and their (i, i−1) sequential connectivities , 2002, Journal of biomolecular NMR.

[3]  R. Stevens,et al.  Global Efforts in Structural Genomics , 2001, Science.

[4]  Thomas Szyperski,et al.  Protein NMR spectroscopy in structural genomics , 2000, Nature Structural Biology.

[5]  Kurt Wüthrich,et al.  Sequence-specific NMR assignment of proteins by global fragment mapping with the program Mapper , 2000, Journal of biomolecular NMR.

[6]  H. Atreya,et al.  A tracked approach for automated NMR assignments in proteins (TATAPRO) , 2000, Journal of biomolecular NMR.

[7]  Chris Bailey-Kellogg,et al.  The NOESY jigsaw: automated protein secondary structure and main-chain assignment from sparse, unassigned NMR data , 2000, RECOMB '00.

[8]  H N Moseley,et al.  Automated analysis of NMR assignments and structures for proteins. , 1999, Current opinion in structural biology.

[9]  Mark J Howard,et al.  Protein NMR spectroscopy , 1998, Current Biology.

[10]  G. Montelione,et al.  Automated analysis of protein NMR assignments using methods from artificial intelligence. , 1997, Journal of molecular biology.

[11]  Kuo-Bin Li,et al.  Automated Resonance Assignment of Proteins Using Heteronuclear 3D NMR, 1. Backbone Spin Systems Extraction and Creation of Polypeptides , 1997, J. Chem. Inf. Comput. Sci..

[12]  R A Goldstein,et al.  Protein heteronuclear NMR assignments using mean-field simulated annealing. , 1997, Journal of magnetic resonance.

[13]  S. Talukdar,et al.  Automated probabilistic method for assigning backbone resonances of (13C,15N)-labeled proteins , 1997, Journal of biomolecular NMR.

[14]  S. Grzesiek,et al.  NMRPipe: A multidimensional spectral processing system based on UNIX pipes , 1995, Journal of biomolecular NMR.

[15]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[16]  W. M. Westler,et al.  A relational database for sequence-specific protein NMR data , 1991, Journal of biomolecular NMR.

[17]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[18]  D. Rubin Bayesianly Justifiable and Relevant Frequency Calculations for the Applied Statistician , 1984 .

[19]  J. Lukin,et al.  MONTE: An automated Monte Carlo based approach to nuclear magnetic resonance assignment of proteins , 2003, Journal of biomolecular NMR.

[20]  T. Hitchens,et al.  Data requirements for reliable chemical shift assignments in deuterated proteins , 2003, Journal of biomolecular NMR.

[21]  Tao Jiang,et al.  Bayesian Modeling and Computation in Bioinformatics Research , 2002 .

[22]  Horst Kessler,et al.  Automated backbone assignment of labeled proteins using the threshold accepting algorithm , 1998, Journal of biomolecular NMR.

[23]  Kurt Wüthrich,et al.  GARANT-a general algorithm for resonance assignment of multidimensional nuclear magnetic resonance spectra , 1997, J. Comput. Chem..