PICKY: a novel SVD-based NMR spectra peak picking method

Motivation: Picking peaks from experimental NMR spectra is a key unsolved problem for automated NMR protein structure determination. Such a process is a prerequisite for resonance assignment, nuclear overhauser enhancement (NOE) distance restraint assignment, and structure calculation tasks. Manual or semi-automatic peak picking, which is currently the prominent way used in NMR labs, is tedious, time consuming and costly. Results: We introduce new ideas, including noise-level estimation, component forming and sub-division, singular value decomposition (SVD)-based peak picking and peak pruning and refinement. PICKY is developed as an automated peak picking method. Different from the previous research on peak picking, we provide a systematic study of the proposed method. PICKY is tested on 32 real 2D and 3D spectra of eight target proteins, and achieves an average of 88% recall and 74% precision. PICKY is efficient. It takes PICKY on average 15.7 s to process an NMR spectrum. More important than these numbers, PICKY actually works in practice. We feed peak lists generated by PICKY to IPASS for resonance assignment, feed IPASS assignment to SPARTA for fragments generation, and feed SPARTA fragments to FALCON for structure calculation. This results in high-resolution structures of several proteins, for example, TM1112, at 1.25 Å. Availability: PICKY is available upon request. The peak lists of PICKY can be easily loaded by SPARKY to enable a better interactive strategy for rapid peak picking. Contact: mli@uwaterloo.ca

[1]  Claudio Nicolini,et al.  Neural networks for the peak-picking of nuclear magnetic resonance spectra , 1993, Neural Networks.

[2]  M. Williamson,et al.  Automated protein structure calculation from NMR data , 2009, Journal of biomolecular NMR.

[3]  Peter Güntert,et al.  Automated structure determination from NMR spectra , 2009, European Biophysics Journal.

[4]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[5]  M. Billeter,et al.  MUNIN: A new approach to multi-dimensional NMR spectra interpretation , 2001, Journal of biomolecular NMR.

[6]  Martin Billeter,et al.  MUNIN: Application of three-way decomposition to the analysis of heteronuclear NMR relaxation data** , 2001, Journal of biomolecular NMR.

[7]  Harold Gulliksen,et al.  Contributions to mathematical psychology , 1964 .

[8]  Richard A. Harshman,et al.  Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis , 1970 .

[9]  Simon A. Corne,et al.  An artificial neural network for classifying cross peaks in two-dimensional NMR spectra , 1992 .

[10]  G. W. Stewart,et al.  On the Early History of the Singular Value Decomposition , 1993, SIAM Rev..

[11]  Joos Vandewalle,et al.  A Multilinear Singular Value Decomposition , 2000, SIAM J. Matrix Anal. Appl..

[12]  Xin Gao,et al.  IPASS : Error Tolerant NMR Backbone Resonance Assignment by Linear Programming , 2009 .

[13]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[14]  G. W. STEWARTt ON THE EARLY HISTORY OF THE SINGULAR VALUE DECOMPOSITION * , 2022 .

[15]  Robert Powers,et al.  A common sense approach to peak picking in two-, three-, and four-dimensional spectra using automatic computer analysis of contour diagrams , 1991 .

[16]  Gerard J. Kleywegt,et al.  A versatile approach toward the partially automatic recognition of cross peaks in 2D 1H NMR spectra , 1990 .

[17]  A. Rouh,et al.  Bayesian signal extraction from noisy FT NMR spectra , 1994, Journal of Biomolecular NMR.

[18]  M. Billeter,et al.  Automated peak picking and peak integration in macromolecular NMR spectra using AUTOPSY. , 1998, Journal of magnetic resonance.

[19]  K. Wüthrich,et al.  Protein NMR structure determination with automated NOE-identification in the NOESY spectra using the new software ATNOS , 2002, Journal of biomolecular NMR.

[20]  H. Kalbitzer,et al.  A general Bayesian method for an automated signal class recognition in 2D NMR spectra combined with a multivariate discriminant analysis , 1995, Journal of biomolecular NMR.

[21]  A. Bax,et al.  Protein backbone chemical shifts predicted from searching a database for torsion angle and sequence homology , 2007, Journal of biomolecular NMR.

[22]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[23]  Shuai Cheng Li,et al.  Fragment‐HMM: A new approach to protein structure prediction , 2008, Protein science : a publication of the Protein Society.

[24]  Bruce A. Johnson,et al.  NMR View: A computer program for the visualization and analysis of NMR data , 1994, Journal of biomolecular NMR.

[25]  A. Altieri,et al.  Automation of NMR structure determination of proteins. , 2004, Current opinion in structural biology.