Gasa: a Graph-Based Automated NMR Backbone Resonance Sequential Assignment Program

The success in backbone resonance sequential assignment is fundamental to three dimensional protein structure determination via Nuclear Magnetic Resonance (NMR) spectroscopy. Such a sequential assignment can roughly be partitioned into three separate steps: grouping resonance peaks in multiple spectra into spin systems, chaining the resultant spin systems into strings, and assigning these strings to non-overlapping consecutive amino acid residues in the target protein. Separately dealing with these three steps has been adopted in many existing assignment programs, and it works well on protein NMR data with close-to-ideal quality, while only moderately or even poorly on most real protein datasets, where noises as well as data degeneracies occur frequently. We propose in this work to partition the sequential assignment not by physical steps, but only virtual steps, and use their outputs to cross validate each other. The novelty lies in the places, where the ambiguities at the grouping step will be resolved in finding the highly confident strings at the chaining step, and the ambiguities at the chaining step will be resolved by examining the mappings of strings at the assignment step. In this way, all ambiguities at the sequential assignment will be resolved globally and optimally. The resultant assignment program is called Graph-based Approach for Sequential Assignment (GASA), which has been compared to several recent similar developments including PACES, RANDOM, MARS, and RIBRA. The performance comparisons with these works demonstrated that GASA is more promising for practical use.

[1]  Zhi-Zhong Chen,et al.  Computational Assignment of Protein Backbone Nmr Peaks Byefficient Bounding and Filtering , 2003, J. Bioinform. Comput. Biol..

[2]  Chris Bailey-Kellogg,et al.  A random graph approach to NMR sequential assignment , 2004, J. Comput. Biol..

[3]  Xiang Wan,et al.  CISA: Combined NMR Resonance Connectivity Information Determination and Sequential Assignment , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[4]  Brian E Coggins,et al.  PACES: Protein sequential assignment by computer-assisted exhaustive search , 2003, Journal of biomolecular NMR.

[5]  Wen-Lian Hsu,et al.  RIBRA - An Error-Tolerant Algorithm for the NMR Backbone Assignment Problem , 2006, J. Comput. Biol..

[6]  G. Montelione,et al.  Automated analysis of protein NMR assignments using methods from artificial intelligence. , 1997, Journal of molecular biology.

[7]  G. Wagner,et al.  NMR spectroscopy: a multifaceted approach to macromolecular structure , 2000, Quarterly Reviews of Biophysics.

[8]  Gordon M Crippen,et al.  CASA: An Efficient Automated Assignment of Protein Mainchain NMR Data Using an Ordered Tree Search Algorithm , 2005, Journal of biomolecular NMR.

[9]  Wen-Lian Hsu,et al.  GANA—a genetic algorithm for NMR backbone resonance assignment , 2005, 2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW'05).

[10]  Kurt Wüthrich,et al.  GARANT-a general algorithm for resonance assignment of multidimensional nuclear magnetic resonance spectra , 1997, J. Comput. Chem..

[11]  J. Lukin,et al.  MONTE: An automated Monte Carlo based approach to nuclear magnetic resonance assignment of proteins , 2003, Journal of biomolecular NMR.

[12]  Kurt Wüthrich,et al.  Sequence-specific NMR assignment of proteins by global fragment mapping with the program Mapper , 2000, Journal of biomolecular NMR.

[13]  Tao Jiang,et al.  Automated assignment of backbone NMR peaks using constrained bipartite matching , 2002, Comput. Sci. Eng..

[14]  M. Zweckstetter,et al.  Mars - robust automatic backbone assignment of proteins , 2004, Journal of biomolecular NMR.

[15]  Timothy F. Havel,et al.  Solution conformation of proteinase inhibitor IIA from bull seminal plasma by 1H nuclear magnetic resonance and distance geometry. , 1985, Journal of molecular biology.

[16]  Kurt Wüthrich,et al.  GARANT‐a general algorithm for resonance assignment of multidimensional nuclear magnetic resonance spectra , 1997 .

[17]  Xiang Wan,et al.  Histogram-based Scoring Schemes for Protein Nmr Resonance Assignment , 2004, J. Bioinform. Comput. Biol..