A constraint logic programming approach to 3D structure determination of large protein complexes

The paper describes a novel framework, constructed using constraint logic programming and parallelism, to determine the association between parts of the primary sequence of a protein and α-helices extracted from 3-dimensional low-resolution descriptions of large protein complexes. The association is determined by extracting constraints from the 3D information, regarding length, relative position, and connectivity of helices, and solving these constraints with the guidance of a secondary structure prediction algorithm. Parallelism is employed to enhance performance on large proteins. The framework provides a fast, inexpensive alternative to determine the exact tertiary structure of unknown proteins.

[1]  Krzysztof R. Apt,et al.  Principles of constraint programming , 2003 .

[2]  Rolf Backofen The Protein Structure Prediction Problem: A Constraint Optimization Approach using a New Lower Bound , 2004, Constraints.

[3]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[4]  Pierre Baldi,et al.  Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles , 2002, Proteins.

[5]  Jing He,et al.  IDENTIFICATION OF α-HELICES FROM LOW RESOLUTION PROTEIN DENSITY MAPS , 2006 .

[6]  Wen Jiang,et al.  Deriving folds of macromolecular complexes through electron cryomicroscopy and bioinformatics approaches. , 2002, Current opinion in structural biology.

[7]  Leon Sterling,et al.  The Art of Prolog , 1987, IEEE Expert.

[8]  Enrico Pontelli,et al.  A Parallel Algorithm for Helix Mapping Between 3D and 1D Protein Structure Using the Length Constraints , 2004, ISPA.

[9]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[10]  Richard Bonneau,et al.  Ab initio protein structure prediction: progress and prospects. , 2001, Annual review of biophysics and biomolecular structure.

[11]  W. Chiu,et al.  Seeing the herpesvirus capsid at 8.5 A. , 2000, Science.

[12]  M. Baker,et al.  Bridging the information gap: computational tools for intermediate resolution structure interpretation. , 2001, Journal of molecular biology.

[13]  Matthew L. Baker,et al.  Electron cryomicroscopy and bioinformatics suggest protein fold models for rice dwarf virus , 2001, Nature Structural Biology.

[14]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.