A constraint solver for discrete lattices, its parallelization, and application to protein structure prediction

This paper presents the design, implementation and application of a constraint programming framework on 3D crystal lattices. The framework provides the flexibility to express and resolve constraints dealing with structural relationships of entities placed in a 3D lattice structure in space. Both sequential and parallel implementations of the framework are described, along with experiments that highlight its superior performance with respect to the use of more traditional frameworks (e.g. constraints on finite domains and integer programming) to model lattice constraints. The framework is motivated and applied to address the problem of solving the protein folding prediction problem, i.e. predicting the 3D structure of a protein from its primary amino acid sequence. Results and comparison with performance of other constraint-based solutions to this problem are presented. Copyright © 2007 John Wiley & Sons, Ltd.

[1]  Roman Barták,et al.  Constraint Programming: In Pursuit of the Holy Grail , 1999 .

[2]  M. Levitt,et al.  A lattice model for protein structure prediction at low resolution. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Rolf Backofen The Protein Structure Prediction Problem: A Constraint Optimization Approach using a New Lower Bound , 2004, Constraints.

[4]  Alantha Newman A new algorithm for protein folding in the HP model , 2002, SODA '02.

[5]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[6]  Lavery,et al.  Mathematical Challenges from Theoretical/Computational Chemistry. , 1995 .

[7]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[8]  Steven Skiena,et al.  Local rules for protein folding on a triangular lattice and generalized hydrophobicity in the HP model , 1997, RECOMB '97.

[9]  Shashi Shekhar,et al.  Maintaining Spatial Constraints Using a Dimension Graph Approach , 2001, Int. J. Artif. Intell. Tools.

[10]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[11]  Alessandro Dal Palù,et al.  Heuristics, optimizations, and parallelism for protein structure prediction in CLP(FD) , 2005, PPDP '05.

[12]  Rolf Backofen A polynomial time upper bound for the number of contacts in the HP-model on the face-centered-cubic lattice (FCC) , 2004, J. Discrete Algorithms.

[13]  B. Rost Review: protein secondary structure prediction continues to rise. , 2001, Journal of structural biology.

[14]  Pedro Barahona,et al.  Applying Constraint Programming to Protein Structure Determination , 1999, CP.

[15]  Pedro Barahona,et al.  Propagating N-Ary Rigid-Body Constraints , 2003, CP.

[16]  Rolf Backofen,et al.  COMPUTATIONAL MOLECULAR BIOLOGY: AN INTRODUCTION , 2000 .

[17]  Jia Chen,et al.  The CCUBE Constraint Object-Oriented Database System , 1999, SIGMOD '99.

[18]  L Toma,et al.  Folding simulation of protein models on the structure‐based cubo‐octahedral lattice with the Contact Interactions algorithm , 1999, Protein science : a publication of the Protein Society.

[19]  Alessandro Dal Palù,et al.  A constraint logic programming approach to 3D structure determination of large protein complexes , 2006, SAC '06.

[20]  Federico Fogolari,et al.  Amino acid empirical contact energy definitions for fold recognition in the space of contact maps , 2003, BMC Bioinformatics.

[21]  Vipin Kumar,et al.  Scalable Load Balancing Techniques for Parallel Computers , 1994, J. Parallel Distributed Comput..

[22]  Roman Barták,et al.  Principles of Constraint Processing , 2008 .

[23]  Agostino Dovier,et al.  Sets and constraint logic programming , 2000, TOPL.

[24]  Krzysztof R. Apt,et al.  Principles of constraint programming , 2003 .

[25]  F. Fogolari,et al.  Modeling of polypeptide chains as C alpha chains, C alpha chains with C beta, and C alpha chains with ellipsoidal lateral chains. , 1996, Biophysical journal.

[26]  Rolf Backofen Constraint Techniques for Solving the Protein Structure Prediction Problem , 1998, CP.

[27]  Frédéric Goualard,et al.  Interval Constraints: Results and Perspectives , 1999, New Trends in Constraints.

[28]  Laurent Perron Search Procedures and Parallelism in Constraint Programming , 1999, CP.

[29]  Udi Manber,et al.  DIB—a distributed implementation of backtracking , 1987, TOPL.

[30]  Alessandro Dal Palù,et al.  Integrating finite domain constraints and CLP with sets , 2003, PPDP '03.

[31]  Pedro Barahona,et al.  PSICO: Solving Protein Structures with Constraint Programming and Optimization , 2002, Constraints.

[32]  Steven Skiena,et al.  Local Rules for Protein Folding on a Triangular Lattice and Generalized Hydrophobicity in the HP Model , 1997, J. Comput. Biol..

[33]  Toby Walsh,et al.  Beyond Finite Domains: The All Different and Global Cardinality Constraints , 2005, CP.

[34]  Pedro Barahona,et al.  Applying Constraint Programming to Rigid Body Protein Docking , 2005, CP.

[35]  D A Clark,et al.  Protein topology prediction through constraint-based search and the evaluation of topological folding rules. , 1991, Protein engineering.

[36]  Rina Dechter,et al.  Constraint Processing , 1995, Lecture Notes in Computer Science.

[37]  Rolf Backofen,et al.  Optimally Compact Finite Sphere Packings - Hydrophobic Cores in the FCC , 2001, CPM.

[38]  Michael J. Maher,et al.  Toward Practical Query Evaluation for Constraint Databases , 2004, Constraints.

[39]  Cynthia A. Phillips,et al.  Pico: An Object-Oriented Framework for Parallel Branch and Bound * , 2001 .

[40]  Joxan Jaffar,et al.  Constraint logic programming , 1987, POPL '87.

[41]  Mats Carlsson,et al.  Parallel execution of prolog programs: a survey , 2001, TOPL.

[42]  R L Jernigan,et al.  Ideal architecture of residue packing and its observation in protein structures , 1997, Protein science : a publication of the Protein Society.

[43]  Edsger W. Dijkstra,et al.  Termination Detection for Diffusing Computations , 1980, Inf. Process. Lett..

[44]  Andrzej Kolinski,et al.  Computational studies of protein folding , 2001, Comput. Sci. Eng..

[45]  Hassan Aït-Kaci,et al.  Warren's Abstract Machine: A Tutorial Reconstruction , 1991 .

[46]  J. Skolnick,et al.  Reduced models of proteins and their applications , 2004 .

[47]  Rolf Backofen,et al.  Fast, Constraint-Based Threading of HP-Sequences to Hydrophobic Cores , 2001, CP.

[48]  Rolf Backofen,et al.  Application of constraint programming techniques for structure prediction of lattice proteins with extended alphabets , 1999, Bioinform..

[49]  Alessandro Dal Palù,et al.  Constraint Logic Programming approach to protein structure prediction , 2004, BMC Bioinformatics.

[50]  Pascal Van Hentenryck Constraint satisfaction in logic programming , 1989, Logic programming.

[51]  Rolf Backofen,et al.  A Constraint-Based Approach to Structure Prediction for Simplified Protein Models That Outperforms Other Existing Methods , 2003, ICLP.

[52]  Alessandro Dal Palù,et al.  Protein Folding in CLP(FD) with Empirical Contact Energies , 2003, CSCLP.