Constraint Programming in Structural Bioinformatics

Bioinformatics aims at applying computer science methods to the wealth of data collected in a variety of experiments in life sciences (e.g. cell and molecular biology, biochemistry, medicine, etc.) in order to help analysing such data and eliciting new knowledge from it. In addition to string processing bioinformatics is often identified with machine learning used for mining the large banks of bio-data available in electronic format, namely in a number of web servers. Nevertheless, there are opportunities of applying other computational techniques in some bioinformatics applications. In this paper, we report the application of constraint programming to address two structural bioinformatics problems, protein structure prediction and protein interaction (docking). The efficient application of constraint programming requires innovative modelling of these problems, as well as the development of advanced propagation techniques (e.g. global reasoning and propagation), which were adopted in Chemera, a system that is currently used to support biochemists in their research.

[1]  P. Mahadevan,et al.  An overview , 2007, Journal of Biosciences.

[2]  L. Krippahl,et al.  BiGGER: A new (soft) docking algorithm for predicting protein interactions , 2000, Proteins.

[3]  C. Costa,et al.  The Structure of an Electron Transfer Complex Containing a Cytochrome c and a Peroxidase* , 1999, The Journal of Biological Chemistry.

[4]  M Czjzek,et al.  Heteronuclear NMR and soft docking: an experimental approach for a structural model of the cytochrome c553-ferredoxin complex. , 2000, Biochemistry.

[5]  Pedro Barahona,et al.  Machine Learned Heuristics to Improve Constraint Satisfaction , 2004, SBIA.

[6]  Ludwig Krippahl,et al.  Modeling protein complexes with BiGGER , 2003, Proteins.

[7]  Ludwig Krippahl,et al.  Electron transfer complexes of cytochrome c peroxidase from Paracoccus denitrificans containing more than one cytochrome. , 2003, Biochemistry.

[8]  Haim J. Wolfson,et al.  Geometric hashing: an overview , 1997 .

[9]  M J Sternberg,et al.  Use of pair potentials across protein interfaces in screening predicted docked complexes , 1999, Proteins.

[10]  Ludwig Krippahl,et al.  Synechocystis ferredoxin/ferredoxin‐NADP+‐reductase/NADP+ complex: Structural model obtained by NMR‐restrained docking , 2005, FEBS letters.

[11]  E. Katchalski‐Katzir,et al.  Molecular surface recognition: determination of geometric fit between proteins and their ligands by correlation techniques. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Ludwig Krippahl,et al.  Integrating protein structural information , 2003 .

[13]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[14]  Matthew L. Ginsberg,et al.  Limited Discrepancy Search , 1995, IJCAI.

[15]  Stephen E Harding,et al.  The electron transfer complexes of cytochrome c peroxidase from Paracoccus denitrificans. , 2003, Biochemistry.

[16]  Pedro Barahona,et al.  Propagating N-Ary Rigid-Body Constraints , 2003, CP.

[17]  Françoise Guerlesquin,et al.  A novel approach for assesing macromolecular complexes combining soft‐docking calculations with NMR data , 2001, Protein science : a publication of the Protein Society.

[18]  Agostino Dovier,et al.  Using Secondary Structure Information for Protein Folding in CLP(FD) , 2002, Electron. Notes Theor. Comput. Sci..

[19]  Bruce Randall Donald,et al.  A Polynomial-Time Algorithm for De Novo Protein Backbone Structure Determination from Nuclear Magnetic Resonance Data , 2006, J. Comput. Biol..

[20]  Alessandro Dal Palù,et al.  Constraint Logic Programming approach to protein structure prediction , 2004, BMC Bioinformatics.

[21]  François Fages,et al.  The Biochemical Abstract Machine BIOCHAM , 2004, CMSB.

[22]  Francesca Rossi,et al.  Principles and Practice of Constraint Programming – CP 2003 , 2003, Lecture Notes in Computer Science.

[23]  François Fages,et al.  Modelling and querying interaction networks in the biochemical abstract machine BIOCHAM , 2002 .

[24]  Ludwig Krippahl,et al.  Pseudoazurin–Nitrite Reductase Interactions , 2005, Chembiochem : a European journal of chemical biology.

[25]  Pedro Barahona,et al.  Applying Constraint Programming to Rigid Body Protein Docking , 2005, CP.

[26]  Pedro Barahona,et al.  PSICO: Solving Protein Structures with Constraint Programming and Optimization , 2002, Constraints.

[27]  José Carlos Almeida Santos,et al.  Mining protein structure data , 2006 .

[28]  Rolf Backofen,et al.  A Constraint-Based Approach to Fast and Exact Structure Prediction in Three-Dimensional Protein Models , 2006, Constraints.

[29]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[30]  Ludwig Krippahl Integrating Web Resources to Model Protein Structure and Function , 2006, Reasoning Web.

[31]  Pedro Barahona,et al.  Applying Constraint Programming to Protein Structure Determination , 1999, CP.

[32]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.