Combining Constraint Programming and Multidimensional Scaling to solve Distance Geometry Problems

In this paper we address the merits of combining constraint propagation, typical of Constraint Programming (CP), with local search to solve problems where each method alone can perform poorly. This combined approach is used in solving distance geometry problems, namely those arisen in the determination of spatial configuration of molecular structures from Nuclear Magnetic Resonance (NMR) data. In this problem, we apply a special purpose constraint propagation with local search optimisation based on a Multidimensional Scaling (MDS) method. In this case the methods are combined by using the CP algorithm to provide an approximate solution, which the MDS algorithm then refines until the desired solution is obtained. Thus we overcome two difficulties in this type of problem: on the one hand, great precision is difficult to achieve with the CP approach due to the combinatorial explosion of enumerating the variable domains; on the other hand, a good initial guess is very important for the efficiency of an MDS algorithm.

[1]  I. J. Schoenberg Remarks to Maurice Frechet's Article ``Sur La Definition Axiomatique D'Une Classe D'Espace Distances Vectoriellement Applicable Sur L'Espace De Hilbert , 1935 .

[2]  A. Householder,et al.  Discussion of a set of points in terms of their mutual distances , 1938 .

[3]  W. Torgerson Multidimensional scaling: I. Theory and method , 1952 .

[4]  J. Gower Some distance properties of latent root and vector methods used in multivariate analysis , 1966 .

[5]  R. Sibson Studies in the Robustness of Multidimensional Scaling: Perturbational Analysis of Classical Scaling , 1979 .

[6]  Eugene C. Freuder,et al.  The Complexity of Some Polynomial Network Consistency Algorithms for Constraint Satisfaction Problems , 1985, Artif. Intell..

[7]  B. Hendrickson The Molecular Problem: Determining Conformation from Pairwise Distances , 1990 .

[8]  Pascal Van Hentenryck,et al.  A Generic Arc-Consistency Algorithm and its Specializations , 1992, Artif. Intell..

[9]  F. Glover,et al.  In Modern Heuristic Techniques for Combinatorial Problems , 1993 .

[10]  Christian Bessiere,et al.  Arc-Consistency and Arc-Consistency Again , 1993, Artif. Intell..

[11]  C. Toniolo,et al.  N alpha-benzyloxycarbonyl-alpha-aminoisobutyryl-glycyl-L-isoleucyl- L-leucine methyl ester monohydrate. , 1994, Acta crystallographica. Section C, Crystal structure communications.

[12]  A. Fersht,et al.  Protein-protein recognition: crystal structural analysis of a barnase-barstar complex at 2.0-A resolution. , 1994, Biochemistry.

[13]  Nicolas Beldiceanu,et al.  Introducing global constraints in CHIP , 1994 .

[14]  Jorge J. Moré,et al.  E-optimal solutions to distance geometry problems via global continuation , 1995, Global Minimization of Nonconvex Energy Functions: Molecular Conformation and Protein Folding.

[15]  Jorge Nocedal,et al.  A Limited Memory Algorithm for Bound Constrained Optimization , 1995, SIAM J. Sci. Comput..

[16]  R. Huber,et al.  Crystal structure of desulforedoxin from Desulfovibrio gigas determined at 1.8 A resolution: a novel non-heme iron protein structure. , 1995, Journal of molecular biology.

[17]  B. Rupp,et al.  THE STRUCTURE OF BOVINE PANCREATIC TRYPSIN INHIBITOR AT 125K: DEFINITION OF CARBOXYL-TERMINAL RESIDUES GLYCINE-57 AND ALANINE-58 , 1995 .

[18]  F. A. Lootsma Distance Matrix Completion by Numerical Optimization , 1997 .

[19]  Jorge Nocedal,et al.  Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[20]  M. Trosset Computing Distances Between Convex Sets and Subsets of the Positive Semidefinite Matrices , 1997 .

[21]  Michael W. Trosset,et al.  Applications of Multidimensional Scaling to Molecular Conformation , 1997 .

[22]  E. Stavridi,et al.  Hydrophobic side‐chain size is a determinant of the three‐dimensional structure of the p53 oligomerization domain , 1997, The EMBO journal.

[23]  M. Trosset A New Formulation of the Nonmetric Strain Problem in Multidimensional Scaling , 1998 .

[24]  I. Moura,et al.  NMR determination of the global structure of the 113Cd derivative of desulforedoxin: Investigation of the hydrogen bonding pattern at the metal center , 1998, Protein science : a publication of the Protein Society.

[25]  Pedro Barahona,et al.  Applying Constraint Programming to Protein Structure Determination , 1999, CP.

[26]  M. Trosset,et al.  Deriving interatomic distance bounds from chemical structure , 1999 .

[27]  Pedro Barahona,et al.  PSICO: Solving Protein Structures with Constraint Programming and Optimization , 2002, Constraints.