Optimizing an Empirical Scoring Function for Transmembrane Protein Structure Determination

We examine the problem of transmembrane protein structure determination. Like many questions that arise in biological research, this problem cannot be addressed generally by traditional laboratory experimentation alone. Instead, an approach that integrates experiment and computation is required. We formulate the transmembrane protein structure determination problem as a bound-constrained optimization problem using a special empirical scoring function, called Bundler, as the objective function. In this paper, we describe the optimization problem and its mathematical properties, and we examine results obtained using two different derivative-free optimization algorithms.

[1]  Henry D. Shapiro,et al.  Algorithms from P to NP (vol. 1): design and efficiency , 1991 .

[2]  C. Storey,et al.  Aspiration Based Simulated Annealing Algorithm , 1997, J. Glob. Optim..

[3]  Peter L. Freddolino,et al.  Prediction of structure and function of G protein-coupled receptors , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Henry D. Shapiro,et al.  Algorithms from P to NP , 1991 .

[5]  C C Huang,et al.  Annotating PDB files with scene information. , 1995, Journal of molecular graphics.

[6]  Mark Daniel Rintoul,et al.  Constrained Walks and Self-Avoiding Walks: Implications for Protein Structure Determination , 2000 .

[7]  C. Branden,et al.  Introduction to protein structure , 1991 .

[8]  Geoffrey C. Fox,et al.  A Comparison of Annealing Techniques for Academic Course Scheduling , 1997, PATAT.

[9]  J U Bowie,et al.  Helix packing in membrane proteins. , 1997, Journal of molecular biology.

[10]  Cecilia R. Aragon,et al.  Optimization by Simulated Annealing: An Experimental Evaluation; Part II, Graph Coloring and Number Partitioning , 1991, Oper. Res..

[11]  Z. Xiang,et al.  Extending the accuracy limits of prediction for side-chain conformations. , 2001, Journal of molecular biology.

[12]  Jean-Loup Faulon,et al.  Exploring the conformational space of membrane protein folds matching distance constraints , 2003, Protein science : a publication of the Protein Society.

[13]  G R Marshall,et al.  Novel approach to computer modeling of seven-helical transmembrane proteins: current progress in the test case of bacteriorhodopsin. , 2001, Acta biochimica Polonica.

[14]  Peter Salamon,et al.  Facts, Conjectures, and Improvements for Simulated Annealing , 1987 .

[15]  S H White,et al.  MPtopo: A database of membrane protein topology , 2001, Protein science : a publication of the Protein Society.

[16]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[17]  Jie Liang,et al.  Helix-helix packing and interfacial pairwise interactions of residues in membrane proteins. , 2001, Journal of molecular biology.

[18]  D Bergsma,et al.  Orphan G-protein coupled receptors: novel drug targets for the pharmaceutical industry. , 2000, Drug design and discovery.

[19]  Mauro Piccioni,et al.  A combined multistart-annealing algorithm for continuous global optimization , 1991 .

[20]  F Seno,et al.  Optimal potentials for predicting inter‐helical packing in transmembrane proteins , 2002, Proteins.

[21]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Cecilia R. Aragon,et al.  Optimization by Simulated Annealing: An Experimental Evaluation; Part I, Graph Partitioning , 1989, Oper. Res..

[23]  Gareth R. Eaton,et al.  Distance Measurements in Biological Systems by EPR , 2002, Biological Magnetic Resonance.

[24]  O. SIAMJ.,et al.  ON THE CONVERGENCE OF PATTERN SEARCH ALGORITHMS , 1997 .

[25]  S H White,et al.  Energetics, stability, and prediction of transmembrane helices. , 2001, Journal of molecular biology.

[26]  Anthony K. Cheetham,et al.  The synthesis of the new zeolite, ERS-7, and the determination of its structure by simulated annealing and synchrotron X-ray powder diffraction , 1998 .

[27]  Tamara G. Kolda,et al.  Optimization by Direct Search: New Perspectives on Some Classical and Modern Methods , 2003, SIAM Rev..

[28]  Robert Michael Lewis,et al.  On the Local Convergence of Pattern Search , 2003, SIAM J. Optim..

[29]  Joseph S. Schoeniger,et al.  Optimal bundling of the transmembrane helices of integral membrane proteins using sparse distance constraints , 2004 .

[30]  Xavier Barril,et al.  Molecular modelling. , 2006, Molecular bioSystems.

[31]  D. Eisenberg Proteins. Structures and molecular properties, T.E. Creighton. W. H. Freeman and Company, New York (1984), 515, $36.95 , 1985 .

[32]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[33]  Lester Ingber,et al.  Simulated annealing: Practice versus theory , 1993 .

[34]  Emile H. L. Aarts,et al.  Simulated Annealing: Theory and Applications , 1987, Mathematics and Its Applications.

[35]  SkjellumAnthony,et al.  A high-performance, portable implementation of the MPI message passing interface standard , 1996 .

[36]  William Gropp,et al.  Users guide for mpich, a portable implementation of MPI , 1996 .

[37]  Martin Pincus,et al.  Letter to the Editor - A Monte Carlo Method for the Approximate Solution of Certain Types of Constrained Optimization Problems , 1970, Oper. Res..

[38]  Tamara G. Kolda,et al.  On the Convergence of Asynchronous Parallel Pattern Search , 2002, SIAM J. Optim..

[39]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[40]  Philip M. Dean,et al.  An exploration of a novel strategy for superposing several flexible molecules , 1993, J. Comput. Aided Mol. Des..

[41]  G. Heijne,et al.  Genome‐wide analysis of integral membrane proteins from eubacterial, archaean, and eukaryotic organisms , 1998, Protein science : a publication of the Protein Society.

[42]  P. Halling Proteins: Structures and molecular properties (2nd edition). by Thomas E. Creighton, W. H. Freeman, New York, 1992, xiii + 512 pp, price £22.95. ISBN 0‐7167‐7030‐X , 1995 .

[43]  Frances M. G. Pearl,et al.  Gene3D: structural assignment for whole genes and genomes using the CATH domain structure database. , 2002, Genome research.

[44]  D. Goodsell,et al.  Automated docking of substrates to proteins by simulated annealing , 1990, Proteins.

[45]  John E. Dennis,et al.  Direct Search Methods on Parallel Machines , 1991, SIAM J. Optim..

[46]  P Herzyk,et al.  Using experimental information to produce a model of the transmembrane domain of the ion channel phospholamban. , 1998, Biophysical journal.

[47]  G. S. Stiles The effect of numerical precision upon simulated annealing , 1994 .

[48]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[49]  J U Bowie,et al.  Helix‐bundle membrane protein fold templates , 1999, Protein science : a publication of the Protein Society.

[50]  Axel T. Brunger,et al.  X-PLOR Version 3.1: A System for X-ray Crystallography and NMR , 1992 .

[51]  M. J. D. Powell,et al.  Direct search algorithms for optimization calculations , 1998, Acta Numerica.

[52]  V. Torczon,et al.  RANK ORDERING AND POSITIVE BASES IN PATTERN SEARCH ALGORITHMS , 1996 .

[53]  Virginia Torczon,et al.  PDS: Direct Search Methods for Unconstrained Optimization on Either Sequential or Parallel Machines , 1992 .

[54]  Juan C. Meza,et al.  A Class of Trust-Region Methods for Parallel Optimization , 1999, SIAM J. Optim..

[55]  George D. Rose,et al.  Prediction of chain turns in globular proteins on a hydrophobic basis , 1978, Nature.

[56]  Tamara G. Kolda,et al.  APPSPACK 4.0: Asynchronous Parallel Pattern Search for Derivative-Free Optimization , 2004 .

[57]  Anthony Skjellum,et al.  A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..

[58]  H. Scheraga,et al.  An atomically detailed study of the folding pathways of protein A with the stochastic difference equation , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[59]  K. Palczewski,et al.  Crystal Structure of Rhodopsin: A G‐Protein‐Coupled Receptor , 2002, Chembiochem : a European journal of chemical biology.

[60]  L. Ingber Very fast simulated re-annealing , 1989 .

[61]  Fu-Hsieng Allisen Lee,et al.  Parallel simulated annealing on a message-passing multi-computer , 1996 .

[62]  Jie Liang,et al.  Higher-order interhelical spatial interactions in membrane proteins. , 2003, Journal of molecular biology.

[63]  N-city traveling salesman problem: Optimization by simulated annealings , 1986 .

[64]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[65]  F. Richards The interpretation of protein structures: total volume, group volume distributions and packing density. , 1974, Journal of molecular biology.

[66]  Tamara G. Kolda,et al.  Asynchronous Parallel Pattern Search for Nonlinear Optimization , 2001, SIAM J. Sci. Comput..

[67]  P. Adams,et al.  New applications of simulated annealing in X-ray crystallography and solution NMR. , 1997, Structure.