A Framework for Improving Protein Structure Predictions by Teamwork

Pridicting the three dimensional structure of proteins is a difficult task. In the last years several approaches have been proposed for performing this task taking into account different protein chemical and physical properties. As a result, a growing number of protein structure prediction tools is becoming available, some of them being specialized to work on either some aspects of the predictions or on some categories of proteins. In this context, it becomes useful to jointly apply different prediction techniques and combine their results in order to improve the quality of the predictions. However, several problems have to be solved in order to make this a viable possibility. In this paper we propose a framework allowing to (i) define a common reference applicative domain for different prediction techniques, (ii) characterize predictors through evaluating some quality parameters, (iii) characterize the performances of a team of predictors jointly applied over a prediction problem and (iv) obtain a unique prediction from the team. Finally, we highlight the application of this framework to the definition of a multi-agent system performing the team selection task, the integration of multiple, possibly heterogeneous, predictions and the translation of predictors inputs and outputs into a uniform data format.

[1]  Lawrence Hunter,et al.  Artificial Intelligence and Molecular Biology , 1992, AI Mag..

[2]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[3]  Garland R. Marshall,et al.  Properties of intraglobular contacts in proteins: an approach to prediction of tertiary structure , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[4]  D. Fischer,et al.  Protein fold recognition using sequence‐derived predictions , 1996, Protein science : a publication of the Protein Society.

[5]  A. Valencia,et al.  Improving contact predictions by the combination of correlated mutations and other sources of sequence information. , 1997, Folding & design.

[6]  Giancarlo Mauri,et al.  Application of Evolutionary Algorithms to Protein Folding Prediction , 1997, Artificial Evolution.

[7]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[8]  B. Rost,et al.  Protein fold recognition by prediction-based threading. , 1997, Journal of molecular biology.

[9]  O. Lund,et al.  Protein distance constraints predicted by neural networks and probability density functions. , 1997, Protein engineering.

[10]  M. Hill,et al.  Uclacyanins, stellacyanins, and plantacyanins are distinct subfamilies of phytocyanins: Plant‐specific mononuclear blue copper proteins , 1998, Protein science : a publication of the Protein Society.

[11]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[12]  J. Ponder,et al.  Protein structure prediction using a combination of sequence homology and global energy minimization: II. Energy functions , 1998 .

[13]  R. Friesner,et al.  Tertiary structure prediction of mixed α/β proteins via energy minimization , 1998 .

[14]  Andrew E. Torda,et al.  Sausage: protein threading with flexible force fields , 1999, Bioinform..

[15]  William E. Hart,et al.  Protein structure prediction with evolutionary algorithms , 1999 .

[16]  G J Barton,et al.  Evaluation and improvement of multiple sequence methods for protein secondary structure prediction , 1999, Proteins.

[17]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[18]  David Edwards,et al.  From Fold Recognition to Homology Modeling: An Analysis of Protein Modeling Challenges At Different Levels of Prediction Complexity , 2000, Comput. Chem..

[19]  D. Baker,et al.  De novo protein structure determination using sparse NMR data , 2000, Journal of biomolecular NMR.

[20]  C. Chothia,et al.  Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure. , 2001, Journal of molecular biology.

[21]  Marc A. Martí-Renom,et al.  EVA: continuous automatic evaluation of protein structure prediction servers , 2001, Bioinform..

[22]  Giovanni Soda,et al.  Bidirectional Dynamics for Protein Secondary Structure Prediction , 2001, Sequence Learning.

[23]  J Meller,et al.  Linear programming optimization and a double statistical filter for protein threading protocols , 2001, Proteins.

[24]  B. Rost Protein Structure Prediction in 1D, 2D, and 3D , 2002 .

[25]  Luigi Palopoli,et al.  Exploiting Agents for Improving Protein Structure Prediction by Teamwork , 2002 .