Prediction of Protein Topologies Using GIOHMMs and GRNNs

We develop and test new machine learning methods for the prediction of topological representations of protein structures in the form of coarseor ne-grained contact or distance maps that are translation and rotation invariant. The methods are based on generalized input-output hidden Markov models (GIOHMMs) and generalized recursive neural networks (GRNNs). The methods are used to predict topology directly in the ne-grained case and, in the coarsegrained case, indirectly by rst learning how to score candidate graphs and then using the scoring function to search the space of possible con gurations. Computer simulations show that the predictors achieve state-of-the-art performance.

[1]  Pierre Baldi,et al.  Prediction of contact maps by GIOHMMs and recurrent neural networks using lateral propagation from all four cardinal corners , 2002, ISMB.

[2]  Yoshua Bengio,et al.  Input-output HMMs for sequence processing , 1996, IEEE Trans. Neural Networks.

[3]  Gianluca Pollastri,et al.  Machine Learning Structural and Functional Proteomics , 2001 .

[4]  A. Lesk,et al.  Assessment of novel fold targets in CASP4: Predictions of three‐dimensional structures, secondary structures, and interresidue contacts , 2001, Proteins.

[5]  Pierre Baldi,et al.  Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles , 2002, Proteins.

[6]  Giovanni Soda,et al.  Exploiting the past and the future in protein secondary structure prediction , 1999, Bioinform..

[7]  A. Sali,et al.  Protein Structure Prediction and Structural Genomics , 2001, Science.

[8]  Michael I. Jordan,et al.  Factorial Hidden Markov Models , 1995, Machine Learning.

[9]  P Fariselli,et al.  Prediction of contact maps with neural networks and correlated mutations. , 2001, Protein engineering.

[10]  Pierre Baldi,et al.  Hybrid Modeling, HMM/NN Architectures, and Protein Applications , 1996, Neural Computation.

[11]  M Vendruscolo,et al.  Recovery of protein structure from contact maps. , 1997, Folding & design.

[12]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[13]  Alessandro Sperduti,et al.  A general framework for adaptive processing of data structures , 1998, IEEE Trans. Neural Networks.

[14]  P. Baldi,et al.  Prediction of coordination number and relative solvent accessibility in proteins , 2002, Proteins.