Protein secondary structure prediction using modular reciprocal bidirectional recurrent neural networks

The supervised learning of recurrent neural networks well-suited for prediction of protein secondary structures from the underlying amino acids sequence is studied. Modular reciprocal recurrent neural networks (MRR-NN) are proposed to model the strong correlations between adjacent secondary structure elements. Besides, a multilayer bidirectional recurrent neural network (MBR-NN) is introduced to capture the long-range intramolecular interactions between amino acids in formation of the secondary structure. The final modular prediction system is devised based on the interactive integration of the MRR-NN and the MBR-NN structures to arbitrarily engage the neighboring effects of the secondary structure types concurrent with memorizing the sequential dependencies of amino acids along the protein chain. The advanced combined network augments the percentage accuracy (Q₃) to 79.36% and boosts the segment overlap (SOV) up to 70.09% when tested on the PSIPRED dataset in three-fold cross-validation.

[1]  Thomas G. Dietterich,et al.  Bioinformatics The Machine Learning Approach 2nd ed. , 2001 .

[2]  Jinmiao Chen,et al.  Cascaded Bidirectional Recurrent Neural Networks for Protein Secondary Structure Prediction , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  B. Rost,et al.  Redefining the goals of protein secondary structure prediction. , 1994, Journal of molecular biology.

[4]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—round IX , 2011, Proteins.

[5]  Aoife McLysaght,et al.  Porter: a new, accurate server for protein secondary structure prediction , 2005, Bioinform..

[6]  Giovanni Soda,et al.  Exploiting the past and the future in protein secondary structure prediction , 1999, Bioinform..

[7]  Ron Shamir,et al.  Artificial Intelligence and Heuristic Methods in Bioinformatics , 2003 .

[8]  Seyyed Ali Seyyedsalehi,et al.  Pruning neural networks for protein secondary structure prediction , 2008, 2008 8th IEEE International Conference on BioInformatics and BioEngineering.

[9]  G J Barton,et al.  Evaluation and improvement of multiple sequence methods for protein secondary structure prediction , 1999, Proteins.

[10]  Jimin Pei,et al.  Analysis of CASP8 targets, predictions and assessment methods , 2009, Database J. Biol. Databases Curation.

[11]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[12]  Alessandro Vullo,et al.  Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information , 2007, BMC Bioinformatics.

[13]  B. Rost,et al.  A modified definition of Sov, a segment‐based measure for protein secondary structure prediction assessment , 1999, Proteins.

[14]  Seyyed Ali Seyyed Salehi,et al.  Modeling phones coarticulation effects in a neural network based speech recognition system , 2004, INTERSPEECH.

[15]  Pierre Baldi,et al.  Bioinformatics - the machine learning approach (2. ed.) , 2000 .

[16]  Russell Reed,et al.  Pruning algorithms-a survey , 1993, IEEE Trans. Neural Networks.

[17]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[18]  Jose C. Principe,et al.  Neural and Adaptive Systems: Fundamentals through Simulations with CD-ROM , 1999 .

[19]  S. Bryant,et al.  Critical assessment of methods of protein structure prediction (CASP): Round II , 1997, Proteins.

[20]  Zafer Aydin,et al.  A signal processing application in genomic research: protein secondary structure prediction , 2006 .

[21]  Hakan Erdogan,et al.  Bayesian Protein Secondary Structure Prediction With Near-Optimal Segmentations , 2007, IEEE Transactions on Signal Processing.

[22]  Wei Chu,et al.  Bayesian segmental models with multiple sequence alignment profiles for protein secondary structure and contact map prediction , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[23]  T. Sejnowski,et al.  Predicting the secondary structure of globular proteins using neural network models. , 1988, Journal of molecular biology.

[24]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[25]  A A Salamov,et al.  Prediction of protein secondary structure by combining nearest-neighbor algorithms and multiple sequence alignments. , 1995, Journal of molecular biology.

[26]  Pierre Baldi,et al.  Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles , 2002, Proteins.

[27]  E. Korner,et al.  Cortical architecture and self-referential control for brain-like computation , 2002, IEEE Engineering in Medicine and Biology Magazine.

[28]  Juan Cui,et al.  Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity , 2006, Proteomics.

[29]  Yanqing Zhang,et al.  Protein Secondary Structure Prediction Using Genetic Neural Support Vector Machines , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[30]  Alessio Ceroni,et al.  Learning protein secondary structure from sequential and relational data , 2005, Neural Networks.

[31]  M. Mesulam,et al.  From sensation to cognition. , 1998, Brain : a journal of neurology.

[32]  N. Balakrishnan,et al.  Characterization of protein secondary structure , 2004, IEEE Signal Processing Magazine.

[33]  Hava Siegelmann,et al.  Application of expert networks for predicting proteins secondary structure. , 2007, Biomolecular engineering.

[34]  Jose C. Principe,et al.  Neural and adaptive systems : fundamentals through simulations , 2000 .

[35]  Pierre Baldi,et al.  Three-stage prediction of protein ?-sheets by neural networks, alignments and graph algorithms , 2005, ISMB.