Adaptive Filtering Techniques Combined with Natural Selection-Based Heuristic Algorithms in the Prediction of Protein-Protein Interactions

The analysis of protein-protein interactions (PPIs) is crucial to the understanding of cellular organizations, processes and functions. The reliability of the current experimental approaches interaction data is prone to error. Thus, a variety of computational methods have been developed to supplement the interactions that have been detected experimentally. The present paper’s main objective is to present a novel classification framework for predicting PPIs combining the advantages of two algorithmic methods’ categories (heuristic methods, adaptive filtering techniques) in order to produce high performance classifiers while maintaining their interpretability. Our goal is to find a simple mathematical equation that governs the best classifier enabling the extraction of biological knowledge. State-of-the-art adaptive filtering techniques were combined with the most contemporary heuristic methods which are based in the natural selection process. To the best of our knowledge, this is the first time that the proposed classification framework is applied and analyzed extensively for the problem of predicting PPIs. The proposed methodology was tested with a commonly used data set using all possible combinations of the selected adaptive filtering and heuristic techniques and comparisons were made. The best algorithmic combinations derived from these procedures were Genetic Algorithms with Extended Kalman Filters and Particle Swarm Optimization with Extended Kalman Filters. Using these algorithmic combinations high accuracy interpretable classifiers were produced.

[1]  David A. Gough,et al.  Predicting protein-protein interactions from primary structure , 2001, Bioinform..

[2]  Andrei L. Lomize,et al.  OPM: Orientations of Proteins in Membranes database , 2006, Bioinform..

[3]  Bernard Widrow,et al.  Adaptive switching circuits , 1988 .

[4]  Graham C. Goodwin,et al.  Dynamic System Identification: Experiment Design and Data Analysis , 2012 .

[5]  Mei Liu,et al.  Prediction of protein-protein interactions using random decision forest framework , 2005, Bioinform..

[6]  Erik L. L. Sonnhammer,et al.  Inparanoid: a comprehensive database of eukaryotic orthologs , 2004, Nucleic Acids Res..

[7]  Richard S. Sutton,et al.  Adapting Bias by Gradient Descent: An Incremental Version of Delta-Bar-Delta , 1992, AAAI.

[8]  Natalie Wilson,et al.  Human Protein Reference Database , 2004, Nature Reviews Molecular Cell Biology.

[9]  A. Valencia,et al.  Prediction of protein--protein interaction sites in heterocomplexes with neural networks. , 2002, European journal of biochemistry.

[10]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[11]  Athanasios K. Tsakalidis,et al.  A new hybrid method for predicting protein interactions using Genetic Algorithms and Extended Kalman Filters , 2010, Proceedings of the 10th IEEE International Conference on Information Technology and Applications in Biomedicine.

[12]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[13]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[14]  Limsoon Wong,et al.  Exploiting Indirect Neighbours and Topological Weight to Predict Protein Function from Protein-Protein Interactions , 2006, BioDM.

[15]  Michelle S. Scott,et al.  Predicting subcellular localization via protein motif co-occurrence. , 2004, Genome research.

[16]  Paulo S. R. Diniz,et al.  Adaptive Filtering: Algorithms and Practical Implementation , 1997 .

[17]  John Holland,et al.  Adaptation in Natural and Artificial Sys-tems: An Introductory Analysis with Applications to Biology , 1975 .

[18]  Geoffrey J. Barton,et al.  Probabilistic prediction and ranking of human protein-protein interactions , 2007, BMC Bioinformatics.

[19]  Limsoon Wong,et al.  Exploiting indirect neighbours and topological weight to predict protein function from protein--protein interactions , 2006 .

[20]  Saman K. Halgamuge,et al.  Particle Swarm Optimization with Self-Adaptive Acceleration Coefficients , 2002, FSKD.

[21]  Greg Welch,et al.  An Introduction to Kalman Filter , 1995, SIGGRAPH 2001.