Determination of protein-protein interaction through Artificial Neural Network and Support Vector Machine: A Comparative study

Protein-protein interactions (PPI) plays considerable role in most of the cellular processes and study of PPI enhances understanding of molecular mechanism of the cells. After emergence of proteomics, huge amount of protein sequences were generated but there interaction patterns are still unrevealed. Traditionally various techniques were used to predict PPI but are deficient in terms of accuracy. To overcome the limitations of experimental approaches numerous computational approaches were developed to find PPI. However previous computational approaches were based on descriptors, various external factors and protein sequences. In this article, a sequence based prediction model is proposed by using various machine learning approaches. A comparative study was done to understand efficiency of various machine learning approaches. Large amount of yeast PPI data have been analyzed. Same data has been incorporated for different classification approach like Artificial Neural Network (ANN) and Support Vector Machine (SVM), and compared their results. Existing methods with additional features were implemented to enhance the accuracy of the result. Thus it was concluded that efficiency of this model was more admirable than those existing sequence-based methods; therefore it can be effective for future proteomics research work.

[1]  Kyungsook Han,et al.  Sequence-based prediction of protein-protein interactions by means of rotation forest and autocorrelation descriptor. , 2010, Protein and peptide letters.

[2]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[3]  Sayan Mukherjee,et al.  Feature Selection for SVMs , 2000, NIPS.

[4]  T. Takagi,et al.  Assessment of prediction accuracy of protein function from protein–protein interaction data , 2001, Yeast.

[5]  David A. Gough,et al.  Predicting protein-protein interactions from primary structure , 2001, Bioinform..

[6]  Loris Nanni,et al.  Fusion of classifiers for predicting protein-protein interactions , 2005, Neurocomputing.

[7]  J H Lakey,et al.  Measuring protein-protein interactions. , 1998, Current opinion in structural biology.

[8]  Xue-wen Chen,et al.  KUPS: constructing datasets of interacting and non-interacting protein pairs with associated attributions , 2010, Nucleic Acids Res..

[9]  Yanzhi Guo,et al.  Using support vector machine combined with auto covariance to predict protein–protein interactions from protein sequences , 2008, Nucleic acids research.

[10]  Kui Zhang,et al.  Prediction of protein function using protein-protein interaction data , 2002, Proceedings. IEEE Computer Society Bioinformatics Conference.

[11]  Lei Wang,et al.  AdaBoost with SVM-based component classifiers , 2008, Eng. Appl. Artif. Intell..

[12]  Chris H. Q. Ding,et al.  Multi-class protein fold recognition using support vector machines and neural networks , 2001, Bioinform..

[13]  J M Gauthier,et al.  Protein--protein interaction maps: a lead towards cellular functions. , 2001, Trends in genetics : TIG.

[14]  M. Charton,et al.  The structural dependence of amino acid hydrophobicity parameters. , 1982, Journal of theoretical biology.

[15]  Loris Nanni,et al.  An ensemble of K-local hyperplanes for predicting protein-protein interactions , 2006, Bioinform..

[16]  D. Eisenberg,et al.  Detecting protein function and protein-protein interactions from genome sequences. , 1999, Science.

[17]  A. Valencia,et al.  Correlated mutations contain information about protein-protein interaction. , 1997, Journal of molecular biology.

[18]  Andrey Rzhetsky,et al.  Towards the Prediction of Complete Protein-Protein Interaction Networks , 2001, Pacific Symposium on Biocomputing.

[19]  Stanley Letovsky,et al.  Predicting protein function from protein/protein interaction data: a probabilistic approach , 2003, ISMB.

[20]  A. Valencia,et al.  Computational methods for the prediction of protein interactions. , 2002, Current opinion in structural biology.

[21]  Wing-Kin Sung,et al.  Probabilistic prediction of protein-protein interactions from the protein sequences , 2006, Comput. Biol. Medicine.

[22]  Ioannis Xenarios,et al.  Mining literature for protein-protein interactions , 2001, Bioinform..