RP-FIRF: Prediction of Self-interacting Proteins Using Random Projection Classifier Combining with Finite Impulse Response Filter

The self-interacting proteins (SIPs) plays a significant part in the organism and the regulation of cellular functions. Thence, we developed an effective algorithm to predict SIPs, named RP-FIRF, which merges the Random Projection (RP) classifier and Finite Impulse Response Filter (FIRF) together. More specifically, the Position Specific Scoring Matrix (PSSM) was firstly converted from protein sequence by exploiting Position Specific Iterated BLAST (PSI-BLAST). Then, we obtained the same size of matrix by implementing a valid matrix multiplication on PSSM, and applied FIRF approach to calculate the eigenvalues of each protein. The Principal Component Analysis (PCA) approach is used to extract the most relevant information. Finally, the performance of the proposed method is performed on human dataset. The results show that our model can achieve high average accuracies of 97.89% on human dataset using the 5-fold cross-validation, which demonstrate that our method is a useful tool for identifying SIPs.

[1]  Shuai Li,et al.  A MapReduce based parallel SVM for large-scale predicting protein-protein interactions , 2014, Neurocomputing.

[2]  Zhu-Hong You,et al.  Increasing the reliability of protein-protein interaction networks via non-convex semantic embedding , 2013, Neurocomputing.

[3]  Xing Chen,et al.  FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model , 2016, Oncotarget.

[4]  Zhu-Hong You,et al.  Using Weighted Sparse Representation Model Combined with Discrete Cosine Transformation to Predict Protein-Protein Interactions from Protein Sequence , 2015, BioMed research international.

[5]  Xing Chen,et al.  Accurate prediction of protein-protein interactions by integrating potential evolutionary information embedded in PSSM profile and discriminative vector machine classifier , 2017, Oncotarget.

[6]  Zhu-Hong You,et al.  Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data , 2010, Bioinform..

[7]  MengChu Zhou,et al.  Highly Efficient Framework for Predicting Interactions Between Proteins , 2017, IEEE Transactions on Cybernetics.

[8]  Sylvie Ricard-Blum,et al.  MatrixDB, the extracellular matrix interaction database: updated content, a new navigator and expanded functionalities , 2014, Nucleic Acids Res..

[9]  Kara Dolinski,et al.  The BioGRID interaction database: 2015 update , 2014, Nucleic Acids Res..

[10]  Zhu-Hong You,et al.  ILNCSIM: improved lncRNA functional similarity calculation model , 2016, Oncotarget.

[11]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[12]  L. Baisamy,et al.  Leucine Zipper-mediated Homo-oligomerization Regulates the Rho-GEF Activity of AKAP-Lbc* , 2005, Journal of Biological Chemistry.

[13]  Jiangning Song,et al.  SPAR: a random forest-based predictor for self-interacting proteins with fine-grained domain information , 2016, Amino Acids.

[14]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[15]  Zhu-Hong You,et al.  Detecting Protein-Protein Interactions with a Novel Matrix-Based Protein Sequence Representation and Support Vector Machines , 2015, BioMed research international.

[16]  Zhu-Hong You,et al.  Predicting Protein-Protein Interactions from Primary Protein Sequences Using a Novel Multi-Scale Local Feature Representation Scheme and the Random Forest , 2015, PloS one.

[17]  Xing Chen,et al.  PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein–Protein Interactions from Protein Sequences , 2017, International journal of molecular sciences.

[18]  Reza Ebrahimpour,et al.  PPIevo: protein-protein interaction prediction from PSSM based evolutionary information. , 2013, Genomics.

[19]  L Shapiro,et al.  Linking molecular affinity and cellular specificity in cadherin-mediated adhesion , 2009, Proceedings of the National Academy of Sciences.

[20]  Karin Breuer,et al.  InnateDB: systems biology of innate immunity and beyond—recent updates and continuing curation , 2012, Nucleic Acids Res..

[21]  Motonori Ota,et al.  Alteration of oligomeric state and domain architecture is essential for functional transformation between transferase and hydrolase with the same scaffold , 2009, Protein Science.

[22]  Xing Chen,et al.  Predicting protein-protein interactions from protein sequences by a stacked sparse autoencoder deep neural network. , 2017, Molecular bioSystems.

[23]  Jian Wang,et al.  Proteome-wide Prediction of Self-interacting Proteins Based on Multiple Properties* , 2013, Molecular & Cellular Proteomics.

[24]  Zhu-Hong You,et al.  t-LSE: A Novel Robust Geometric Approach for Modeling Protein-Protein Interaction Networks , 2013, PloS one.

[25]  Tianwei Yu,et al.  K-Profiles: A Nonlinear Clustering Method for Pattern Detection in High Dimensional Data , 2015, BioMed research international.

[26]  Shuai Li,et al.  Inverse-Free Extreme Learning Machine With Optimal Information Updating , 2016, IEEE Transactions on Cybernetics.

[27]  Xing-Ming Zhao,et al.  Improved method for predicting phi-turns in proteins using a two-stage classifier. , 2010, Protein and peptide letters.

[28]  Xing Chen,et al.  DroidDet: Effective and robust detection of android malware using static analysis along with rotation forest model , 2018, Neurocomputing.

[29]  Zhu-Hong You,et al.  An improved sequence-based prediction protocol for protein-protein interactions using amino acids substitution matrix and rotation forest ensemble classifiers , 2017, Neurocomputing.

[30]  Yong Zhou,et al.  Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier. , 2017, Journal of theoretical biology.

[31]  Yong Zhou,et al.  Ens-PPI: A Novel Ensemble Classifier for Predicting the Interactions of Proteins Using Autocovariance Transformation from PSSM , 2016, BioMed research international.

[32]  Zhen Ji,et al.  Large-Scale Protein-Protein Interactions Detection by Integrating Big Biosensing Data with Computational Model , 2014, BioMed research international.

[33]  Nik Bessis,et al.  CS-PSO: chaotic particle swarm optimization algorithm for solving combinatorial optimization problems , 2016, Soft Computing.

[34]  Xiaobo Zhou,et al.  A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network , 2010, BMC Bioinformatics.

[35]  Xing Chen,et al.  Long non-coding RNAs and complex diseases: from experimental results to computational models , 2016, Briefings Bioinform..

[36]  Angel F. Lopez,et al.  The Dimeric Versus Monomeric Status of 14-3-3ζ Is Controlled by Phosphorylation of Ser58 at the Dimer Interface* , 2003, Journal of Biological Chemistry.

[37]  Zhu-Hong You,et al.  An ensemble approach for large-scale identification of protein-protein interactions using the alignments of multiple sequences , 2016, Oncotarget.

[38]  Rafael C. Jimenez,et al.  The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases , 2013, Nucleic Acids Res..

[39]  Zhen Ji,et al.  Assessing and predicting protein interactions by combining manifold embedding with multiple information integration , 2012, BMC Bioinformatics.

[40]  I. Ispolatov,et al.  Binding properties and evolution of homodimers in protein–protein interaction networks , 2005, Nucleic acids research.

[41]  S F Altschul,et al.  Iterated profile searches with PSI-BLAST--a tool for discovery in protein databases. , 1998, Trends in biochemical sciences.

[42]  Xing Chen,et al.  Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition , 2016, BMC Systems Biology.

[43]  Javier De Las Rivas,et al.  Protein–Protein Interactions Essentials: Key Concepts to Building and Analyzing Interactome Networks , 2010, PLoS Comput. Biol..

[44]  Reza Ebrahimpour,et al.  LocFuse: human protein-protein interaction prediction via classifier fusion using protein localization information. , 2014, Genomics.

[45]  Zhu-Hong You,et al.  Increasing reliability of protein interactome by fast manifold embedding , 2013, Pattern Recognit. Lett..

[46]  Xing Chen,et al.  Identification of self-interacting proteins by exploring evolutionary information embedded in PSI-BLAST-constructed position specific scoring matrix , 2016, Oncotarget.

[47]  Xing Chen,et al.  Robust and accurate prediction of protein self-interactions from amino acids sequence using evolutionary information. , 2016, Molecular bioSystems.

[48]  Zhu-Hong You,et al.  A novel approach based on KATZ measure to predict associations of human microbiota with non‐infectious diseases , 2016, Bioinform..

[49]  Xing Chen,et al.  PSPEL: In Silico Prediction of Self-Interacting Proteins from Amino Acids Sequences Using Ensemble Learning , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[50]  Lior Rokach,et al.  Random Projection Ensemble Classifiers , 2009, ICEIS.

[51]  Yong Zhou,et al.  An improved efficient rotation forest algorithm to predict the interactions among proteins , 2018, Soft Comput..

[52]  Xing Chen,et al.  Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding , 2016, BMC Bioinformatics.

[53]  Xing Chen,et al.  PBMDA: A novel and effective path-based computational model for miRNA-disease association prediction , 2017, PLoS Comput. Biol..

[54]  Xiuquan Du,et al.  A Novel Feature Extraction Scheme with Ensemble Coding for Protein–Protein Interaction Prediction , 2014, International journal of molecular sciences.

[55]  J. Matthews,et al.  The power of two: protein dimerization in biology. , 2004, Trends in biochemical sciences.

[56]  Xing Chen,et al.  Construction of reliable protein-protein interaction networks using weighted sparse representation based classifier with pseudo substitution matrix representation features , 2016, Neurocomputing.

[57]  Hai-Cheng Yi,et al.  Detection of Interactions between Proteins by Using Legendre Moments Descriptor to Extract Discriminatory Information Embedded in PSSM , 2017, Molecules.