Using the Multi-instance Learning Method to Predict Protein-Protein Interactions with Domain Information

Identifying protein-protein interactions (PPIs) can help us to know the protein function and is critical for understanding the mechanisms of proteome. Recently, lots of computational methods such as the domain-based approach have been developed for predicting the protein-protein interactions. The conventional domain-based methods usually need to infer the interacting domain pairs from already known interacting sets of proteins, and then to predict the PPIs. However, it is difficult to provide the detailed information that which of the domain pairs will actually interact for the PPIs prediction. Therefore, it is of great importance to develop a new computational model which can ignore the information whether a domain pair is interacting or not. In this paper, we propose a novel method using multi-instance learning (MIL) for predicting protein-protein interactions based on the domain information. Firstly, the domain pairs of two proteins were composed. Then, we use the amino acid composition feature encoding method to encode the domain pairs. Finally, two multi-instance learning methods were used for training the data. The experiment results demonstrate that the proposed method is effective.

[1]  Zhi-Hua Zhou,et al.  Ensembles of Multi-instance Learners , 2003, ECML.

[2]  B. Honig,et al.  Structure-based prediction of protein-protein interactions on a genome-wide scale , 2012, Nature.

[3]  Bo Zhang,et al.  A geometrical representation of McCulloch-Pitts neural model and its applications , 1999, IEEE Trans. Neural Networks.

[4]  Yanzhi Guo,et al.  Using support vector machine combined with auto covariance to predict protein–protein interactions from protein sequences , 2008, Nucleic acids research.

[5]  Zhu-Hong You,et al.  Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis , 2013, BMC Bioinformatics.

[6]  Chen Xiang Department of Computer Science and Technology , 2006 .

[7]  Zalmiyah Zakaria,et al.  Utilizing shared interacting domain patterns and Gene Ontology information to improve protein-protein interaction prediction , 2010, Comput. Biol. Medicine.

[8]  Jaques Reifman,et al.  Reconstituting protein interaction networks using parameter-dependent domain-domain interactions , 2012, BMC Bioinformatics.

[9]  Wang Fei,et al.  Structural Domain Based Multiple Instance Learning for Predicting Gram-Positive Bacterial Protein Subcellular Localization , 2009, 2009 International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing.

[10]  Jun Wang,et al.  Solving the Multiple-Instance Problem: A Lazy Learning Approach , 2000, ICML.

[11]  Sharmila Anishetty,et al.  A matrix based algorithm for Protein-Protein Interaction prediction using Domain-Domain Associations. , 2013, Journal of theoretical biology.

[12]  Reza Ebrahimpour,et al.  PPIevo: protein-protein interaction prediction from PSSM based evolutionary information. , 2013, Genomics.

[13]  Dong-Soo Han,et al.  A Computational Model for Predicting Protein Interactions Based on Multidomain Collaboration , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[14]  Juwen Shen,et al.  Predicting protein–protein interactions based only on sequences information , 2007, Proceedings of the National Academy of Sciences.

[15]  De-Shuang Huang,et al.  Predicting protein–protein interactions from sequence using correlation coefficient and high-quality interaction dataset , 2010, Amino Acids.

[16]  Farshad Fotouhi,et al.  Computational Approaches for Predicting Protein–Protein Interactions: A Survey , 2006, Journal of Medical Systems.

[17]  Gary D Bader,et al.  Computational Prediction of Protein–Protein Interactions , 2008, Molecular biotechnology.

[18]  Thomas Gärtner,et al.  Multi-Instance Kernels , 2002, ICML.

[19]  Jie Tang,et al.  Multiple-instance learning with instance selection via constructive covering algorithm , 2014, Tsinghua Science and Technology.

[20]  Hendrik Blockeel,et al.  Multi-Instance Learning , 2010, Encyclopedia of Machine Learning.