Incorporating fuzzy semantic similarity measure in detecting human protein complexes in PPI network: A multiobjective approach

Detection of protein complexes within protein-protein interaction networks (PPIN) is a valuable step toward the analysis of biological processes and pathways. Several high-throughput experimental techniques produce large number of PPIs that can be extensively utilized for constructing PPI network of a species. Decomposition of the whole PPI network into smaller and manageable modules is an ongoing challenge. Here we have developed a multi-objective algorithm for detecting human protein complexes by partitioning large human PPI network into clusters which serve as protein complexes. Some graphical properties like density, centrality etc., are utilized for building the objectives. Besides the graphical properties we have also exploited a fuzzy measure based semantic similarity approach to construct similarity based objective. The proposed technique is demonstrated in the human PPI network and the resulting complexes are analyzed in context of Gene Ontology (GO) and pathway enrichment. We have also compared our results with that of some state-of-the-art algorithms in context of different performance metrics. The biological relevance of our predicted complexes are also established here by linking them with 22 key disease classes.

[1]  Jacques van Helden,et al.  Evaluation of clustering algorithms for protein-protein interaction networks , 2006, BMC Bioinformatics.

[2]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[3]  A. Wagner How the global structure of protein interaction networks evolves , 2002, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[4]  Brad T. Sherman,et al.  The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists , 2007, Genome Biology.

[5]  Anton J. Enright,et al.  Detection of functional modules from protein interaction networks , 2003, Proteins.

[6]  L. Mirny,et al.  Protein complexes and functional modules in molecular networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[7]  H. Clevers,et al.  Linking Colorectal Cancer to Wnt Signaling , 2000, Cell.

[8]  Haiyuan Yu,et al.  Detecting overlapping protein complexes in protein-protein interaction networks , 2012, Nature Methods.

[9]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[10]  R. Holcombe,et al.  Wnt signaling and colon carcinogenesis: Beyond APC , 2011, Journal of carcinogenesis.

[11]  Serhiy Souchelnytskyi,et al.  Smad regulation in TGF-β signal transduction , 2022 .

[12]  C. Heldin,et al.  Smad regulation in TGF-beta signal transduction. , 2001, Journal of cell science.

[13]  A. Mukhopadhyay,et al.  Detecting protein complexes in a PPI network: a gene ontology based multi-objective evolutionary approach. , 2012, Molecular bioSystems.

[14]  Ichiro Takada,et al.  Wnt and PPARγ signaling in osteoblastogenesis and adipogenesis , 2009, Nature Reviews Rheumatology.

[15]  Olivier Bodenreider,et al.  Gene expression correlation and gene ontology-based similarity: an assessment of quantitative relationships , 2004, 2004 Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[16]  James M. Keller,et al.  Fuzzy Measures on the Gene Ontology for Gene Product Similarity , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  Hans Clevers,et al.  Caught up in a Wnt storm: Wnt signaling in cancer. , 2003, Biochimica et biophysica acta.

[18]  Thomas Lengauer,et al.  A new measure for functional similarity of gene products based on Gene Ontology , 2006, BMC Bioinformatics.

[19]  Dipanwita Roy Chowdhury,et al.  Human protein reference database as a discovery resource for proteomics , 2004, Nucleic Acids Res..

[20]  Katsuhiko Murakami,et al.  PCDq: human protein complex database with quality index which summarizes different levels of evidences of protein complexes predicted from H-Invitational protein-protein interactions integrative dataset , 2012, BMC Systems Biology.

[21]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[22]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.