Performance Analysis of Molecular Complex Detection in Social Network Datasets

Researches related graph dataset conducted for years. One of its main topics was community detection. The development of algorithms to do community detection continuously conducted by adjusting characteristics of datasets used. One of which is Molecular Complex Detection (MCODE) algorithm used to community detection in a dataset of protein-protein interaction (PPI). However, use of the algorithm still limited to PPI dataset only. The aim of this research was to conducted experiment usage of MCODE algorithm in other datasets such us social network datasets. An experiment conducted by comparing the performance of MCODE with other benchmark algorithms such us Label Propagation and Girvan-Newman. From the experiment performed was resulted that for modularity MCODE showed the best result when compared with others, followed Girvan-Newman and Label Propagation with its values were 0.67, 0.66 and 0.46, respectively. Furthermore, for a testing parameter such us running time and average clustering coefficient, MCODE showed better result compared with Girvan-Newman and Label Propagation. For running time, MCODE needed mean time as 0.053 s, GirvanNewman as 0.056 s and Label Propagation as 0.078 s and for test parameter of average clustering coefficient, MCODE was 0.37, Girvan-Newman was 0.44 and Label Propagation was 0.46. General Terms Data Mining, Graph Mining, Social Network Analysis, Algorithm

[1]  Jianhua Li,et al.  Automatic Threshold Calculation Based Label Propagation Algorithm for Overlapping Community , 2016, 2016 IEEE First International Conference on Data Science in Cyberspace (DSC).

[2]  Athina P. Petropulu,et al.  Detecting community structure using label propagation with consensus weight in complex network , 2014 .

[3]  Zhou Lihua,et al.  Improved Modularity Based on Girvan-Newman Modularity , 2012, 2012 Second International Conference on Intelligent System Design and Engineering Application.

[4]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[5]  Laurent Jacques,et al.  Discriminative and Efficient Label Propagation on Complementary Graphs for Multi-Object Tracking , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[7]  Tijana Vujičić,et al.  Performance Analysis of Girvan-Newman Algorithm on Different Types of Random Graphs , 2016 .

[8]  Jian Xu,et al.  Community structure analysis using label propagation and flow-based ensemble learning , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[9]  Erzsébet Merényi,et al.  SOM and MCODE methods of defining functional clusters in MRI of the brain , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[10]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[11]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[13]  Arun Ross,et al.  Predicting Missing Demographic Information in Biometric Records Using Label Propagation Techniques , 2016, 2016 International Conference of the Biometrics Special Interest Group (BIOSIG).

[14]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[15]  Jacques van Helden,et al.  Evaluation of clustering algorithms for protein-protein interaction networks , 2006, BMC Bioinformatics.

[16]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[17]  Jun Zhong,et al.  Personalized Activity Recognition Using Molecular Complex Detection Clustering , 2014, 2014 IEEE 11th Intl Conf on Ubiquitous Intelligence and Computing and 2014 IEEE 11th Intl Conf on Autonomic and Trusted Computing and 2014 IEEE 14th Intl Conf on Scalable Computing and Communications and Its Associated Workshops.

[18]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Damir Vukicevic,et al.  Community structure in networks: Girvan-Newman algorithm improvement , 2014, 2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[20]  Boleslaw K. Szymanski,et al.  Fuzzy overlapping community quality metrics , 2015, Social Network Analysis and Mining.

[21]  Zhang Zhen,et al.  Identifying the Communities in the Metabolic Network Using 'Component' Definition and Girvan-Newman Algorithm , 2015, 2015 14th International Symposium on Distributed Computing and Applications for Business Engineering and Science (DCABES).

[22]  Donald E. Knuth,et al.  The Stanford GraphBase - a platform for combinatorial computing , 1993 .

[23]  Abdelmounaam Rezgui,et al.  A Link Strength Based Label Propagation Algorithm for Community Detection , 2016, 2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom).

[24]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.