IoMT-Based Association Rule Mining for the Prediction of Human Protein Complexes

The inspiring increase in the Internet-enabling devices has influenced health industry due to the nature of these devices where they offer health related information swiftly. One of the prominent characteristics of these devices is to provide physicians with effective diagnosis of sensitive diseases. Internet of Medical Things (IoMT) is a means of connecting medical devices to computing nodes with the help of Internet for affording real-time communications between patients and clinicians to understand the interaction of human protein complexes. A secure and correct protein complex prediction plays an important job in perceiving the principal method of various cellular determinations and to elucidate the functionality of different un-annotated proteins. Different experimental schemes have been evolved to accomplish this task, however, these schemes have high error rates and are not efficient in terms of time, cost, privacy, and security. To tackle these limitations, numerous computational models have been developed that consider a protein complex as a dense sub-graph and utilize some basic topological properties such as density and degree statistics as a feature set for protein complex prediction. Different kinds of sub-graph structures, e.g., ring, star, linear, and hybrid have also been found in Protein-Protein Interaction Network (PPIN), therefore, more advance topological properties may be helpful to predict these structures. Moreover, the amino acid sequence of protein determines its formation, thus, the sequence information is important for predicting the interacting property among proteins in a secure way. In this study, we have computed basic as well as advance topological features by considering the interaction network of human protein complexes in the IoMT environment. In addition, biological features, i.e., discrete wavelet coefficients, length, and entropy from amino acid sequences of proteins have been computed. The supervised learning method based on association rules such as Partial Tree (PART) and Non-Nested Generalized Exemplars (NNGE) are trained to identify human protein complexes on the basis of integrated topological and biological properties. The 10-fold cross validation is exercised to measure the proposed methods. Experimental results show that association rule learners with integrated features outperform other complex mining algorithms, i.e., probabilistic Bayesian Network (BN), and Random Forest, in terms of accuracy and efficiency in addition to provide privacy.

[1]  Xiujuan Lei,et al.  Protein complex detection with semi-supervised learning in protein interaction networks , 2011, Proteome Science.

[2]  Joel J. P. C. Rodrigues,et al.  A novel deep learning based framework for the detection and classification of breast cancer using transfer learning , 2019, Pattern Recognit. Lett..

[3]  Yasir Faheem,et al.  An e-Health care services framework for the detection and classification of breast cancer in breast cytology images as an IoMT application , 2019, Future Gener. Comput. Syst..

[4]  B. Séraphin,et al.  A generic protein purification method for protein complex characterization and proteome exploration , 1999, Nature Biotechnology.

[5]  Zong Dai,et al.  Identification of human protein complexes from local sub-graphs of protein-protein interaction network based on random forest with topological structure features. , 2012, Analytica chimica acta.

[6]  Xiaoli Li,et al.  Computational approaches for detecting protein complexes from protein interaction networks: a survey , 2010, BMC Genomics.

[7]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.

[8]  Mohsen Guizani,et al.  A blockchain-based fog computing framework for activity recognition as an application to e-Healthcare services , 2019, Future Gener. Comput. Syst..

[9]  Anton J. Enright,et al.  Detection of functional modules from protein interaction networks , 2003, Proteins.

[10]  Byung-Seo Kim,et al.  Trust Management Techniques for the Internet of Things: A Survey , 2019, IEEE Access.

[11]  Mohsen Guizani,et al.  A Decade of Internet of Things: Analysis in the Light of Healthcare Applications , 2019, IEEE Access.

[12]  Steven Salzberg,et al.  Programs for Machine Learning , 2004 .

[13]  Byung-Seo Kim,et al.  The Internet of Things: A Review of Enabled Technologies and Future Challenges , 2019, IEEE Access.

[14]  Ashkan Golshani,et al.  Computational methods for predicting protein-protein interactions. , 2008, Advances in biochemical engineering/biotechnology.

[15]  Lusheng Wang,et al.  Identification of Protein Complexes Using Weighted PageRank-Nibble Algorithm and Core-Attachment Structure , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  C. Landry,et al.  An in Vivo Map of the Yeast Protein Interactome , 2008, Science.

[17]  Wei Chen,et al.  iDNA4mC: identifying DNA N4‐methylcytosine sites based on nucleotide chemical properties , 2017, Bioinform..

[18]  Shigehiko Kanaya,et al.  Development and implementation of an algorithm for detection of protein complexes in large interaction networks , 2006, BMC Bioinformatics.

[19]  Min Wu,et al.  A core-attachment based method to detect protein complexes in PPI networks , 2009, BMC Bioinformatics.

[20]  Mohsen Guizani,et al.  RobustTrust – A Pro-Privacy Robust Distributed Trust Management Mechanism for Internet of Things , 2019, IEEE Access.

[21]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[22]  Jianding Qiu,et al.  Prediction of G-protein-coupled receptor classes based on the concept of Chou's pseudo amino acid composition: an approach from discrete wavelet transform. , 2009, Analytical biochemistry.

[23]  Kim-Kwang Raymond Choo,et al.  A blockchain future for internet of things security: a position paper , 2017, Digit. Commun. Networks.

[24]  Yanjun Qi,et al.  Protein complex identification by supervised graph local clustering , 2008, ISMB.

[25]  Guimei Liu,et al.  Complex discovery from weighted PPI networks , 2009, Bioinform..

[26]  Haiyuan Yu,et al.  Detecting overlapping protein complexes in protein-protein interaction networks , 2012, Nature Methods.

[27]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[28]  K. Chou,et al.  iACP: a sequence-based tool for identifying anticancer peptides , 2016, Oncotarget.

[29]  J. Bao,et al.  A wavelet-based feature vector model for DNA clustering. , 2015, Genetics and molecular research : GMR.

[30]  Lei Chen,et al.  Identifying protein complexes using hybrid properties. , 2009, Journal of proteome research.

[31]  K. Chou Some remarks on protein attribute prediction and pseudo amino acid composition , 2010, Journal of Theoretical Biology.

[32]  Mohsen Guizani,et al.  Machine learning in the Internet of Things: Designed techniques for smart cities , 2019, Future Gener. Comput. Syst..

[33]  Illés J. Farkas,et al.  CFinder: locating cliques and overlapping modules in biological networks , 2006, Bioinform..

[34]  M. M. Mohie-Eldin,et al.  Assessing the Effects of Data Selection and Representation on the Development of Reliable E. coli Sigma 70 Promoter Region Predictors , 2015, PloS one.

[35]  Shenghuo Zhu,et al.  A survey on wavelet applications in data mining , 2002, SKDD.

[36]  Mohsen Guizani,et al.  HoliTrust-A Holistic Cross-Domain Trust Management Mechanism for Service-Centric Internet of Things , 2019, IEEE Access.

[37]  S. Salzberg,et al.  INSTANCE-BASED LEARNING : Nearest Neighbour with Generalisation , 1995 .

[38]  Zoe L. Jiang,et al.  Decision Tree Based Approaches for Detecting Protein Complex in Protein Protein Interaction Network (PPI) via Link and Sequence Analysis , 2018, IEEE Access.

[39]  K. Chou,et al.  iRNA-Methyl: Identifying N(6)-methyladenosine sites using pseudo nucleotide composition. , 2015, Analytical biochemistry.

[40]  Nazar Zaki,et al.  Detecting Protein Complexes in Protein Interaction Networks Modeled as Gene Expression Biclusters , 2015, PloS one.

[41]  Feng Yu,et al.  Predicting protein complex in protein interaction network - a supervised learning based method , 2014, 2013 IEEE International Conference on Bioinformatics and Biomedicine.