Combining High Speed ELM with a CNN Feature Encoding to Predict LncRNA-Disease Associations

Accumulated evidence indicates that lncRNAs are critical for many biological processes, especially diseases. Therefore, identifying potential lncRNA-disease associations is significant for disease prevention, diagnosis, treatment and understanding of cell life activities at the molecular level. Although novel technologies have generated considerable associations for various lncRNAs and diseases, it has inevitable drawbacks such as high cost, time consumption, and error rate. For this reason, integrating various biological databases to predict the potential association of lncRNA and disease is of great attraction. In this paper, we proposed the model called ECLDA to predict lncRNA-disease associations by combining CNN and highspeed ELM. Firstly, the feature vectors are constructed by integrating lncRNA functional similarity, disease semantic similarity and Gaussian interaction profile kernel similarity. Secondly, CNN is carried out to mine local and higher-level abstract features of the vectors. Finally, high speed ELM is used to identify the novel lncRNA disease associations. The ECLDA computational model achieved AUCs of 0.9014 in 5-fold cross validation. The results showed that ECLDA is expected to be a practical tool for biomedical research in the future.

[1]  Zhu-Hong You,et al.  Increasing the reliability of protein-protein interaction networks via non-convex semantic embedding , 2013, Neurocomputing.

[2]  Xing Chen,et al.  FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model , 2016, Oncotarget.

[3]  Zhu-Hong You,et al.  Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis , 2013, BMC Bioinformatics.

[4]  Zhu-Hong You,et al.  An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[5]  Yong Zhou,et al.  A Computational-Based Method for Predicting Drug-Target Interactions by Using Stacked Autoencoder Deep Neural Network , 2017, J. Comput. Biol..

[6]  Zhu-Hong You,et al.  RP-FIRF: Prediction of Self-interacting Proteins Using Random Projection Classifier Combining with Finite Impulse Response Filter , 2018, ICIC.

[7]  Rong Yin,et al.  Long noncoding RNA: an emerging paradigm of cancer research , 2013, Tumor Biology.

[8]  Zhu-Hong You,et al.  t-LSE: A Novel Robust Geometric Approach for Modeling Protein-Protein Interaction Networks , 2013, PloS one.

[9]  Xing Chen,et al.  A Systematic Prediction of Drug-Target Interactions Using Molecular Fingerprints and Protein Sequences. , 2018, Current protein & peptide science.

[10]  Hai-Cheng Yi,et al.  A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information , 2018, Molecular therapy. Nucleic acids.

[11]  Zhu-Hong You,et al.  Increasing reliability of protein interactome by fast manifold embedding , 2013, Pattern Recognit. Lett..

[12]  Zhen Ji,et al.  Prediction of protein-protein interactions from amino acid sequences using extreme learning machine combined with auto covariance descriptor , 2013, 2013 IEEE Workshop on Memetic Computing (MC).

[13]  Zhu-Hong You,et al.  Predicting Protein-Protein Interactions from Primary Protein Sequences Using a Novel Multi-Scale Local Feature Representation Scheme and the Random Forest , 2015, PloS one.

[14]  Zhu-Hong You,et al.  Prediction of protein-protein interactions by label propagation with protein evolutionary and chemical information derived from heterogeneous network. , 2017, Journal of theoretical biology.

[15]  Xing Chen,et al.  Prediction of microbe–disease association from the integration of neighbor and graph with collaborative recommendation model , 2017, Journal of Translational Medicine.

[16]  Yong Zhou,et al.  Efficient Framework for Predicting ncRNA-Protein Interactions Based on Sequence Information by Deep Learning , 2018, ICIC.

[17]  Xing Chen,et al.  MicroRNAs and complex diseases: from experimental results to computational models , 2019, Briefings Bioinform..

[18]  Yong Zhou,et al.  Computational Methods for the Prediction of Drug-Target Interactions from Drug Fingerprints and Protein Sequences by Stacked Auto-Encoder Deep Neural Network , 2017, ISBRA.

[19]  Zhu-Hong You,et al.  PRMDA: personalized recommendation-based MiRNA-disease association prediction , 2017, Oncotarget.

[20]  Zhu-Hong You,et al.  RFDT: A Rotation Forest-based Predictor for Predicting Drug-Target Interactions Using Drug Structure and Protein Sequence Information. , 2016, Current protein & peptide science.

[21]  Yong Zhou,et al.  Prediction of Protein–Protein Interactions with Clustered Amino Acids and Weighted Sparse Representation , 2015, International journal of molecular sciences.

[22]  Zhu-Hong You,et al.  ILNCSIM: improved lncRNA functional similarity calculation model , 2016, Oncotarget.

[23]  Xing Chen,et al.  Improving protein–protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model , 2016, Protein science : a publication of the Protein Society.

[24]  Xing Chen,et al.  LncRNADisease: a database for long-non-coding RNA-associated diseases , 2012, Nucleic Acids Res..

[25]  Zhu-Hong You,et al.  Identifying Spurious Interactions in the Protein-Protein Interaction Networks Using Local Similarity Preserving Embedding , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[26]  Zhu-Hong You,et al.  A novel method to predict protein-protein interactions based on the information of protein sequence , 2012, 2012 IEEE International Conference on Control System, Computing and Engineering.

[27]  Zhen Ji,et al.  Prediction of protein-protein interactions from amino acid sequences using a novel multi-scale continuous and discontinuous feature set , 2014, BMC Bioinformatics.

[28]  Zhu-Hong You,et al.  Improving Prediction of Self-interacting Proteins Using Stacked Sparse Auto-Encoder with PSSM profiles , 2018, International journal of biological sciences.

[29]  Sebo Withoff,et al.  Genetic variation in the non-coding genome: Involvement of micro-RNAs and long non-coding RNAs in disease. , 2014, Biochimica et biophysica acta.

[30]  Zhu-Hong You,et al.  Detecting Protein-Protein Interactions with a Novel Matrix-Based Protein Sequence Representation and Support Vector Machines , 2015, BioMed research international.

[31]  K. Bensalah,et al.  Sunitinib Combined with Angiotensin-2 Type-1 Receptor Antagonists Induces More Necrosis: A Murine Xenograft Model of Renal Cell Carcinoma , 2014, BioMed research international.

[32]  Qionghai Dai,et al.  Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity , 2015, Scientific Reports.

[33]  Zhu-Hong You,et al.  Detection of Interactions between Proteins through Rotation Forest and Local Phase Quantization Descriptors , 2015, International journal of molecular sciences.

[34]  Xing Chen,et al.  IRWRLDA: improved random walk with restart for lncRNA-disease association prediction , 2016, Oncotarget.

[35]  Dong Wang,et al.  Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases , 2010, Bioinform..

[36]  Xing Chen,et al.  Construction of reliable protein-protein interaction networks using weighted sparse representation based classifier with pseudo substitution matrix representation features , 2016, Neurocomputing.

[37]  Howard Y. Chang,et al.  Long Noncoding RNAs in Cancer Pathways. , 2016, Cancer cell.

[38]  Hai-Cheng Yi,et al.  Detection of Interactions between Proteins by Using Legendre Moments Descriptor to Extract Discriminatory Information Embedded in PSSM , 2017, Molecules.

[39]  Tonghai Jiang,et al.  Predicting Protein Interactions Using a Deep Learning Method-Stacked Sparse Autoencoder Combined with a Probabilistic Classification Vector Machine , 2018, Complex..

[40]  Jun Cui,et al.  Modeling of signaling crosstalk-mediated drug resistance and its implications on drug combination , 2016, Oncotarget.

[41]  Xing Chen,et al.  Highly Accurate Prediction of Protein-Protein Interactions via Incorporating Evolutionary Information and Physicochemical Characteristics , 2016, International journal of molecular sciences.

[42]  H. Lodish,et al.  Emerging mechanisms of long noncoding RNA function during normal and malignant hematopoiesis. , 2017, Blood.

[43]  Xing Chen,et al.  Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition , 2016, BMC Systems Biology.

[44]  Shuai Li,et al.  A MapReduce based parallel SVM for large-scale predicting protein-protein interactions , 2014, Neurocomputing.

[45]  Rory Johnson Long non-coding RNAs in Huntington's disease neurodegeneration , 2012, Neurobiology of Disease.

[46]  Yong Zhou,et al.  Prediction of Drug–Target Interaction Networks from the Integration of Protein Sequences and Drug Chemical Structures , 2017, Molecules.

[47]  Xing Chen,et al.  Accurate prediction of protein-protein interactions by integrating potential evolutionary information embedded in PSSM profile and discriminative vector machine classifier , 2017, Oncotarget.

[48]  Zhu-Hong You,et al.  A heterogeneous label propagation approach to explore the potential associations between miRNA and disease , 2018, Journal of Translational Medicine.

[49]  Zhu-Hong You,et al.  Identifying Spurious Interactions in the Protein-Protein Interaction Networks Using Local Similarity Preserving Embedding , 2014, ISBRA.

[50]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[51]  Yong Zhou,et al.  Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier. , 2017, Journal of theoretical biology.

[52]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[53]  Xing Chen,et al.  Predicting protein-protein interactions from protein sequences by a stacked sparse autoencoder deep neural network. , 2017, Molecular bioSystems.

[54]  Tianwei Yu,et al.  K-Profiles: A Nonlinear Clustering Method for Pattern Detection in High Dimensional Data , 2015, BioMed research international.

[55]  Xing Chen,et al.  In silico prediction of drug-target interaction networks based on drug chemical structure and protein sequences , 2017, Scientific Reports.

[56]  Zhen Ji,et al.  Assessing and predicting protein interactions by combining manifold embedding with multiple information integration , 2012, BMC Bioinformatics.

[57]  Xing Chen,et al.  NRDTD: a database for clinically or experimentally supported non-coding RNAs and drug targets associations , 2017, Database J. Biol. Databases Curation.

[58]  Zhu-Hong You,et al.  Novel Human miRNA-Disease Association Inference Based on Random Forest , 2018, Molecular therapy. Nucleic acids.

[59]  Zhen Ji,et al.  Large-Scale Protein-Protein Interactions Detection by Integrating Big Biosensing Data with Computational Model , 2014, BioMed research international.

[60]  Nadav S. Bar,et al.  Landscape of transcription in human cells , 2012, Nature.

[61]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[62]  Zhu-Hong You,et al.  Using manifold embedding for assessing and predicting protein interactions from high-throughput experimental data , 2010, Bioinform..

[63]  MengChu Zhou,et al.  Highly Efficient Framework for Predicting Interactions Between Proteins , 2017, IEEE Transactions on Cybernetics.

[64]  Keith C. C. Chan,et al.  Large-scale prediction of drug-target interactions from deep representations , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[65]  Xing Chen,et al.  LMTRDA: Using logistic model tree to predict MiRNA-disease associations by fusing multi-source information of sequences and similarities , 2019, PLoS Comput. Biol..