K_net: Lysine Malonylation Sites Identification With Neural Network

Lysine Malonylation (Kmal) is a newly discovered protein post-translational modifications (PTMs) type, which plays an important role in many biological processes. Therefore, identifying and understanding Kmal sites is very critical in the studies of biology and diseases. The typical methods are time-wasting and expensive. Nowadays, many researchers have proposed machine learning (ML) methods to deal with PTMs’s identification issue. Especially, some deep learning (DL) methods are also utilized in this field. In this work, we proposed K_net, which employed Convolutional Neural Network to identify the potential sites. Meanwhile, we proposed a new verification method Split to Equal Validation (SEV), which can well solve the impact of sample imbalance on prediction results. More Specifically, Acc, Sn, Sp, MCC and AUC values were adopted to evaluate the prediction performance of predictors. In total, CNN_Kmal achieved the better performance than other methods.

[1]  H. Doyle,et al.  Post-translational protein modifications in antigen recognition and autoimmunity. , 2001, Trends in immunology.

[2]  Donald Voet,et al.  A new project for a new year: A biochemistry and molecular biology digital library , 2003 .

[3]  Yanchun Liang,et al.  MusiteDeep: a deep‐learning framework for general and kinase‐specific phosphorylation site prediction , 2017, Bioinform..

[4]  R. Stephens,et al.  Unique microRNA molecular profiles in lung cancer diagnosis and prognosis. , 2006, Cancer cell.

[5]  De-Shuang Huang,et al.  IMKPse: Identification of Protein Malonylation Sites by the Key Features Into General PseAAC , 2019, IEEE Access.

[6]  Weimin Fan,et al.  A miR-20a/MAPK1/c-Myc regulatory feedback loop regulates breast carcinogenesis and chemoresistance , 2017, Cell Death and Differentiation.

[7]  Shao-Ping Shi,et al.  PLMLA: prediction of lysine methylation and lysine acetylation by combining multiple features. , 2012, Molecular bioSystems.

[8]  A Keith Dunker,et al.  Sequences and topology: disorder, modularity, and post/pre translation modification. , 2013, Current opinion in structural biology.

[9]  Panagiotis Paliogiannis,et al.  Homoarginine and all‐cause mortality: A systematic review and meta‐analysis , 2018, European journal of clinical investigation.

[10]  Lawrence H. Kushi,et al.  Recent Trends in Cardiovascular Mortality in the United States and Public Health Goals. , 2016, JAMA cardiology.

[11]  Md. Solayman,et al.  In silico analysis of nonsynonymous single nucleotide polymorphisms of the human adiponectin receptor 2 (ADIPOR2) gene , 2017, Comput. Biol. Chem..

[12]  Jianyu Shi,et al.  Predicting existing targets for new drugs base on strategies for missing interactions , 2016, BMC Bioinformatics.

[13]  Yuehui Chen,et al.  Classification of Protein Structure Classes on Flexible Neutral Tree , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[14]  Md. Rafsan Jani,et al.  iPromoter-FSEn: Identification of bacterial σ70 promoter sequences using feature subspace based ensemble classifier. , 2019, Genomics.

[15]  Andrea R. Beccari,et al.  ADPredict: ADP-ribosylation site prediction based on physicochemical and structural descriptors , 2018, Bioinform..

[16]  B. Bardot,et al.  Biosynthesis, Post-translation Modification, and Functional Characterization of Drm/Gremlin* , 2000, The Journal of Biological Chemistry.

[17]  Thomas J J Maal,et al.  Three-dimensional facial simulation in bilateral sagittal split osteotomy: a validation study of 100 patients. , 2015, Journal of oral and maxillofacial surgery : official journal of the American Association of Oral and Maxillofacial Surgeons.

[18]  De-Shuang Huang,et al.  Normalized Feature Vectors: A Novel Alignment-Free Sequence Comparison Method Based on the Numbers of Adjacent Amino Acids , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[19]  S. Barik,et al.  MicroRNAs: Processing, Maturation, Target Recognition and Regulatory Functions. , 2011, Molecular and cellular pharmacology.

[20]  Julie Maupin-Furlow,et al.  Post-translation modification in Archaea: lessons from Haloferax volcanii and other haloarchaea. , 2013, FEMS microbiology reviews.

[21]  Annette M. Molinaro,et al.  Loss-based estimation with cross-validation: applications to microarray data analysis , 2003, SKDD.

[22]  Yu Xue,et al.  DeepNitro: Prediction of Protein Nitration and Nitrosylation Sites by Deep Learning , 2018, Genom. Proteom. Bioinform..

[23]  Minoru Yoshida,et al.  Bone Morphogenetic Protein-2 Stimulates Runx2 Acetylation* , 2006, Journal of Biological Chemistry.

[24]  Gary Walsh,et al.  Post-translational modifications in the context of therapeutic proteins , 2006, Nature Biotechnology.

[25]  Li Wang,et al.  Predicting lysine glycation sites using bi-profile bayes feature extraction , 2017, Comput. Biol. Chem..

[26]  Xiaohui Wu,et al.  APAtrap: identification and quantification of alternative polyadenylation sites from RNA-seq data , 2018, Bioinform..

[27]  Yi Zhang,et al.  The First Identification of Lysine Malonylation Substrates and Its Regulatory Enzyme* , 2011, Molecular & Cellular Proteomics.

[28]  Li-na Wang,et al.  Computational prediction of species‐specific malonylation sites via enhanced characteristic strategy , 2016, Bioinform..

[29]  S. Brunak,et al.  Analysis and prediction of mammalian protein glycation. , 2006, Glycobiology.

[30]  Christodoulos A. Floudas,et al.  Proteome-wide post-translational modification statistics: frequency analysis and curation of the swiss-prot database , 2011, Scientific reports.