Machine intelligence in peptide therapeutics: A next‐generation tool for rapid disease screening

Discovery and development of biopeptides are time‐consuming, laborious, and dependent on various factors. Data‐driven computational methods, especially machine learning (ML) approach, can rapidly and efficiently predict the utility of therapeutic peptides. ML methods offer an array of tools that can accelerate and enhance decision making and discovery for well‐defined queries with ample and sophisticated data quality. Various ML approaches, such as support vector machines, random forest, extremely randomized tree, and more recently deep learning methods, are useful in peptide‐based drug discovery. These approaches leverage the peptide data sets, created via high‐throughput sequencing and computational methods, and enable the prediction of functional peptides with increased levels of accuracy. The use of ML approaches in the development of peptide‐based therapeutics is relatively recent; however, these techniques are already revolutionizing protein research by unraveling their novel therapeutic peptide functions. In this review, we discuss several ML‐based state‐of‐the‐art peptide‐prediction tools and compare these methods in terms of their algorithms, feature encodings, prediction scores, evaluation methodologies, and software utilities. We also assessed the prediction performance of these methods using well‐constructed independent data sets. In addition, we discuss the common pitfalls and challenges of using ML approaches for peptide therapeutics. Overall, we show that using ML models in peptide research can streamline the development of targeted peptide therapies.

[1]  Zhangxin Chen,et al.  ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network , 2017, Molecules.

[2]  Sven Rahmann,et al.  Application of Neural Networks in Diagnosing Cancer Disease using Demographic Data , 2010 .

[3]  B. Miralles,et al.  Antihypertensive peptides from food proteins: a review. , 2012, Food & function.

[4]  Renzhi Cao,et al.  Deep convolutional neural networks for predicting the quality of single protein structural models , 2019, bioRxiv.

[5]  Avinash Sonawane,et al.  Antimicrobial peptides and proteins in mycobacterial therapy: current status and future prospects. , 2014, Tuberculosis.

[6]  Virapong Prachayasittikul,et al.  CryoProtect: A Web Server for Classifying Antifreeze Proteins from Nonantifreeze Proteins , 2017 .

[7]  Edwin Wang,et al.  Putting benchmarks in their rightful place: The heart of computational biology , 2018, PLoS Comput. Biol..

[8]  Bin Li,et al.  Applications of machine learning in drug discovery and development , 2019, Nature Reviews Drug Discovery.

[9]  Gwang Lee,et al.  AIPpred: Sequence-Based Prediction of Anti-inflammatory Peptides Using Random Forest , 2018, Front. Pharmacol..

[10]  Gajendra P. S. Raghava,et al.  AHTPDB: a comprehensive platform for analysis and presentation of antihypertensive peptides , 2014, Nucleic Acids Res..

[11]  L. Otvos Peptide-based drug design: here and now. , 2008, Methods in molecular biology.

[12]  I. Kohane,et al.  Big Data and Machine Learning in Health Care. , 2018, JAMA.

[13]  Gajendra P. S. Raghava,et al.  CPPsite 2.0: a repository of experimentally validated cell-penetrating peptides , 2015, Nucleic Acids Res..

[14]  Jie Hou,et al.  DeepQA: improving the estimation of single protein model quality with deep belief networks , 2016, BMC Bioinformatics.

[15]  S. Wold,et al.  New chemical descriptors relevant for the design of biologically active peptides. A multivariate characterization of 87 amino acids. , 1998, Journal of medicinal chemistry.

[16]  R. Epand,et al.  Bacterial membrane lipids in the action of antimicrobial agents , 2011, Journal of peptide science : an official publication of the European Peptide Society.

[17]  Keehyoung Joo,et al.  ConDo: protein domain boundary prediction using coevolutionary information , 2018, Bioinform..

[18]  P. Agrawal,et al.  In silico approaches for predicting the half-life of natural and modified peptides in blood , 2018, PloS one.

[19]  Leyi Wei,et al.  AtbPpred: A Robust Sequence-Based Prediction of Anti-Tubercular Peptides Using Extremely Randomized Trees , 2019, Computational and structural biotechnology journal.

[20]  Tarmo Tamm,et al.  Prediction of Cell-Penetrating Peptides Using Artificial Neural Networks. , 2010, Current computer-aided drug design.

[21]  Ran Su,et al.  CPPred-FL: a sequence-based predictor for large-scale identification of cell-penetrating peptides by feature representation learning , 2018, Briefings Bioinform..

[22]  Hanmei Xu,et al.  DRAMP: a comprehensive data repository of antimicrobial peptides , 2016, Scientific Reports.

[23]  Hao Lv,et al.  iRNA-m7G: Identifying N7-methylguanosine Sites by Fusing Multiple Features , 2019, Molecular therapy. Nucleic acids.

[24]  Qiang Sun,et al.  Principal Component Analysis for Big Data , 2018, Wiley StatsRef: Statistics Reference Online.

[25]  K. Ngiam,et al.  Big data and machine learning algorithms for health-care delivery. , 2019, The Lancet. Oncology.

[26]  Yi Xiong,et al.  A Hadoop-Based Method to Predict Potential Effective Drug Combination , 2014, BioMed research international.

[27]  Wei Chen,et al.  Prediction of cell-penetrating peptides with feature selection techniques. , 2016, Biochemical and biophysical research communications.

[28]  Q. Zou,et al.  Gene2vec: gene subsequence embedding for prediction of mammalian N6-methyladenosine sites from mRNA , 2018, RNA.

[29]  Quan Zou,et al.  SecProMTB: Support Vector Machine‐Based Classifier for Secretory Proteins Using Imbalanced Data Sets Applied to Mycobacterium tuberculosis , 2019, Proteomics.

[30]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[31]  Sean Ekins,et al.  Exploiting machine learning for end-to-end drug discovery and development , 2019, Nature Materials.

[32]  Vineet K. Sharma,et al.  Prediction of anti-inflammatory proteins/peptides: an insilico approach , 2016, Journal of Translational Medicine.

[33]  Isaac Kohane,et al.  Machine Learning in Medicine. Reply. , 2019, The New England journal of medicine.

[34]  L. Gowda,et al.  Arachin derived peptides as selective angiotensin I-converting enzyme (ACE) inhibitors: Structure–activity relationship , 2010, Peptides.

[35]  D. Craik,et al.  The Future of Peptide‐based Drugs , 2013, Chemical biology & drug design.

[36]  Hao Lv,et al.  Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique , 2018, Bioinform..

[37]  K. Chou,et al.  iRNA-3typeA: Identifying Three Types of Modification at RNA’s Adenosine Sites , 2018, Molecular therapy. Nucleic acids.

[38]  C. Poh,et al.  Peptides as Therapeutic Agents for Dengue Virus , 2017, International journal of medical sciences.

[39]  Ying Wang,et al.  Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features , 2019, Scientific Reports.

[40]  Xiangxiang Zeng,et al.  Prediction of Potential Disease-Associated MicroRNAs by Using Neural Networks , 2019, Molecular therapy. Nucleic acids.

[41]  Hao Lin,et al.  XG-PseU: an eXtreme Gradient Boosting based method for identifying pseudouridine sites , 2019, Molecular Genetics and Genomics.

[42]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[43]  S. Muggleton,et al.  Protein secondary structure prediction using logic-based machine learning. , 1992, Protein engineering.

[44]  A. Khusro,et al.  Anti-tubercular peptides: A quest of future therapeutic weapon to combat tuberculosis. , 2016, Asian Pacific journal of tropical medicine.

[45]  Manoj Kumar,et al.  Prediction and Analysis of Quorum Sensing Peptides Based on Sequence Features , 2015, PloS one.

[46]  Balachandran Manavalan,et al.  Random Forest-Based Protein Model Quality Assessment (RFMQA) Using Structural Features and Potential Energy Terms , 2014, PloS one.

[47]  K. Chou,et al.  REVIEW : Recent advances in developing web-servers for predicting protein attributes , 2009 .

[48]  Raghvendra Mall,et al.  DeepCrystal: A Deep Learning Framework for Sequence-based Protein Crystallization Prediction , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[49]  Yunyun Lv,et al.  High Throughput Identification of Antihypertensive Peptides from Fish Proteome Datasets , 2018, Marine drugs.

[50]  L. Nevin Advancing the beneficial use of machine learning in health care and medicine: Toward a community understanding , 2018, PLoS medicine.

[51]  B. Bassler How bacteria talk to each other: regulation of gene expression by quorum sensing. , 1999, Current opinion in microbiology.

[52]  Myeong Ok Kim,et al.  iBCE-EL: A New Ensemble Learning Framework for Improved Linear B-Cell Epitope Prediction , 2018, Front. Immunol..

[53]  Jiangning Song,et al.  Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences , 2019, Briefings Bioinform..

[54]  Jiangning Song,et al.  ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides , 2018, Bioinform..

[55]  Virapong Prachayasittikul,et al.  TargetAntiAngio: A Sequence-Based Tool for the Prediction and Analysis of Anti-Angiogenic Peptides , 2019, International journal of molecular sciences.

[56]  Jana Sperschneider,et al.  Machine learning in plant-pathogen interactions: empowering biological predictions from field scale to genome scale. , 2020, The New phytologist.

[57]  R. Kumar,et al.  Prediction of Cell-Penetrating Potential of Modified Peptides Containing Natural and Chemically Modified Residues , 2018, Front. Microbiol..

[58]  Robert Bals,et al.  Epithelial antimicrobial peptides in host defense against infection , 2000, Respiratory research.

[59]  Tao Zeng,et al.  Prediction of heme binding residues from protein sequences with integrative sequence profiles , 2012, Proteome Science.

[60]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[61]  Wei Chen,et al.  iProEP: A Computational Predictor for Predicting Promoter , 2019, Molecular therapy. Nucleic acids.

[62]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[63]  Gaotao Shi,et al.  CPPred-RF: A Sequence-based Predictor for Identifying Cell-Penetrating Peptides and Their Uptake Efficiency. , 2017, Journal of proteome research.

[64]  Kumardeep Chaudhary,et al.  An in silico platform for predicting, screening and designing of antihypertensive peptides , 2015, Scientific Reports.

[65]  Gajendra P. S. Raghava,et al.  CPPsite: a curated database of cell penetrating peptides , 2012, Database J. Biol. Databases Curation.

[66]  Leyi Wei,et al.  mAHTPred: a sequence-based meta-predictor for improving the prediction of anti-hypertensive peptides using effective feature representation , 2018, Bioinform..

[67]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[68]  Miao Sun,et al.  AngularQA: Protein Model Quality Assessment with LSTM Networks , 2019 .

[69]  Q. Zou,et al.  SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides , 2017, BMC Genomics.

[70]  Balachandran Manavalan,et al.  MLACP: machine-learning-based prediction of anticancer peptides , 2017, Oncotarget.

[71]  Hao Lv,et al.  iRNA-m2G: Identifying N2-methylguanosine Sites Based on Sequence-Derived Information , 2019, Molecular therapy. Nucleic acids.

[72]  Jooyoung Lee,et al.  SVMQA: support‐vector‐machine‐based protein single‐model quality assessment , 2017, Bioinform..

[73]  James C. Collins,et al.  The Current State of Peptide Drug Discovery: Back to the Future? , 2017, Journal of medicinal chemistry.

[74]  Alexander A. Zamyatnin,et al.  The EROP-Moscow oligopeptide database , 2005, Nucleic Acids Res..

[75]  Guoying Zhang,et al.  ACPred-Fuse: fusing multi-view information improves the prediction of anticancer peptides , 2019, Briefings Bioinform..

[76]  George Kurian,et al.  Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[77]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[78]  Yan He,et al.  Classification of Small GTPases with Hybrid Protein Features and Advanced Machine Learning Techniques , 2017, Current Bioinformatics.

[79]  Yi Xiong,et al.  Protein-protein interface hot spots prediction based on a hybrid feature selection strategy , 2018, BMC Bioinformatics.

[80]  Gajendra P. S. Raghava,et al.  CancerPPD: a database of anticancer peptides and proteins , 2014, Nucleic Acids Res..

[81]  Celine Vens,et al.  Identifying discriminative classification-based motifs in biological sequences , 2011, Bioinform..

[82]  Alessandro Sette,et al.  The Immune Epitope Database (IEDB): 2018 update , 2018, Nucleic Acids Res..

[83]  Andy Liaw,et al.  Extreme Gradient Boosting as a Method for Quantitative Structure-Activity Relationships , 2016, J. Chem. Inf. Model..

[84]  Hiroyuki Kurata,et al.  PreAIP: Computational Prediction of Anti-inflammatory Peptides by Integrating Multiple Complementary Features , 2019, Front. Genet..

[85]  Saravanan Vijayakumar,et al.  ACPP: A Web Server for Prediction and Design of Anti-cancer Peptides , 2014, International Journal of Peptide Research and Therapeutics.

[86]  Jeffrey Dean,et al.  Machine Learning in Medicine , 2019, The New England journal of medicine.

[87]  David T. Jones,et al.  Deep learning extends de novo protein modelling coverage of genomes using iteratively predicted structural constraints , 2018, Nature Communications.

[88]  Tianqi Chen,et al.  Extreme Gradient Boosting [R package xgboost version 1.2.0.1] , 2020 .

[89]  Gholamreza Haffari,et al.  Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods , 2018, Briefings Bioinform..

[90]  Rui Gao,et al.  PTPD: predicting therapeutic peptides by deep learning and word2vec , 2019, BMC Bioinformatics.

[91]  Kipp W. Johnson,et al.  Machine learning in cardiovascular medicine: are we there yet? , 2018, Heart.

[92]  Arne Elofsson,et al.  Methods for estimation of model accuracy in CASP12 , 2017, bioRxiv.

[93]  Yi Xiong,et al.  An accurate feature‐based method for identifying DNA‐binding residues on protein surfaces , 2011, Proteins.

[94]  Hiroyuki Kurata,et al.  Efficient computational model for identification of antitubercular peptides by integrating amino acid patterns and properties , 2019, FEBS letters.

[95]  Evelien Wynendaele,et al.  Quorumpeps database: chemical space, microbial origin and functionality of quorum sensing peptides , 2012, Nucleic Acids Res..

[96]  Eyke Hüllermeier,et al.  An Exact Algorithm for F-Measure Maximization , 2011, NIPS.

[97]  P. Agrawal,et al.  NAGbinder: An approach for identifying N‐acetylglucosamine interacting residues of a protein from its primary sequence , 2019, Protein science : a publication of the Protein Society.

[98]  Hai-Cheng Yi,et al.  ACP-DL: A Deep Learning Long Short-Term Memory Model to Predict Anticancer Peptides Using High-Efficiency Feature Representation , 2019, Molecular therapy. Nucleic acids.

[99]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[100]  E. Pirogova,et al.  Evaluation of the use of therapeutic peptides for cancer treatment , 2017, Journal of Biomedical Science.

[101]  Wei Pan,et al.  On Efficient Large Margin Semisupervised Learning: Method and Theory , 2009, J. Mach. Learn. Res..

[102]  Jian Wang,et al.  StraPep: a structure database of bioactive peptides , 2018, Database J. Biol. Databases Curation.

[103]  Myeong Ok Kim,et al.  PIP-EL: A New Ensemble Learning Method for Improved Proinflammatory Peptide Predictions , 2018, Front. Immunol..

[104]  César G. Castellanos Domínguez,et al.  Applicability of semi-supervised learning assumptions for gene ontology terms prediction , 2016 .

[105]  Q. Zou,et al.  NucPosPred: Predicting species-specific genomic nucleosome positioning via four different modes of general PseKNC. , 2018, Journal of theoretical biology.

[106]  Andrew L. Ferguson,et al.  What can machine learning do for antimicrobial peptides, and what can antimicrobial peptides do for machine learning? , 2017, Interface Focus.

[107]  Balachandran Manavalan,et al.  4mCpred-EL: An Ensemble Learning Framework for Identification of DNA N4-Methylcytosine Sites in the Mouse Genome , 2019, Cells.

[108]  Susan M. Bridges,et al.  Prediction of Cell Penetrating Peptides by Support Vector Machines , 2011, PLoS Comput. Biol..

[109]  Shreyas Karnik,et al.  CAMP: a useful resource for research on antimicrobial peptides , 2009, Nucleic Acids Res..

[110]  Jiangning Song,et al.  Bastion3: a two-layer ensemble predictor of type III secreted effectors , 2018, Bioinform..

[111]  Roman M. Balabin,et al.  Neural network approach to quantum-chemistry data: accurate prediction of density functional theory energies. , 2009, The Journal of chemical physics.

[112]  Wei Chen,et al.  iATP: A Sequence Based Method for Identifying Anti-tubercular Peptides , 2020, Medicinal Chemistry.

[113]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[114]  Junjie Chen,et al.  Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences , 2015, Nucleic Acids Res..

[115]  Quan Zou,et al.  Incorporating Distance-based Top-n-gram and Random Forest to Identify Electron Transport Proteins. , 2019, Journal of proteome research.

[116]  Zicheng Cao,et al.  G-DipC: An Improved Feature Representation Method for Short Sequences to Predict the Type of Cargo in Cell-Penetrating Peptides , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[117]  Gajendra P. S. Raghava,et al.  Prediction of Antitubercular Peptides From Sequence Information Using Ensemble Classifier and Hybrid Features , 2018, Front. Pharmacol..

[118]  Yanchun Liang,et al.  Capsule network for protein post-translational modification site prediction , 2018, Bioinform..

[119]  Jiangning Song,et al.  Comparative analysis and prediction of quorum-sensing peptides using feature representation learning and machine learning algorithms , 2018, Briefings Bioinform..

[120]  Geoffrey I. Webb,et al.  DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites , 2019, Bioinform..

[121]  Hui Ding,et al.  A Random Forest Sub-Golgi Protein Classifier Optimized via Dipeptide and Amino Acid Composition Features , 2019, Front. Bioeng. Biotechnol..

[122]  Virapong Prachayasittikul,et al.  Meta-iAVP: A Sequence-Based Meta-Predictor for Improving the Prediction of Antiviral Peptides Using Effective Feature Representation , 2019, International journal of molecular sciences.

[123]  Yi Xiong,et al.  PDC-SGB: Prediction of effective drug combinations using a stochastic gradient boosting algorithm. , 2017, Journal of theoretical biology.

[124]  P. Agrawal,et al.  NeuroPIpred: a tool to predict, design and scan insect neuropeptides , 2019, Scientific Reports.

[125]  Jiangning Song,et al.  Bastion6: a bioinformatics approach for accurate prediction of type VI secreted effectors , 2018, Bioinform..

[126]  Bin Liu,et al.  Pse-in-One 2.0: An Improved Package of Web Servers for Generating Various Modes of Pseudo Components of DNA, RNA, and Protein Sequences , 2017 .

[127]  Jiangning Song,et al.  Quokka: a comprehensive tool for rapid and accurate prediction of kinase family‐specific phosphorylation sites in the human proteome , 2018, Bioinform..

[128]  Davor Juretic,et al.  DADP: the database of anuran defense peptides , 2012, Bioinform..

[129]  M. Khrestchatisky,et al.  Synthetic therapeutic peptides: science and market. , 2010, Drug discovery today.

[130]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[131]  Rajesh Kumar,et al.  Computing wide range of protein/peptide features from their sequence and structure , 2019 .

[132]  Stefan Kramer,et al.  Enhancing navigation in biomedical databases by community voting and database-driven text classification , 2009, BMC Bioinformatics.

[133]  Michael K Gilson,et al.  Discovering de novo peptide substrates for enzymes using machine learning , 2018, Nature Communications.

[134]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[135]  Balachandran Manavalan,et al.  iGHBP: Computational identification of growth hormone binding proteins from sequences using extremely randomised tree , 2018, Computational and structural biotechnology journal.

[136]  Hiroyuki Kurata,et al.  Computational identification of microbial phosphorylation sites by the enhanced characteristics of sequence information , 2019, Scientific Reports.

[137]  Gwang Lee,et al.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome , 2019, Molecular therapy. Nucleic acids.

[138]  Andy Chi-Lung Lee,et al.  A Comprehensive Review on Current Advances in Peptide Drug Development and Design , 2019, International journal of molecular sciences.

[139]  K. Chou,et al.  iACP: a sequence-based tool for identifying anticancer peptides , 2016, Oncotarget.

[140]  Jiangning Song,et al.  MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters , 2019, Bioinform..

[141]  Hao Wang,et al.  Enhanced Prediction of Hot Spots at Protein-Protein Interfaces Using Extreme Gradient Boosting , 2018, Scientific Reports.

[142]  Chris H. Q. Ding,et al.  Minimum redundancy feature selection from microarray gene expression data , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[143]  Yan Lin,et al.  iTerm-PseKNC: a sequence-based tool for predicting bacterial transcriptional terminators , 2018, Bioinform..

[144]  R. Fitzgerald,et al.  Antihypertensive Peptides from Food Proteins , 2013 .

[145]  Morten Nielsen,et al.  IEDB-AR: immune epitope database—analysis resource in 2019 , 2019, Nucleic Acids Res..

[146]  A. Zamyatnin EROP-Moscow: specialized data bank for endogenous regulatory oligopeptides. , 1991, Protein sequences & data analysis.

[147]  Nitesh V. Chawla,et al.  Data Mining for Imbalanced Datasets: An Overview , 2005, The Data Mining and Knowledge Discovery Handbook.

[148]  Kumardeep Chaudhary,et al.  Cell Penetrating Peptides , 2016 .

[149]  M. Bliss Banting's, Best's, and Collip's accounts of the discovery of insulin. , 1982, Bulletin of the history of medicine.

[150]  Jenni A. M. Sidey-Gibbons,et al.  Machine learning in medicine: a practical introduction , 2019, BMC Medical Research Methodology.

[151]  Dong-Qing Wei,et al.  PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method , 2018, Front. Microbiol..

[152]  Jolene L. Lau,et al.  Therapeutic peptides: Historical perspectives, current development trends, and future directions. , 2017, Bioorganic & medicinal chemistry.

[153]  M. Morris,et al.  Twenty years of cell-penetrating peptides: from molecular mechanisms to therapeutics , 2009, British journal of pharmacology.

[154]  Gajendra P. S. Raghava,et al.  SATPdb: a database of structurally annotated therapeutic peptides , 2015, Nucleic Acids Res..

[155]  B. L. Betechuoh,et al.  Autoencoder networks for HIV classification , 2006 .

[156]  Wei Zhao,et al.  VisFeature: a stand-alone program for visualizing and analyzing statistical features of biological sequences , 2019, Bioinform..

[157]  Chao Bian,et al.  Whole Genome Sequencing of Chinese White Dolphin (Sousa chinensis) for High-Throughput Screening of Antihypertensive Peptides , 2019, Marine drugs.

[158]  M. Lefranc,et al.  DBAASP: database of antimicrobial activity and structure of peptides. , 2014, FEMS microbiology letters.

[159]  Jacek M. Zurada,et al.  Introduction to artificial neural systems , 1992 .

[160]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[161]  T. Hoffmann,et al.  Peptide therapeutics: current status and future directions. , 2015, Drug discovery today.

[162]  Manish Kumar,et al.  Face Recognition Using Eigen Faces and Artificial Neural Network , 2010 .

[163]  Hiroyuki Kurata,et al.  Prediction of S-nitrosylation sites by integrating support vector machines and random forest. , 2019, Molecular omics.

[164]  Leyi Wei,et al.  Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation , 2019, Molecular therapy. Nucleic acids.

[165]  Xiaowei Zhao,et al.  LAMP: A Database Linking Antimicrobial Peptides , 2013, PloS one.

[166]  Hiroyuki Kurata,et al.  Large-Scale Assessment of Bioinformatics Tools for Lysine Succinylation Sites , 2019, Cells.

[167]  Jiangning Song,et al.  PeNGaRoo, a combined gradient boosting and ensemble learning framework for predicting non-classical secreted proteins , 2019, Bioinform..

[168]  R. Kumar,et al.  CancerPDF: A repository of cancer-associated peptidome found in human biofluids , 2017, Scientific Reports.

[169]  V. Laengsri,et al.  ThalPred: a web-based prediction tool for discriminating thalassemia trait and iron deficiency anemia , 2019, BMC Medical Informatics and Decision Making.

[170]  Dong Wang,et al.  iLoc‐lncRNA: predict the subcellular location of lncRNAs by incorporating octamer composition into general PseKNC , 2018, Bioinform..

[171]  Kuldip K. Paliwal,et al.  Enhancing Protein Fold Prediction Accuracy Using Evolutionary and Structural Features , 2013, PRIB.

[172]  Nalini Schaduangrat,et al.  THPep: A machine learning-based approach for predicting tumor homing peptides , 2019, Comput. Biol. Chem..

[173]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[174]  Virapong Prachayasittikul,et al.  PAAP: a web server for predicting antihypertensive activity of peptides. , 2018, Future medicinal chemistry.

[175]  Gajendra P. S. Raghava,et al.  AntiTbPdb: a knowledgebase of anti-tubercular peptides , 2018, Database J. Biol. Databases Curation.

[176]  Andrew L. Ferguson,et al.  Machine learning-enabled discovery and design of membrane-active peptides. , 2017, Bioorganic & medicinal chemistry.

[177]  Balachandran Manavalan,et al.  mACPpred: A Support Vector Machine-Based Meta-Predictor for Identification of Anticancer Peptides , 2019, International journal of molecular sciences.

[178]  Jiangning Song,et al.  A comprehensive review and performance evaluation of bioinformatics tools for HLA class I peptide-binding prediction , 2020, Briefings Bioinform..

[179]  Geoffrey I. Webb,et al.  iLearn : an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data , 2019, Briefings Bioinform..

[180]  Rakesh Kumar,et al.  dPABBs: A Novel in silico Approach for Predicting and Designing Anti-biofilm Peptides , 2016, Scientific Reports.

[181]  K. Borgwardt,et al.  Machine Learning in Medicine , 2015, Mach. Learn. under Resour. Constraints Vol. 3.

[182]  Kumardeep Chaudhary,et al.  In Silico Approach for Prediction of Antifungal Peptides , 2018, Front. Microbiol..

[183]  Anna Iwaniak,et al.  BIOPEP database of sensory peptides and amino acids. , 2016, Food research international.

[184]  Balachandran Manavalan,et al.  DHSpred: support-vector-machine-based human DNase I hypersensitive sites prediction using the optimal features selected by random forest , 2017, bioRxiv.

[185]  Kiattawee Choowongkomon,et al.  FeptideDB: A web application for new bioactive peptides from food protein , 2019, Heliyon.

[186]  Zengtang Qu,et al.  Improved Extreme Learning Machine based on the Sensitivity Analysis , 2018 .

[187]  Quan Zou,et al.  Selecting Essential MicroRNAs Using a Novel Voting Method , 2019, Molecular therapy. Nucleic acids.

[188]  A. Iwaniak,et al.  BIOPEP database and other programs for processing bioactive peptide sequences. , 2008, Journal of AOAC International.

[189]  Deborah Hix,et al.  The immune epitope database (IEDB) 3.0 , 2014, Nucleic Acids Res..

[190]  M. Bliss The discovery of insulin: the inside story. , 1997, Publication - American Institute of the History of Pharmacy.

[191]  Piyush Agrawal,et al.  SAMbinder: A web server for predicting SAM binding residues of a protein from its amino acid sequence , 2019, bioRxiv.

[192]  K. Chou,et al.  iRNA-PseColl: Identifying the Occurrence Sites of Different RNA Modifications by Incorporating Collective Effects of Nucleotides into PseKNC , 2017, Molecular therapy. Nucleic acids.

[193]  Kumardeep Chaudhary,et al.  In Silico Models for Designing and Discovering Novel Anticancer Peptides , 2013, Scientific Reports.

[194]  Gunnar Rätsch,et al.  Soft Margins for AdaBoost , 2001, Machine Learning.

[195]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[196]  Dong-Sheng Cao,et al.  PyBioMed: a python library for various molecular representations of chemicals, proteins and DNAs and their interactions , 2018, Journal of Cheminformatics.

[197]  Yi Xiong,et al.  PseUI: Pseudouridine sites identification based on RNA sequence information , 2018, BMC Bioinformatics.

[198]  Timur Shtatland,et al.  PepBank - a database of peptides based on sequence text mining and public peptide data sources , 2007, BMC Bioinformatics.

[199]  Balachandran Manavalan,et al.  Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy. , 2018, Journal of proteome research.

[200]  M Michael Gromiha,et al.  Seq2Feature: a comprehensive web-based feature extraction tool , 2019, Bioinform..

[201]  Wei Chen,et al.  iDNA4mC: identifying DNA N4‐methylcytosine sites based on nucleotide chemical properties , 2017, Bioinform..

[202]  Tatsuya Akutsu,et al.  Deep Learning with Evolutionary and Genomic Profiles for Identifying Cancer Subtypes , 2018, 2018 IEEE 18th International Conference on Bioinformatics and Bioengineering (BIBE).

[203]  A. M. Turing,et al.  Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[204]  Jie Hu,et al.  Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools , 2019, Briefings Bioinform..

[205]  Raghvendra Mall,et al.  BCrystal: an interpretable sequence-based protein crystallization predictor , 2019, Bioinform..

[206]  Abdul Basit,et al.  Identification of Anticancer Peptides Using Optimal Feature Space of Chou's Split Amino Acid Composition and Support Vector Machine , 2017 .

[207]  Nithin V. George,et al.  KELM-CPPpred: Kernel Extreme Learning Machine Based Prediction Model for Cell-Penetrating Peptides. , 2018, Journal of proteome research.

[208]  Xia Li,et al.  APD3: the antimicrobial peptide database as a tool for research and education , 2015, Nucleic Acids Res..

[209]  Abdollah Dehzangi,et al.  PyFeat: a Python-based effective feature generation tool for DNA, RNA and protein sequences , 2019, Bioinform..

[210]  William Stafford Noble,et al.  Empirical comparison of web‐based antimicrobial peptide prediction tools , 2017, Bioinform..

[211]  H. Mohabatkar,et al.  Predicting anticancer peptides with Chou's pseudo amino acid composition and investigating their mutagenicity via Ames test. , 2014, Journal of theoretical biology.

[212]  Ran Su,et al.  Iterative feature representations improve N4-methylcytosine site prediction , 2019, Bioinform..

[213]  Feng-Min Li,et al.  Identifying anticancer peptides by using improved hybrid compositions , 2016, Scientific Reports.

[214]  Ran Su,et al.  PEPred-Suite: improved and robust prediction of therapeutic peptides using adaptive feature representation learning , 2019, Bioinform..

[215]  Geoffrey I. Webb,et al.  iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences , 2018, Bioinform..

[216]  Gianluca Pollastri,et al.  CPPpred: prediction of cell penetrating peptides , 2013, Bioinform..

[217]  Muhammad Iqbal,et al.  iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space , 2017, Artif. Intell. Medicine.

[218]  Dong-Sheng Cao,et al.  propy: a tool to generate various modes of Chou's PseAAC , 2013, Bioinform..

[219]  Guangmin Liang,et al.  A Novel Hybrid Sequence-Based Model for Identifying Anticancer Peptides , 2018, Genes.

[220]  Virapong Prachayasittikul,et al.  ACPred: A Computational Tool for the Prediction and Analysis of Anticancer Peptides , 2019, Molecules.

[221]  A. Gautam,et al.  THPdb: Database of FDA-approved peptide and protein therapeutics , 2017, PloS one.