Recent Progress in Machine Learning-based Prediction of Peptide Activity for Drug Discovery.

Over the past decades, peptide as a therapeutic candidate has received increasing attention in drug discovery, especially for antimicrobial peptides (AMPs), anticancer peptides (ACPs) and antiinflammatory peptides (AIPs). It is considered that the peptides can regulate various complex diseases which are previously untouchable. In recent years, the critical problem of antimicrobial resistance drives the pharmaceutical industry to look for new therapeutic agents. Compared to organic small drugs, peptide- based therapy exhibits high specificity and minimal toxicity. Thus, peptides are widely recruited in the design and discovery of new potent drugs. Currently, large-scale screening of peptide activity with traditional approaches is costly, time-consuming and labor-intensive. Hence, in silico methods, mainly machine learning approaches, for their accuracy and effectiveness, have been introduced to predict the peptide activity. In this review, we document the recent progress in machine learning-based prediction of peptides which will be of great benefit to the discovery of potential active AMPs, ACPs and AIPs.

[1]  D. Hoskin,et al.  Cationic antimicrobial peptides as novel cytotoxic agents for cancer treatment , 2006, Expert opinion on investigational drugs.

[2]  Maqsood Hayat,et al.  iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou’s PseAAC to formulate DNA samples , 2015, Molecular Genetics and Genomics.

[3]  Manoj Kumar,et al.  AVPdb: a database of experimentally validated antiviral peptides targeting medically important viruses , 2013, Nucleic Acids Res..

[4]  Shreyas Karnik,et al.  CAMP: a useful resource for research on antimicrobial peptides , 2009, Nucleic Acids Res..

[5]  Qi Wang,et al.  In Silico Pharmacoepidemiologic Evaluation of Drug-Induced Cardiovascular Complications Using Combined Classifiers , 2018, J. Chem. Inf. Model..

[6]  Mikhail G. Dozmorov,et al.  Systems biology approach for mapping the response of human urothelial cells to infection by Enterococcus faecalis , 2007, BMC Bioinformatics.

[7]  Gwang Lee,et al.  AIPpred: Sequence-Based Prediction of Anti-inflammatory Peptides Using Random Forest , 2018, Front. Pharmacol..

[8]  Simon Fong,et al.  AmPEP: Sequence-based prediction of antimicrobial peptides using distribution patterns of amino acid properties and random forest , 2018, Scientific Reports.

[9]  Kuan Y. Chang,et al.  A Large-Scale Structural Classification of Antimicrobial Peptides , 2015, BioMed research international.

[10]  Amarda Shehu,et al.  Improving Recognition of Antimicrobial Peptides and Target Selectivity through Machine Learning and Genetic Programming , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[11]  Zhe Wang,et al.  APD: the Antimicrobial Peptide Database , 2004, Nucleic Acids Res..

[12]  CHUN WEI YAP,et al.  PaDEL‐descriptor: An open source software to calculate molecular descriptors and fingerprints , 2011, J. Comput. Chem..

[13]  Jiansong Fang,et al.  Predictions of BuChE Inhibitors Using Support Vector Machine and Naive Bayesian Classification Techniques in Drug Discovery , 2013, J. Chem. Inf. Model..

[14]  Gajendra P. S. Raghava,et al.  CancerPPD: a database of anticancer peptides and proteins , 2014, Nucleic Acids Res..

[15]  Davor Juretic,et al.  DADP: the database of anuran defense peptides , 2012, Bioinform..

[16]  W. H. Elliott,et al.  Data for Biochemical Research , 1986 .

[17]  Andrei Gabrielian,et al.  Predictive Model of Linear Antimicrobial Peptides Active against Gram-Negative Bacteria , 2018, J. Chem. Inf. Model..

[18]  Vladimir B. Bajic,et al.  DAMPD: a manually curated antimicrobial peptide database , 2011, Nucleic Acids Res..

[19]  G. Spyrou,et al.  C-PAmP: Large Scale Analysis and Database Construction Containing High Scoring Computationally Predicted Antimicrobial Peptides for All the Available Plant Species , 2013, PloS one.

[20]  Ri-Bo Huang,et al.  Recent development of peptide drugs and advance on theory and methodology of peptide inhibitor design. , 2015, Medicinal chemistry (Shariqah (United Arab Emirates)).

[21]  K. Chou Prediction of protein cellular attributes using pseudo‐amino acid composition , 2001, Proteins.

[22]  Ming Zhang,et al.  Comparing sequences without using alignments: application to HIV/SIV subtyping , 2007, BMC Bioinformatics.

[23]  William F. Porto,et al.  CS-AMPPred: An Updated SVM Model for Antimicrobial Activity Prediction in Cysteine-Stabilized Peptides , 2012, PloS one.

[24]  J. Bromberg,et al.  IL-10 immunosuppression in transplantation. , 1995, Current opinion in immunology.

[25]  Riadh Hammami,et al.  PhytAMP: a database dedicated to antimicrobial plant peptides , 2008, Nucleic Acids Res..

[26]  F. Albericio,et al.  The Pharmaceutical Industry in 2016. An Analysis of FDA Drug Approvals from a Perspective of the Molecule Type , 2017, Molecules.

[27]  Kumardeep Chaudhary,et al.  In Silico Models for Designing and Discovering Novel Anticancer Peptides , 2013, Scientific Reports.

[28]  Artem Cherkasov,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm068 Databases and ontologies AMPer: a database and an automated discovery tool for antimicrobial peptides , 2022 .

[29]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[30]  Gajendra P. S. Raghava,et al.  Analysis and prediction of antibacterial peptides , 2007, BMC Bioinformatics.

[31]  Yong Zhou,et al.  A Computational-Based Method for Predicting Drug-Target Interactions by Using Stacked Autoencoder Deep Neural Network , 2017, J. Comput. Biol..

[32]  Vineet K. Sharma,et al.  Prediction of anti-inflammatory proteins/peptides: an insilico approach , 2016, Journal of Translational Medicine.

[33]  Prabina Kumar Meher,et al.  Predicting antimicrobial peptides with improved accuracy by incorporating the compositional, physico-chemical and structural features into Chou’s general PseAAC , 2017, Scientific Reports.

[34]  Saravanan Vijayakumar,et al.  ACPP: A Web Server for Prediction and Design of Anti-cancer Peptides , 2014, International Journal of Peptide Research and Therapeutics.

[35]  Guillaume Castel,et al.  Phage Display of Combinatorial Peptide Libraries: Application to Antiviral Research , 2011, Molecules.

[36]  T. Hoffmann,et al.  Peptide therapeutics: current status and future directions. , 2015, Drug discovery today.

[37]  Lee Whitmore,et al.  The Peptaibol Database: a database for sequences and structures of naturally occurring peptaibols , 2004, Nucleic Acids Res..

[38]  Gajendra P. S. Raghava,et al.  Hemolytik: a database of experimentally determined hemolytic and non-hemolytic peptides , 2013, Nucleic Acids Res..

[39]  K. Chou,et al.  iAMP-2L: a two-level multi-label classifier for identifying antimicrobial peptides and their functional types. , 2013, Analytical biochemistry.

[40]  K. Chou,et al.  Prediction of Antimicrobial Peptides Based on Sequence Alignment and Feature Selection Methods , 2011, PloS one.

[41]  S. Piotto,et al.  YADAMP: yet another database of antimicrobial peptides. , 2012, International journal of antimicrobial agents.

[42]  Daniel J Rigden,et al.  Prediction of antimicrobial peptides based on the adaptive neuro-fuzzy inference system application. , 2012, Biopolymers.

[43]  Forest Baskett,et al.  An Algorithm for Finding Nearest Neighbors , 1975, IEEE Transactions on Computers.

[44]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[45]  Hanmei Xu,et al.  DRAMP: a comprehensive data repository of antimicrobial peptides , 2016, Scientific Reports.

[46]  A. Mócsai,et al.  What is the future of targeted therapy in rheumatology: biologics or small molecules? , 2014, BMC Medicine.

[47]  Roger Beuerman,et al.  Defensins knowledgebase: a manually curated database and information source focused on the defensins family of antimicrobial peptides , 2006, Nucleic Acids Res..

[48]  Xingzhen Lao,et al.  Computational resources and tools for antimicrobial peptides , 2017, Journal of peptide science : an official publication of the European Peptide Society.

[49]  N. Kaplowitz,et al.  Drug-Induced Liver Injury , 2004, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[50]  Conan K. L. Wang,et al.  CyBase: a database of cyclic protein sequences and structures, with applications in protein discovery and engineering , 2007, Nucleic Acids Res..

[51]  Martin Mozina,et al.  Orange: data mining toolbox in python , 2013, J. Mach. Learn. Res..

[52]  K. Chou,et al.  iACP: a sequence-based tool for identifying anticancer peptides , 2016, Oncotarget.

[53]  Muhammad Iqbal,et al.  iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space , 2017, Artif. Intell. Medicine.

[54]  Gajendra P. S. Raghava,et al.  Computer-aided designing of immunosuppressive peptides based on IL-10 inducing potential , 2017, Scientific Reports.

[55]  M. V. Nogués,et al.  Discovering new in silico tools for antimicrobial peptide prediction. , 2012, Current drug targets.

[56]  Balachandran Manavalan,et al.  MLACP: machine-learning-based prediction of anticancer peptides , 2017, Oncotarget.

[57]  Amarda Shehu,et al.  Deep learning improves antimicrobial peptide recognition , 2018, Bioinform..

[58]  Guangmin Liang,et al.  A Novel Hybrid Sequence-Based Model for Identifying Anticancer Peptides , 2018, Genes.

[59]  Rui Liu,et al.  Discovery of Multitarget-Directed Ligands against Alzheimer's Disease through Systematic Prediction of Chemical-Protein Interactions , 2015, J. Chem. Inf. Model..

[60]  Shreyas Karnik,et al.  ClassAMP: A Prediction Tool for Classification of Antimicrobial Peptides , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[61]  Andrei Gabrielian,et al.  DBAASP v.2: an enhanced database of structure and antimicrobial/cytotoxic activity of natural and synthetic peptides , 2015, Nucleic acids research.

[62]  F. Albericio,et al.  The Pharmaceutical Industry in 2017. An Analysis of FDA Drug Approvals from the Perspective of Molecules , 2018, Molecules.

[63]  Chuang Liu,et al.  In silico polypharmacology of natural products , 2017, Briefings Bioinform..

[64]  Igor V. Tetko,et al.  Virtual Computational Chemistry Laboratory – Design and Description , 2005, J. Comput. Aided Mol. Des..

[65]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[66]  Tu T Ho,et al.  The polypharmacology of natural products. , 2018, Future medicinal chemistry.

[67]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[68]  Yadi Zhou,et al.  Prediction of chemical-protein interactions: multitarget-QSAR versus computational chemogenomic methods. , 2012, Molecular bioSystems.

[69]  Bakhtiar Affendi Rosdi,et al.  Prediction of Antimicrobial Peptides Based on Sequence Alignment and Support Vector Machine-Pairwise Algorithm Utilizing LZ-Complexity , 2015, BioMed research international.

[70]  C. Hawrylowicz,et al.  Potential role of interleukin-10-secreting regulatory T cells in allergy and asthma , 2005, Nature Reviews Immunology.

[71]  Yangyang He,et al.  Consensus models for CDK5 inhibitors in silico and their application to inhibitor discovery , 2014, Molecular Diversity.

[72]  Xia Li,et al.  APD2: the updated antimicrobial peptide database and its application in peptide design , 2008, Nucleic Acids Res..

[73]  Ian H. Witten,et al.  Data mining in bioinformatics using Weka , 2004, Bioinform..

[74]  Qi Wang,et al.  Discovery of neuroprotective compounds by machine learning approaches , 2016 .

[75]  K. Dohi,et al.  Allograft transduction of IL-10 prolongs survival following orthotopic liver transplantation , 1999, Gene Therapy.

[76]  Gajendra P. S. Raghava,et al.  AntiBP2: improved version of antibacterial peptide prediction , 2010, BMC Bioinformatics.

[77]  O L Franco,et al.  Computational tools for exploring sequence databases as a resource for antimicrobial peptides. , 2017, Biotechnology advances.

[78]  Faiza Hanif Waghu,et al.  CAMPR3: a database on sequences, structures and signatures of antimicrobial peptides , 2015, Nucleic Acids Res..

[79]  H. Mohabatkar,et al.  Predicting anticancer peptides with Chou's pseudo amino acid composition and investigating their mutagenicity via Ames test. , 2014, Journal of theoretical biology.

[80]  Faiza Hanif Waghu,et al.  CAMP: Collection of sequences and structures of antimicrobial peptides , 2013, Nucleic Acids Res..

[81]  Ming Wen,et al.  Deep-Learning-Based Drug-Target Interaction Prediction. , 2017, Journal of proteome research.

[82]  C. Tanford Contribution of Hydrophobic Interactions to the Stability of the Globular Conformation of Proteins , 1962 .

[83]  Luhua Lai,et al.  Deep Learning for Drug-Induced Liver Injury , 2015, J. Chem. Inf. Model..

[84]  J. Valadi,et al.  Recent trends in antimicrobial peptide prediction using machine learning techniques , 2017, Bioinformation.

[85]  Maqsood Hayat,et al.  "iSS-Hyb-mRMR": Identification of splicing sites using hybrid space of pseudo trinucleotide and pseudo tetranucleotide composition , 2016, Comput. Methods Programs Biomed..