AMP0: Species-Specific Prediction of Anti-microbial Peptides Using Zero and Few Shot Learning

Evolution of drug-resistant microbial species is one of the major challenges to global health. Development of new antimicrobial treatments such as antimicrobial peptides needs to be accelerated to combat this threat. However, the discovery of novel antimicrobial peptides is hampered by low-throughput biochemical assays. Computational techniques can be used for rapid screening of promising antimicrobial peptide candidates prior to testing in the wet lab. The vast majority of existing antimicrobial peptide predictors are non-targeted in nature, i.e., they can predict whether a given peptide sequence is antimicrobial, but they are unable to predict whether the sequence can target a particular microbial species. In this work, we have used zero and few shot machine learning to develop a targeted antimicrobial peptide activity predictor called AMP0. The proposed predictor takes the sequence of a peptide and any N/C-termini modifications together with the genomic sequence of a microbial species to generate targeted predictions. Cross-validation results show that the proposed scheme is particularly effective for targeted antimicrobial prediction in comparison to existing approaches and can be used for screening potential antimicrobial peptides in a targeted manner with only a small number of training examples for novel species. AMP0 webserver is available at http://ampzero.pythonanywhere.com.

[1]  M. Blaser,et al.  Evolutionary implications of microbial genome tetranucleotide frequency biases. , 2003, Genome research.

[2]  S Karlin,et al.  Comparisons of eukaryotic genomic sequences. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Nikos Komodakis,et al.  Dynamic Few-Shot Visual Learning Without Forgetting , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4]  F. Mandl,et al.  Thinking Outside the Box-Novel Antibacterials To Tackle the Resistance Crisis. , 2018, Angewandte Chemie.

[5]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[6]  M. Natália D. S. Cordeiro,et al.  First Multitarget Chemo-Bioinformatic Model To Enable the Discovery of Antibacterial Peptides against Multiple Gram-Positive Pathogens , 2016, J. Chem. Inf. Model..

[7]  K Nishikawa,et al.  Differences in dinucleotide frequencies of human, yeast, and Escherichia coli genes. , 1997, DNA research : an international journal for rapid publication of reports on genes and genomes.

[8]  A. Ben-Hur,et al.  PAIRpred: Partner‐specific prediction of interacting residues from sequence and structure , 2014, Proteins.

[9]  S. Karlin,et al.  Comparative DNA analysis across diverse genomes. , 1998, Annual review of genetics.

[10]  Samy Bengio,et al.  Zero-Shot Learning by Convex Combination of Semantic Embeddings , 2013, ICLR.

[11]  M. Willcox,et al.  A Pilot Study of the Synergy between Two Antimicrobial Peptides and Two Common Antibiotics , 2019, Antibiotics.

[12]  W. Fontes,et al.  Influence of N‐terminus modifications on the biological activity, membrane interaction, and secondary structure of the antimicrobial peptide hylin‐a1 , 2011, Biopolymers.

[13]  Wei Wang,et al.  Antibiotic resistance: a rundown of a global crisis , 2018, Infection and drug resistance.

[14]  Simon Fong,et al.  AmPEP: Sequence-based prediction of antimicrobial peptides using distribution patterns of amino acid properties and random forest , 2018, Scientific Reports.

[15]  Hugo Larochelle,et al.  Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[16]  C. L. Ventola The antibiotic resistance crisis: part 1: causes and threats. , 2015, P & T : a peer-reviewed journal for formulary management.

[17]  Scott J. Hultgren,et al.  Precision antimicrobial therapeutics: the path of least resistance? , 2018, npj Biofilms and Microbiomes.

[18]  Maria Luisa Mangoni,et al.  Effect of natural L- to D-amino acid conversion on the organization, membrane binding, and biological function of the antimicrobial peptides bombinins H. , 2006, Biochemistry.

[19]  V. V. Kleandrova,et al.  Enabling the Discovery and Virtual Screening of Potent and Safe Antimicrobial Peptides. Simultaneous Prediction of Antibacterial Activity and Cytotoxicity. , 2016, ACS combinatorial science.

[20]  Geoffrey E. Hinton,et al.  Zero-shot Learning with Semantic Output Codes , 2009, NIPS.

[21]  William Stafford Noble,et al.  Empirical comparison of web‐based antimicrobial peptide prediction tools , 2017, Bioinform..

[22]  Tao Xiang,et al.  Learning to Compare: Relation Network for Few-Shot Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  Sadaf Gull,et al.  AMAP: Hierarchical multi-label prediction of biologically active and antimicrobial peptides , 2019, Comput. Biol. Medicine.

[24]  Andrei Gabrielian,et al.  Predictive Model of Linear Antimicrobial Peptides Active against Gram-Negative Bacteria , 2018, J. Chem. Inf. Model..

[25]  Joan Bruna,et al.  Few-Shot Learning with Graph Neural Networks , 2017, ICLR.

[26]  S. Karlin,et al.  Dinucleotide relative abundance extremes: a genomic signature. , 1995, Trends in genetics : TIG.

[27]  XiangTao,et al.  Transductive Multi-View Zero-Shot Learning , 2015 .

[28]  Marc Torrent,et al.  A theoretical approach to spot active regions in antimicrobial proteins , 2009, BMC Bioinformatics.

[29]  Shaogang Gong,et al.  Semantic Autoencoder for Zero-Shot Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Naruya Saitou,et al.  Estimation of bacterial species phylogeny through oligonucleotide frequency distances. , 2009, Genomics.

[31]  Miguel A. de Pedro,et al.  Emerging knowledge of regulatory roles of d-amino acids in bacteria , 2010, Cellular and Molecular Life Sciences.

[32]  Y. Tateno,et al.  Structural and Functional Differences in Two Cyclic Bacteriocins with the Same Sequences Produced by Lactobacilli , 2004, Applied and Environmental Microbiology.

[33]  Guozhi Yu,et al.  Predicting drug resistance evolution: insights from antimicrobial peptides and antibiotics , 2017, bioRxiv.

[34]  Faiza Hanif Waghu,et al.  CAMPR3: a database on sequences, structures and signatures of antimicrobial peptides , 2015, Nucleic Acids Res..

[35]  K Nishikawa,et al.  Genes from nine genomes are separated into their organisms in the dinucleotide composition space. , 1998, DNA research : an international journal for rapid publication of reports on genes and genomes.

[36]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[37]  Philip H. S. Torr,et al.  An embarrassingly simple approach to zero-shot learning , 2015, ICML.

[38]  S. Karlin,et al.  Global dinucleotide signatures and analysis of genomic heterogeneity. , 1998, Current opinion in microbiology.

[39]  Richard S. Zemel,et al.  Prototypical Networks for Few-shot Learning , 2017, NIPS.

[40]  Virapong Prachayasittikul,et al.  HemoPred: a web server for predicting the hemolytic activity of peptides. , 2017, Future medicinal chemistry.

[41]  Dong Xu,et al.  Imbalanced multi-label learning for identifying antimicrobial peptides and their functional types , 2016, Bioinform..

[42]  Gajendra P. S. Raghava,et al.  Prediction of Antimicrobial Potential of a Chemically Modified Peptide From Its Tertiary Structure , 2018, Front. Microbiol..

[43]  Eleazar Eskin,et al.  The Spectrum Kernel: A String Kernel for SVM Protein Classification , 2001, Pacific Symposium on Biocomputing.

[44]  Andrei Gabrielian,et al.  DBAASP v.2: an enhanced database of structure and antimicrobial/cytotoxic activity of natural and synthetic peptides , 2015, Nucleic acids research.

[45]  Andrew Y. Ng,et al.  Zero-Shot Learning Through Cross-Modal Transfer , 2013, NIPS.

[46]  Peng Qiu,et al.  Long short-term memory recurrent neural networks for antibacterial peptide identification , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[47]  Venkatesh Saligrama,et al.  Zero-Shot Learning via Semantic Similarity Embedding , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[48]  Jessica M. A. Blair A climate for antibiotic resistance , 2018, Nature Climate Change.

[49]  M. Martins,et al.  Clinical Application of AMPs. , 2019, Advances in experimental medicine and biology.

[50]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[51]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .