StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency

MOTIVATION Cell-penetrating peptides (CPPs) are a vehicle for transporting into living cells pharmacologically active molecules, such as short interfering RNAs, nanoparticles, plasmid DNAs, and small peptides, thus offering great potential as future therapeutics. Existing experimental techniques for identifying CPPs are time-consuming and expensive. Thus, the prediction of CPPs from peptide sequences by using computational methods can be useful to annotate and guide the experimental process quickly. Many machine learning-based methods have recently emerged for identifying CPPs. Although considerable progress has been made, existing methods still have low feature representation capabilities, thereby limiting further performance improvements. RESULTS We propose a method called StackCPPred, which proposes three feature methods on the basis of the pairwise energy content of the residue as follows: RECM-composition, PseRECM, and RECM-DWT. These features are used to train stacking-based machine learning methods to effectively predict CPPs. On the basis of the CPP924 and CPPsite3 datasets with jackknife validation, StackDPPred achieved 94.5% and 78.3% accuracy, which was 2.9% and 5.8% higher than the state-of-the-art CPP predictors, respectively. StackCPPred can be a powerful tool for predicting CPPs and their uptake efficiency, facilitating hypothesis-driven experimental design and accelerating their applications in clinical therapy. AVAILABILITY Source code and data can be downloaded from https://github.com/Excelsior511/StackCPPred. SUPPLEMENTARY INFORMATION Supplementary data are available online at Bioinformatics.

[1]  Mark J. Shensa,et al.  The discrete wavelet transform: wedding the a trous and Mallat algorithms , 1992, IEEE Trans. Signal Process..

[2]  K. Chou Prediction of protein cellular attributes using pseudo‐amino acid composition , 2001, Proteins.

[3]  P. Tompa,et al.  The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins. , 2005, Journal of molecular biology.

[4]  Chris H. Q. Ding,et al.  Minimum Redundancy Feature Selection from Microarray Gene Expression Data , 2005, J. Bioinform. Comput. Biol..

[5]  Kuo-Chen Chou,et al.  MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM. , 2007, Biochemical and biophysical research communications.

[6]  Ű. Langel,et al.  Predicting cell-penetrating peptides. , 2008, Advanced drug delivery reviews.

[7]  Tarmo Tamm,et al.  Prediction of Cell-Penetrating Peptides Using Artificial Neural Networks. , 2010, Current computer-aided drug design.

[8]  Susan M. Bridges,et al.  Prediction of Cell Penetrating Peptides by Support Vector Machines , 2011, PLoS Comput. Biol..

[9]  Loris Nanni,et al.  Wavelet images and Chou’s pseudo amino acid composition for protein classification , 2011, Amino Acids.

[10]  Jörg Gsponer,et al.  Intrinsically disordered proteins: regulation and disease. , 2011, Current opinion in structural biology.

[11]  F. Milletti,et al.  Cell-penetrating peptides: classes, origin, and current landscape. , 2012, Drug discovery today.

[12]  Tianyue Jiang,et al.  Dual-functional liposomes based on pH-responsive cell-penetrating peptide and hyaluronic acid for tumor-targeted anticancer drug delivery. , 2012, Biomaterials.

[13]  Kumardeep Chaudhary,et al.  Cell Penetrating Peptides , 2016 .

[14]  M. Tanimoto,et al.  Tumour lineage-homing cell-penetrating peptides as anticancer molecular delivery systems , 2012, Nature Communications.

[15]  D. Selwood,et al.  siRNA Delivery: From Lipids to Cell‐penetrating Peptides and Their Mimics , 2012, Chemical biology & drug design.

[16]  Gianluca Pollastri,et al.  CPPpred: prediction of cell penetrating peptides , 2013, Bioinform..

[17]  S. Reissmann Cell penetration: scope and limitations by the application of cell‐penetrating peptides , 2014, Journal of peptide science : an official publication of the European Peptide Society.

[18]  Shengrong Guo,et al.  Recent progress of cell-penetrating peptides as new carriers for intracellular cargo delivery. , 2014, Journal of controlled release : official journal of the Controlled Release Society.

[19]  Loris Nanni,et al.  An Empirical Study of Different Approaches for Protein Classification , 2014, TheScientificWorldJournal.

[20]  K. Chou,et al.  iRNA-Methyl: Identifying N(6)-methyladenosine sites using pseudo nucleotide composition. , 2015, Analytical biochemistry.

[21]  P. Boisguérin,et al.  Delivery of therapeutic oligonucleotides with cell penetrating peptides☆ , 2015, Advanced Drug Delivery Reviews.

[22]  E. Ruoslahti,et al.  Neuropilin-1 and heparan sulfate proteoglycans cooperate in cellular uptake of nanoparticles functionalized by cationic cell-penetrating peptides , 2015, Science Advances.

[23]  G. Radicioni,et al.  Characterization of the cell penetrating properties of a human salivary proline-rich peptide. , 2015, Biochimica et biophysica acta.

[24]  Chen Chu,et al.  Prediction and analysis of cell-penetrating peptides using pseudo-amino acid composition and random forest models , 2015, Amino Acids.

[25]  Dariusz Mrozek,et al.  Scaling Ab Initio Predictions of 3D Protein Structures in Microsoft Azure Cloud , 2015, Journal of Grid Computing.

[26]  David Zhang,et al.  Feature selection and analysis on correlated gas sensor data with recursive feature elimination , 2015 .

[27]  Yue-Wern Huang,et al.  Delivery of Nucleic Acids and Nanomaterials by Cell-Penetrating Peptides: Opportunities and Challenges , 2015, BioMed research international.

[28]  Sumaiya Iqbal,et al.  DisPredict: A Predictor of Disordered Protein Using Optimized RBF Kernel , 2015, PloS one.

[29]  D. Raucher,et al.  Cell-penetrating peptides: strategies for anticancer treatment. , 2015, Trends in molecular medicine.

[30]  James G. Lyons,et al.  Predict Gram-Positive and Gram-Negative Subcellular Localization via Incorporating Evolutionary Information and Physicochemical Features Into Chou's General PseAAC , 2015, IEEE Transactions on NanoBioscience.

[31]  David T. Jones,et al.  DISOPRED3: precise disordered region predictions with annotated protein-binding activity , 2014, Bioinform..

[32]  Gabriel del Rio,et al.  Effective Design of Multifunctional Peptides by Combining Compatible Functions , 2016, PLoS Comput. Biol..

[33]  Wei Chen,et al.  Prediction of cell-penetrating peptides with feature selection techniques. , 2016, Biochemical and biophysical research communications.

[34]  Yaoqi Zhou,et al.  sDFIRE: Sequence‐specific statistical energy function for protein structure prediction by decoy selections , 2016, J. Comput. Chem..

[35]  R. Batchu,et al.  Pancreatic Cancer Cell Lysis by Cell-Penetrating Peptide-MAGE-A3-Induced Cytotoxic T Lymphocytes. , 2016, JAMA surgery.

[36]  N. Kamei,et al.  Applicability and Limitations of Cell-Penetrating Peptides in Noncovalent Mucosal Drug or Carrier Delivery Systems. , 2016, Journal of pharmaceutical sciences.

[37]  Q. Zou,et al.  SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides , 2017, BMC Genomics.

[38]  Hexokinase II–derived cell-penetrating peptide targets mitochondria and triggers apoptosis in cancer cells , 2017, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[39]  Gaotao Shi,et al.  CPPred-RF: A Sequence-based Predictor for Identifying Cell-Penetrating Peptides and Their Uptake Efficiency. , 2017, Journal of proteome research.

[40]  Avdesh Mishra,et al.  Three-Dimensional Ideal Gas Reference State based Energy Function , 2017 .

[41]  Balachandran Manavalan,et al.  MLACP: machine-learning-based prediction of anticancer peptides , 2017, Oncotarget.

[42]  Zsuzsanna Dosztányi,et al.  IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding , 2018, Nucleic Acids Res..

[43]  Nithin V. George,et al.  KELM-CPPpred: Kernel Extreme Learning Machine Based Prediction Model for Cell-Penetrating Peptides. , 2018, Journal of proteome research.

[44]  Zhen Cao,et al.  The lncLocator: a subcellular localization predictor for long non‐coding RNAs based on a stacked ensemble classifier , 2018, Bioinform..

[45]  Balachandran Manavalan,et al.  Machine-Learning-Based Prediction of Cell-Penetrating Peptides and Their Uptake Efficiency with Improved Accuracy. , 2018, Journal of proteome research.

[46]  Sumaiya Iqbal,et al.  RBSURFpred: Modeling protein accessible surface area in real and binary space using regularized and optimized regression. , 2018, Journal of theoretical biology.

[47]  Justin M. Wolfe,et al.  Machine Learning To Predict Cell-Penetrating Peptides for Antisense Delivery , 2018, ACS central science.

[48]  Xiujun Gong,et al.  A Model Stacking Framework for Identifying DNA Binding Proteins by Orchestrating Multi-View Features and Classifiers , 2018, Genes.

[49]  Lihong Peng,et al.  Improved DNA-Binding Protein Identification by Incorporating Evolutionary Information Into the Chou’s PseAAC , 2018, IEEE Access.

[50]  Balachandran Manavalan,et al.  iGHBP: Computational identification of growth hormone binding proteins from sequences using extremely randomised tree , 2018, Computational and structural biotechnology journal.

[51]  Sumaiya Iqbal,et al.  PBRpredict-Suite: a suite of models to predict peptide-recognition domain residues from protein sequence , 2018, Bioinform..

[52]  Avdesh Mishra,et al.  StackDPPred: a stacking based prediction of DNA‐binding protein from sequence , 2018, Bioinform..

[53]  Jie Hu,et al.  Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools , 2019, Briefings Bioinform..

[54]  Ran Su,et al.  CPPred-FL: a sequence-based predictor for large-scale identification of cell-penetrating peptides by feature representation learning , 2018, Briefings Bioinform..