PrAS: Prediction of amidation sites using multiple feature extraction

Amidation plays an important role in a variety of pathological processes and serious diseases like neural dysfunction and hypertension. However, identification of protein amidation sites through traditional experimental methods is time consuming and expensive. In this paper, we proposed a novel predictor for Prediction of Amidation Sites (PrAS), which is the first software package for academic users. The method incorporated four representative feature types, which are position-based features, physicochemical and biochemical properties features, predicted structure-based features and evolutionary information features. A novel feature selection method, positive contribution feature selection was proposed to optimize features. PrAS achieved AUC of 0.96, accuracy of 92.1%, sensitivity of 81.2%, specificity of 94.9% and MCC of 0.76 on the independent test set. PrAS is freely available at https://sourceforge.net/p/praspkg.

[1]  Gary Walsh,et al.  Post-translational modifications in the context of therapeutic proteins , 2006, Nature Biotechnology.

[2]  Christodoulos A. Floudas,et al.  Proteome-wide post-translational modification statistics: frequency analysis and curation of the swiss-prot database , 2011, Scientific reports.

[3]  J. Simiand,et al.  SR146131: a new potent, orally active, and selective nonpeptide cholecystokinin subtype 1 receptor agonist. II. In vivo pharmacological characterization. , 1999, The Journal of pharmacology and experimental therapeutics.

[4]  Chihiro Nakajima,et al.  A new approach for detecting C‐terminal amidation of proteins and peptides by mass spectrometry in conjunction with chemical derivatization , 2009, Proteomics.

[5]  W. Atchley,et al.  Solving the protein sequence metric problem. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6]  A. F. Bradbury,et al.  Enzyme‐catalysed peptide amidation , 1987 .

[7]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[8]  M. Finnie,et al.  Mechanism of C-terminal amide formation by pituitary enzymes , 1982, Nature.

[9]  P. Tamburini,et al.  Peptide substrate specificity of the alpha-amidating enzyme isolated from rat medullary thyroid CA-77 cells. , 2009, International journal of peptide and protein research.

[10]  Pierre Baldi,et al.  SCRATCH: a protein structure and structural feature prediction server , 2005, Nucleic Acids Res..

[11]  A Bairoch,et al.  High-throughput mass spectrometric discovery of protein post-translational modifications. , 1999, Journal of molecular biology.

[12]  Ning Zhang,et al.  Prediction of protein amidation sites by feature selection and analysis , 2013, Molecular Genetics and Genomics.

[13]  M. Eiden,et al.  C-terminal amidation of PACAP-38 and PACAP-27 is dispensable for biological activity at the PAC1 receptor , 2016, Peptides.

[14]  R. Mains,et al.  Peptide alpha-amidation. , 1988, Annual review of physiology.

[15]  R. C. Johnson,et al.  Neuropeptide Amidation in Drosophila: Separate Genes Encode the Two Enzymes Catalyzing Amidation , 1997, The Journal of Neuroscience.

[16]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[18]  D. Merkler,et al.  C-terminal amidated peptides: production by the in vitro enzymatic amidation of glycine-extended peptides and the importance of the amide to bioactivity. , 1994, Enzyme and microbial technology.

[19]  A. Edison,et al.  Conformational Ensembles: The Role of Neuropeptide Structures in Receptor Binding , 1999, The Journal of Neuroscience.

[20]  Hiroyuki Ogata,et al.  AAindex: Amino Acid Index Database , 1999, Nucleic Acids Res..

[21]  Toshifumi Takao,et al.  Peptidomic Identification and Biological Validation of Neuroendocrine Regulatory Peptide-1 and -2* , 2007, Journal of Biological Chemistry.

[22]  Yong-Zi Chen,et al.  Prediction of Ubiquitination Sites by Using the Composition of k-Spaced Amino Acid Pairs , 2011, PloS one.

[23]  Johannes Söding,et al.  The MPI bioinformatics Toolkit as an integrative platform for advanced protein sequence and structure analysis , 2016, Nucleic Acids Res..

[24]  E. Koonin,et al.  A universal trend of amino acid gain and loss in protein evolution , 2005, Nature.

[25]  R. Mains,et al.  The biosynthesis of neuropeptides: peptide alpha-amidation. , 1992, Annual review of neuroscience.

[26]  Bernard F. Buxton,et al.  The DISOPRED server for the prediction of protein disorder , 2004, Bioinform..

[27]  G P Mueller,et al.  Differential regulation of peptide alpha-amidation by dexamethasone and disulfiram. , 1999, Molecular pharmacology.

[28]  Sarah Rachel Dennison,et al.  The effect of C-terminal amidation on the efficacy and selectivity of antimicrobial and anticancer peptides , 2009, Molecular and Cellular Biochemistry.

[29]  William C. Wetsel,et al.  Interactions of peptide amidation and copper: Novel biomarkers and mechanisms of neural dysfunction , 2010, Neurobiology of Disease.