Protein motifs retrieval by SS terns occurrences

This paper describes a new approach to the analysis of protein 3D structure based on the Secondary Structure (SS) representation. The focus is here on structural motif retrieval. The strategy is derived from the Generalized Hough Transform (GHT), but considering as structural primitive element, the triplet of SSs. The triplet identity is evaluated on the triangle having the vertices on the SS midpoints, and is represented by the three midpoints distances. The motif is characterized by the complete set of triplets, so the Reference Table (RT) has a tuple for each triplet. Tuples contain, beside the discriminant component (the three edge lengths), the mapping rule, i.e. the Reference Point (RP) location referred to the triplet. In the macromolecule to be analyzed, each possible triplet is searched in the RT and every match gives a contribution to a candidate location of the RP. Presence and location of the searched motif are certified by the collection of a number of contribution equal (obviously in absence of noise and ambiguities) to the RT cardinality (i.e. the number of motif triplets). The approach is tested on twenty proteins selected randomly from the PDB, but having a different number of SSs ranging from 14 to 46. The retrieval of all possible structural blocks composed by three, four and five SSs (very compact and completely distributed) have been conducted. The results show valuable performances for precision and computation time.

[1]  Yi Zhong,et al.  Searching for three-dimensional secondary structural patterns in proteins with ProSMoS , 2007, Bioinform..

[2]  Nick V. Grishin,et al.  ProSMoS server: a pattern-based search using interaction matrix representation of protein structures , 2009, Nucleic Acids Res..

[3]  K. Chou,et al.  Design Novel Dual Agonists for Treating Type-2 Diabetes by Targeting Peroxisome Proliferator-Activated Receptors with Core Hopping Approach , 2012, PloS one.

[4]  Kuo-Chen Chou,et al.  Insights from Modeling the 3D Structure of New Delhi Metallo-β-Lactamse and Its Binding Interactions with Antibiotic Drugs , 2011, PloS one.

[5]  Kuo-Chen Chou,et al.  Coupling interaction between thromboxane A2 receptor and alpha-13 subunit of guanine nucleotide-binding protein. , 2005, Journal of proteome research.

[6]  K. Chou,et al.  iLoc-Hum: using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites. , 2012, Molecular bioSystems.

[7]  K. Chou,et al.  iLoc-Gpos: a multi-layer classifier for predicting the subcellular localization of singleplex and multiplex Gram-positive bacterial proteins. , 2012, Protein and peptide letters.

[8]  W Graham Richards,et al.  Classification of proteins based on similarity of two-dimensional protein maps. , 2008, Biophysical chemistry.

[9]  M. Esmaeili,et al.  Using the concept of Chou's pseudo amino acid composition for risk type prediction of human papillomaviruses. , 2010, Journal of theoretical biology.

[10]  K. Chou Some remarks on protein attribute prediction and pseudo amino acid composition , 2010, Journal of Theoretical Biology.

[11]  A. Giuliani,et al.  A computational approach identifies two regions of Hepatitis C Virus E1 protein as interacting domains involved in viral fusion process , 2009, BMC Structural Biology.

[12]  Lianping Yang,et al.  Use of information discrepancy measure to compare protein secondary structures , 2009 .

[13]  Yuan-Fang Wang,et al.  CTSS: a robust and efficient method for protein structure alignment based on local geometrical and biological features , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[14]  Kuo-Chen Chou,et al.  Insights into the Mutation-Induced HHH Syndrome from Modeling Human Mitochondrial Ornithine Transporter-1 , 2012, PloS one.

[15]  K. Chou Prediction of protein cellular attributes using pseudo‐amino acid composition , 2001, Proteins.

[16]  M. S. Madhusudhan,et al.  Biological insights from topology independent comparison of protein 3D structures , 2011, Nucleic acids research.

[17]  Jianxiu Guo,et al.  Predicting protein folding rates using the concept of Chou's pseudo amino acid composition , 2011, Journal of computational chemistry.

[18]  K. Chou,et al.  Prediction of protein structural classes. , 1995, Critical reviews in biochemistry and molecular biology.

[19]  Suyu Mei,et al.  Multi-kernel transfer learning based on Chou's PseAAC formulation for protein submitochondria localization. , 2012, Journal of theoretical biology.

[20]  Dongsheng Zou,et al.  Supersecondary structure prediction using Chou's pseudo amino acid composition , 2011, J. Comput. Chem..

[21]  Maqsood Hayat,et al.  Discriminating outer membrane proteins with Fuzzy K-nearest Neighbor algorithms based on the general form of Chou's PseAAC. , 2012, Protein and peptide letters.

[22]  K. Chou,et al.  Prediction of the tertiary structure and substrate binding site of caspase‐8 , 1997, FEBS letters.

[23]  K. Chou,et al.  Novel Inhibitor Design for Hemagglutinin against H1N1 Influenza Virus by Core Hopping Method , 2011, PloS one.

[24]  Chao Chen,et al.  Dual-layer wavelet SVM for predicting protein structural class via the general form of Chou's pseudo amino acid composition. , 2012, Protein and peptide letters.

[25]  K. Chou,et al.  REVIEW : Recent advances in developing web-servers for predicting protein attributes , 2009 .

[26]  Adam Godzik,et al.  Using an alignment of fragment strings for comparing protein structures , 2007, Bioinform..

[27]  K Henrick,et al.  Electronic Reprint Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions , 2022 .

[28]  Chi-Ren Shyu,et al.  A fast protein structure retrieval system using image-based distance matrices and multidimensional index , 2004, Proceedings. Fourth IEEE Symposium on Bioinformatics and Bioengineering.

[29]  Ambuj K. Singh,et al.  PSI: indexing protein structures for fast similarity search , 2003, ISMB.

[30]  Kian-Lee Tan,et al.  Augmenting SSEs with structural properties for rapid protein structure comparison , 2003, Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings..

[31]  K. Chou Prediction of protein cellular attributes using pseudo‐amino acid composition , 2001 .