Graph Signal Processing on protein residue networks helps in studying its biophysical properties

Understanding the physical and chemical properties of proteins is vital, and many efforts have been made to study the emergent properties of the macro-molecules as a combination of long chains of amino acids. Here, we present a graph signal processing based approach to model the biophysical property of proteins. For each protein inter-residue proximity-based network is used as basis graph and the respective amino acid properties are used as node-signals. Signals on nodes are decomposed on network’s Laplacian eigenbasis using graph Fourier transformations. We found that the intensity in low-frequency components of graph signals of residue features could be used to model few biophysical properties of proteins. Specifically, using our approach, we could model protein folding-rate, globularity and fraction of alpha-helices and beta-sheets. Our approach also allows amalgamation of different types of chemical and graph theoretic properties of residue to be used together in a multi-variable regression model to predict biophysical properties.

[1]  N. Go RESPECTIVE ROLES OF SHORT- AND LONG-RANGE INTERACTIONS IN PROTEIN FOLDING , 1981 .

[2]  P. Sen,et al.  Introduction to bivariate and multivariate analysis , 1981 .

[3]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[4]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[5]  Roger Sauter,et al.  Introduction to Probability and Statistics for Engineers and Scientists , 2005, Technometrics.

[6]  Ganesh Bagler,et al.  Network properties of protein structures , 2004, q-bio/0408009.

[7]  M. Michael Gromiha,et al.  FOLD-RATE: prediction of protein folding rates from amino acid sequence , 2006, Nucleic Acids Res..

[8]  H. Scheraga,et al.  The role of hydrophobic interactions in initiation and propagation of protein folding , 2006, Proceedings of the National Academy of Sciences.

[9]  Ganesh Bagler,et al.  Assortative mixing in Protein Contact Networks and protein folding kinetics , 2007, Bioinform..

[10]  Juan Liu,et al.  Respective roles of short-and long-range interactions in protein folding , 2004, Wuhan University Journal of Natural Sciences.

[11]  C. Kuo-chen,et al.  FoldRate: A Web-Server for Predicting Protein Folding Rates from Primary Sequence , 2009 .

[12]  Jiangning Song,et al.  Prediction of protein folding rates from primary sequence by fusing multiple sequential features , 2009 .

[13]  Yi Xiao,et al.  Improvements of network approach for analysis of the folding free‐energy surface of peptides and proteins , 2010, J. Comput. Chem..

[14]  L. Greene Protein structure networks. , 2012, Briefings in functional genomics.

[15]  R. Nussinov,et al.  Send Orders of Reprints at Reprints@benthamscience.net Allo-network Drugs: Extension of the Allosteric Drug Concept to Protein- Protein Interaction and Signaling Networks , 2022 .

[16]  A. Finkelstein,et al.  Restrictions to protein folding determined by the protein size , 2013, FEBS letters.

[17]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[18]  X. Xiao,et al.  Swfoldrate: Predicting protein folding rates from amino acid sequence with sliding window method , 2013, Proteins.

[19]  José M. F. Moura,et al.  Discrete Signal Processing on Graphs: Frequency Analysis , 2013, IEEE Transactions on Signal Processing.

[20]  Bairong Shen,et al.  The construction of an amino acid network for understanding protein structure and function , 2014, Amino Acids.

[21]  Jiangning Song,et al.  Towards more accurate prediction of protein folding rates: a review of the existing web-based bioinformatics approaches , 2015, Briefings Bioinform..

[22]  Nita Parekh,et al.  NAPS: Network Analysis of Protein Structures , 2016, Nucleic Acids Res..

[23]  Yongdong Zhang,et al.  Boosted Near-miss Under-sampling on SVM ensembles for concept detection in large-scale imbalanced datasets , 2016, Neurocomputing.

[24]  U. Feige,et al.  Spectral Graph Theory , 2015 .