GPCRtm: An amino acid substitution matrix for the transmembrane region of class A G Protein-Coupled Receptors

BackgroundProtein sequence alignments and database search methods use standard scoring matrices calculated from amino acid substitution frequencies in general sets of proteins. These general-purpose matrices are not optimal to align accurately sequences with marked compositional biases, such as hydrophobic transmembrane regions found in membrane proteins. In this work, an amino acid substitution matrix (GPCRtm) is calculated for the membrane spanning segments of the G protein-coupled receptor (GPCR) rhodopsin family; one of the largest transmembrane protein family in humans with great importance in health and disease.ResultsThe GPCRtm matrix reveals the amino acid compositional bias distinctive of the GPCR rhodopsin family and differs from other standard substitution matrices. These membrane receptors, as expected, are characterized by a high content of hydrophobic residues with regard to globular proteins. On the other hand, the presence of polar and charged residues is higher than in average membrane proteins, displaying high frequencies of replacement within themselves.ConclusionsAnalysis of amino acid frequencies and values obtained from the GPCRtm matrix reveals patterns of residue replacements different from other standard substitution matrices. GPCRs prioritize the reactivity properties of the amino acids over their bulkiness in the transmembrane regions. A distinctive role is that charged and polar residues seem to evolve at different rates than other amino acids. This observation is related to the role of the transmembrane bundle in the binding of ligands, that in many cases involve electrostatic and hydrogen bond interactions. This new matrix can be useful in database search and for the construction of more accurate sequence alignments of GPCRs.

[1]  M. O. Dayhoff,et al.  22 A Model of Evolutionary Change in Proteins , 1978 .

[2]  S. Altschul Amino acid substitution matrices from an information theoretic perspective , 1991, Journal of Molecular Biology.

[3]  J. Ballesteros,et al.  [19] Integrated methods for the construction of three-dimensional models and computational probing of structure-function relations in G protein-coupled receptors , 1995 .

[4]  M. Gelfand,et al.  BATMAS30: Amino acid substitution matrix for alignment of bacterial transporters , 2003, Proteins.

[5]  David C. Jones,et al.  A mutation data matrix for transmembrane proteins , 1994, FEBS letters.

[6]  R. Neubig,et al.  Depicting a protein's two faces: GPCR classification by phylogenetic tree‐based HMMs , 2003, FEBS letters.

[7]  J G Henikoff,et al.  PHAT: a transmembrane-specific substitution matrix. Predicted hydrophobic and transmembrane. , 2000, Bioinformatics.

[8]  M. Grossmann,et al.  G Protein-coupled Receptors , 1998, The Journal of Biological Chemistry.

[9]  Kaiser Jamil,et al.  Sequence-structure based phylogeny of GPCR Class A Rhodopsin receptors. , 2014, Molecular phylogenetics and evolution.

[10]  M. Rask-Andersen,et al.  The druggable genome: Evaluation of drug targets in clinical trials suggests major shifts in molecular class and indication. , 2014, Annual review of pharmacology and toxicology.

[11]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[12]  S. Vilar,et al.  Recent structural advances of β1 and β2 adrenoceptors yield keys for ligand recognition and drug design. , 2013, Journal of medicinal chemistry.

[13]  Leonardo Pardo,et al.  Influence of the g- conformation of Ser and Thr on the structure of transmembrane helices. , 2010, Journal of structural biology.

[14]  Krzysztof Palczewski,et al.  Sequence analyses of G-protein-coupled receptors: similarities to rhodopsin. , 2003, Biochemistry.

[15]  F. Echeverri,et al.  The human olfactory receptor repertoire , 2001, Genome Biology.

[16]  Arnau Cordomí,et al.  The G-protein coupled receptor family: actors with many faces. , 2012, Current pharmaceutical design.

[17]  M. Babu,et al.  Molecular signatures of G-protein-coupled receptors , 2013, Nature.

[18]  M. O. Dayhoff A model of evolutionary change in protein , 1978 .

[19]  Arnau Cordomí,et al.  Modeling of G protein-coupled receptors using crystal structures: from monomers to signaling complexes. , 2014, Advances in experimental medicine and biology.

[20]  T. Schöneberg,et al.  Evolution of GPCR: Change and continuity , 2011, Molecular and Cellular Endocrinology.

[21]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[22]  Jorja G. Henikoff,et al.  PHAT: a transmembrane-specific substitution matrix , 2000, Bioinform..

[23]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[24]  Gianluigi Caltabiano,et al.  Impact of Helix Irregularities on Sequence Alignment and Homology Modeling of G Protein‐Coupled Receptors , 2012, Chembiochem : a European journal of chemical biology.

[25]  Leonardo Pardo,et al.  Relation between sequence and structure in membrane proteins , 2013, Bioinform..

[26]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[27]  T. Imai,et al.  Statistical sequence analyses of G‐protein‐coupled receptors: Structural and functional characteristics viewed with periodicities of entropy, hydrophobicity, and volume , 2004, Proteins.

[28]  P. Molinoff,et al.  International Union of Pharmacology nomenclature of adrenoceptors. , 1994, Pharmacological reviews.

[29]  Gert Vriend,et al.  GPCRDB information system for G protein-coupled receptors , 2003, Nucleic Acids Res..

[30]  R. Stevens,et al.  The 2.6 Angstrom Crystal Structure of a Human A2A Adenosine Receptor Bound to an Antagonist , 2008, Science.

[31]  Stephen F. Altschul,et al.  The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions , 2005, Bioinform..

[32]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[33]  Kolakowski Lf GCRDB: A G-PROTEIN-COUPLED RECEPTOR DATABASE , 1994 .

[34]  Sid Topiol,et al.  X-ray structure breakthroughs in the GPCR transmembrane region. , 2009, Biochemical pharmacology.

[35]  Pascal Sirand-Pugnet,et al.  A novel substitution matrix fitted to the compositional bias in Mollicutes improves the prediction of homologous relationships , 2011, BMC Bioinformatics.

[36]  M. O. Dayhoff,et al.  Atlas of protein sequence and structure , 1965 .

[37]  Makiko Suwa,et al.  Automatic gene collection system for genome-scale overview of G-protein coupled receptors in eukaryotes. , 2005, Gene.

[38]  R. Stevens,et al.  FoldGPCR: Structure prediction protocol for the transmembrane domain of G protein‐coupled receptors from class A , 2010, Proteins.

[39]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[40]  H. Schiöth,et al.  The G-protein-coupled receptors in the human genome form five main families. Phylogenetic analysis, paralogon groups, and fingerprints. , 2003, Molecular pharmacology.

[41]  Paolo Carloni,et al.  GOMoDo: A GPCRs Online Modeling and Docking Webserver , 2013, PloS one.

[42]  Bas Vroling,et al.  GPCRdb: an information system for G protein-coupled receptors , 2015, Nucleic Acids Res..

[43]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[44]  Hugh Rosen,et al.  Crystal Structure of a Lipid G Protein–Coupled Receptor , 2012, Science.

[45]  John L. Spouge,et al.  The Gumbel pre-factor k for gapped local alignment can be estimated from simulations of global alignment , 2005, Nucleic acids research.

[46]  G. Gonnet,et al.  Exhaustive matching of the entire protein sequence database. , 1992, Science.

[47]  H. Horiuchi Seven-transmembrane receptors , 2015 .

[48]  L. F. Kolakowski GCRDb: a G-protein-coupled receptor database. , 1994, Receptors & channels.

[49]  Alan Wise,et al.  Target validation of G-protein coupled receptors. , 2002, Drug discovery today.

[50]  David Haussler,et al.  Classifying G-protein coupled receptors with support vector machines , 2002, Bioinform..