Improving protein alignment algorithms using amino-acid hydrophobicities - Applications of TMATCH, A new algorithms

Motivation Sequence database search and matching algorithms are an important tool when trying to understand the structure (and so the function) of proteins. Proteins with similar structure and function often have very similar primary structure. There are however many cases where proteins with similar structure have very different primary structures. Substitution matrices (PAM, BLOSUM, Gonnett) can be used to identify proteins of similar structure, but they fail when the sequence similarity falls below about 25%. Results We have described a new algorithm for examining the the primary structure of proteins against a database of known proteins with a new hydrophobicity index. In this paper, we examine the ability of TMATCH to identify proteins of similar structure using sequence matching with the hydrophobicity index. We compare results from TMATCH with those obtained using FASTA and PSI-BLAST. We show that by using similarity patterns spread across the entire length of two proteins we get a more robust indicator of remote relatedness than relying upon high similarity scoring pair regions. Availability The program TMATCH is available on request Contact chitturk@uah.edu

[1]  Y. Kawarabayasi,et al.  Specific interactions of three proliferating cell nuclear antigens with replication‐related proteins in Aeropyrum pernix , 2007, Molecular microbiology.

[2]  Oliviero Carugo,et al.  Atom depth in protein structure and function. , 2003, Trends in biochemical sciences.

[3]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[4]  David P. Cavanaugh,et al.  A hydrophobic proclivity index for protein alignments , 2015, F1000Research.

[5]  William R. Pearson,et al.  Identifying distantly related protein sequences , 1991, Comput. Appl. Biosci..

[6]  K. Palczewski,et al.  Crystal Structure of Rhodopsin: A G‐Protein‐Coupled Receptor , 2002, Chembiochem : a European journal of chemical biology.

[7]  R. Agarwala,et al.  Protein database searches using compositionally adjusted substitution matrices , 2005, The FEBS journal.

[8]  S. Bell,et al.  A heterotrimeric PCNA in the hyperthermophilic archaeon Sulfolobus solfataricus. , 2003, Molecular cell.

[9]  S. Vries,et al.  Mitochondrial cytochrome b: evolution and structure of the protein. , 1993, Biochimica et biophysica acta.

[10]  Krishnan K. Chittur,et al.  F1000Research TMATCH: A New Algorithm for Protein Alignments using amino-acid hydrophobicities , 2019 .

[11]  Y. Ishino,et al.  Stoichiometric complex formation by proliferating cell nuclear antigen (PCNA) and its interacting protein: purification and crystallization of the DNA polymerase and PCNA monomer mutant complex from Pyrococcus furiosus. , 2006, Acta crystallographica. Section F, Structural biology and crystallization communications.

[12]  P. Andrew Karplus,et al.  Hydrophobicity regained: Hydrophobicity regained , 1997 .

[13]  Takashi Itoh,et al.  Thermococcus thioreducens sp. nov., a novel hyperthermophilic, obligately sulfur-reducing archaeon from a deep-sea hydrothermal vent. , 2007, International journal of systematic and evolutionary microbiology.

[14]  Damien Marsic,et al.  New DNA polymerase from the hyperthermophilic marine archaeon Thermococcus thioreducens , 2008, Extremophiles.

[15]  S. Karlin,et al.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[16]  T K Attwood,et al.  Fingerprinting G-protein-coupled receptors. , 1994, Protein engineering.

[17]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[18]  Oliviero Carugo,et al.  Atom depth as a descriptor of the protein interior. , 2003, Biophysical journal.

[19]  Sequence homology and structural similarity between cytochrome b of mitochondrial complex III and the chloroplast b6-f complex: position of the cytochrome b hemes in the membrane. , 1984, Proceedings of the National Academy of Sciences of the United States of America.