Homology Searches Using Supersecondary Structure Code.

Supersecondary structure code (SSSC), which is represented as the combination of α-helix-type (SSSC: H), β-sheet-type (SSSC: S), the other (SSSC: T), and disorder residue or C-terminal (SSSC: D) patterns, has been produced by the developed concept of Ramachandran plot, in addition, with the ω angle and with the specification of positions of torsion angles in a protein by the registration of codes for torsion angles of each amino acid peptide unit, derived from the fuzzy search of structural code homology using the template patterns 3a5c4a (SSSC: H) and 6c4a4a (SSSC: S) with conformational codes. The DSSP (Dictionary of Secondary Structure in Proteins) method assigns the secondary structure including hydrogen bond well. In contrast, supersecondary structure code is very sensitive to the supersecondary structures of proteins. In this chapter, the protocol of homology search methods, the sequence alignment using supersecondary structure code, the assignment of supersecondary structure code T, the fuzzy search using supersecondary structure code, and the exact search using supersecondary structure code are described. Supersecondary structure code is variable with the conformational change. If possible, many Protein Data Bank (PDB) data of similar main chains of proteins should be used for the homology searches. The thorough check of SSSC sequences is also useful to reveal the role of target pattern.

[1]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[2]  Kazutaka Katoh,et al.  Application of the MAFFT sequence alignment program to large data—reexamination of the usefulness of chained guide trees , 2016, Bioinform..

[3]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[4]  A. Gustchina,et al.  On the supersecondary structure of acid proteases. , 1979, Biochemical and biophysical research communications.

[5]  G. N. Ramachandran,et al.  Stereochemistry of polypeptide chain configurations. , 1963, Journal of molecular biology.

[6]  Laurence A. Nafie,et al.  Data Mining of Supersecondary Structure Homology between Light Chains of Immunogloblins and MHC Molecules: Absence of the Common Conformational Fragment in the Human IgM Rheumatoid Factor , 2013, J. Chem. Inf. Model..

[7]  Axel T Brunger,et al.  Exploring the structural dynamics of the E.coli chaperonin GroEL using translation-libration-screw crystallographic refinement of intermediate states. , 2004, Journal of molecular biology.

[8]  Ian W. Davis,et al.  Structure validation by Cα geometry: ϕ,ψ and Cβ deviation , 2003, Proteins.

[9]  Bosco K. Ho,et al.  The Ramachandran plots of glycine and pre-proline , 2005, BMC Structural Biology.

[10]  Gert Vriend,et al.  Everyday , 2020, Oxford Research Encyclopedia of Literature.

[11]  Cecilia Bartolucci,et al.  Crystal structure of wild-type chaperonin GroEL. , 2005, Journal of molecular biology.

[12]  F. Richards,et al.  Identification of structural motifs from protein coordinate data: Secondary structure and first‐level supersecondary structure * , 1988, Proteins.

[13]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[14]  L. Nafie,et al.  Three-Dimensional Chemical Structure Search Using the Conformational Code for Organic Molecules (CCOM) Program. , 2016, Chirality.

[15]  G J Kleywegt,et al.  Phi/psi-chology: Ramachandran revisited. , 1996, Structure.