Detection of gene duplication signals of Ig folds from their amino acid sequences

Protein folds may evolve from short peptide ancestors via gene duplication and fusion. For proteins with internal structural symmetry, this means that their sequences should be made up of identical repeats. However, many of these repeat signals can only be seen at the structural level yet. Motivated by the fact that proteins may have similar structures if their sequences have more than 25% identical amino acids, we suggest a method to detect the sequence repeats of proteins directly from their sequences. Using this method, we show that the internal repetitions of the immunoglobulin folds could be identified directly at the sequence level. Proteins 2007. © 2007 Wiley‐Liss, Inc.

[1]  A. Mclachlan,et al.  Repeated helical pattern in apolipoprotein-A-I , 1977, Nature.

[2]  Ming-Jing Hwang,et al.  Alternative alignments from comparison of protein structures , 2004, Proteins.

[3]  Yi Xiao,et al.  A common sequence-associated physicochemical feature for proteins of beta-trefoil family , 2005, Comput. Biol. Chem..

[4]  D. Mount Bioinformatics: Sequence and Genome Analysis , 2001 .

[5]  A. Mclachlan,et al.  Repeated folding pattern in copper–zinc superoxide dismutase , 1980, Nature.

[6]  A. Mclachlan Three-fold structural pattern in the soybean trypsin inhibitor (Kunitz). , 1979, Journal of molecular biology.

[7]  A. Mclachlan,et al.  Repeating sequences and gene duplication in proteins. , 1972, Journal of molecular biology.

[8]  C. Orengo,et al.  Correlation of observed fold frequency with the occurrence of local structural motifs. , 1999, Journal of molecular biology.

[9]  Alfredo Colosimo,et al.  Nonlinear signal analysis methods in the elucidation of protein sequence-structure relationships. , 2002, Chemical reviews.

[10]  A. Mclachlan,et al.  Sequence repeats in α-tropomyosin , 1975 .

[11]  A. Mclachlan Gene duplication and the origin of repetitive protein structures. , 1987, Cold Spring Harbor symposia on quantitative biology.

[12]  Nikolai A. Kudryashov,et al.  Identification of Amino Acid Latent Periodicity within 94 Protein Families , 2006, J. Comput. Biol..

[13]  Evidence for gene duplication in collagen. , 1976, Journal of molecular biology.

[14]  A. Mclachlan,et al.  Analysis of gene duplication repeats in the myosin rod. , 1983, Journal of molecular biology.

[15]  A. Mclachlan,et al.  Structural repeats and evolution of tobacco mosaic virus coat protein and RNA. , 1980, Journal of molecular biology.

[16]  Matthias Wilmanns,et al.  Structural Evidence for Evolution of the b / a Barrel Scaffold by Gene Duplication and Fusion , 2022 .

[17]  Liisa Holm,et al.  Rapid automatic detection and alignment of repeats in protein sequences , 2000, Proteins.

[18]  S Rackovsky,et al.  "Hidden" sequence periodicities and protein architecture. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Jaap Heringa,et al.  Tracking repeats using significance and transitivity , 2004, ISMB/ECCB.

[20]  J. Söding,et al.  More than the sum of their parts: On the evolution of proteins from peptides , 2003, BioEssays : news and reviews in molecular, cellular and developmental biology.

[21]  M Wilmanns,et al.  Structural evidence for evolution of the beta/alpha barrel scaffold by gene duplication and fusion. , 2000, Science.

[22]  Birte Höcker,et al.  Dissection of a (βα)8-barrel enzyme into two folded halves , 2001, Nature Structural Biology.

[23]  D. Eisenberg,et al.  Correlation of sequence hydrophobicities measures similarity in three-dimensional protein structure. , 1983, Journal of molecular biology.