Codon number shapes peptide redundancy in the universal proteome composition

The proteomes catalogued in the UniRef100 database were collected into a single proteome set and examined for actual versus theoretical pentapeptide occurrences. We found a highly diversified degree of pentapeptide redundancy. Numerically, 953 pentamers are expressed only once in the protein world, whereas 103 pentamers occur more than 50,000 times. Moreover, it seems that 417 potentially possible pentapeptides are not present in the protein world. On the whole, tracing the redundancy profile of the protein world as a function of pentapeptide occurrences reveals a quasi-Gaussian curve, with tails representing scarcely and repeatedly occurring 5-mers. Analysis of physico-chemical-biological parameters shows that codon number is the main factor influencing and favoring specific pentapeptide frequencies in the universal proteome composition. That is, when compared to the set of never-expressed 5-mers, the pentapeptides frequently represented in the universal proteome are endowed with a higher number of multi-codonic amino acids. In contrast, the bulkiness degree and the hydrophobicity level play a smaller role. Unexpectedly, the heat of formation of pentapeptide appears to have the least influence.

[1]  D. Kanduc,et al.  Enhanced expression of initiator tRNAMet in human gastric and colorectal carcinoma , 1997, Biochemistry and molecular biology international.

[2]  J. Berger The genetic code and the origin of life , 1976, Acta biotheoretica.

[3]  D. Kanduc,et al.  Modulation of HPV16 E7 translation by tRNAs in eukaryotic cell‐free translation systems , 1998, Biochemistry and molecular biology international.

[4]  Anthony Kusalik,et al.  Rare peptide segments are found significantly more often in proto-oncoproteins than control proteins: implications for immunology and oncology , 2009, Journal of The Royal Society Interface.

[5]  Xuhua Xia,et al.  What Amino Acid Properties Affect Protein Evolution? , 1998, Journal of Molecular Evolution.

[6]  Darja Kanduc,et al.  Immunogenicity in peptide-immunotherapy: from self/nonself to similar/dissimilar sequences. , 2008, Advances in experimental medicine and biology.

[7]  J. Wade,et al.  Overview of Solid Phase Synthesis of “Difficult Peptide” Sequences , 2007, Current protocols in protein science.

[8]  M. Nakano,et al.  Synthetic pentapeptides inhibiting autophosphorylation of insulin receptor in a non‐ATP‐competitive mechanism , 2009, Journal of peptide science : an official publication of the European Peptide Society.

[9]  S. Tanabe Epitope peptides and immunotherapy. , 2007, Current protein & peptide science.

[10]  B. Persson,et al.  Characterization of oligopeptide patterns in large protein sets , 2007, BMC Genomics.

[11]  F. Marincola,et al.  Widespread and ample peptide overlapping between HCV and Homo sapiens proteomes , 2007, Peptides.

[12]  Darja Kanduc,et al.  Peptimmunology: immunogenic peptides and sequence redundancy. , 2005, Current drug discovery technologies.

[13]  K. Takano,et al.  A new scale for side-chain contribution to protein stability based on the empirical stability analysis of mutant proteins. , 2001, Protein engineering.

[14]  B. Trost,et al.  Peptidology: short amino acid modules in cell biology and immunology , 2007, Amino Acids.

[15]  A. Kusalik,et al.  Massive peptide sharing between viral and human proteomes , 2008, Peptides.

[16]  R. Dummer,et al.  Non‐self‐discrimination as a driving concept in the identification of an immunodominant HMW‐MAA epitopic peptide sequence by autoantibodies from melanoma cancer patients , 2004, International journal of cancer.

[17]  Reinhard Dummer,et al.  Proteomic Scan for Tyrosinase Peptide Antigenic Pattern in Vitiligo and Melanoma: Role of Sequence Similarity and HLA-DR1 Affinity1 , 2005, The Journal of Immunology.

[18]  J. M. Zimmerman,et al.  The characterization of amino acid sequences in proteins by statistical methods. , 1968, Journal of theoretical biology.

[19]  Ashwinikumar K. Naik,et al.  The Evolutionary History of the Translation Machinery , 2004 .

[20]  Darja Kanduc,et al.  Proteome-based epitopic peptide scanning along PSA. , 2009, Experimental and molecular pathology.

[21]  Romeu Cardoso Guimarães,et al.  Chapter 8 – Genetic Code: a Self-Referential and Functional Model , 2004 .

[22]  G. Pályi,et al.  Progress in biological chirality , 2004 .

[23]  A. Sigalov Multichain immune recognition receptor signaling from spatiotemporal organization to human disease. Preface. , 2008, Advances in experimental medicine and biology.