Surface antigens and potential virulence factors from parasites detected by comparative genomics of perfect amino acid repeats

BackgroundMany parasitic organisms, eukaryotes as well as bacteria, possess surface antigens with amino acid repeats. Making up the interface between host and pathogen such repetitive proteins may be virulence factors involved in immune evasion or cytoadherence. They find immunological applications in serodiagnostics and vaccine development. Here we use proteins which contain perfect repeats as a basis for comparative genomics between parasitic and free-living organisms.ResultsWe have developed Reptile http://reptile.unibe.ch, a program for proteome-wide probabilistic description of perfect repeats in proteins. Parasite proteomes exhibited a large variance regarding the proportion of repeat-containing proteins. Interestingly, there was a good correlation between the percentage of highly repetitive proteins and mean protein length in parasite proteomes, but not at all in the proteomes of free-living eukaryotes. Reptile combined with programs for the prediction of transmembrane domains and GPI-anchoring resulted in an effective tool for in silico identification of potential surface antigens and virulence factors from parasites.ConclusionSystemic surveys for perfect amino acid repeats allowed basic comparisons between free-living and parasitic organisms that were directly applicable to predict proteins of serological and parasitological importance. An on-line tool is available at http://genomics.unibe.ch/dora.

[1]  D. Eisenberg,et al.  A census of protein repeats. , 1999, Journal of molecular biology.

[2]  A. Masuda,et al.  DNA cloning of Plasmodium falciparum circumsporozoite gene: amino acid sequence of repetitive epitope. , 1984, Science.

[3]  Liisa Holm,et al.  Rapid automatic detection and alignment of repeats in protein sequences , 2000, Proteins.

[4]  C. Suquet,et al.  Parasite defense mechanisms for evasion of host attack; a review. , 1987, Veterinary parasitology.

[5]  E. Handman,et al.  Leucine-rich repeats in host-pathogen interactions. , 2004, Archivum immunologiae et therapiae experimentalis.

[6]  William R Taylor,et al.  Toward the detection and validation of repeats in protein structure , 2004, Proteins.

[7]  T. Ilg Proteophosphoglycans of Leishmania. , 2000, Parasitology today.

[8]  C. Ponting,et al.  Protein repeats: structures, functions, and evolution. , 2001, Journal of structural biology.

[9]  Dinesh Gupta,et al.  ProtRepeatsDB: a database of amino acid repeats in genomes , 2006, BMC Bioinformatics.

[10]  Fabienne Thomarat,et al.  Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi , 2001, Nature.

[11]  Rolf Apweiler,et al.  The Integr8 project - a resource for genomic and proteomic data , 2004, Silico Biol..

[12]  C. Ponting,et al.  Homology-based method for identification of protein repeats using statistical significance estimates. , 2000, Journal of molecular biology.

[13]  R. Hernández-Pando,et al.  The PGRS domain of Mycobacterium tuberculosis PE_PGRS Rv1759c antigen is an efficient subunit vaccine to prevent reactivation in a murine model of chronic tuberculosis. , 2007, Vaccine.

[14]  P. Monaghan,et al.  EtMIC4: a microneme protein from Eimeria tenella that contains tandem arrays of epidermal growth factor-like repeats and thrombospondin type-I repeats. , 2001, International journal for parasitology.

[15]  Mann-Whitney Test , 1987 .

[16]  S. Makhzami,et al.  Enterococcal Leucine-Rich Repeat-Containing Protein Involved in Virulence and Host Inflammatory Response , 2007, Infection and Immunity.

[17]  P. T. Englund,et al.  Multiple procyclin isoforms are expressed differentially during the development of insect forms of Trypanosoma brucei. , 2001, Journal of molecular biology.

[18]  Srinivasan Ramachandran,et al.  MAAP: Malarial adhesins and adhesin‐like proteins predictor , 2008, Proteins.

[19]  Gregory S. Douglas,et al.  Quantification of Hydrocarbon Biodegradation Using Internal Markers , 2005 .

[20]  L. Toledo-Pereyra Trust , 2006, Mediation Behaviour.

[21]  N. Day,et al.  Virulent Combinations of Adhesin and Toxin Genes in Natural Populations of Staphylococcus aureus , 2002, Infection and Immunity.

[22]  L. Rénia,et al.  The vaccine is dead--long live the vaccine. , 2007, Trends in parasitology.

[23]  Jaap Heringa,et al.  Tracking repeats using significance and transitivity , 2004, ISMB/ECCB.

[24]  S. Hoffman,et al.  Diagnosis of malaria by detection of Plasmodium falciparum HRP-2 antigen with a rapid dipstick antigen-capture assay , 1994, The Lancet.

[25]  Niklaus Fankhauser,et al.  Identification of GPI anchor attachment signals by a Kohonen self-organizing map , 2005, Bioinform..

[26]  Titu Andreescu,et al.  Inclusion-Exclusion Principle , 2004 .

[27]  J. Jensen,et al.  The gene product of the Plasmodium falciparum 11.1 locus is a protein larger than one megadalton. , 1990, Molecular and biochemical parasitology.

[28]  Sean R. Eddy,et al.  Multiple Alignment Using Hidden Markov Models , 1995, ISMB.

[29]  Karl Popper,et al.  The REPRO server : finding protein internal sequence repeats through the Web , 2000 .

[30]  A. Krogh,et al.  A combined transmembrane topology and signal peptide prediction method. , 2004, Journal of molecular biology.

[31]  U. Gophna,et al.  The formation of Escherichia coli curli amyloid fibrils is mediated by prion-like peptide repeats. , 2005, Journal of molecular biology.

[32]  J. de la Fuente,et al.  Adhesion of outer membrane proteins containing tandem repeats of Anaplasma and Ehrlichia species (Rickettsiales: Anaplasmataceae) to tick cells. , 2004, Veterinary microbiology.

[33]  M. V. Katti,et al.  Amino acid repeat patterns in protein sequences: Their diversity and structural‐functional implications , 2000, Protein science : a publication of the Protein Society.

[34]  M. Carrington,et al.  Expression of a polypeptide containing a dipeptide repeat is confined to the insect stage of Trypanosoma brucei , 1987, Nature.

[35]  E. Marcotte,et al.  A fast algorithm for genome‐wide analysis of proteins with repeated sequences , 1999, Proteins.

[36]  B. Eikmanns,et al.  The Surface Protein Srr-1 of Streptococcus agalactiae Binds Human Keratin 4 and Promotes Adherence to Epithelial HEp-2 Cells , 2007, Infection and Immunity.

[37]  Martine Guillerm,et al.  Neglected tests for neglected patients , 2006, Nature.

[38]  Markus Gruber,et al.  REPPER—repeats and their periodicities in fibrous proteins , 2005, Nucleic Acids Res..

[39]  Daniel P. Depledge,et al.  RepSeq – A database of amino acid repeats present in lower eukaryotic pathogens , 2007 .