Gene survey of the pathogenic protozoan Trypanosoma cruzi.

We have performed a survey of the active genes in the important human pathogen Trypanosoma cruzi by analyzing 5013 expressed sequence tags (ESTs) generated from a normalized epimastigote cDNA library. Clustering of all sequences resulted in 771 clusters, comprising 54% of the ESTs. In total, the ESTs corresponded to 3054 transcripts that might represent one-fourth of the total gene repertoire in T. cruzi. About 33% of the T. cruzi transcripts showed similarity to sequences in the public databases, and a large number of hitherto undiscovered genes predicted to be involved in transcription, cell cycle control, cell division, signal transduction, secretion, and metabolism were identified. More than 140 full-length gene sequences were derived from the ESTs. Comparisons with all open reading frames in yeast and in Caenorhabditis elegans showed that only 12% of the T. cruzi transcripts were shared among diverse eukaryotic organisms. Comparison with other kinetoplastid sequences identified 237 orthologous genes that are shared between these evolutionarily divergent organisms. The generated data are a useful resource for further studies of the biology of the parasite and for development of new means to combat Chagas' disease.

[1]  M. Soares,et al.  Normalization and subtraction: two approaches to facilitate gene discovery. , 1996, Genome research.

[2]  Venter Jc Identification of new human receptor and transporter genes by high throughput cDNA (EST) sequencing. , 1993 .

[3]  J. M. Requena,et al.  Genomic repetitive DNA elements of Trypanosoma cruzi. , 1996, Parasitology today.

[4]  U. Pettersson,et al.  Complete sequence of a 93.4-kb contig from chromosome 3 of Trypanosoma cruzi containing a strand-switch region. , 1998, Genome research.

[5]  A. Frasch,et al.  Comparison of the genes coding for the common 5' terminal sequence of messenger RNAs in three trypanosome species. , 1984, Nucleic Acids Research.

[6]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[7]  M. Levin,et al.  A short interspersed repetitive element provides a new 3' acceptor site for trans-splicing in certain ribosomal P2 beta protein genes of Trypanosoma cruzi. , 1994, Molecular and biochemical parasitology.

[8]  R. Verdun,et al.  Gene Discovery through Expressed Sequence Tag Sequencing in Trypanosoma cruzi , 1998, Infection and Immunity.

[9]  Mark Dubnick,et al.  Btab - a Blast output parser , 1992, Comput. Appl. Biosci..

[10]  M. Soares,et al.  Construction of a Normalized cDNA Library for the Trypanosoma cruzi Genome Project , 1999, The Journal of eukaryotic microbiology.

[11]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[12]  G C Overton,et al.  Gene discovery by EST sequencing in Toxoplasma gondii reveals sequences restricted to the Apicomplexa. , 1998, Genome research.

[13]  D. Paslier,et al.  The Trypanosoma cruzi genome initiative. , 1997, Parasitology today.

[14]  André Goffeau,et al.  The yeast genome directory. , 1997, Nature.

[15]  A. Frasch,et al.  The Trypanosoma cruzi Mucin Family Is Transcribed from Hundreds of Genes Having Hypervariable Regions* , 1998, The Journal of Biological Chemistry.

[16]  K Matsubara,et al.  Complementary DNA sequence (EST) collections and the expression information of the human genome , 1997, FEBS letters.

[17]  F. Bringaud,et al.  Conserved organization of genes in trypanosomatids. , 1998, Molecular and biochemical parasitology.

[18]  J. Blackwell Progress in the Leishmania genome project , 1997 .

[19]  S. Melville Parasite genome analysis. Genome research in Trypanosoma brucei: chromosome size polymorphism and its relevance to genome mapping and analysis. , 1997, Transactions of the Royal Society of Tropical Medicine and Hygiene.

[20]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[21]  T. Urményi,et al.  Identification of transcribed sequences (ESTs) in the Trypanosoma cruzi genome project. , 1997, Memorias do Instituto Oswaldo Cruz.