Messenger RNA processing sites in Trypanosoma brucei.

In Kinetoplastids, protein-coding genes are transcribed polycistronically by RNA polymerase II. Individual mature mRNAs are generated from polycistronic precursors by 5' trans splicing of a 39-nt capped leader RNA and 3' polyadenylation. It was previously known that trans splicing generally occurs at an AG dinucleotide downstream of a polypyrimidine tract, and that polyadenylation is coupled to downstream trans splicing. The few polyadenylation sites that had been examined were 100-400 nt upstream of the polypyrimidine tract which marked the adjacent trans splice site. We wished to define the sequence requirements for trypanosome mRNA processing more tightly and to generate a predictive algorithm. By scanning all available Trypanosoma brucei cDNAs for splicing and polyadenylation sites, we found that trans splicing generally occurs at the first AG following a polypyrimidine tract of 8-25 nt, giving rise to 5'-UTRs of a median length of 68 nt. We also found that in general, polyadenylation occurs at a position with one or more A residues located between 80 and 140 nt from the downstream polypyrimidine tract. These data were used to calibrate free parameters in a grammar model with distance constraints, enabling prediction of polyadenylation and trans splice sites for most protein-coding genes in the trypanosome genome. The data from the genome analysis and the program are available from: .

[1]  G. Church,et al.  Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation , 1998, Nature Biotechnology.

[2]  E. Ullu,et al.  Temporal order of RNA-processing reactions in trypanosomes , 1993 .

[3]  Matthew R. Pocock,et al.  The Bioperl toolkit: Perl modules for the life sciences. , 2002, Genome research.

[4]  Shulamit Michaeli,et al.  MINIREVIEWS trans and cis Splicing in Trypanosomatids : Mechanism , Factors , and Regulation , 2003 .

[5]  K. Matthews,et al.  tbCPSF30 Depletion by RNA Interference Disrupts Polycistronic RNA Processing in Trypanosoma brucei* , 2003, Journal of Biological Chemistry.

[6]  E. Ullu,et al.  Polygene transcripts are precursors to calmodulin mRNAs in trypanosomes. , 1988, The EMBO journal.

[7]  C. Clayton,et al.  Characterisation of the growth and differentiation in vivo and in vitro-of bloodstream-form Trypanosoma brucei strain TREU 927. , 2001, Molecular and biochemical parasitology.

[8]  S. Sather,et al.  Identification of a small rna containing the trypanosome spliced leader: A donor of shared 5′ sequences of trypanosomatid mRNAs? , 1984, Cell.

[9]  S. Beverley,et al.  Coupling of poly(A) site selection and trans-splicing in Leishmania. , 1993, Genes & development.

[10]  B. Séraphin,et al.  The spliceosomal snRNP core complex of Trypanosoma brucei: cloning and functional analysis reveals seven Sm protein constituents. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[11]  P. Borst,et al.  Post-transcriptional control of the differential expression of phosphoglycerate kinase genes in Trypanosoma brucei. , 1988, Journal of molecular biology.

[12]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[13]  C. Clayton,et al.  Degradation of the unstable EP1 mRNA in Trypanosoma brucei involves initial destruction of the 3'-untranslated region. , 2001, Nucleic acids research.

[14]  J. Donelson,et al.  Post-transcriptional Elements Regulating Expression of mRNAs from the Amastin/Tuzin Gene Cluster of Trypanosoma cruzi(*) , 1995, The Journal of Biological Chemistry.

[15]  E. Patzelt,et al.  Mapping of branch sites in trans-spliced pre-mRNAs of Trypanosoma brucei , 1989, Molecular and cellular biology.

[16]  Kenneth Stuart,et al.  Transcription of Leishmania major Friedlin chromosome 1 initiates in both directions within a single region. , 2003, Molecular cell.

[17]  Larry Wall,et al.  Programming Perl , 1991 .

[18]  R Braun,et al.  Control of polyadenylation and alternative splicing of transcripts from adjacent genes in a procyclin expression site: a dual role for polypyrimidine tracts in trypanosomes? , 1994, Nucleic acids research.

[19]  C. Clayton,et al.  Effect of multiple downstream splice sites on polyadenylation in Trypanosoma brucei. , 1998, Molecular and biochemical parasitology.

[20]  D. Sherman,et al.  A possible role for the 3'-untranslated region in developmental regulation in Trypanosoma brucei. , 1993, Molecular and biochemical parasitology.

[21]  N. Agabian,et al.  Trypanosoma brucei spliced-leader RNA methylations are required for trans splicing in vivo , 1992, Molecular and cellular biology.

[22]  K. P. Watkins,et al.  Trypanosome mRNAs have unusual "cap 4" structures acquired by addition of a spliced leader. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[23]  B. Barrell,et al.  The DNA sequence of chromosome I of an African trypanosome: gene content, chromosome organisation, recombination and polymorphism. , 2003, Nucleic acids research.

[24]  M. L. Muhich,et al.  Heat-shock disruption of trans-splicing in trypanosomes: effect on Hsp70, Hsp85 and tubulin mRNA synthesis. , 1989, Gene.

[25]  G. Cross,et al.  Discontinuously synthesized mRNA from Trypanosoma brucei contains the highly methylated 5' cap structure, m7GpppA*A*C(2'-O)mU*A. , 1988, The Journal of biological chemistry.

[26]  R Braun,et al.  Accurate polyadenylation of procyclin mRNAs in Trypanosoma brucei is determined by pyrimidine-rich elements in the intergenic regions , 1994, Molecular and cellular biology.

[27]  William Arbuthnot Sir Lane,et al.  Biochemical and functional characterization of the cis-spliceosomal U1 small nuclear RNP from Trypanosoma brucei. , 2002, Molecular and biochemical parasitology.

[28]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[29]  P. Tebabi,et al.  Trypanosoma brucei: enrichment by UV of intergenic transcripts from the variable surface glycoprotein gene expression site , 1989, Molecular and cellular biology.

[30]  L. V. D. van der Ploeg,et al.  Requirement of a polypyrimidine tract for trans‐splicing in trypanosomes: discriminating the PARP promoter from the immediately adjacent 3′ splice acceptor site. , 1991, The EMBO journal.

[31]  C. Clayton,et al.  Tests of heterologous promoters and intergenic regions in Leishmania major. , 2000, Molecular and biochemical parasitology.

[32]  S. Sunkin,et al.  Leishmania major Friedlin chromosome 1 has an unusual distribution of protein-coding genes. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[33]  C. Clayton,et al.  Hierarchies of RNA-processing signals in a trypanosome surface antigen mRNA precursor , 1994, Molecular and cellular biology.

[34]  E. Ullu,et al.  Exonic Sequences in the 5′ Untranslated Region of α-Tubulin mRNA Modulate trans Splicing inTrypanosoma brucei , 1998, Molecular and Cellular Biology.

[35]  Steven Salzberg,et al.  The sequence and analysis of Trypanosoma brucei chromosome II. , 2003, Nucleic acids research.

[36]  M. G. Lee,et al.  RNA Polymerase I Transcribes Procyclin Genes and Variant Surface Glycoprotein Gene Expression Sites in Trypanosoma brucei , 2003, Eukaryotic Cell.

[37]  P. Myler,et al.  The unusual gene organization of Leishmania major chromosome 1 may reflect novel transcription processes. , 2000, Nucleic acids research.

[38]  E. Ullu,et al.  A common pyrimidine-rich motif governs trans-splicing and polyadenylation of tubulin polycistronic pre-mRNA in trypanosomes. , 1994, Genes & development.

[39]  E. Ullu,et al.  Trans splicing in trypanosomes requires methylation of the 5' end of the spliced leader RNA. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[40]  T. Gaasterland,et al.  An organism-specific method to rank predicted coding regions in Trypanosoma brucei. , 2003, Nucleic acids research.