Application of learning techniques to splicing site recognition.

Most genes of eukaryotic genomes are disrupted by introns. The application of a learning technique which uses both statistic and syntactic analysis lead to the establishment of logical rules enabling the recognition of intron/exon junctions between uncoding and coding sequences. The rules were tested on rat actin gene sequences containing some or all of the introns and 50 exon nucleotides on either side of the intron. The results show good recognition of the excision site. This recognition is more ambiguous when the sequence is short; for the acceptor sequence it presents a good selection. The learning achieved with both the donor and acceptor sequence does not lead to recognition. This result indicates that it is not the relationship between donor and acceptor sites in the same intron which determines sequence selection or the splicing mechanism.

[1]  Stephen M. Mount,et al.  A catalogue of splice junction sequences. , 1982, Nucleic acids research.

[2]  Michael R. Green,et al.  Excision of an intact intron as a novel lariat structure during pre-mRNA splicing in vitro , 1984, Cell.

[3]  D. Givol,et al.  Nucleotide sequence of the rat skeletal muscle actin gene , 1982, Nature.

[4]  P. Gruss,et al.  Rescue of a splicing defective mutant by insertion of an heterologous intron , 1980, Nature.

[5]  Stephen M. Mount,et al.  The U1 small nuclear RNA-protein complex selectively binds a 5′ splice site in vitro , 1983, Cell.

[6]  Stanley N Cohen,et al.  Structure and genomic organization of the mouse dihydrofolate reductase gene , 1980, Cell.

[7]  B. Wieringa,et al.  A minimal intron length but no specific internal sequence is required for splicing the large rabbit β-globin intron , 1984, Cell.

[8]  F. Baas,et al.  The human thyroglobulin gene contains two 15-17 kb introns near its 3'-end. , 1983, Nucleic acids research.

[9]  E. Lewis,et al.  Molecular Genetics of the Bithorax Complex in Drosophila melanogaster , 1983, Science.

[10]  J. Abelson RNA processing and the intervening sequence problem. , 1979, Annual review of biochemistry.

[11]  L. Grivell,et al.  One gene's intron is another gene's exon , 1981, Nature.

[12]  R. Lührmann,et al.  The 5′ terminus of the RNA moiety of U1 small nuclear ribonucleoprotein particles is required for the splicing of messenger RNA precursors , 1984, Cell.

[13]  Tom Maniatis,et al.  Specific transcription and RNA splicing defects in five cloned β-thalassaemia genes , 1983, Nature.