Localization of the initiation of translation in messenger RNAs of prokaryotes by learning techniques.

Learning processes are applied to the recognition of protein coding regions in prokaryotes. Non-contradictory, statistical and logical rules are deduced from a set of known examples of coding sequences. These rules enable to build characteristic patterns on the m-RNA upstream of the initiating codon. These rules are applied with success to recognize more than 180 coding sequences and to detect and/or eliminate hypothetical reading frames or unknown genes.

[1]  Francis Rodier,et al.  Key for protein coding sequences identification: computer analysis of codon strategy , 1982, Nucleic Acids Res..

[2]  J. Fickett Recognition of protein coding regions in DNA sequences. , 1982, Nucleic acids research.

[3]  C. Alff-Steinberger,et al.  Evidence for a coding pattern on the non-coding strand of the E. coli genome. , 1984, Nucleic acids research.

[4]  D. K. Hawley,et al.  Compilation and analysis of Escherichia coli promoter DNA sequences. , 1983, Nucleic acids research.

[5]  Michael N. Hall,et al.  A role for mRNA secondary structure in the control of translation initiation , 1982, Nature.

[6]  E. Gren Recognition of messenger RNA during translational initiation in Escherichia coli. , 1984, Biochimie.

[7]  I. G. Young,et al.  Nucleotide sequence coding for the respiratory NADH dehydrogenase of Escherichia coli. UUG initiation codon. , 1981, European journal of biochemistry.

[8]  M. B. Bahramian How bacterial ribosomes select translation initiation sites. , 1980, Journal of theoretical biology.

[9]  P. Sharp Speculations on RNA splicing , 1981, Cell.

[10]  G Osterburg,et al.  Nucleotide sequence of bacteriophage fd DNA. , 1978, Nucleic acids research.

[11]  Manolo Gouy,et al.  Codon catalog usage is a genome strategy modulated for gene expressivity , 1981, Nucleic Acids Res..

[12]  T. D. Schneider,et al.  Use of the 'Perceptron' algorithm to distinguish translational initiation sites in E. coli. , 1982, Nucleic acids research.

[13]  M Grunberg-Manago,et al.  Sequence of a 1.26‐kb DNA fragment containing the structural gene for E.coli initiation factor IF3: presence of an AUU initiator codon. , 1982, The EMBO journal.

[14]  A. D. McLachlan,et al.  Codon preference and its use in identifying protein coding regions in long DNA sequences , 1982, Nucleic Acids Res..

[15]  Marvin Minsky,et al.  Perceptrons: An Introduction to Computational Geometry , 1969 .

[16]  M J Shulman,et al.  The coding function of nucleotide sequences can be discerned by statistical analysis. , 1981, Journal of theoretical biology.

[17]  T. D. Schneider,et al.  Characterization of Translational Initiation Sites in E. Coui , 1982 .

[18]  J. Gabarro-Arpa,et al.  The hierarchical approach to the DNA stability problem. I. Patterns in non-equilibrium denaturation and renaturation. , 1982, Biochimie.