Physics-based gene identification: proof of concept for Plasmodium falciparum

The ab initio prediction of new genes in eukaryotic genomes represents a difficult task, notably for the identification of complex split genes. A Physics-Based Gene Identification (PBGI) method was formulated recently (Yeramian, Gene, 255, 139-150, 151-168, 2000a,b) to address this problem, taking as a model the Plasmodium falciparum genome. Here, the predictive power of this method is put under experimental test for this genome. The presented results demonstrate the usefulness of the PBGI as a gene-identification tool for P. falciparum, notably for the discovery of new genes with no homology to known genes. Perspectives opened by this new method for other eukaryotic genomes are also mentioned.

[1]  Samuel Karlin,et al.  Genomics: Annotation of the Drosophila genome , 2001, Nature.

[2]  Steven L. Salzberg,et al.  Bioinformatics: Finding genes in Plasmodium falciparum , 2000, Nature.

[3]  E. Yeramian,et al.  Genes and the physics of the DNA double-helix. , 2000, Gene.

[4]  R. Gwilliam,et al.  The complete nucleotide sequence of chromosome 3 of Plasmodium falciparum , 1999, Nature.

[5]  Douglas Poland,et al.  Theory of helix-coil transitions in biopolymers , 1970 .

[6]  E V Koonin,et al.  Chromosome 2 sequence of the human malaria parasite Plasmodium falciparum. , 1998, Science.

[7]  E. Yeramian,et al.  The physics of DNA and the annotation of the Plasmodium falciparum genome. , 2000, Gene.

[8]  S. Salzberg,et al.  Interpolated Markov models for eukaryotic gene finding. , 1999, Genomics.

[9]  E. Yeramian,et al.  Complexity and Tractability. Statistical Mechanics of Helix-Coil Transitions in Circular DNA as a Model-Problem , 1994 .

[10]  D J Rawlings,et al.  A novel 40-kDa membrane-associated EF-hand calcium-binding protein in Plasmodium falciparum. , 1992, The Journal of biological chemistry.

[11]  Henri Buc,et al.  An optimal formulation of the matrix method in statistical mechanics of one‐dimensional interacting units: Efficient iterative algorithmic procedures , 1990 .

[12]  P. Claverie,et al.  Analysis of multiexponential functions without a hypothesis as to the number of components , 1987, Nature.

[13]  M. Fixman,et al.  Theory of DNA melting curves , 1977, Biopolymers.