An Algorithm for Gene Prediction Based on the Z Curve

The methods based on digital signal processing have been applied to analyze and identify the genes in a DNA sequence. At first, we discussed the Voss mapping and the Z curve representation which can map the DNA alphabetic sequence into the digital sequences. The mathematic expression and biological meaning of the two mappings were studied. This scheme was based on the Z curve representation because of its clear biological meaning. According to period-3 property of protein coding regions, a kind of adaptive filter was proposed to predict the exons in a DNA sequence. The prediction curve of the exons was obtained with the Recursive Least Squares (RLS) adaptive algorithm. It is shown that the RLS algorithm is better than the existing multistage filter for gene prediction by comparing the simulation curves.

[1]  M. Yan,et al.  A new fourier transform approach for protein coding measure based on the format of the Z curve , 1998, Bioinform..

[2]  D. Falconer,et al.  Steady-state behavior of RLS adaptive algorithms , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Dimitris Anastassiou,et al.  Genomic signal processing , 2001, IEEE Signal Process. Mag..

[4]  David D. Falconer,et al.  Tracking properties and steady-state performance of RLS adaptive filter algorithms , 1986, IEEE Trans. Acoust. Speech Signal Process..

[5]  E. Trifonov,et al.  The pitch of chromatin DNA is reflected in its nucleotide sequence. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[6]  A. Antoniou,et al.  Application of parametric window functions to the STDFT method for gene prediction , 2005, PACRIM. 2005 IEEE Pacific Rim Conference on Communications, Computers and signal Processing, 2005..

[7]  R. Voss,et al.  Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. , 1992, Physical review letters.

[8]  P. Vaidyanathan Genomics and proteomics: a signal processor's tour , 2004, IEEE Circuits and Systems Magazine.

[9]  Mahmood Akhtar,et al.  Gene and exon prediction using time domain algorithms , 2005, Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005..