Denoising the 3-Base Periodicity Walks of DNA Sequences in Gene Finding

A nonlinear Tracking-Differentiator is one-input- two-output system that can generate smooth approximation of measured signals and get the derivatives of the signals. The nonlinear tracking-Differentiator is explored to denoise and generate the derivatives of the walks of the 3-periodicity of DNA sequences. An improved algorithm for gene finding is presented using the nonlinear Tracking-Differentiator. The gene finding algorithm employs the 3-base periodicity of coding region. The 3-base periodicity DNA walks are denoised and tracked using the nonlinear Tracking- Differentiator. Case studies demonstrate that the nonlinear Tracking-Differentiator is an effective method to improve the accuracy of the gene finding algorithm. 

[1]  S. Tiwari,et al.  Prediction of probable genes by Fourier analysis of genomic sequences , 1997, Comput. Appl. Biosci..

[2]  Dong Sun,et al.  An enhanced fuzzy PD controller with two discrete nonlinear tracking differentiators , 2004 .

[3]  C. Tomlin,et al.  Biology by numbers: mathematical modelling in developmental biology , 2007, Nature Reviews Genetics.

[4]  S. C. Kremer,et al.  Gene Prediction Based on DNA Spectral Analysis: A Literature Review , 2011, J. Comput. Biol..

[5]  Changchuan Yin,et al.  A Fourier Characteristic of Coding Sequences: Origins and a Non-Fourier Approximation , 2005, J. Comput. Biol..

[6]  R. Guigó,et al.  Evaluation of gene structure prediction programs. , 1996, Genomics.

[7]  Dimitris Anastassiou,et al.  Frequency-domain analysis of biomolecular sequences , 2000, Bioinform..

[8]  Stephen S.-T. Yau,et al.  A study of tracking-differentiator , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[9]  Annangarachari Krishnamachari,et al.  On the origin of three base periodicity in genomes , 2012, Biosyst..

[10]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[11]  J. Han,et al.  NONLINEAR TRACKING-DIFFERENTIATOR , 1994 .

[12]  D. Sudheer Reddy,et al.  Peakwise Smoothing of Data Models using Wavelets , 2010 .

[13]  Michael Ruogu Zhang,et al.  Identification of protein coding regions in the human genome by quadratic discriminant analysis. , 1997, Proceedings of the National Academy of Sciences of the United States of America.