Advanced DNA Mapping Schemes for Exon Prediction Using Digital Filters

Genomic signal processing (GSP) is the engineering area concerned with genomic data analysis using digital signal processing techniques by conversion of the genomic sequence into numerical one as a first step. One of the central issues in GSP is maximizing the accuracy of protein coding region prediction in a given DNA sequence. In this study advanced DNA numerical representations (genetic code context, 2-bit binary and EIIP) were compared in terms of their sensitivity, specificity and correlation coefficient for maximizing the accuracy of the prediction of protein coding region. Digital filters based technique has been applied to extract the period 3 components and removing the undesired noise from the DNA sequence. Results from implementation of the technique on 8 human genes showed that the 2-bit binary representation scheme associated with the used filtering technique has the maximum accuracy compared to the other tested schemes. These findings suggests that the 2-bit binary representation scheme greatly enhances the prediction accuracy of the protein coding region using digital filters opening a new avenue to use this scheme in different applications.

[1]  R. Guigó,et al.  Evaluation of gene structure prediction programs. , 1996, Genomics.

[2]  Mai S. Mabrouk,et al.  A Study of the Potential of EIIP Mapping Method in Exon Prediction Using the Frequency Domain Techniques , 2012 .

[3]  Alan K. Mackworth,et al.  Evaluation of gene-finding programs on mammalian sequences. , 2001, Genome research.

[4]  D. Goldsack,et al.  Contribution of the free energy of mixing of hydrophobic side chains to the stability of the tertiary structure of proteins. , 1973, Journal of theoretical biology.

[5]  Vasile Palade,et al.  A neural network based multi-classifier system for gene identification in DNA sequences , 2004, Neural Computing & Applications.

[6]  Leonidas D. Iasemidis,et al.  Autoregressive Modeling and Feature Analysis of DNA Sequences , 2004, EURASIP J. Adv. Signal Process..

[7]  I. Cosic Macromolecular bioactivity: is it resonant interaction between macromolecules?-theory and applications , 1994, IEEE Transactions on Biomedical Engineering.

[8]  J. Lazovic,et al.  Selection of amino acid parameters for Fourier transform-based analysis of proteins , 1996, Comput. Appl. Biosci..

[9]  Dimitris Anastassiou,et al.  Genomic signal processing , 2001, IEEE Signal Process. Mag..

[10]  Mahmood Akhtar,et al.  Signal Processing in Sequence Analysis: Advances in Eukaryotic Gene Prediction , 2008, IEEE Journal of Selected Topics in Signal Processing.

[11]  Andreas Antoniou,et al.  Identification of Hot-Spot Locations in Proteins Using Digital Filters , 2008, IEEE Journal of Selected Topics in Signal Processing.

[12]  P. Argos,et al.  Structural prediction of membrane-bound proteins. , 2005, European journal of biochemistry.

[13]  M InbamalarT,et al.  Study of DNA Sequence Analysis Using DSP Techniques , 2013 .

[14]  A. Antoniou Digital Signal Processing: Signals, Systems, and Filters , 2005 .

[15]  Andreas Antoniou,et al.  Location of exons in DNA sequences using digital filters , 2009, 2009 IEEE International Symposium on Circuits and Systems.

[16]  Mohammed Abo-Zahhad,et al.  Integrated Model of DNA Sequence Numerical Representation and Artificial Neural Network for Human Donor and Acceptor Sites Prediction , 2014 .

[17]  Hon Keung Kwan,et al.  Numerical representation of DNA sequences , 2009, 2009 IEEE International Conference on Electro/Information Technology.

[18]  Changchuan Yin,et al.  Numerical representation of DNA sequences based on genetic code context and its applications in periodicity analysis of genomes , 2008, 2008 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[19]  Bruce Alberts,et al.  Essential Cell Biology , 1983 .

[20]  H. Saberkari,et al.  Prediction of protein coding regions in DNA sequences using signal processing methods , 2012, 2012 IEEE Symposium on Industrial Electronics and Applications.

[21]  P.P. Vaidyanathan,et al.  Digital filters for gene prediction applications , 2002, Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002..