A novel DNA mapping scheme for improved exon prediction using digital filters

It has become a challenging issue for the researchers to predict protein coding regions in context of Signal Processing and bioinformatics. This paper presents a novel mapping technique and a hybrid approach combining Gauss windowing and FFT techniques for improved gene prediction. Mapping scheme plays an important role in accuracy of exon prediction. Some mapping scheme works well in a particular gene set but may not perform better in some other gene predictions. The three base periodicity properties is used to identify the exons location. In addition to this a threshold formulation has been used to investigate the performance of the algorithm in terms of various parameters like accuracy, sensitivity, specificity, geometric-mean and execution time. Analysis is performed in both nucleotides as well as exon level detection. The performance of the proposed mapping is compared with numerous existing mappings and also tested on NCBI genes which prove the efficiency of the algorithm.

[1]  N. Rao,et al.  Detection of 3-periodicity for small genomic sequences based on AR technique , 2004, 2004 International Conference on Communications, Circuits and Systems (IEEE Cat. No.04EX914).

[2]  R. M. C. Junior,et al.  Identification of Protein Coding Regions Using the Modified Gabor-Wavelet Transform , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  Rajiv Saxena,et al.  An Adaptive Window Length Strategy for Eukaryotic CDS Prediction , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[4]  M InbamalarT,et al.  Study of DNA Sequence Analysis Using DSP Techniques , 2013 .

[5]  Ganapati Panda,et al.  Identification of Protein-Coding Regions in DNA Sequences Using A Time-Frequency Filtering Approach , 2011, Genom. Proteom. Bioinform..

[6]  Andreas Antoniou,et al.  Filter-Based Methodology for the Location of Hot Spots in Proteins and Exons in DNA , 2012, IEEE Transactions on Biomedical Engineering.

[7]  S. Biswas PREDICTION OF CANCER CELL USING DIGITAL SIGNAL PROCESSING , 2011 .

[8]  Mai S. Mabrouk,et al.  Advanced DNA Mapping Schemes for Exon Prediction Using Digital Filters , 2016 .

[9]  Mohammed Abo-Zahhad,et al.  Genomic Analysis and Classification of Exon and Intron Sequences Using DNA Numerical Mapping Techniques , 2012 .

[10]  Stefan C. Kremer,et al.  Prediction of Protein Coding Regions Using a Wide-Range Wavelet Window Method , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[11]  Guangchen Liu,et al.  Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform , 2014 .

[12]  E. Ambikairajah,et al.  Detection of period-3 behavior in genomic sequences using singular value decomposition , 2005, Proceedings of the IEEE Symposium on Emerging Technologies, 2005..

[13]  Mai S. Mabrouk,et al.  A Study of the Potential of EIIP Mapping Method in Exon Prediction Using the Frequency Domain Techniques , 2012 .

[14]  Soma Barman,et al.  Improved gene prediction by principal component analysis based autoregressive Yule-Walker method. , 2016, Gene.

[15]  Low Tang Jung,et al.  From DNA to protein: Why genetic code context of nucleotides for DNA signal processing? A review , 2017, Biomed. Signal Process. Control..

[16]  Ibrahim Türkoglu,et al.  A novel numerical mapping method based on entropy for digitizing DNA sequences , 2018, Neural Computing and Applications.

[17]  P. P. Vaidyanathan,et al.  The role of signal-processing concepts in genomics and proteomics , 2004, J. Frankl. Inst..

[18]  Dimitris Anastassiou,et al.  Genomic signal processing , 2001, IEEE Signal Process. Mag..

[19]  M. Sujatha,et al.  Analysis of Genomic and Proteomic Sequence Using Fir Filter , 2014 .

[20]  R. Voss,et al.  Evolution of long-range fractal correlations and 1/f noise in DNA base sequences. , 1992, Physical review letters.