HMACA: Towards Proposing a Cellular Automata Based Tool for Protein Coding, Promoter Region Identification and Protein Structure Prediction

Human body consists of lot of cells, each cell consist of DeOxaRibo Nucleic Acid (DNA). Identifying the genes from the DNA sequences is a very difficult task. But identifying the coding regions is more complex task compared to the former. Identifying the protein which occupy little place in genes is a really challenging issue. For understating the genes coding region analysis plays an important role. Proteins are molecules with macro structure that are responsible for a wide range of vital biochemical functions, which includes acting as oxygen, cell signaling, antibody production, nutrient transport and building up muscle fibers. Promoter region identification and protein structure prediction has gained a remarkable attention in recent years. Even though there are some identification techniques addressing this problem, the approximate accuracy in identifying the promoter region is closely 68% to 72%. We have developed a Cellular Automata based tool build with hybrid multiple attractor cellular automata (HMACA) classifier for protein coding region, promoter region identification and protein structure prediction which predicts the protein and promoter regions with an accuracy of 76%. This tool also predicts the structure of protein with an accuracy of 80%.

[1]  R Abagyan,et al.  Homology modeling with internal coordinate mechanics: Deformation zone mapping and improvements of models via conformational search , 1997, Proteins.

[2]  Santanu Chattopadhyay,et al.  Highly regular, modular, and cascadable design of cellular automata-based pattern classifier , 2000, IEEE Trans. Very Large Scale Integr. Syst..

[3]  Tommaso Toffoli,et al.  Reversible Computing , 1980, ICALP.

[4]  N. Manolios,et al.  Identification and characterization of polymorphisms in the promoter region of the human Apo-1/Fas (CD95) gene. , 1997, Molecular immunology.

[5]  N. Abraham,et al.  Identification of binding sites for transcription factors NF-kappa B and AP-2 in the promoter region of the human heme oxygenase 1 gene. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Nicola Santoro,et al.  Convergence and aperiodicity in fuzzy cellular automata: Revisiting rule 90 , 1998 .

[7]  J. Barrett,et al.  Cloning and characterization of the promoter region of human telomerase reverse transcriptase gene. , 1999, Cancer research.

[8]  T. Arinami,et al.  Identification of a polymorphism in the promoter region of DRD4associated with the human novelty seeking personality trait , 2000, Molecular Psychiatry.

[9]  Robin Milner,et al.  On Observing Nondeterminism and Concurrency , 1980, ICALP.

[10]  J. Fickett Recognition of protein coding regions in DNA sequences. , 1982, Nucleic acids research.

[11]  B. Blaisdell,et al.  A prevalent persistent global nonrandomness that distinguishes coding and non-coding eucaryotic nuclear DNA sequences , 2006, Journal of Molecular Evolution.

[12]  G. Vichniac Simulating physics with cellular automata , 1984 .

[13]  E. Snyder,et al.  Identification of coding regions in genomic DNA sequences: an application of dynamic programming and neural networks. , 1993, Nucleic acids research.

[14]  S. Hamilton-Dutoit,et al.  Sequence analysis of the Epstein-Barr virus (EBV) latent membrane protein-1 gene and promoter region: identification of four variants among wild-type EBV isolates. , 1997, Blood.

[15]  E. Snyder,et al.  Identification of protein coding regions in genomic DNA. , 1995, Journal of molecular biology.

[16]  C. Langton Self-reproduction in cellular automata , 1984 .

[17]  Parimal Pal Chaudhuri,et al.  FMACA: A Fuzzy Cellular Automata Based Pattern Classifier , 2004, DASFAA.

[18]  Ramesh Babu,et al.  Identification of Promoter Region in Genomic DNA Using Cellular Automata Based Text Clustering , 2010, Int. Arab J. Inf. Technol..

[19]  C. Bauer,et al.  Analysis of the Rhodobacter capsulatus puf operon. Location of the oxygen-regulated promoter region and the identification of an additional puf-encoded gene. , 1988, The Journal of biological chemistry.

[20]  F. Ruddle,et al.  Use of a protein-blotting procedure and a specific DNA probe to identify nuclear proteins that recognize the promoter region of the transferrin receptor gene. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Martin G. Reese,et al.  Application of a Time-delay Neural Network to Promoter Annotation in the Drosophila Melanogaster Genome , 2001, Comput. Chem..