N-W Algorithm and ANFIS Modeling on Alignment Similarity of Triplex Capsid Protein of Human Herpes Simplex Virus

Optimal sequence similarity of triplex capsid proteins of human herpes simplex virus (HHV) is a complex bioinformatics problem, which is controlled by alignment algorithms, substitution matrix, gap penalty and gap extension. A precise choice of mutation matrix is required to optimal the alignment similarity and appropriate computational approach required for similarity search. The present paper uses Adaptive Neuro-Fuzzy Inference System (ANFIS) approach to model and simulate the alignment similarity for PAM and Blosum substitution matrices. Mutation matrix and sequences of HHV-I and HHV-II were taken as model’s input parameters. The model is the combination of fuzzy inference, artificial neural network, and set of fuzzy rules has been developed directly from computational analysis using N-W algorithm. The proposed modeling approach is verified by comparing the expected results with the observed practical results obtained by computational analysis under specific conditions. The application of ANFIS test shows that the substitution matrix predicted by a proposed model is fully in agreement with the experimental values at 0.5% level of significance.

[1]  Ying Huang,et al.  Prediction of protein subcellular locations using fuzzy k-NN method , 2004, Bioinform..

[2]  S. Katebi,et al.  Protein Superfamily Classification Using Fuzzy Rule-Based Classifier , 2009, IEEE Transactions on NanoBioscience.

[3]  Shoudan Liang,et al.  cWINNOWER algorithm for finding fuzzy DNA motifs , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[4]  Maximilian Schlosshauer,et al.  A novel approach to local reliability of sequence alignments , 2002, Bioinform..

[5]  Juan J. Nieto,et al.  The fuzzy polynucleotide space: basic properties , 2003, Bioinform..

[6]  Irena Roterman-Konieczna,et al.  Prediction of Functional Sites Based on the Fuzzy Oil Drop Model , 2007, PLoS Comput. Biol..

[7]  Wei Zhang,et al.  A new validity measure for a correlation-based fuzzy c-means clustering algorithm , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[8]  Francisco-Javier Lopez,et al.  FISim: A new similarity measure between transcription factor binding sites based on the fuzzy integral , 2009, BMC Bioinformatics.

[9]  Mattias Ohlsson,et al.  Matching protein structures with fuzzy alignments , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Shiow-Fen Hwang,et al.  Accurate prediction of enzyme subfamily class using an adaptive fuzzy k-nearest neighbor method , 2007, Biosyst..

[11]  Liisa Holm,et al.  Sensitive pattern discovery with 'fuzzy' alignments of distantly related proteins , 2003, ISMB.

[12]  Nicos Maglaveras,et al.  Fuzzy Hidden Markov Models: A New Approach In Multiple Sequence Alignment , 2006, MIE.

[13]  Kuo-Chen Chou,et al.  Using supervised fuzzy clustering to predict protein structural classes. , 2005, Biochemical and biophysical research communications.

[14]  Marc A. Martí-Renom,et al.  Prediction of enzyme function by combining sequence similarity and protein interactions , 2008, BMC Bioinformatics.

[15]  Chris Upton,et al.  Base-By-Base: Single nucleotide-level analysis of whole viral genome alignments , 2004, BMC Bioinformatics.

[16]  Joarder Kamruzzaman,et al.  Hidden Markov Models Incorporating Fuzzy Measures and Integrals for Protein Sequence Identification and Alignment , 2008, Genom. Proteom. Bioinform..

[17]  Enrique Querol,et al.  Prediction of Protein Function Improving Sequence Remote Alignment Search by a Fuzzy Logic Algorithm , 2008, The protein journal.

[18]  James M. Keller,et al.  Fuzzy Measures on the Gene Ontology for Gene Product Similarity , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[19]  Andrew D Fernandes,et al.  Sequence signatures and the probabilistic identification of proteins in the Myc-Max-Mad network. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Mireille Régnier,et al.  Short fuzzy tandem repeats in genomic sequences, identification, and possible role in regulation of gene expression , 2006, Bioinform..

[21]  Thomas Bäck,et al.  Characteristic amino acid combinations in olfactory G protein‐coupled receptors , 2007, Proteins.

[22]  Nicos Maglaveras,et al.  Enhancing the Quality of Phylogenetic Analysis using Fuzzy Hidden Markov Model Alignments , 2007, MedInfo.

[23]  K. N. Ramachandran Nair,et al.  A fuzzy guided genetic algorithm for operon prediction , 2005, Bioinform..