Protein Sequence Alignment Based on Fuzzy Arithmetic and Genetic Algorithm

In this paper a novel way to construct pairwise alignment of protein sequence is proposed. Currently in protein sequence alignment the vital problem is having too many uncertain factors and causes significant data loss while using crisp data. For this important premise, fuzzy concept is introduced and fuzziness is implemented in the matrix for 250 point accepted mutations (PAMs) in sequence aligning, and integrated with the genetic algorithm (GA). The purpose for this implementation is to reduce the effects of uncertain factor, avoid making use of crisp values or weights resulting in significant data loss, and increase solution accuracy and method suitability. Experimental results of fuzzy matrix for 250 PAM can find more continuous and identical protein sequence after sequence alignment by GA.

[1]  Liisa Holm,et al.  Sensitive pattern discovery with 'fuzzy' alignments of distantly related proteins , 2003, ISMB.

[2]  M. Yasunaga,et al.  Aligning multiple protein sequences by parallel hybrid genetic algorithm. , 2002, Genome informatics. International Conference on Genome Informatics.

[3]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[4]  M. O. Dayhoff A model of evolutionary change in protein , 1978 .

[5]  Ying Huang,et al.  Prediction of protein subcellular locations using fuzzy k-NN method , 2004, Bioinform..

[6]  A. V. Aho,et al.  On Computing All Suboptimal Alignments 1 , 1997 .

[7]  Olivier Poch,et al.  BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs , 1999, Bioinform..

[8]  Moritoshi Yasunaga,et al.  A parallel hybrid genetic algorithm for multiple protein sequence alignment , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[9]  Didier Dubois,et al.  Fuzzy sets and systems ' . Theory and applications , 2007 .

[10]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[11]  Juan J. Nieto,et al.  The fuzzy polynucleotide space: basic properties , 2003, Bioinform..

[12]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[13]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[14]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Kun-Mao Chao,et al.  Linear-space algorithms that build local alignments from fragments , 1995, Algorithmica.

[16]  S F Altschul,et al.  Weights for data related by a tree. , 1989, Journal of molecular biology.

[17]  Kun-Mao Chao,et al.  Recent Developments in Linear-Space Alignment Methods: A Survey , 1994, J. Comput. Biol..

[18]  W. Miller,et al.  A time-efficient, linear-space local similarity algorithm , 1991 .

[19]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[20]  Juan J. Nieto,et al.  Midpoints for fuzzy sets and their application in medicine , 2003, Artif. Intell. Medicine.

[21]  Ping-Teng Chang,et al.  Fuzzy strategic replacement analysis , 2005, Eur. J. Oper. Res..

[22]  Etienne E. Kerre,et al.  Defuzzification: criteria and classification , 1999, Fuzzy Sets Syst..

[23]  Mattias Ohlsson,et al.  Matching protein structures with fuzzy alignments , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[24]  F. Rodriguez,et al.  Simulating complex traits influenced by genes with fuzzy-valued effects in pedigreed populations , 2003, Bioinform..

[25]  Madan M. Gupta,et al.  Fuzzy mathematical models in engineering and management science , 1988 .