The graphical representation of protein sequences based on the physicochemical properties and its applications

Based on the chaos game representation, a 2D graphical representation of protein sequences was introduced in which the 20 amino acids are rearranged in a cyclic order according to their physicochemical properties. The Euclidean distances between the corresponding amino acids from the 2‐D graphical representations are computed to find matching (or conserved) fragments of amino acids between the two proteins. Again, the cumulative distance of the 2D‐graphical representations is defined to compare the similarity of protein. And, the examination of the similarity among sequences of the ND5 proteins of nine species shows the utility of our approach. © 2010 Wiley Periodicals, Inc. J Comput Chem, 2010

[1]  Xin Wang,et al.  2-D graphical representation of protein sequences and its application to coronavirus phylogeny. , 2008, BMB reports.

[2]  Amir Niknejad,et al.  DNA sequence representation without degeneracy. , 2003, Nucleic acids research.

[3]  Dejan Plavšić,et al.  A novel unexpected use of a graphical representation of DNA : Graphical alignment of DNA sequences , 2006 .

[4]  E. Hamori,et al.  H curves, a novel method of representation of nucleotide series especially suited for long DNA sequences. , 1983, The Journal of biological chemistry.

[5]  H. J. Jeffrey Chaos game representation of gene structure. , 1990, Nucleic acids research.

[6]  Jure Zupan,et al.  On representation of proteins by star-like graphs. , 2007, Journal of molecular graphics & modelling.

[7]  Dejan Plavšić,et al.  Novel 2-D graphical representation of DNA sequences and their numerical characterization , 2003 .

[8]  Milan Randic,et al.  On 3-D Graphical Representation of DNA Primary Sequences and Their Numerical Characterization , 2000, J. Chem. Inf. Comput. Sci..

[9]  Milan Randić,et al.  2-D Graphical representation of proteins based on physico-chemical properties of amino acids , 2007 .

[10]  A. Nandy,et al.  A new graphical representation and analysis of DNA sequence structure. I: Methodology and application to globin genes , 1994 .

[11]  Alexandru T. Balaban,et al.  Unique graphical representation of protein sequences based on nucleotide triplet codons , 2004 .

[12]  M. Novič,et al.  Representation of proteins as walks in 20-D space , 2008, SAR and QSAR in environmental research.

[13]  Fenglan Bai,et al.  A 2-D graphical representation of protein sequences based on nucleotide triplet codons , 2005 .

[14]  Milan Randić,et al.  Another look at the chaos-game representation of DNA , 2008 .

[15]  Chun Li,et al.  Analysis of similarity/dissimilarity of protein sequences , 2008, Proteins.

[16]  M. Gates A simple way to look at DNA. , 1986, Journal of theoretical biology.

[17]  W. Taylor,et al.  Identification of protein sequence homology by consensus template alignment. , 1986, Journal of molecular biology.

[18]  R Zhang,et al.  Analysis of distribution of bases in the coding sequences by a diagrammatic technique. , 1991, Nucleic acids research.

[19]  M. Randic,et al.  2-D Graphical representation of proteins based on virtual genetic code , 2004, SAR and QSAR in environmental research.

[20]  Fengchun Tian,et al.  Applications of representation method for DNA sequences based on symbolic dynamics , 2009 .

[21]  Tomaz Pisanski,et al.  Graphical representation of proteins as four-color maps and their numerical characterization. , 2009, Journal of molecular graphics & modelling.

[22]  Guohua Huang,et al.  H–L curve: A novel 2D graphical representation for DNA sequences , 2008 .

[23]  M. Novič,et al.  On novel representation of proteins based on amino acid adjacency matrix , 2008, SAR and QSAR in environmental research.

[24]  Milan Randić On a geometry-based approach to protein sequence alignment , 2008 .

[25]  Jure Zupan,et al.  Novel 2-D graphical representation of proteins , 2006 .

[26]  Jia Wen,et al.  A 2D graphical representation of protein sequence and its numerical characterization , 2009 .

[27]  Liu Yang,et al.  3-D maps and coupling numbers for protein sequences , 2009 .