Analysis of similarity/dissimilarity of protein sequences

On the basis of a selected pair of physicochemical properties of amino acids, we introduce a dynamic 2D graphical representation of protein sequences. Then, we introduce and compare two numerical characterizations of protein graphs as descriptors to analyze the nine ND5 proteins. The approach is simple, convenient, and fast. Proteins 2008. © 2008 Wiley‐Liss, Inc.

[1]  Timothy Clark,et al.  2D-dynamic representation of DNA sequences , 2007 .

[2]  Milan Randic,et al.  On 3-D Graphical Representation of DNA Primary Sequences and Their Numerical Characterization , 2000, J. Chem. Inf. Comput. Sci..

[3]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[4]  Jonas S. Almeida,et al.  Alignment-free sequence comparison-a review , 2003, Bioinform..

[5]  M Novic,et al.  Novel numerical and graphical representation of DNA sequences and proteins , 2006, SAR and QSAR in environmental research.

[6]  EUGENE HAMORI,et al.  Novel DNA sequence representations , 1985, Nature.

[7]  Milan Randic,et al.  On 3-D Graphical Representation of Proteomics Maps and Their Numerical Characterization , 2001, J. Chem. Inf. Comput. Sci..

[8]  Milan Randić,et al.  2-D Graphical representation of proteins based on physico-chemical properties of amino acids , 2007 .

[9]  Ashesh Nandy,et al.  On the uniqueness of quantitative DNA difference descriptors in 2D graphical representation models , 2003 .

[10]  E. Hamori,et al.  H curves, a novel method of representation of nucleotide series especially suited for long DNA sequences. , 1983, The Journal of biological chemistry.

[11]  Jure Zupan,et al.  Novel 2-D graphical representation of proteins , 2006 .

[13]  M. Randic,et al.  Highly compact 2D graphical representation of DNA sequences , 2004, SAR and QSAR in environmental research.

[14]  A. Nandy A new graphical representation and analysis of DNA sequence structure. I: Methodology and application to globin genes , 1994 .

[15]  H. Brand,et al.  Reply to Comment on , 1997 .

[16]  Yu-hua Yao,et al.  A class of new 2-D graphical representation of DNA sequences and their application , 2004 .

[17]  S. Basak,et al.  Mathematical descriptors of DNA sequences: development and applications , 2006 .

[18]  A. Nandy,et al.  GRAPHICAL ANALYSIS OF DNA SEQUENCE STRUCTURE. II: RELATIVE ABUNDANCES OF NUCLEOTIDES IN DNAS, GENE EVOLUTION AND DUPLICATION , 1995 .

[19]  Yu-Hua Yao,et al.  A class of 2D graphical representations of RNA secondary structures and the analysis of similarity based on them , 2005, J. Comput. Chem..

[20]  Tianming Wang,et al.  On Graphical and Numerical Representation of Protein Sequences , 2006, Journal of biomolecular structure & dynamics.

[21]  Bo Liao A 2D graphical representation of DNA sequence , 2005 .

[22]  M. Randic,et al.  2-D Graphical representation of proteins based on virtual genetic code , 2004, SAR and QSAR in environmental research.

[23]  P. M. Leong,et al.  Random walk and gap plots of DNA sequences , 1995, Comput. Appl. Biosci..

[24]  Alexandru T. Balaban,et al.  Unique graphical representation of protein sequences based on nucleotide triplet codons , 2004 .

[25]  Bo Liao,et al.  New 2D graphical representation of DNA sequences , 2004, J. Comput. Chem..

[26]  Dejan Plavšić,et al.  Novel 2-D graphical representation of DNA sequences and their numerical characterization , 2003 .

[27]  Milan Randic,et al.  A novel 2-D graphical representation of DNA sequences of low degeneracy , 2001 .

[28]  Yu-hua Yao,et al.  Analysis of similarity/dissimilarity of DNA sequences based on a 3-D graphical representation , 2005 .

[29]  Chun Li,et al.  Numerical characterization and similarity analysis of DNA sequences based on 2-D graphical representation of the characteristic sequences. , 2003, Combinatorial chemistry & high throughput screening.

[30]  Yuhua Yao,et al.  A new 2D graphical representation—Classification curve and the analysis of similarity/dissimilarity of DNA sequences , 2006 .

[31]  Xin Chen,et al.  An information-based sequence distance and its application to whole mitochondrial genome phylogeny , 2001, Bioinform..

[32]  Khalid Sayood,et al.  A new sequence distance measure for phylogenetic tree construction , 2003, Bioinform..

[33]  Chandan Raychaudhury,et al.  Indexing Scheme and Similarity Measures for Macromolecular Sequences , 1999, J. Chem. Inf. Comput. Sci..

[34]  Xiaofeng Guo,et al.  Numerical characterization of DNA sequences in a 2-D graphical representation scheme of low degeneracy , 2003 .

[35]  Li Yan,et al.  Some notes on 2-D graphical representation of DNA sequence , 2008, 2008 27th Chinese Control Conference.

[36]  M. Gates A simple way to look at DNA. , 1986, Journal of theoretical biology.

[37]  Tianming Wang,et al.  A novel 2D graphical representation of DNA sequences and its application. , 2006, Journal of molecular graphics & modelling.

[38]  François-Joseph Lapointe,et al.  A weighted least-squares approach for inferring phylogenies from incomplete distance matrices , 2004, Bioinform..

[39]  Renfa Li,et al.  Novel method for analyzing proteome , 2007 .

[40]  Jure Zupan,et al.  On representation of proteins by star-like graphs. , 2007, Journal of molecular graphics & modelling.