On 3-D Graphical Representation of DNA Primary Sequences and Their Numerical Characterization

In this article we (1) outline the construction of a 3-D "graphical" representation of DNA primary sequences, illustrated on a portion of the human beta globin gene; (2) describe a particular scheme that transforms the above 3-D spatial representation of DNA into a numerical matrix representation; (3) illustrate construction of matrix invariants for DNA sequences; and (4) suggest a data reduction based on statistical analysis of matrix invariants generated for DNA. Each of the four contributions represents a novel development that we hope will facilitate comparative studies of DNA and open new directions for representation and characterization of DNA primary sequences.

[1]  Milan Randić,et al.  On characterization of DNA primary sequences by a condensed matrix , 2000 .

[2]  Milan Randic,et al.  On Characterization of Chemical Structure , 1997, J. Chem. Inf. Comput. Sci..

[3]  Sonja Nikolic,et al.  The Detour Matrix in Chemistry , 1997, J. Chem. Inf. Comput. Sci..

[4]  M. Randic,et al.  Comparison of sequences as a method for evaluation of the molecular similarity , 1986, Journal of computational chemistry.

[5]  H. Coxeter Self-dual configurations and regular graphs , 1950 .

[6]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[7]  A. Nandy,et al.  A new graphical representation and analysis of DNA sequence structure. I: Methodology and application to globin genes , 1994 .

[8]  E Hamori Graphic representation of long DNA sequences by the method of H curves--current results and future aspects. , 1989, BioTechniques.

[9]  Milan Randic,et al.  On Characterization of Cyclic Structures , 1997, J. Chem. Inf. Comput. Sci..

[10]  Temple F. Smith,et al.  Comparison of biosequences , 1981 .

[11]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[12]  Chandan Raychaudhury,et al.  Indexing Scheme and Similarity Measures for Macromolecular Sequences , 1999, J. Chem. Inf. Comput. Sci..

[13]  Milan Randić On structural ordering and branching of acyclic saturated hydrocarbons , 1998 .

[14]  Goran Krilov,et al.  ON A CHARACTERIZATION OF THE FOLDING OF PROTEINS , 1999 .

[15]  Tomaz Pisanski,et al.  On Numerical Characterization of Cyclicity , 2000, J. Chem. Inf. Comput. Sci..

[16]  Milan Randic,et al.  On the Similarity of DNA Primary Sequences , 2000, J. Chem. Inf. Comput. Sci..

[17]  A. Nandy GRAPHICAL ANALYSIS OF DNA SEQUENCE STRUCTURE : III. INDICATIONS OF EVOLUTIONARY DISTINCTIONS AND CHARACTERISTICS OF INTRONS AND EXONS , 1996 .

[18]  Michael S. Waterman,et al.  General methods of sequence comparison , 1984 .

[19]  H. Wiener Structural determination of paraffin boiling points. , 1947, Journal of the American Chemical Society.

[20]  D Sankoff,et al.  Matching sequences under deletion-insertion constraints. , 1972, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Milan Randic Condensed Representation of DNA Primary Sequences , 2000, J. Chem. Inf. Comput. Sci..

[22]  Milan Randic,et al.  Distance/Distance Matrixes , 1994, J. Chem. Inf. Comput. Sci..