A Novel Visualization of DNA Sequences, Reflecting GC-Content

Significant progresses of visualization technology of DNA sequences have been made by solving visual effect problems of degeneracy, loss of information, difficulty in multidimensional space and difficulty for long DNA sequences. Different from traditional models focusing on visualization effect problems, we propose a novel visualization tool — GC-Curve, which not only solves all the visual effect problems mentioned above, but also can show the GC-content of a DNA. GC-content is an important feature of DNA, which is related to the stability of DNA, the density of genes, natural selection, mutational bias, etc. So the visualization reflecting GC-content has great potential in many applications. The applications of GC-Curve on similarity analysis, GC-content analysis, stability analysis and melting temperature prediction are presented. A software of GC-Curve is available at https://www.box.com/s/g872v3pq4kuz86sj5coq.

[1]  Marjan Vracko,et al.  Compact 2-D graphical representation of DNA , 2003 .

[2]  Xiao Sun,et al.  TN curve: A novel 3D graphical representation of DNA sequence based on trinucleotides and its applications , 2009, Journal of Theoretical Biology.

[3]  M. Frank-Kamenetskii,et al.  Base-stacking and base-pairing contributions into thermal stability of the DNA double helix , 2006, Nucleic acids research.

[4]  Zhao-Hui Qi,et al.  PN-curve: A 3D graphical representation of DNA sequences and their numerical characterization , 2007 .

[5]  Alexandru T Balaban,et al.  Graphical representation of proteins. , 2011, Chemical reviews.

[6]  H. J. Jeffrey Chaos game representation of gene structure. , 1990, Nucleic acids research.

[7]  Dorota Bielińska-Wa̧ż Four-component spectral representation of DNA sequences , 2009 .

[8]  M. A. GATES,et al.  Simpler DNA sequence representations , 1985, Nature.

[9]  Renfa Li,et al.  A group of 3D graphical representation of DNA sequences based on dual nucleotides , 2008 .

[10]  Milan Randic,et al.  A novel 2-D graphical representation of DNA sequences of low degeneracy , 2001 .

[11]  Xiao Sun,et al.  Analysis of Similarities/Dissimilarities of DNA Sequences Based on a Novel Graphical Representation , 2010 .

[12]  Yu-hua Yao,et al.  Analysis of similarity/dissimilarity of DNA sequences based on a 3-D graphical representation , 2005 .

[13]  J. Shaffer,et al.  Hybridization of synthetic oligodeoxyribonucleotides to ΦX 174 DNA: the effect of single base pair mismatch , 1979 .

[14]  XiaoChan Tang,et al.  On the similarity/dissimilarity of DNA sequences based on 4D graphical representation , 2010 .

[15]  Alan Wee-Chung Liew,et al.  DB-Curve: a novel 2D method of DNA sequence visualization and representation , 2003 .

[16]  Dejan Plavšić,et al.  Novel 2-D graphical representation of DNA sequences and their numerical characterization , 2003 .

[17]  Kequan Ding,et al.  Novel 4D numerical representation of DNA sequences , 2005 .

[18]  Amir Niknejad,et al.  DNA sequence representation without degeneracy. , 2003, Nucleic acids research.

[19]  EUGENE HAMORI,et al.  Novel DNA sequence representations , 1985, Nature.

[20]  Shankar Subramaniam,et al.  Classification studies based on a spectral representation of DNA. , 2010, Journal of theoretical biology.

[21]  R Zhang,et al.  Z curves, an intutive tool for visualizing and analyzing the DNA sequences. , 1994, Journal of biomolecular structure & dynamics.

[22]  Tao Song,et al.  ColorSquare: A Colorful Square Visualization of DNA Sequences , 2012 .

[23]  Qi Dai,et al.  Analysis of similarity/dissimilarity of DNA sequences based on a class of 2D graphical representation , 2008, J. Comput. Chem..

[24]  Matteo Fumagalli,et al.  Both selective and neutral processes drive GC content evolution in the human genome , 2008, BMC Evolutionary Biology.

[25]  Milan Randić,et al.  Graphical representations of DNA as 2-D map , 2004 .

[26]  Renfa Li,et al.  A 3D graphical representation of DNA sequence based on numerical coding method , 2010 .

[27]  Kequan Ding,et al.  On 2D graphical representation of DNA sequence of nondegeneracy , 2005 .

[28]  Bo Liao,et al.  A 3D graphical representation of DNA sequences and its application , 2006, Theor. Comput. Sci..

[29]  Zheng Zhang,et al.  Spectral representation of DNA sequences and its application , 2010, 2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA).

[30]  Guohua Huang,et al.  H–L curve: A novel 2D graphical representation for DNA sequences , 2008 .

[31]  Zhongxi Mo,et al.  Three 3D graphical representations of DNA primary sequences based on the classifications of DNA bases and their applications , 2011, Journal of Theoretical Biology.

[32]  Yu-hua Yao,et al.  A class of new 2-D graphical representation of DNA sequences and their application , 2004 .

[33]  Dejan Plavšić,et al.  Four-color map representation of DNA or RNA sequences and their numerical characterization , 2005 .

[34]  Tianming Wang,et al.  PNN-curve: a new 2D graphical representation of DNA sequences and its application. , 2006, Journal of theoretical biology.

[35]  Ling Li,et al.  Using Huffman coding method to visualize and analyze DNA sequences , 2011, J. Comput. Chem..

[36]  Zhu-Jin Zhang DV-Curve: a novel intuitive tool for visualizing and analyzing DNA sequences , 2009, Bioinform..

[37]  Zhao-Hui Qi,et al.  New 3D graphical representation of DNA sequence based on dual nucleotides , 2007, Journal of Theoretical Biology.

[38]  Tianming Wang,et al.  New graphical representation of a DNA sequence based on the ordered dinucleotides and its application to sequence analysis , 2012 .

[39]  G. Bernardi,et al.  The vertebrate genome: isochores and evolution. , 1993, Molecular biology and evolution.

[40]  E. Hamori,et al.  H curves, a novel method of representation of nucleotide series especially suited for long DNA sequences. , 1983, The Journal of biological chemistry.

[41]  John A Birdsell,et al.  Integrating genomics, bioinformatics, and classical genetics to study the effects of recombination on genome evolution. , 2002, Molecular biology and evolution.

[42]  Zhao-Hui Qi,et al.  Novel 2D graphical representation of DNA sequence based on dual nucleotides , 2007 .

[43]  Ren Zhang,et al.  The Z curve database: a graphic representation of genome sequences , 2003, Bioinform..

[44]  Kequan Ding,et al.  A 4D representation of DNA sequences and its application , 2005 .