PQuad: visualization of predicted peptides and proteins

New high-throughput proteomic techniques generate data faster than biologists can analyze it. Hidden within this massive and complex data are answers to basic questions about how cells function. The data afford an opportunity to take a global or systems approach studying whole proteomes comprising all the proteins in an organism. However, the tremendous size and complexity of the high-throughput data make it difficult to process and interpret. Existing tools for studying a few proteins at a time are not suitable for global analysis. Visualization provides powerful analysis capabilities for enormous, complex data at multiple resolutions. We developed a novel interactive visualization tool, PQuad, for the visual analysis of proteins and peptides identified from high-throughput data on biological samples. PQuad depicts the peptides in the context of their source protein and DNA, thereby integrating proteomic and genomic information. A wrapped line metaphor is applied across key resolutions of the data, from a compressed view of an entire chromosome to the actual nucleotide sequence. PQuad provides a difference visualization for comparing peptides from samples prepared under different experimental conditions. We describe the requirements for such a visual analysis tool, the design decisions, and the novel aspects of PQuad.

[1]  David W. Speicher,et al.  Proteomics: an infinite problem with infinite potential. (Opinion) , 2002 .

[2]  P. Hegde,et al.  The Institute for Genomic Research , 1998, Current Biology.

[3]  H. Griffin,et al.  The European Bioinformatics Institute , 1995 .

[4]  Chris North,et al.  Evolving visual metaphors and dynamic tools for bioinformatics visualization , 2002, VIS '02.

[5]  Ben Shneiderman,et al.  Readings in information visualization - using vision to think , 1999 .

[6]  Kim Rutherford,et al.  Artemis: sequence visualization and annotation , 2000, Bioinform..

[7]  Ben Shneiderman,et al.  Image-Browser Taxonomy and Guidelines for Designers , 1995, IEEE Softw..

[8]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[9]  Jacob D. Jaffe,et al.  Proteogenomic mapping as a complementary method to perform genome annotation , 2004, Proteomics.

[10]  Mudita Singhal,et al.  Integrating Evolving Tools for Proteomics Research , 2004, METMBS.

[11]  J. Yates,et al.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. , 1995, Analytical chemistry.

[12]  James T. Enns,et al.  Large Datasets at a Glance: Combining Textures and Colors in Scientific Visualization , 1999, IEEE Trans. Vis. Comput. Graph..

[13]  John T. Stasko,et al.  The information mural: a technique for displaying and navigating large information spaces , 1995, Proceedings of Visualization 1995 Conference.

[14]  Naren Ramakrishnan,et al.  The Emerging Landscape of Bioinformatics Software Systems , 2002, Computer.

[15]  Pak Chung Wong,et al.  Global Visualization and Alignments of Whole Bacterial Genomes , 2003, IEEE Trans. Vis. Comput. Graph..

[16]  J. Avise,et al.  Evolving genomic metaphors: a new look at the language of DNA. , 2001, Science.

[17]  Daniel B. Carr,et al.  Some visualization challenges , 2001 .

[18]  M. Wilm,et al.  Error-tolerant identification of peptides in sequence databases by peptide sequence tags. , 1994, Analytical chemistry.