Visual analysis of gel-free proteome data

We present a visual exploration system supporting protein analysis when using gel-free data acquisition methods. The data to be analyzed is obtained by coupling liquid chromatography (LC) with mass spectrometry (MS). LC-MS data have the properties of being nonequidistantly distributed in the time dimension (measured by LC) and being scattered in the mass-to-charge ratio dimension (measured by MS). We describe a hierarchical data representation and visualization method for large LC-MS data. Based on this visualization, we have developed a tool that supports various data analysis steps. Our visual tool provides a global understanding of the data, intuitive detection and classification of experimental errors, and extensions to LC-MS/MS, LC/LC-MS, and LC/LC-MS/MS data analysis. Due to the presence of randomly occurring rare isotopes within the same protein molecule, several intensity peaks may be detected that all refer to the same peptide. We have developed methods to unite such intensity peaks. This deisotoping step is visually documented by our system, such that misclassification can be detected intuitively. For differential protein expression analysis, we compute and visualize the differences in protein amounts between experiments. In order to compute the differential expression, the experimental data need to be registered. For registration, we perform a nonrigid warping step based on landmarks. The landmarks can be assigned automatically using protein identification methods. We evaluate our methods by comparing protein analysis with and without our interactive visualization-based exploration tool.

[1]  J. Bernhardt,et al.  Dual channel imaging of two‐dimensional electropherograms in Bacillus subtilis , 1999, Electrophoresis.

[2]  J. van Helden,et al.  Interactive visualization and exploration of relationships between biological objects. , 2000, Trends in biotechnology.

[3]  D. Hochstrasser,et al.  From Proteins to Proteomes: Large Scale Protein Identification by Two-Dimensional Electrophoresis and Arnino Acid Analysis , 1996, Bio/Technology.

[4]  Mark A. Duchaineau,et al.  ROAMing terrain: Real-time Optimally Adapting Meshes , 1997, Proceedings. Visualization '97 (Cat. No. 97CB36155).

[5]  J. Yates,et al.  Large-scale analysis of the yeast proteome by multidimensional protein identification technology , 2001, Nature Biotechnology.

[6]  M. Mann,et al.  Electrospray ionization for mass spectrometry of large biomolecules. , 1989, Science.

[7]  Hugues Hoppe Smooth view-dependent level-of-detail control and its application to terrain rendering , 1998, Proceedings Visualization '98 (Cat. No.98CB36276).

[8]  Rong Wang,et al.  The need for a public proteomics repository , 2004, Nature Biotechnology.

[9]  Lars Linsen,et al.  Differential protein expression analysis via liquid-chromatography/mass-spectrometry data visualization , 2005, VIS 05. IEEE Visualization, 2005..

[10]  Paolo Cignoni,et al.  Multiresolution modeling and visualization of volume data , 1997 .

[11]  Paolo Cignoni,et al.  Representation and visualization of terrain surfaces at variable resolution , 1997, The Visual Computer.

[12]  Patrick G. A. Pedrioli,et al.  A tool to visualize and evaluate data obtained by liquid chromatography-electrospray ionization-mass spectrometry. , 2004, Analytical chemistry.

[13]  M. Tyers,et al.  From genomics to proteomics , 2003, Nature.

[14]  Frank Losasso,et al.  Geometry clipmaps , 2004, ACM Trans. Graph..

[15]  Daniel Cohen-Or,et al.  Temporal continuity of levels of detail in Delaunay triangulated terrain , 1996, Proceedings of Seventh Annual IEEE Visualization '96.

[16]  M. Karas,et al.  Laser desorption ionization of proteins with molecular masses exceeding 10,000 daltons. , 1988, Analytical chemistry.

[17]  S. Jacobsson,et al.  Data preprocessing by wavelets and genetic algorithms for enhanced multivariate analysis of LC peptide mapping. , 2004, Journal of pharmaceutical and biomedical analysis.

[18]  Aidong Zhang,et al.  Interactive visualization and analysis for gene expression data , 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences.

[19]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[20]  Thomas Ertl,et al.  The multilevel finite element method for adaptive mesh optimization and visualization of volume data , 1997, Proceedings. Visualization '97 (Cat. No. 97CB36155).

[21]  J. Bernhardt,et al.  Using standard positions and image fusion to create proteome maps from collections of two‐dimensional gel electrophoresis images , 2003, Proteomics.

[22]  Paolo Cignoni,et al.  Multiresolution Representation and Visualization of Volume Data , 1997, IEEE Trans. Vis. Comput. Graph..

[23]  William Ribarsky,et al.  Real-time, continuous level of detail rendering of height fields , 1996, SIGGRAPH.

[24]  Tosiyasu L. Kunii,et al.  Unconstrained Automatic Image Matching Using Multiresolutional Critical-Point Filters , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Bernd Hamann,et al.  GeneBox: Interactive Visualization of Microarray Data Sets , 2003, METMBS.

[26]  Bernd Hamann,et al.  An octree-based multiresolution approach supporting interactive rendering of very large volume data sets , 2001 .

[27]  David Salesin,et al.  Wavelets for computer graphics: theory and applications , 1996 .

[28]  Xiaoyu Yang,et al.  Characterizing complex peptide mixtures using a multi-dimensional liquid chromatography-mass spectrometry system: Saccharomyces cerevisiae as a model system. , 2004, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[29]  R. Aebersold,et al.  Automated statistical analysis of protein abundance ratios from data generated by stable-isotope dilution and tandem mass spectrometry. , 2003, Analytical chemistry.

[30]  Bernd Hamann,et al.  Wavelet-Based Multiresolution with , 2003, Computing.

[31]  Per Olof Edlund,et al.  Pharmaceutical and biomedical analysis: A report on the 5th International Symposium on Pharmaceutical and Biomedical Analysis held in Stockholm, Sweden, September 21–24, 1994 , 1995 .

[32]  Bernd Hamann,et al.  Wavelet-based multiresolution with n-th-root-of-2 Subdivision , 2004 .

[33]  Jörg Bernhardt,et al.  Bacillus subtilis during feast and famine: visualization of the overall regulation of protein synthesis during glucose starvation by proteome analysis. , 2003, Genome research.

[34]  Hanspeter Pfister,et al.  Hardware-accelerated 3D visualization of mass spectrometry data , 2005, VIS 05. IEEE Visualization, 2005..