On-line analytical framework for the 2-DE based proteome information

The integration of proteome information is one of the key issues in proteomics research. There are currently several proteome databases which provide proteome information such as Swiss-Prot, PDB, and SRS. However, each proteome database system supports only simple inquiries on the proteome information of its database. In order to enhance the analysis support capability of proteome information, this paper proposes a data warehouse system which constructs proteome data by integrating diverse protein information along with clinical and experiment information produced in various methods in order to enhance the analysis support capability of proteome information. Based on the proteome data warehouse, OLAP and exception discovery queries are carried out. Therefore, complex multidimensional analysis is feasible for highly systematized proteome data in a proteome data warehouse. Furthermore, various analysis results which integrate experiment information, clinical information, image information, and spot information of proteins are provided.

[1]  M J Dunn,et al.  Positional reproducibility of protein spots in two‐dimensional polyacrylamide gel electrophoresis using immobilised pH gradient isoelectric focusing in the first dimension: An interlaboratory comparison , 1994, Electrophoresis.

[2]  R D Appel,et al.  Make2ddb: A simple package to set up a two‐dimensional electrophoresis database for the World Wide Web , 1997, Electrophoresis.

[3]  Andrew Emili,et al.  De novo peptide sequencing and quantitative profiling of complex protein mixtures using mass-coded abundance tagging , 2002, Nature Biotechnology.

[4]  A. Görg,et al.  The current state of two‐dimensional electrophoresis with immobilized pH gradients , 2000, Electrophoresis.

[5]  R D Appel,et al.  Melanie II – a third‐generation software package for analysis of two‐dimensional electrophoresis images: II. Algorithms , 1997, Electrophoresis.

[6]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[7]  R D Appel,et al.  Melanie II – a third‐generation software package for analysis of two‐dimensional electrophoresis images: I. Features and user interface , 1997, Electrophoresis.

[8]  P Dupree,et al.  Quantitative and reproducible two‐dimensional gel analysis using Phoretix 2D Full , 2001, Electrophoresis.

[9]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[10]  Nimrod Megiddo,et al.  Discovery-Driven Exploration of OLAP Data Cubes , 1998, EDBT.

[11]  Jung Eun Shim,et al.  An integrated proteome database for two‐dimensional electrophoresis data analysis and laboratory information management system , 2002, Proteomics.

[12]  Erik Thomsen,et al.  OLAP Solutions - Building Multidimensional Information Systems , 1997 .

[13]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.

[14]  T. Rabilloud Two‐dimensional gel electrophoresis in proteomics: Old, old fashioned, but it still climbs up the mountains , 2002, Proteomics.