Construction of an open-access database that integrates cross-reference information from the transcriptome and proteome of immune cells

MOTIVATION Although a huge amount of mammalian genomic data does become publicly available, there are still hurdles for biologists to overcome before such data can be fully exploited. One of the challenges for gaining biological insight from genomic data has been the inability to cross-reference transcriptomic and proteomic data using a single informational platform. To address this, we constructed an open-access database that enabled us to cross-reference transcriptomic and proteomic data obtained from immune cells. RESULTS The database, named RefDIC (Reference genomics Database of Immune Cells), currently contains: (i) quantitative mRNA profiles for human and mouse immune cells/tissues obtained using Affymetrix GeneChip technology; (ii) quantitative protein profiles for mouse immune cells obtained using two-dimensional gel electrophoresis (2-DE) followed by image analysis and mass spectrometry and (iii) various visualization tools to cross-reference the mRNA and protein profiles of immune cells. RefDIC is the first open-access database for immunogenomics and serves as an important information-sharing platform, enabling a focused genomic approach in immunology. AVAILABILITY All raw data and information can be accessed from http://refdic.rcai.riken.jp/. The microarray data is also available at http://cibex.nig.ac.jp/ under CIBEX accession no. CBX19, and http://www.ebi.ac.uk/pride/ under PRIDE accession numbers 2354-2378 and 2414.

[1]  S. de Vos,et al.  Transcriptional Profiling of Human Hematopoiesis During In Vitro Lineage‐Specific Differentiation , 2005, Stem cells.

[2]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[3]  R. Myers,et al.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data , 2005, Nucleic acids research.

[4]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[5]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[6]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[7]  Atsushi Hijikata,et al.  Construction of quantitative proteome reference maps of mouse spleen and lymph node based on two‐dimensional gel electrophoresis , 2006, Proteomics.

[8]  Neil R Smalheiser Informatics and hypothesis‐driven research , 2002, EMBO reports.

[9]  Wei-Min Liu,et al.  Robust estimators for expression analysis , 2002, Bioinform..

[10]  Riitta Lahesmaa,et al.  Proteome characterization of human T helper 1 and 2 cells , 2004, Proteomics.

[11]  G. Schuler Pieces of the puzzle: expressed sequence tags and the catalog of human genes , 1997, Journal of Molecular Medicine.

[12]  Kathleen F. Kerr,et al.  Standardizing global gene expression analysis between laboratories and across platforms , 2005, Nature Methods.

[13]  E. Birney,et al.  The International Protein Index: An integrated database for proteomics experiments , 2004, Proteomics.

[14]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[15]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[16]  P. Cutler Protein arrays: The current state‐of‐the‐art , 2003, Proteomics.

[17]  Alexander R. Abbas,et al.  Immune response in silico (IRIS): immune-specific genes identified from a compendium of microarray expression data , 2005, Genes and Immunity.

[18]  Luc Negroni,et al.  Assessing factors for reliable quantitative proteomics based on two‐dimensional gel electrophoresis , 2004, Proteomics.

[19]  M Giehl,et al.  Gene expression profiling of CD34+ cells identifies a molecular signature of chronic myeloid leukemia blast crisis , 2006, Leukemia.

[20]  M. Rudemo,et al.  Statistical exploration of variation in quantitative two‐dimensional gel electrophoresis data , 2004, Proteomics.

[21]  S. Enkemann,et al.  A sequence-based identification of the genes detected by probesets on the Affymetrix U133 plus 2.0 array , 2005, Nucleic acids research.

[22]  John Quackenbush,et al.  Synchronous global assessment of gene and protein expression in colorectal cancer progression. , 2005, Genomics.

[23]  P Dupree,et al.  Quantitative and reproducible two‐dimensional gel analysis using Phoretix 2D Full , 2001, Electrophoresis.

[24]  Kenneth H Buetow,et al.  Detecting false expression signals in high-density oligonucleotide arrays by an in silico approach. , 2005, Genomics.