RAD (RNA abundance database): an infrastructure for array data analysis

Analysis of array-based gene expression experiments is challenging particularly when multiple experiments are involved and presents challenges in data management as well. Selecting well-measured spots and normalizing raw data are basic steps required for subsequent analyses, especially those involved with comparisons over a collection of experiments. Other preprocessing steps might also be needed for certain analyses. The most appropriate criteria for spot selection, for normalization, and for other data transformations depend on the experiments under study and on the questions investigated. Furthermore, comparing experiments appropriately requires knowledge of how the experiments were performed and the samples that were used in sufficient detail to understand their degree of similarity. Approaches taken in RAD to address these issues will be presented. These include the storage of raw and processed data along with history and parameter tracking and the use of ontologies to provide precise consistent experimental descriptions.

[1]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[2]  Janan T. Eppig,et al.  GXD: a Gene Expression Database for the laboratory mouse: current status and recent enhancements , 2000, Nucleic Acids Res..

[3]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[4]  C. Müller,et al.  Large-scale clustering of cDNA-fingerprinting data. , 1999, Genome research.

[5]  D. Botstein,et al.  A gene expression database for the molecular pharmacology of cancer , 2000, Nature Genetics.

[6]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[7]  Trevor Hastie,et al.  Gene Shaving: a new class of clustering methods for expression arrays , 2000 .

[8]  Ji Huang,et al.  [Serial analysis of gene expression]. , 2002, Yi chuan = Hereditas.

[9]  Patrik D'haeseleer,et al.  Genetic network inference: from co-expression clustering to reverse engineering , 2000, Bioinform..

[10]  N. Sampas,et al.  Molecular classification of cutaneous malignant melanoma by gene expression profiling , 2000, Nature.

[11]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Y Sakaki,et al.  High-density cDNA filter analysis: a novel approach for large-scale, quantitative analysis of gene expression. , 1995, Gene.

[13]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[14]  Val Tannen,et al.  K2/Kleisli and GUS: Experiments in integrated access to genomic data sources , 2001, IBM Syst. J..

[15]  Christina Kendziorski,et al.  On Differential Variability of Expression Ratios: Improving Statistical Inference about Gene Expression Changes from Microarray Data , 2001, J. Comput. Biol..

[16]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[17]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[18]  L. Lazzeroni Plaid models for gene expression data , 2000 .

[19]  Edward R. Dougherty,et al.  Simulator for gene expression networks , 2001, SPIE BiOS.

[20]  Y. Chen,et al.  Ratio-based decisions and the quantitative analysis of cDNA microarray images. , 1997, Journal of biomedical optics.

[21]  Marcel J. T. Reinders,et al.  Genetic network models: a comparative study , 2001, SPIE BiOS.

[22]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Jonathan Crabtree,et al.  A relational schema for both array-based and SAGE gene expression experiments , 2001, Bioinform..

[24]  G S Michaels,et al.  Cluster analysis and data visualization of large-scale gene expression data. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[25]  J. Claverie,et al.  The significance of digital gene expression profiles. , 1997, Genome research.

[26]  Emmanuel Spanakis,et al.  Discrimination of fibroblast subtypes by multivariate analysis of gene expression , 1997, International journal of cancer.

[27]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[28]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[29]  S. Dudoit,et al.  STATISTICAL METHODS FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN REPLICATED cDNA MICROARRAY EXPERIMENTS , 2002 .

[30]  Christian A. Rees,et al.  Systematic variation in gene expression patterns in human cancer cell lines , 2000, Nature Genetics.

[31]  Satoru Miyano,et al.  Inferring qualitative relations in genetic networks and metabolic pathways , 2000, Bioinform..

[32]  Gregory R. Grant,et al.  Generation of patterns from gene expression data by assigning confidence to differentially expressed genes , 2000, Bioinform..

[33]  Ron Shamir,et al.  Clustering Gene Expression Patterns , 1999, J. Comput. Biol..

[34]  D. Lockhart,et al.  Expression monitoring by hybridization to high-density oligonucleotide arrays , 1996, Nature Biotechnology.