M@IA: A Modular Open-Source Application for Microarray Workflow and Integrative Datamining

Microarray technology is a widely used approach to gene expression analysis. Many tools for microarray management and data analysis have been developed, and recently new methods have been proposed for deciphering biological pathways by integrating microarray data with other data sources. However, to improve microarray analysis and provide meaningful gene interaction networks, integrated software solutions are still needed. Therefore, we developed M@IA, an environment for DNA microarray data analysis allowing gene network reconstruction. M@IA is a microarray integrated application which includes all of the steps of a microarray study, from MIAME-compliant raw data storage and processing gene expression analysis. Furthermore, M@IA allows automatic gene annotation based on ontology, metabolic/signalling pathways, protein interaction, miRNA and transcriptional factor associations, as well as integrative analysis of gene interaction networks. Statistical and graphical methods facilitate analysis, yielding new hypotheses on gene expression data. To illustrate our approach, we applied M@IA modules to microarray data taken from an experiment on liver tissue. We integrated differentially expressed genes with additional biological information, thus identifying new molecular interaction networks that are associated with fibrogenesis. M@IA is a new application for microarray management and data analysis, offering functional insights into microarray data by the combination of gene expression data and biological knowledge annotation based on interactive graphs. M@IA is an interactive multi-user interface based on a flexible modular architecture and it is freely available for academic users at http://maia.genouest.org.

[1]  Steven C. Lawlor,et al.  GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways , 2002, Nature Genetics.

[2]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[3]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[4]  Dominic Widdows,et al.  Using Curvature and Markov Clustering in Graphs for Lexical Acquisition and Word Sense Discrimination , 2004 .

[5]  S. Gruvberger,et al.  BioArray Software Environment (BASE): a platform for comprehensive management and analysis of microarray data , 2002, Genome Biology.

[6]  Zlatko Trajanoski,et al.  CARMAweb: comprehensive R- and bioconductor-based web service for microarray data analysis , 2006, Nucleic Acids Res..

[7]  Priyanka Gupta,et al.  BioWarehouse: a bioinformatics database warehouse toolkit , 2006, BMC Bioinformatics.

[8]  Gary D Bader,et al.  BIND--The Biomolecular Interaction Network Database. , 2001, Nucleic acids research.

[9]  David Botstein,et al.  SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data , 2003, Nucleic Acids Res..

[10]  Zhenjun Hu,et al.  VisANT: an online visualization and analysis tool for biological interaction data , 2004, BMC Bioinformatics.

[11]  David Botstein,et al.  The Stanford Microarray Database , 2001, Nucleic Acids Res..

[12]  Stijn van Dongen,et al.  miRBase: microRNA sequences, targets and gene nomenclature , 2005, Nucleic Acids Res..

[13]  Robert E. W. Hancock,et al.  ArrayPipe: a flexible processing pipeline for microarray data , 2004, Nucleic Acids Res..

[14]  Aedín C. Culhane,et al.  Expression Profiler: next generation - an online platform for analysis of microarray data , 2004, Nucleic Acids Res..

[15]  D. Glaise,et al.  Upregulation of the tumor suppressor gene menin in hepatocellular carcinomas and its significance in fibrogenesis , 2006, Hepatology.

[16]  Ming Yi,et al.  WholePathwayScope: a comprehensive pathway-based analysis tool for high-throughput data , 2006, BMC Bioinformatics.

[17]  Ramón Díaz-Uriarte,et al.  IDconverter and IDClight: Conversion and annotation of gene and protein IDs , 2007, BMC Bioinformatics.

[18]  Sergio Contrino,et al.  ArrayExpress—a public repository for microarray gene expression data at the EBI , 2004, Nucleic Acids Res..

[19]  Joaquín Dopazo,et al.  GEPAS: a web-based resource for microarray gene expression data analysis , 2003, Nucleic Acids Res..

[20]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[21]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[22]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Gary D. Bader,et al.  cPath: open source software for collecting, storing, and querying biological pathways , 2006, BMC Bioinformatics.

[24]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[25]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.