GEMMA - A Grid environment for microarray management and analysis in bone marrow stem cells experiments

Microarray techniques are successfully used to investigate thousands gene expression profiling in a variety of genomic analyses such as gene identification, drug discovery and clinical diagnosis, providing a large amount of genomic data for the overall research community. A Grid based Environment for distributed Microarray data Management and Analysis (GEMMA) is being built. This platform is planned to provide shared, standardized and reliable tools for managing and analyzing biological data related to bone marrow stem cell cultures, in order to maximize the results of distributed experiments. Different microarray analysis algorithms may be offered to the end-user, through a web interface. A set of modular and independent applications may be published on the portal, and either single algorithms or a combination of them might be invoked by the user, through a workflow strategy. Services may be implemented within an existing Grid computing infrastructure to solve problems concerning both large datasets storage (data intensive problem) and large computational times (computing intensive problem). Moreover, experimental data annotation may be collected according to the same rules and stored through the Grid portal, by using a metadata schema, which allows a comprehensive and replicable sharing of microarray experiments among different researchers. The environment has been tested, so far, as regards performance results concerning Grid parallelization of a microarray based gene expression analysis. First results show a very promising speedup ratio.

[1]  A. Pulvirenti,et al.  GENIUS: a web portal for the grid , 2003 .

[2]  M. Zago,et al.  The Profile of Gene Expression of Human Marrow Mesenchymal Stem Cells , 2003, Stem cells.

[3]  Stephen R Quake,et al.  Significance and statistical errors in the analysis of DNA microarray data , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Joaquín Dopazo,et al.  FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes , 2004, Bioinform..

[5]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[7]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[8]  F. Rademakers,et al.  ROOT — An object oriented data analysis framework , 1997 .

[9]  Steven Tuecke,et al.  Internet X.509 Public Key Infrastructure (PKI) Proxy Certificate Profile , 2004, RFC.

[10]  G. Kopen,et al.  MicroSAGE Analysis of 2,353 Expressed Genes in a Single Cell‐Derived Colony of Undifferentiated Human Mesenchymal Stem Cells Reveals mRNAs of Multiple Cell Lineages , 2001, Stem cells.

[11]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[12]  Ivan Martin,et al.  Three‐Dimensional Perfusion Culture of Human Bone Marrow Cells and Generation of Osteoinductive Grafts , 2005, Stem cells.

[13]  Joachim Geiler,et al.  Workflow-based Grid applications , 2006, Future Gener. Comput. Syst..

[14]  Jason E. Stewart,et al.  Design and implementation of microarray gene expression markup language (MAGE-ML) , 2002, Genome Biology.

[15]  Mark Pollitt,et al.  Exploration , 2006, J. Digit. Forensic Pract..

[16]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[17]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[18]  M. Mastrogiacomo,et al.  Proliferation kinetics and differentiation potential of ex vivo expanded human bone marrow stromal cells: Implications for their use in cell therapy. , 2000, Experimental hematology.

[19]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[20]  D. Wendt,et al.  Oscillating perfusion of cell suspensions through three‐dimensional scaffolds enhances cell seeding efficiency and uniformity , 2003, Biotechnology and bioengineering.

[21]  Birger Koblitz,et al.  Performance comparison of the LCG2 and gLite file catalogues , 2006, IEEE Nuclear Science Symposium Conference Record, 2005.

[22]  Jonathan W. Essex,et al.  BioSimGrid: Grid-enabled biomolecular simulation data storage and analysis , 2006, Future Gener. Comput. Syst..

[23]  Andrew C. Simpson,et al.  Securing web services for deployment in health grids , 2006, Future Gener. Comput. Syst..

[24]  Flavia Donno,et al.  The INFN-Grid Testbed , 2005, Future Gener. Comput. Syst..

[25]  Giancarlo Mauri,et al.  Network integration of data and analysis of oncology interest , 2006, J. Integr. Bioinform..

[26]  Hubert Hackl,et al.  MARS: Microarray analysis, retrieval, and storage system , 2005, BMC Bioinformatics.

[27]  Ivan Martin,et al.  Engineering of osteoinductive grafts by isolation and expansion of ovine bone marrow stromal cells directly on 3D ceramic scaffolds , 2006, Biotechnology and bioengineering.

[28]  Jemal H. Abawajy Special section: Parallel input/output management techniques (PIOMT) in cluster and grid computing , 2006, Future Gener. Comput. Syst..

[29]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[30]  Alvis Brazma,et al.  On the Importance of Standardisation in Life Sciences , 2001, Bioinform..

[31]  R. Barbera The GENIUS Grid Portal , 2003 .

[32]  Lorenzo Bruzzone,et al.  BioDCV : a Distributed Computing System for the Complete Validation of Gene Profiles , 2005 .

[33]  A. Friedenstein,et al.  Heterotopic of bone marrow. Analysis of precursor cells for osteogenic and hematopoietic tissues. , 1968, Transplantation.