CellFinder's Molecular Database and its Application to Stem Cell Research

CellFinder is a freely available on-line resource for cell-based data in mammalian cells and in vitro cell lines. CellFinder was developed to provide easier access to different types of cell-based data, which include anatomical and microscopic images, protein expression and whole genome mRNA expression profiles. In this short communication, we describe the collection, processing and storage of gene expression data for the CellFinder resource. Studies involving the use of normal cells and/or production of pluripotent stem cells or their engineered cell types, were chosen from public data repositories of mRNA expression data. Samples in each study were manually curated with ontology terms, and the data was further processed to yield normalized expression data. Sample, experiment and molecular data were stored in a postgreSQL database. The resulting molecular database currently contains 1588 samples from 56 studies, covering 42 cell types in human and mouse. The molecular database is undergoing continuous development and will serve as a source of standardized, processed and expertly annotated data for CellFinder analysis tools such as MarkerTool and CompareTool.

[1]  S. Yamanaka,et al.  Induction of Pluripotent Stem Cells from Mouse Embryonic and Adult Fibroblast Cultures by Defined Factors , 2006, Cell.

[2]  Hirokazu Chiba,et al.  CELLPEDIA: a repository for human cell information for cell studies and differentiation analyses , 2011, Database J. Biol. Databases Curation.

[3]  Khadija El Amrani,et al.  MGFM: a novel tool for detection of tissue and cell specific marker genes from microarray gene expression data , 2015, BMC Genomics.

[4]  Pan Du,et al.  lumi: a pipeline for processing Illumina microarray , 2008, Bioinform..

[5]  C. Wells,et al.  YuGene: a simple approach to scale gene expression data derived from different platforms for integrated analyses. , 2014, Genomics.

[6]  Rowland Mosbergen,et al.  Stemformatics: visualisation and sharing of stem cell gene expression. , 2013, Stem cell research.

[7]  Mariana L. Neves,et al.  CellFinder: a cell data repository , 2013, Nucleic Acids Res..

[8]  Ulf Leser,et al.  CELDA - an ontology for the comprehensive representation of cells in complex systems , 2013, BMC Bioinformatics.

[9]  Andreas Kurtz,et al.  Semantic Body Browser: graphical exploration of an organism and spatially resolved expression data visualization , 2015, Bioinform..

[10]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[11]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[12]  篠原 隆司,et al.  Induction of pluripotent stem cell cells from germ cells , 2012 .