Basic approaches to gene expression analysis of stem cells by microarrays.

This chapter covers gene expression analysis by microarray to study and characterize stem cells. In a case-study scenario, we describe basic bioinformatic methodologies used to answer common questions in microarray experiments involving one or more stem cell populations. Service providers or departmental core labs usually carry out sample preparation, hybridization, and scanning of microarrays. Therefore, in this chapter, we focus on the state-of-the-art data analysis that avoids common pitfalls and introduces the reader to important controls that yield robust biologically relevant results. We describe evaluation of differentially expressed genes, clustering methods, gene-set enrichment analysis, and gene network discovery methods that can be used to formulate meaningful biological insights as well as suggest new wet lab experiments.

[1]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[3]  Michael G. Barnes,et al.  Experimental comparison and cross-validation of the Affymetrix and Illumina gene expression analysis platforms , 2005, Nucleic acids research.

[4]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[5]  Robert Gentleman,et al.  Bioconductor Case Studies , 2008 .

[6]  A I Saeed,et al.  TM4: a free, open-source system for microarray data management and analysis. , 2003, BioTechniques.

[7]  Dennis B. Troup,et al.  NCBI GEO: archive for high-throughput functional genomic data , 2008, Nucleic Acids Res..

[8]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Pablo Tamayo,et al.  Metagenes and molecular pattern discovery using matrix factorization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[10]  D. Botstein,et al.  Singular value decomposition for genome-wide expression data processing and modeling. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Robert Gentleman,et al.  The ALL Dataset , 2008 .

[12]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[13]  R. Shamir,et al.  Regulatory networks define phenotypic classes of human stem cell lines , 2008, Nature.

[14]  Zhen Jiang,et al.  Bioconductor Project Bioconductor Project Working Papers Year Paper Extensions to Gene Set Enrichment , 2013 .

[15]  J. Mesirov,et al.  GenePattern 2.0 , 2006, Nature Genetics.

[16]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[17]  Ann M. Hess,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Filtering for increased power for microarray data analysis , 2008 .

[18]  Ron Shamir,et al.  Identification of functional modules using network topology and high-throughput data , 2007, BMC Systems Biology.

[19]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[20]  C. Ball,et al.  Repeatability of published microarray gene expression analyses , 2009, Nature Genetics.

[21]  Pan Du,et al.  lumi: a pipeline for processing Illumina microarray , 2008, Bioinform..

[22]  Ron Shamir,et al.  EXPANDER – an integrative program suite for microarray data analysis , 2005, BMC Bioinformatics.

[23]  D. Allison,et al.  Microarray data analysis: from disarray to consolidation and consensus , 2006, Nature Reviews Genetics.