Identification of regulatory modules by co-clustering latent variable models: stem cell differentiation

MOTIVATION An important issue in stem cell biology is to understand how to direct differentiation towards a specific cell type. To elucidate the mechanism, previous studies have focused on identifying the responsible gene regulators, which have, however, failed to provide a systemic view of regulatory modules. To obtain a unified description of the regulatory modules, we characterized major stem cell species by employing a co-clustering latent variable model (LVM). The LVM-based method allowed us to elucidate the cell type-specific transcription factors, using genomic sequences as well as expression profiles. RESULTS We used a list of genes enriched in each of 21 stem cell subpopulations, and their upstream genomic sequences. The LVM-based study allowed us to uncover the regulatory modules for each stem cell cluster, e.g. GABP and E2F for the proliferation phase, and Ap2alpha and Ap2gamma for the quiescence phase. Furthermore, the identities of the stem cell clusters were well revealed by the constituent genes that were directly targeted by the modules. Consequently, our analytical framework was demonstrated to be useful through a detailed case study of stem cell differentiation and can be applied to problems with similar characteristics.

[1]  Laurence Reid,et al.  From gradients to axes, from morphogenesis to differentiation , 1990, Cell.

[2]  A. Maelicke,et al.  The expression level of the orphan nuclear receptor GCNF (germ cell nuclear factor) is critical for neuronal differentiation. , 2004, Molecular endocrinology.

[3]  Gary D. Stormo,et al.  Identifying DNA and protein patterns with statistically significant alignments of multiple sequences , 1999, Bioinform..

[4]  Martin Kuiper,et al.  BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks , 2005, Bioinform..

[5]  E. Pankratova,et al.  Tissue-specific isoforms of the ubiquitous transcription factor Oct-1 , 2001, Molecular Genetics and Genomics.

[6]  P. Sharp,et al.  Embryonic Lethality, Decreased Erythropoiesis, and Defective Octamer-Dependent Promoter Activation in Oct-1-Deficient Mice , 2004, Molecular and Cellular Biology.

[7]  M. Veloso,et al.  Latent Variable Models , 2019, Statistical and Econometric Methods for Transportation Data Analysis.

[8]  Pu Zhang,et al.  Enhancement of hematopoietic stem cell repopulating capacity and self-renewal in the absence of the transcription factor C/EBP alpha. , 2004, Immunity.

[9]  Wolfgang Schmid,et al.  Disruption of CREB function in brain leads to neurodegeneration , 2002, Nature Genetics.

[10]  I. Kola,et al.  The ETS Transcription Factor GABPα Is Essential for Early Embryogenesis , 2004, Molecular and Cellular Biology.

[11]  D. Melton,et al.  "Stemness": Transcriptional Profiling of Embryonic and Adult Stem Cells , 2002, Science.

[12]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Michael I. Jordan,et al.  A latent variable model for chemogenomic profiling , 2005, Bioinform..

[14]  Andrew L. Kung,et al.  Distinct roles for CREB-binding protein and p300 in hematopoietic stem cell self-renewal , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Alexander E. Kel,et al.  TRANSFAC®: transcriptional regulation, from patterns to profiles , 2003, Nucleic Acids Res..

[16]  C. Shaw,et al.  Molecular Signatures of Proliferation and Quiescence in Hematopoietic Stem Cells , 2004, PLoS biology.

[17]  Helen C. Hurst,et al.  Physical and Functional Interactions among AP-2 Transcription Factors, p300/CREB-binding Protein, and CITED2* , 2003, The Journal of Biological Chemistry.

[18]  Megan F. Cole,et al.  Core Transcriptional Regulatory Circuitry in Human Embryonic Stem Cells , 2005, Cell.

[19]  Iain L. Campbell,et al.  STAT1 deficiency unexpectedly and markedly exacerbates the pathophysiological actions of IFN-α in the central nervous system , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Byoung-Tak Zhang,et al.  Self-Organizing Latent Lattice Models for Temporal Gene Expression Profiling , 2003, Machine Learning.

[21]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[22]  V. Heath,et al.  C/EBPalpha deficiency results in hyperproliferation of hematopoietic progenitor cells and disrupts macrophage development in vitro and in vivo. , 2004, Blood.

[23]  J. DeGregori,et al.  Defective Gene Expression, S Phase Progression, and Maturation during Hematopoiesis in E2F1/E2F2 Mutant Mice , 2003, Molecular and Cellular Biology.

[24]  Hiroshi Kawamoto,et al.  Long-term cultured E2A-deficient hematopoietic progenitor cells are pluripotent. , 2004, Immunity.

[25]  Hubert Schorle,et al.  Transcription Factor AP-2γ Stimulates Proliferation and Apoptosis and Impairs Differentiation in a Transgenic Model11Grants from Deutsche Forschungsgemeinschaft (Scho 503) and HGF (SGF01SF9808) to H.S. supported this work. Note: Data deposition: The transgenic mice are registered with MGD as Tg(Tcfa , 2003 .

[26]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[27]  V. Heath,et al.  C/EBPα deficiency results in hyperproliferation of hematopoietic progenitor cells and disrupts macrophage development in vitro and in vivo , 2004 .

[28]  Barry Fine,et al.  Modulation of CREB activity by the Rho GTPase regulates cell and organism size during mouse embryonic development. , 2002, Developmental cell.

[29]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[30]  Hiroshi Handa,et al.  NF-Y Is Essential for the Recruitment of RNA Polymerase II and Inducible Transcription of Several CCAAT Box-Containing Genes , 2005, Molecular and Cellular Biology.

[31]  Joseph T. Chang,et al.  Spectral biclustering of microarray data: coclustering genes and conditions. , 2003, Genome research.

[32]  Christopher M. Bishop Latent Variable Models , 1998, Learning in Graphical Models.

[33]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[34]  Hubert Schorle,et al.  Transcription factor AP-2gamma stimulates proliferation and apoptosis and impairs differentiation in a transgenic model. , 2003, Molecular cancer research : MCR.

[35]  George M. Church,et al.  Biclustering of Expression Data , 2000, ISMB.

[36]  John T. Dimos,et al.  A Stem Cell Molecular Signature , 2002, Science.

[37]  R Taub,et al.  Transcriptional up-regulation of the delayed early gene HRS/SRp40 during liver regeneration. Interactions among YY1, GA-binding proteins, and mitogenic signals. , 1998, The Journal of biological chemistry.