Genome-wide targets identification of "core" pluripotency transcription factors with integrated features in human embryonic stem cells.

Embryonic stem cells (ESCs) play an important role in developmental biology which is still lacking clear molecular mechanisms. The "core" transcription factors (TFs) including OCT4, SOX2 and NANOG are essential for maintaining the stemness of ESCs. But the downstream targets of these "core" TFs are still ambiguous. Based on support vector machine (SVM) technology, this study develops a label method algorithm (LMA) for genome-wide target identification of "core" TFs in humans, which eliminates the need for negative training samples. This method integrates histone modifications and TF binding motifs as identification features. Compared with a previous mapping-convergence (M-C) algorithm, the LMA can provide more stable and reliable predictions. 4796, 3166 and 4384 target genes of OCT4, SOX2 and NANOG, respectively, were identified with the LMA model. Then verifications of the predicted targets were carried out based on their functional consistency and their connection degree in networks from a computational system biology perspective. The results showed that the targets of "core" TFs present higher gene functional similarity and closer connection distance than background levels.

[1]  Nadezhda T. Doncheva,et al.  Topological analysis and interactive visualization of biological networks and protein structures , 2012, Nature Protocols.

[2]  M. Esteller Cancer epigenomics: DNA methylomes and histone-modification maps , 2007, Nature Reviews Genetics.

[3]  Shinya Yamanaka,et al.  Fbx15 Is a Novel Target of Oct3/4 but Is Dispensable for Embryonic Stem Cell Self-Renewal and Mouse Development , 2003, Molecular and Cellular Biology.

[4]  Mingzhi Liao,et al.  Synergistic Transcriptional and Post-Transcriptional Regulation of ESC Characteristics by Core Pluripotency Transcription Factors in Protein-Protein Interaction Networks , 2014, PloS one.

[5]  M. Tada,et al.  Pluripotential competence of cells associated with Nanog activity , 2005, Mechanisms of Development.

[6]  Fidel Ramírez,et al.  Computing topological parameters of biological networks , 2008, Bioinform..

[7]  Alexei A. Sharov,et al.  Pluripotency governed by Sox2 via regulation of Oct3/4 expression in mouse embryonic stem cells , 2007, Nature Cell Biology.

[8]  Jun Fu,et al.  OY-TES-1 may regulate the malignant behavior of liver cancer via NANOG, CD9, CCND2 and CDCA3: a bioinformatic analysis combine with RNAi and oligonucleotide microarray. , 2015, Oncology reports.

[9]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..

[10]  G. Martin,et al.  Isolation of a pluripotent cell line from early mouse embryos cultured in medium conditioned by teratocarcinoma stem cells. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[11]  M. Grunstein,et al.  Acetylated histone H3K56 interacts with Oct4 to promote mouse embryonic stem cell pluripotency , 2013, Proceedings of the National Academy of Sciences.

[12]  Michael J. Ziller,et al.  Reference Maps of Human ES and iPS Cell Variation Enable High-Throughput Characterization of Pluripotent Cell Lines , 2011, Cell.

[13]  A. Regev,et al.  An embryonic stem cell–like gene expression signature in poorly differentiated aggressive human tumors , 2008, Nature Genetics.

[14]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[15]  J. Kononen,et al.  POU5F1 (OCT3/4) identifies cells with pluripotent potential in human germ cell tumors. , 2003, Cancer research.

[16]  Aung Ko Win,et al.  Identification of susceptibility loci for colorectal cancer in a genome-wide meta-analysis. , 2014, Human molecular genetics.

[17]  Austin G Smith,et al.  Signalling, cell cycle and pluripotency in embryonic stem cells. , 2002, Trends in cell biology.

[18]  Lee E. Edsall,et al.  Human DNA methylomes at base resolution show widespread epigenomic differences , 2009, Nature.

[19]  D. Botstein,et al.  Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF , 2001, Nature.

[20]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[21]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[22]  Holger Fröhlich,et al.  GOSim – an R-package for computation of information theoretic GO similarities between terms and gene products , 2007, BMC Bioinformatics.

[23]  Megan F. Cole,et al.  Core Transcriptional Regulatory Circuitry in Human Embryonic Stem Cells , 2005, Cell.

[24]  R. Lovell-Badge,et al.  Multipotent cell lineages in early mouse development depend on SOX2 function. , 2003, Genes & development.

[25]  D. Armant,et al.  Inner cell mass localization of NANOG precedes OCT3/4 in rhesus monkey blastocysts. , 2009, Stem cells and development.

[26]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[27]  Justine R. Smith,et al.  Sequence- and target-independent angiogenesis suppression by siRNA via TLR3 , 2008, Nature.

[28]  Ash A. Alizadeh,et al.  Prediction of survival in diffuse large-B-cell lymphoma based on the expression of six genes. , 2004, The New England journal of medicine.

[29]  Y. Kluger,et al.  Comparative FAIRE‐seq Analysis Reveals Distinguishing Features of the Chromatin Structure of Ground State‐ and Primed‐Pluripotent Cells , 2015, Stem cells.

[30]  B Marshall,et al.  Gene Ontology Consortium: The Gene Ontology (GO) database and informatics resource , 2004, Nucleic Acids Res..

[31]  S. Orkin,et al.  Chipping away at the Embryonic Stem Cell Network , 2005, Cell.

[32]  X. Ke,et al.  Mitogen-activated protein kinase binding protein 1 (MAPKBP1) is an unfavorable prognostic biomarker in cytogenetically normal acute myeloid leukemia , 2015, Oncotarget.

[33]  Wei Wang,et al.  Predicting the Human Epigenome from DNA Motifs , 2014, Nature Methods.

[34]  J. Bruix,et al.  Focus on hepatocellular carcinoma. , 2004, Cancer cell.

[35]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[36]  Hui Liu,et al.  Genome-wide identification of Polycomb target genes in human embryonic stem cells. , 2013, Gene.

[37]  J. Brickman,et al.  Conserved roles for Oct4 homologues in maintaining multipotency during early vertebrate development , 2006, Development.

[38]  Eran Segal,et al.  Module map of stem cell genes guides creation of epithelial cancer stem cells. , 2008, Cell stem cell.

[39]  Raja Jothi,et al.  Genome-wide identification of in vivo protein–DNA binding sites from ChIP-Seq data , 2008, Nucleic acids research.

[40]  D. Broomhead,et al.  Radial Basis Functions, Multi-Variable Functional Interpolation and Adaptive Networks , 1988 .

[41]  M. Kaufman,et al.  Establishment in culture of pluripotential cells from mouse embryos , 1981, Nature.

[42]  Simon Kasif,et al.  Genomewide Analysis of PRC1 and PRC2 Occupancy Identifies Two Classes of Bivalent Domains , 2008, PLoS genetics.

[43]  Lee E. Edsall,et al.  Distinct epigenomic landscapes of pluripotent and lineage-committed human cells. , 2010, Cell stem cell.

[44]  J. Sedgwick,et al.  Gene knock-out technology: a methodological overview for the interested novice. , 1995, Journal of immunological methods.