CLIP-GENE: a web service of the condition specific context-laid integrative analysis for gene prioritization in mouse TF knockout experiments

MotivationTranscriptome data from the gene knockout experiment in mouse is widely used to investigate functions of genes and relationship to phenotypes. When a gene is knocked out, it is important to identify which genes are affected by the knockout gene. Existing methods, including differentially expressed gene (DEG) methods, can be used for the analysis. However, existing methods require cutoff values to select candidate genes, which can produce either too many false positives or false negatives. This hurdle can be addressed either by improving the accuracy of gene selection or by providing a method to rank candidate genes effectively, or both. Prioritization of candidate genes should consider the goals or context of the knockout experiment. As of now, there are no tools designed for both selecting and prioritizing genes from the mouse knockout data. Hence, the necessity of a new tool arises.ResultsIn this study, we present CLIP-GENE, a web service that selects gene markers by utilizing differentially expressed genes, mouse transcription factor (TF) network, and single nucleotide variant information. Then, protein-protein interaction network and literature information are utilized to find genes that are relevant to the phenotypic differences. One of the novel features is to allow researchers to specify their contexts or hypotheses in a set of keywords to rank genes according to the contexts that the user specify. We believe that CLIP-GENE will be useful in characterizing functions of TFs in mouse experiments.Availabilityhttp://epigenomics.snu.ac.kr/CLIP-GENEReviewersThis article was reviewed by Dr. Lee and Dr. Pongor.

[1]  Concetto Spampinato,et al.  Combining literature text mining with microarray data: advances for system biology modeling , 2012, Briefings Bioinform..

[2]  Yong Huang,et al.  Weak base pairing in both seed and 3′ regions reduces RNAi off-targets and enhances si/shRNA designs , 2014, Nucleic acids research.

[3]  Melissa J. Davis,et al.  Gene regulatory network inference: evaluation and application to ovarian cancer allows the prioritization of drug targets , 2012, Genome Medicine.

[4]  Hui Liu,et al.  AnimalTFDB: a comprehensive animal transcription factor database , 2011, Nucleic Acids Res..

[5]  T. Partridge,et al.  Muscle satellite cells adopt divergent fates , 2004, The Journal of cell biology.

[6]  Adam A. Margolin,et al.  Reverse engineering of regulatory networks in human B cells , 2005, Nature Genetics.

[7]  Michael S. Samoilov,et al.  Inference of gene regulatory networks from genome-wide knockout fitness data , 2012, Bioinform..

[8]  D. Watkins-Chow,et al.  A unique missense allele of BAF155, a core BAF chromatin remodeling complex protein, causes neural tube closure defects in mice , 2014, Developmental neurobiology.

[9]  M. Downes,et al.  Barx2 and Pax7 Have Antagonistic Functions in Regulation of Wnt Signaling and Satellite Cell Differentiation , 2014, Stem cells.

[10]  I. Amit,et al.  Digital cell quantification identifies global immune cell dynamics during influenza infection , 2014, Molecular systems biology.

[11]  G. Bates,et al.  Dysfunction of the CNS-Heart Axis in Mouse Models of Huntington's Disease , 2014, PLoS genetics.

[12]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[13]  Canglin Wu,et al.  RegNetwork: an integrated database of transcriptional and post-transcriptional regulatory networks in human and mouse , 2015, Database J. Biol. Databases Curation.

[14]  Farren J. Isaacs,et al.  Computational studies of gene regulatory networks: in numero molecular biology , 2001, Nature Reviews Genetics.

[15]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[16]  A. Swaroop,et al.  OTX2 loss causes rod differentiation defect in CRX-associated congenital blindness. , 2014, The Journal of clinical investigation.

[17]  Koichiro Tamura,et al.  Phylogenetic placement of metagenomic reads using the minimum evolution principle , 2015, BMC Genomics.

[18]  Guy Karlebach,et al.  Modelling and analysis of gene regulatory networks , 2008, Nature Reviews Molecular Cell Biology.

[19]  Alfred De Grazia,et al.  Mathematical Derivation of an Election System , 1953 .

[20]  Bart De Moor,et al.  A guide to web tools to prioritize candidate genes , 2011, Briefings Bioinform..

[21]  Zhiping Weng,et al.  miR-10b-5p expression in Huntington’s disease brain relates to age of onset and the extent of striatal involvement , 2015, BMC Medical Genomics.

[22]  R. Meech,et al.  Barx2 Is Expressed in Satellite Cells and Is Required for Normal Muscle Growth and Regeneration , 2012, Stem cells.

[23]  Luke Barron,et al.  The transcription factor GATA3 is critical for the development of all IL-7Rα-expressing innate lymphoid cells. , 2014, Immunity.

[24]  R. Meech,et al.  Barx2 and Fgf10 regulate ocular glands branching morphogenesis by controlling extracellular matrix remodeling , 2011, Development.

[25]  N. Mukhopadhyay,et al.  Astrocyte elevated gene‐1 and c‐Myc cooperate to promote hepatocarcinogenesis in mice , 2015, Hepatology.

[26]  Michael R. Green,et al.  Genetic and pharmacological reactivation of the mammalian inactive X chromosome , 2014, Proceedings of the National Academy of Sciences.

[27]  B. Olwin,et al.  Pax-7 up-regulation inhibits myogenesis and cell cycle progression in satellite cells: a potential mechanism for self-renewal. , 2004, Developmental biology.

[28]  Thomas Craig,et al.  GeneFriends: An online co-expression analysis tool to identify novel gene targets for aging and complex diseases , 2012, BMC Genomics.

[29]  yoko fukuda-yuzawa,et al.  miR-212 and miR-132 are dispensable for mouse mammary gland development , 2014, Nature Genetics.

[30]  Pax5 loss imposes a reversible differentiation block in B-progenitor acute lymphoblastic leukemia. , 2014, Genes & development.

[31]  K. Reymann,et al.  K‐Lysine acetyltransferase 2a regulates a hippocampal gene expression network linked to memory formation , 2014, The EMBO journal.

[32]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[33]  Yves Moreau,et al.  PINTA: a web server for network-based gene prioritization from expression data , 2011, Nucleic Acids Res..

[34]  Y. Urade,et al.  Prostaglandin D2 acts through the Dp2 receptor to influence male germ cell differentiation in the foetal mouse testis , 2014, Development.

[35]  Y. Moreau,et al.  Computational tools for prioritizing candidate genes: boosting disease gene discovery , 2012, Nature Reviews Genetics.

[36]  Sun Kim,et al.  Combined analysis of gene regulatory network and SNV information enhances identification of potential gene markers in mouse knockout studies with small number of samples , 2015, BMC Medical Genomics.

[37]  K. Chowdhury,et al.  miR-212 and miR-132 are dispensable for mouse mammary gland development , 2014, Nature Genetics.

[38]  Xiaoli Li,et al.  Multi-resolution independent component analysis for high-performance tumor classification and biomarker discovery , 2011, BMC Bioinformatics.

[39]  Y. Wan GATA3: a master of many trades in immune regulation. , 2014, Trends in immunology.

[40]  Christophe Romier,et al.  TAF4, a subunit of transcription factor II D, directs promoter occupancy of nuclear receptor HNF4A during post-natal hepatocyte differentiation , 2014, eLife.

[41]  D. Edelman,et al.  The homeobox transcription factor Barx2 regulates chondrogenesis during limb development , 2005, Development.

[42]  Qibin Li,et al.  Experimental validation of methods for differential gene expression analysis and sample pooling in RNA-seq , 2015, BMC Genomics.

[43]  Wen Huang,et al.  MTML-msBayes: Approximate Bayesian comparative phylogeographic inference from multiple taxa and multiple loci with rate heterogeneity , 2011, BMC Bioinformatics.

[44]  R. Jaenisch,et al.  Contrasting roles for histone 3 lysine 27 demethylases in acute lymphoblastic leukemia , 2014, Nature.

[45]  R. Sandberg,et al.  Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells , 2014, Science.

[46]  Saijuan Chen,et al.  Histone Methyltransferase Setd 2 Is Required for Murine Embryonic Stem Cell Differentiation toward Endoderm , 2014 .

[47]  Rudiyanto Gunawan,et al.  Optimal design of gene knockout experiments for gene regulatory network inference , 2015, Bioinform..

[48]  Ning Leng,et al.  EBSeq: an empirical Bayes hierarchical model for inference in RNA-seq experiments , 2013, Bioinform..

[49]  Q. Ding,et al.  Loss of MLH1 confers resistance to PI3Kβ inhibitors in renal clear cell carcinoma with SETD2 mutation , 2015, Tumor Biology.

[50]  Zalmiyah Zakaria,et al.  A review on the computational approaches for gene regulatory network construction , 2014, Comput. Biol. Medicine.

[51]  Damian Szklarczyk,et al.  The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored , 2010, Nucleic Acids Res..

[52]  Jin He,et al.  Tet3 and DNA replication mediate demethylation of both the maternal and paternal genomes in mouse zygotes. , 2014, Cell stem cell.

[53]  S. Orkin,et al.  Corepressor Rcor1 is essential for murine erythropoiesis. , 2014, Blood.

[54]  J. W. Cross,et al.  Comparative epigenomics in distantly related teleost species identifies conserved cis-regulatory nodes active during the vertebrate phylotypic period , 2014, Genome research.

[55]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[56]  Ralf Herwig,et al.  ConsensusPathDB: toward a more complete picture of cell biology , 2010, Nucleic Acids Res..

[57]  V. Bolivar,et al.  Cautionary insights on knockout mouse studies: The gene or not the gene? , 2009, Brain, Behavior, and Immunity.

[58]  Jaehoon Choi,et al.  BOSS: context-enhanced search for biomedical objects , 2012, BMC Medical Informatics and Decision Making.

[59]  Xing-Ming Zhao,et al.  NARROMI: a noise and redundancy reduction technique improves accuracy of gene regulatory network inference , 2013, Bioinform..

[60]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[61]  Bart De Moor,et al.  Endeavour update: a web resource for gene prioritization in multiple species , 2008, Nucleic Acids Res..

[62]  Jens Timmer,et al.  Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge , 2007, BMC Systems Biology.