Analysis of Gene Regulatory Networks Inferred from ChIP-seq Data

Computational network biology aims to understand cell behavior through complex network analysis. The Chromatin ImmunoPrecipitation sequencing (ChIP-seq) technique allows interrogating the physical binding interactions between proteins and DNA using Next-Generation Sequencing. Taking advantage of this technique, in this study we propose a computational framework to analyze gene regulatory networks built from ChIP-seq data. We focus on two different cell lines: GM12878, a normal lymphoblastoid cell line, and K562, an immortalised myelogenous leukemia cell line. In the proposed framework, we preprocessed the data, derived network relationships in the data, analyzed their network properties, and identified differences between the two cell lines through network comparison analysis. Throughout our analysis, we identified known cancer genes and other genes that may play important roles in chronic myelogenous leukemia.

[1]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[2]  Gesine Reinert,et al.  New powerful statistics for alignment-free sequence comparison under a pattern transfer model. , 2011, Journal of theoretical biology.

[3]  B. Corominas-Murtra,et al.  On the basic computational structure of gene regulatory networks. , 2009, Molecular bioSystems.

[4]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[5]  C. Lozzio,et al.  Human chronic myelogenous leukemia cell-line with positive Philadelphia chromosome. , 1975, Blood.

[6]  B. Scheijen,et al.  The many faces of IKZF1 in B-cell precursor acute lymphoblastic leukemia , 2018, Haematologica.

[7]  Tin Wee Tan,et al.  In silico grouping of peptide/HLA class I complexes using structural interaction characteristics , 2007, Bioinform..

[8]  J. Aster,et al.  Modulation of gene expression via overlapping binding sites exerted by ZNF143, Notch1 and THAP11 , 2013, Nucleic acids research.

[9]  Gesine Reinert,et al.  Alignment-free protein interaction network comparison , 2014, Bioinform..

[10]  Stefano Ceri,et al.  Exploring Genomic Datasets: from Batch to Interactive and Back , 2018, ExploreDB@SIGMOD/PODS.

[11]  M. Gerstein,et al.  Getting connected: analysis and principles of biological networks. , 2007, Genes & development.

[12]  Marco Masseroli,et al.  Processing of big heterogeneous genomic datasets for tertiary analysis of Next Generation Sequencing data , 2018, Bioinform..

[13]  Christos Faloutsos,et al.  oddball: Spotting Anomalies in Weighted Graphs , 2010, PAKDD.

[14]  Carsten Wiuf,et al.  From evidence to inference: Probing the evolution of protein interaction networks , 2009, HFSP journal.

[15]  S. Horvath,et al.  Weighted gene coexpression network analysis strategies applied to mouse weight , 2007, Mammalian Genome.

[16]  F. Uchiumi,et al.  The hematopoietic regulator, ELF-1, enhances the transcriptional response to Interferon-β of the OAS1 anti-viral gene , 2015, Scientific Reports.

[17]  Taira Maekawa,et al.  Non-steady-state hematopoiesis regulated by the C/EBPβ transcription factor , 2015, Cancer science.

[18]  D. Koh,et al.  KAISO, a critical regulator of p53-mediated transcription of CDKN1A and apoptotic genes , 2014, Proceedings of the National Academy of Sciences.

[19]  Sing-Hoi Sze,et al.  Path Matching and Graph Matching in Biological Networks , 2007, J. Comput. Biol..

[20]  S. Horvath,et al.  Statistical Applications in Genetics and Molecular Biology , 2011 .

[21]  Sangsoo Kim,et al.  Gene expression Differential coexpression analysis using microarray data and its application to human cancer , 2005 .

[22]  Mihai Udrescu,et al.  Statistical fidelity: a tool to quantify the similarity between multi-variable entities with application in complex networks , 2017, Int. J. Comput. Math..

[23]  Natasa Przulj,et al.  Biological network comparison using graphlet degree distribution , 2007, Bioinform..

[24]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[25]  William Stafford Noble,et al.  Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors , 2012, Genome research.

[26]  Thomas V. O'brien,et al.  Pharmacological Inhibition of the Histone Lysine Demethylase KDM1A Suppresses the Growth of Multiple Acute Myeloid Leukemia Subtypes. , 2016, Cancer research.

[27]  A. Verma,et al.  Activation of the p38 Map kinase pathway is essential for the antileukemic effects of dasatinib , 2009, Leukemia & lymphoma.

[28]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008 .

[29]  Michael J. E. Sternberg,et al.  PINALOG: a novel approach to align protein interaction networks—implications for complex detection and function prediction , 2012, Bioinform..