HCVS: Pinpointing Chromatin States Through Hierarchical Clustering and Visualization Scheme

Specific combinations of Histone Modifications (HMs) contributing towards histone code hypothesis lead to various biological functions. HMs combinations have been utilized by various studies to divide the genome into different regions. These study regions have been classified as chromatin states. Mostly Hidden Markov Model (HMM) based techniques have been utilized for this purpose. In case of chromatin studies, data from Next Generation Sequencing (NGS) platforms is being used. Chromatin states based on histone modification combinatorics are annotated by mapping them to functional regions of the genome. The number of states being predicted so far by the HMM tools have been justified biologically till now. The present study aimed at providing a computational scheme to identify the underlying hidden states in the data under consideration. Methods: We proposed a computational scheme HCVS based on hierarchical clustering and visualization strategy in order to achieve the objective of study. We tested our proposed scheme on a real data set of nine cell types comprising of nine chromatin marks. The approach successfully identified the state numbers for various possibilities. The results have been compared with one of the existing models as well which showed quite good correlation. The HCVS model not only helps in deciding the optimal state numbers for a particular data but it also justifies the results biologically thereby correlating the computational and biological aspects.

[1]  Satoru Miyano,et al.  Open source clustering software , 2004 .

[2]  B. Bernstein,et al.  Charting histone modifications and the functional organization of mammalian genomes , 2011, Nature Reviews Genetics.

[3]  Manolis Kellis,et al.  Discovery and characterization of chromatin states for systematic annotation of the human genome , 2010, Nature Biotechnology.

[4]  William Stafford Noble,et al.  Unsupervised pattern discovery in human chromatin structure through genomic segmentation , 2012, Nature Methods.

[5]  Guo-Cheng Yuan,et al.  Epigenetic domains found in mouse embryonic stem cells via a hidden Markov model , 2010, BMC Bioinformatics.

[6]  T. Kouzarides Chromatin Modifications and Their Function , 2007, Cell.

[7]  Lovelace J. Luquette,et al.  Comprehensive analysis of the chromatin landscape in Drosophila , 2010, Nature.

[8]  Michael Grunstein,et al.  Genome-wide patterns of histone modifications in yeast , 2006, Nature Reviews Molecular Cell Biology.

[9]  Muhammad Abdul Qadir,et al.  ChromBiSim: Interactive chromatin biclustering using a simple approach. , 2017, Genomics.

[10]  Megan F. Cole,et al.  Genome-wide Map of Nucleosome Acetylation and Methylation in Yeast , 2005, Cell.

[11]  Michael Q. Zhang,et al.  High-resolution human core-promoter prediction with CoreBoost_HM. , 2009, Genome research.

[12]  Guo-Cheng Yuan,et al.  Chromatin States Accurately Classify Cell Differentiation Stages , 2012, PloS one.

[13]  Marcel Brun,et al.  Clustering Algorithms: On Learning, Validation, Performance, and Applications to Genomics , 2009, Current genomics.

[14]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[15]  S. Schreiber,et al.  Signaling Network Model of Chromatin , 2002, Cell.

[16]  N. Friedman,et al.  Single-Nucleosome Mapping of Histone Modifications in S. cerevisiae , 2005, PLoS biology.

[17]  James D Watson Celebrating the genetic jubilee: a conversation with James D. Watson. Interviewed by John Rennie. , 2003, Scientific American.

[18]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[19]  Sündüz Keles,et al.  Normalization of ChIP-seq data with control , 2012, BMC Bioinformatics.

[20]  T. Mikkelsen,et al.  Genome-wide maps of chromatin state in pluripotent and lineage-committed cells , 2007, Nature.

[21]  Bing Ren,et al.  ChromaSig: A Probabilistic Approach to Finding Common Chromatin Signatures in the Human Genome , 2008, PLoS Comput. Biol..

[22]  Jacob D. Jaffe,et al.  Plasticity in patterns of histone modifications and chromosomal proteins in Drosophila heterochromatin. , 2011, Genome research.

[23]  A. Rechtsteiner,et al.  Broad chromosomal domains of histone modification patterns in C. elegans. , 2011, Genome research.

[24]  C. Allis,et al.  Translating the Histone Code , 2001, Science.

[25]  Stéphane Robin,et al.  Integrative epigenomic mapping defines four main chromatin states in Arabidopsis , 2011, The EMBO journal.

[26]  Gos Micklem,et al.  Supporting Online Material Materials and Methods Figs. S1 to S50 Tables S1 to S18 References Identification of Functional Elements and Regulatory Circuits by Drosophila Modencode , 2022 .

[27]  Raymond K. Auerbach,et al.  Integrative Analysis of the Caenorhabditis elegans Genome by the modENCODE Project , 2010, Science.

[28]  Timothy J. Durham,et al.  Systematic analysis of chromatin state dynamics in nine human cell types , 2011, Nature.

[29]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[30]  Bing Ren,et al.  Prediction of regulatory elements in mammalian genomes using chromatin signatures , 2008, BMC Bioinformatics.

[31]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[32]  Zoubin Ghahramani,et al.  Bayesian correlated clustering to integrate multiple datasets , 2012, Bioinform..

[33]  Guillaume J. Filion,et al.  Systematic Protein Location Mapping Reveals Five Principal Chromatin Types in Drosophila Cells , 2010, Cell.

[34]  Jie Wang,et al.  Unsupervised pattern discovery in human chromatin structure through genomic segmentation , 2013, BCB.

[35]  C. Allis,et al.  The language of covalent histone modifications , 2000, Nature.

[36]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[37]  Muhammad Abdul Qadir,et al.  ChromClust: A semi-supervised chromatin clustering toolkit for mining histone modifications interplay. , 2015, Genomics.