Gene Regulatory Elements Extraction in Breast Cancer by Hi-C Data Using a Meta-Heuristic Method

Detection of gene regulatory elements and the interactions among them may be conducive to diagnosis and treatment of many diseases and maladies as cancer. In this regards, Hi-C techniques such as HiCUP, HiC-Pro, HiC-bench, etc., are capable of detecting the gene regulatory elements and their interactions. Extracting only the fixed length gene regulatory elements, these techniques are not able to detect many of gene regulatory elements as they may be of variable-lengths. In this research, we intend to use a two-objective Meta heuristic method based on simulation annealing to provide a method capable of detecting and extracting sequences of variable-length regulators from the genome, and also calculating the interactions between them. In fact, these gene regulatory elements can be potential promoters/enhancers that could play a significant role in the incidence and exacerbation of cancer. To measure the performance and effectiveness of the suggested method, the proposed method is implemented on Hi-C data regarding patients with breast cancer in two blood cells GM12878 and CD34+. Then, the results of implementing the proposed method are compared with the HiCUP and HiC-Pro methods. The results show that the proposed method has a better performance than the HiCUP and HiC-Pro methods. In addition, the proposed method has been investigated for the detection and extraction of gene regulatory elements involved in the occurrence and exacerbation of this type of cancer. Experimental studies have shown that the two promoters BLC6 and HOTTIP discovered by the proposed method have had a significant effect on the incidence and severity of breast cancer in the both genetically engineered blood cells GM12878 and CD34+.

[1]  Wing-Kin Sung,et al.  Learning Chromatin Interaction Using Hi-C Datasets , 2019, Encyclopedia of Bioinformatics and Computational Biology.

[2]  Philip A. Ewels,et al.  HiCUP: pipeline for mapping and processing Hi-C data , 2015, F1000Research.

[3]  A. Jemal,et al.  Global Cancer Statistics , 2011 .

[4]  Ram Sarkar,et al.  Genetic algorithm based cancerous gene identification from microarray data using ensemble of filter methods , 2018, Medical & Biological Engineering & Computing.

[5]  Philip A. Ewels,et al.  HiCUP: pipeline for mapping and processing Hi-C data , 2015, F1000Research.

[6]  Sheng Zhong,et al.  GITAR: An Open Source Tool for Analysis and Visualization of Hi-C Data , 2018, bioRxiv.

[7]  Philip A. Ewels,et al.  Mapping long-range promoter contacts in human cells with high-resolution capture Hi-C , 2015, Nature Genetics.

[8]  Helga Thorvaldsdóttir,et al.  Juicebox.js Provides a Cloud-Based Visualization System for Hi-C Data , 2017, bioRxiv.

[9]  L. Mirny,et al.  Iterative Correction of Hi-C Data Reveals Hallmarks of Chromosome Organization , 2012, Nature Methods.

[10]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.

[11]  O. Troyanskaya,et al.  Predicting effects of noncoding variants with deep learning–based sequence model , 2015, Nature Methods.

[12]  K. Zhao,et al.  Characterization of genome-wide enhancer-promoter interactions reveals co-expression of interacting genes and modes of higher order chromatin organization , 2012, Cell Research.

[13]  Jiale Li,et al.  Polymorphisms in the FOXO gene are associated with growth traits in the Sanmen breeding population of the razor clam Sinonovacula constricta , 2018, Aquaculture and Fisheries.

[14]  Brian D. Gregory,et al.  High-throughput identification of long-range regulatory elements and their target promoters in the human genome , 2013, Nucleic acids research.

[15]  Jean-Philippe Vert,et al.  HiC-Pro: an optimized and flexible pipeline for Hi-C data processing , 2015, Genome Biology.

[16]  D. Du,et al.  Genetic effects of historical anthropogenic disturbance on a long-lived endangered tropical tree Vatica mangachapoi , 2018, Journal of Forestry Research.

[17]  Feng Liu,et al.  PEDLA: predicting enhancers with a deep learning-based algorithmic framework , 2016, Scientific Reports.

[18]  Aristotelis Tsirigos,et al.  HiC-bench: comprehensive and reproducible Hi-C data analysis designed for parameter exploration and benchmarking , 2016, BMC Genomics.

[19]  Yan Li,et al.  A high-resolution map of three-dimensional chromatin interactome in human cells , 2013, Nature.

[20]  Antonella Lisa,et al.  An Association Rule Mining Approach to Discover lncRNAs Expression Patterns in Cancer Datasets , 2015, BioMed research international.

[21]  Aristotelis Tsirigos,et al.  Detecting community structures in Hi-C genomic data , 2015, 2016 Annual Conference on Information Science and Systems (CISS).

[22]  Yuehjen E. Shao,et al.  Mining the breast cancer pattern using artificial neural networks and multivariate adaptive regression splines , 2004, Expert Syst. Appl..

[23]  Xiaowu Chen,et al.  Fish red blood cells express immune genes and responses. , 2018 .

[24]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.