We propose a computational approach for modeling the progression of Oral Squamous Cell Carcinoma (OSCC) through Dynamic Bayesian Network (DBN) models. RNA-Seq transcriptomics data, available from public functional genomics data repositories, are exploited to find genes related to disease progression (i.e. recurrence or no recurrence). Our primary aim is to perform a computational analysis based on the differentially expressed genes identified. More specifically, a search for putative transcription factor binding sites (TFBSs), in the promoters of the input gene set, as well as an analysis of the pathways of the suggested transcription factors is conducted. Activities of transcription factors which are regulated by upstream signaling cascades are further discovered. These activities converge in certain nodes, representing molecules which are potential regulators of OSCC progression. The resulting gene list is further exploited for the inference of their causal relationships and for disease classification in terms of DBN models. The structure and the parameters of the models are defined subsequently, revealing the changes in gene-gene interactions with reference to disease recurrence after surgery. The objectives of the proposed methodology are to: (i) accurately estimate OSCC progression, and (ii) provide better insights into the regulatory mechanisms of the disease. Moreover, we can conjecture about the interactions among genes based on the inferred network models. The proposed approach implies that the resulting regulatory molecules along with the differentially expressed genes extracted, can be considered as new targets, and are candidates for further experimental and in silico validation.
[1]
Benjamin Haibe-Kains,et al.
A network model for angiogenesis in ovarian cancer
,
2015,
BMC Bioinformatics.
[2]
D. Hanahan,et al.
Hallmarks of Cancer: The Next Generation
,
2011,
Cell.
[3]
Gregory D. Schuler,et al.
Database resources of the National Center for Biotechnology Information: update
,
2004,
Nucleic acids research.
[4]
Matthew E. Ritchie,et al.
limma powers differential expression analyses for RNA-sequencing and microarray studies
,
2015,
Nucleic acids research.
[5]
Gregory D. Schuler,et al.
Database resources of the National Center for Biotechnology Information
,
2012,
Nucleic Acids Res..
[6]
Serdar Bozdag,et al.
A Canonical Correlation Analysis-Based Dynamic Bayesian Network Prior to Infer Gene Regulatory Networks from Multiple Types of Biological Data
,
2015,
J. Comput. Biol..
[7]
Dennis B. Troup,et al.
NCBI GEO: mining tens of millions of expression profiles—database and tools update
,
2006,
Nucleic Acids Res..
[8]
Kuldip K. Paliwal,et al.
A Gene Selection Algorithm using Bayesian Classification Approach
,
2012
.
[9]
Sylvain Arlot,et al.
A survey of cross-validation procedures for model selection
,
2009,
0907.4728.
[10]
Alexander E. Kel,et al.
GeneXplain — Identification of Causal Biomarkers and Drug Targets in Personalized Cancer Pathways
,
2011
.
[11]
Seungjin Choi,et al.
Inference of dynamic networks using time-course data
,
2014,
Briefings Bioinform..