Novel Methodology for CRC Biomarkers Detection with Leave-One-Out Bayesian Classification

In our previous research we developed a methodology for extracting significant genes that indicate colorectal cancer (CRC). By using those biomarker genes we proposed an intelligent modelling of their gene expression distributions and used them in the Bayes’ theorem in order to achieve highly precise classification of patients in one of the classes carcinogenic, or healthy. The main objective of our new research is to subside the biomarkers set without degrading the sensitivity and specificity of the classifier. We want to eliminate the biomarkers that do not play an important role in the classification process. To achieve this goal, we propose a novel approach for biomarkers detection based on iterative Bayesian classification. The new Leave-one-out method aims to extract the biomarkers essential for the classification process, i.e. if they are left-out, the classification shows remarkably degraded results. Taking into account only the reduced set of biomarkers, we produced an improved version of our Bayesian classifier when classifying new patients. Another advantage of our approach is using the new biomarkers set in the Gene Ontology (GO) analysis in order to get more precise information on the colorectal cancer’s biomarkers’ biological and molecular functions.

[1]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[2]  B Marshall,et al.  Gene Ontology Consortium: The Gene Ontology (GO) database and informatics resource , 2004, Nucleic Acids Res..

[3]  Yadong Wang,et al.  Constructing disease-specific gene networks using pair-wise relevance metric: Application to colon cancer identifies interleukin 8, desmin and enolase 1 as the central elements , 2008, BMC Systems Biology.

[4]  Qi Zheng,et al.  GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis , 2008, Nucleic Acids Res..

[5]  Steven J. M. Jones,et al.  Meta-analysis of Colorectal Cancer Gene Expression Profiling Studies Identifies Consistently Reported Candidate Biomarkers , 2008, Cancer Epidemiology Biomarkers & Prevention.

[6]  Andrew J. Bulpitt,et al.  From gene expression to gene regulatory networks in Arabidopsis thaliana , 2009, BMC Systems Biology.

[7]  H. Hakonarson,et al.  Analysing biological pathways in genome-wide association studies , 2010, Nature Reviews Genetics.

[8]  Zhijin Wu,et al.  Subset Quantile Normalization Using Negative Control Features , 2010, J. Comput. Biol..

[9]  Lu Xie,et al.  DigOut: viewing differential expression genes as outliers. , 2010, Journal of bioinformatics and computational biology.

[10]  K. Hemminki,et al.  Consensus Pathways Implicated in Prognosis of Colorectal Cancer Identified Through Systematic Enrichment Analysis of Gene Expression Profiling Studies , 2011, PloS one.

[11]  Monika Simjanoska,et al.  Bayesian posterior probability classification of colorectal cancer probed with Affymetrix microarray technology , 2013, 2013 36th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[12]  Li Yang,et al.  Gene Expression Analysis of Peripheral Blood Cells Reveals Toll-Like Receptor Pathway Deregulation in Colorectal Cancer , 2013, PloS one.

[13]  Ana Madevska Bogdanova,et al.  Bayesian Multiclass Classification of Gene Expression Colorectal Cancer Stages , 2013, ICT Innovations.

[14]  Ana Madevska Bogdanova,et al.  Gene Ontology Analysis of Colorectal Cancer Biomarkers Probed with Affymetrix and Illumina Microarrays , 2013, IJCCI.