Identification of Genetic Pathway for Cervical Cancer Development Using Rough and Bayesian Theory

Suitable analysis of microarray dataset can unlock the mystery of the origin of many dreaded disease like cancer which can subsequently be investigated for its rectification, resulting into search for drug design. A critical challenge of the post-genomic era is to find out the cancer causing genes that induce changes in gene expression profiles in the microarray dataset. Various algorithms based on SVM, Data Mining Techniques, Information theory based investigations, Clustering Techniques etc. were used by previous researchers. In this paper, Rough Set Theory and Bayesian Network based techniques have been applied for the same purpose. Rough Set has been used to isolate genes from microarray dataset responsible for cervical cancer. Bayesian approach has been used for extracting the Gene Regulating Network using the isolated genes. The same has been repeated for a normal healthy person. By superimposing these two networks, it is possible to find out the distinct cellular pathway for development of cancer from the departure of directed edges of the two networks. The results obtained in this work are quite satisfactory.

[1]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[2]  Sushmita Mitra,et al.  Evolutionary Rough Feature Selection in Gene Expression Data , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[3]  Igor Cima,et al.  Identification of a Gene Expression Signature Common to Distinct Cancer Pathways , 2012, Cancer informatics.

[4]  Julie M. David,et al.  Machine Learning Approach for Prediction of Learning Disabilities in School-Age Children , 2010 .

[5]  Jin-Kao Hao,et al.  A Hybrid GA/SVM Approach for Gene Selection and Classification of Microarray Data , 2006, EvoWorkshops.

[6]  Satoru Miyano,et al.  Estimation of Genetic Networks and Functional Structures Between Genes by Using Bayesian Networks and Nonparametric Regression , 2001, Pacific Symposium on Biocomputing.

[7]  Aboul Ella Hassanien,et al.  Rough Set Approach for Generation of Classification Rules of Breast Cancer Data , 2004, Informatica.

[8]  Yong Xu,et al.  Neuro-Fuzzy Ensemble Approach for Microarray Cancer Gene Expression Data Analysis , 2006, 2006 International Symposium on Evolving Fuzzy Systems.

[9]  James C. Bezdek,et al.  Fuzzy models—What are they, and why? [Editorial] , 1993, IEEE Transactions on Fuzzy Systems.

[10]  Goutam Saha,et al.  An Approach towards Automated Disease Diagnosis & Drug Design Using Hybrid Rough-Decision Tree from Microarray Dataset , 2013 .

[11]  Azadeh Mohammadi,et al.  Identification of disease-causing genes using microarray data mining and Gene Ontology , 2011, BMC Medical Genomics.

[12]  Edward R. Dougherty,et al.  Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks , 2002, Bioinform..

[13]  Nir Friedman,et al.  Learning Bayesian Networks with Local Structure , 1996, UAI.

[14]  Jan Komorowski,et al.  Classification of Gene Expression Data in an Ontology , 2001, ISMDA.

[15]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[16]  Adnan Darwiche Bayesian networks , 2010, Commun. ACM.

[17]  Lucila Ohno-Machado,et al.  Small, fuzzy and interpretable gene expression based classifiers , 2005, Bioinform..

[18]  Jan Komorowski,et al.  Learning Rough Set Classifiers from Gene Expressions and Clinical Data , 2002, Fundam. Informaticae.

[19]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[20]  Satoru Miyano,et al.  Bayesian network and nonparametric heteroscedastic regression for nonlinear modeling of genetic network , 2003, Proceedings. IEEE Computer Society Bioinformatics Conference.

[21]  Satoru Miyano,et al.  Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data of Bacillus Subtilis Using Differential Equations , 2002, Pacific Symposium on Biocomputing.

[22]  Chih-Hung Hsieh,et al.  Interpretable gene expression classifier with an accurate and compact fuzzy rule base for microarray data analysis. , 2006, Bio Systems.

[23]  Qinghua Hu,et al.  Rule learning for classification based on neighborhood covering reduction , 2011, Inf. Sci..

[24]  Julie M. David,et al.  Prediction of Key Symptoms of Learning Disabilities in School-Age Children Using Rough Sets , 2011 .

[25]  Fuzzy Models-What Are They , and Why ? , 2004 .