Identification of Co-regulated Signature Genes in Pancreas Cancer- A Data Mining Approach

Pancreas cancer is one of the most fatal among the cancers. The mortality rate is high due to the lack of tools for proper diagnosis and effective therapeutics. Identification of changes in gene expression in pancreas cancer may lead to the development of novel tools for diagnosis and effective treatment methodology. In this paper we present an association rule mining approach to identify the association between the genes that are either over expressed or under expressed in pancreas cancer compared to normal pancreas. We have used the SAGE data related to pancreas cancer. It is expected that the results will help in developing better treatment methodology for pancreas cancer and also for designing a low cost microarray chip for diagnosing pancreas cancer. The results have been validated in terms of Gene Ontology and the signature genes have been identified that match with published data.

[1]  C. Becquet,et al.  Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data , 2002, Genome Biology.

[2]  Arun N. Swami,et al.  Set-oriented mining for association rules in relational databases , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[3]  T. Mcintosh,et al.  High Confidence Rule Mining for Microarray Analysis , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[4]  K. R. Seeja,et al.  A Closed Frequent Itemset Mining Algorithm for Gene Expression Databases , 2008, BCBGC.

[5]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[6]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[7]  Gholamhossein Dastghaibyfard,et al.  Parallel Mining of Association Rules from Gene Expression Databases , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[8]  Liang Chen,et al.  A statistical method for identifying differential gene-gene co-expression patterns , 2004, Bioinform..

[9]  Michael Watson,et al.  CoXpress: differential co-expression in gene expression data , 2006, BMC Bioinformatics.

[10]  K. Kinzler,et al.  Serial Analysis of Gene Expression , 1995, Science.

[11]  Le Gruenwald,et al.  Microarray gene expression data association rules mining based on JG-Tree , 2003, 14th International Workshop on Database and Expert Systems Applications, 2003. Proceedings..

[12]  Jean-François Boulicaut,et al.  Mining Concepts from Large SAGE Gene Expression Matrices , 2003, KDID.

[13]  Jan Mollenhauer,et al.  Differentially expressed genes in pancreatic ductal adenocarcinomas identified through serial analysis of gene expression , 2004, Cancer biology & therapy.

[14]  Rainer Spang,et al.  Finding disease specific alterations in the co-expression of genes , 2004, ISMB/ECCB.

[15]  José María Carazo,et al.  Integrated analysis of gene expression by association rules discovery , 2006, BMC Bioinformatics.

[16]  R. Bals,et al.  Identification of disease genes by expression profiling. , 2001, The European respiratory journal.

[17]  Karuturi R. Krishna Murthy,et al.  Significance Analysis and Improved Discovery of Differentially Co-expressed Gene Sets in Microarray Data , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[18]  Narendra Tuteja,et al.  Serial Analysis of Gene Expression: Applications in Human Studies , 2004, Journal of biomedicine & biotechnology.

[19]  David R. Gilbert,et al.  Ismb/eccb 2004 , 2004, ISMB/ECCB.

[20]  Giovanni Parmigiani,et al.  Relationships and differentially expressed genes among pancreatic cancers examined by large-scale serial analysis of gene expression. , 2002, Cancer research.

[21]  Chad Creighton,et al.  Mining gene expression databases for association rules , 2003, Bioinform..

[22]  Carolina Ruiz,et al.  Hypothesis-Driven Specialization of Gene Expression Association Rules , 2007, 2007 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2007).

[23]  Kimberly Walter,et al.  Discovery of novel tumor markers of pancreatic cancer using global gene expression technology. , 2002, The American journal of pathology.