Mining Discriminant Sequential Patterns for Aging Brain

Discovering new information about groups of genes implied in a disease is still challenging. Microarrays are a powerful tool to analyse gene expression. In this paper, we propose a new approach outlining relationships between genes based on their ordered expressions. Our contribution is twofold. First, we propose to use a new material, called sequential patterns, to be investigated by biologists. Secondly, due to the expression matrice density, extracting sequential patterns from microarray datasets is far away from being easy. The aim of our proposal is to provide the biological experts with an efficient approach based on discriminant sequential patterns. Results of various experiments on real biological data highlight the relevance of our proposal.

[1]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[2]  Werner Dubitzky,et al.  Knowledge Exploration in Life Science Informatics , 2004, Lecture Notes in Computer Science.

[3]  Anthony K. H. Tung,et al.  Carpenter: finding closed patterns in long biological datasets , 2003, KDD '03.

[4]  Anthony K. H. Tung,et al.  FARMER: finding interesting rule groups in microarray datasets , 2004, SIGMOD '04.

[5]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[6]  Maguelonne Teisseire,et al.  GeneMining: Identification, Visualization, and Interpretation of Brain Ageing Signatures , 2009, MIE.

[7]  Blaz Zupan,et al.  Towards knowledge-based gene expression data mining , 2007, J. Biomed. Informatics.

[8]  Riccardo Bellazzi,et al.  Precedence Temporal Networks to represent temporal relationships in gene expression data , 2007, J. Biomed. Informatics.

[9]  Brigitte Trousse,et al.  Extracting Sequential Patterns for Gene Regulatory Expressions Profiles , 2004, KELSI.

[10]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Thomas A. Darden,et al.  Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method , 2001, Bioinform..

[12]  Gregory Piatetsky-Shapiro,et al.  Microarray data mining: facing the challenges , 2003, SKDD.

[13]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[14]  Carlo Combi,et al.  Data mining with Temporal Abstractions: learning rules from time series , 2007, Data Mining and Knowledge Discovery.

[15]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.