Gene expression array exploration using K-formal concept analysis

DNA micro-arrays are a mechanism for eliciting gene expression values, the concentration of the transcription products of a set of genes, under different chemical conditions. The phenomena of interest-- up-regulation, down-regulation and co-regulation--are hypothesized to stem from the functional relationships among transcription products. In [1,2,3] a generalisation of Formal Concept Analysis was developed with data mining applications in mind, κ-Formal Concept Analysis, where incidences take values in certain kinds of semirings, instead of the usual Boolean carrier set. In this paper, we use (Rmin, +)- and (Rmax,+)- Formal Concept Analysis to analyse gene expression data for Arabidopsis thaliana. We introduce the mechanism to render the data in the appropriate algebra and profit by the wealth of different Galois Connections available in Generalized Formal Concept Analysis to carry different analysis for up- and down-regulated genes.

[1]  Amedeo Napoli,et al.  Two FCA-Based Methods for Mining Gene Expression Data , 2009, ICFCA.

[2]  Christian V. Forst,et al.  Identifying Genes of Gene Regulatory Networks Using Formal Concept Analysis , 2008, J. Comput. Biol..

[3]  Francisco J. Valverde-Albacete,et al.  Further Galois Connections between Semimodules over Idempotent Semirings , 2007, CLA.

[4]  Bernhard Ganter,et al.  Pattern Structures and Their Projections , 2001, ICCS.

[5]  Amedeo Napoli,et al.  Mining gene expression data with pattern structures in formal concept analysis , 2011, Inf. Sci..

[6]  Francisco J. Valverde-Albacete,et al.  Towards a Generalisation of Formal Concept Analysis for Data Mining Purposes , 2006, ICFCA.

[7]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[8]  Bernhard Ganter,et al.  Formal Concept Analysis , 2013 .

[9]  Ruggero G. Pensa,et al.  Towards Fault-Tolerant Formal Concept Analysis , 2005, AI*IA.

[10]  Ruggero G. Pensa,et al.  A Methodology for Biologically Relevant Pattern Discovery from Gene Expression Data , 2004, Discovery Science.

[11]  R. Stoughton Applications of DNA microarrays in biology. , 2005, Annual review of biochemistry.

[12]  Francisco J. Valverde-Albacete,et al.  Extending conceptualisation modes for generalised Formal Concept Analysis , 2011, Inf. Sci..

[13]  Hideki Takahashi,et al.  Transcriptome analyses give insights into selenium-stress responses and selenium tolerance mechanisms in Arabidopsis. , 2007, Physiologia plantarum.

[14]  Susanne Motameny,et al.  Formal Concept Analysis for the Identification of Combinatorial Biomarkers in Breast Cancer , 2008, ICFCA.

[15]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .