Frequent Itemsets for Genomic Profiling

Frequent itemset mining is a promising approach to the study of genomic profiling data. Here a dataset consists of real numbers describing the relative level in which a clone occurs in human DNA for given patient samples. One can then mine, for example, for sets of samples that share some common behavior on the clones, i.e., gains or losses. Frequent itemsets show promising biological expressiveness, can be computed efficiently, and are very flexible. Their visualization provides the biologist with useful information for the discovery of patterns. Also it turns out that the use of (larger) frequent itemsets tends to filter out noise.

[1]  Walter A. Kosters,et al.  Apriori, A Depth First Implementation , 2003, FIMI.

[2]  K. Kinzler,et al.  Genetic instabilities in human cancers , 1998, Nature.

[3]  Gediminas Adomavicius,et al.  Handling very large numbers of association rules in the analysis of microarray data , 2002, KDD.

[4]  H. Ledebur,et al.  A mammalian artificial chromosome engineering system (ACE System) applicable to biopharmaceutical protein production, transgenesis and gene-based cell therapy. , 2004, Nucleic acids research.

[5]  Renée X de Menezes,et al.  Genomic profiling by DNA amplification of laser capture microdissected tissues and array CGH. , 2004, Nucleic acids research.

[6]  H. Döhner,et al.  Matrix‐based comparative genomic hybridization: Biochips to screen for genomic imbalances , 1997, Genes, chromosomes & cancer.

[7]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[8]  Jane Fridlyand,et al.  High-resolution analysis of DNA copy number alterations in colorectal cancer by array-based comparative genomic hybridization. , 2004, Carcinogenesis.

[9]  Céline Rouveirol,et al.  Local Pattern Discovery in Array-CGH Data , 2004, Local Pattern Detection.

[10]  Chengqi Zhang,et al.  Association Rule Mining , 2002, Lecture Notes in Computer Science.

[11]  Shichao Zhang,et al.  Association Rule Mining: Models and Algorithms , 2002 .

[12]  W. Kuo,et al.  High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays , 1998, Nature Genetics.

[13]  Walter A. Kosters,et al.  Competitive Neural Networks for Customer Choice Models , 2002 .

[14]  Bart Goethals,et al.  FIMI'03: Workshop on Frequent Itemset Mining Implementations , 2003 .