Mining frequent itemsets for protein kinase regulation

Protein kinases, a family of enzymes, have been viewed as an important signaling intermediary by living organisms for regulating critical biological processes such as memory, hormone response and cell growth. The unbalanced kinases are known to cause cancer and other diseases. With the increasing efforts to collect, store and disseminate information about the entire kinase family, it not only leads to valuable data set to understand cell regulation but also poses a big challenge to extract valuable knowledge about metabolic pathway from the data. Data mining techniques that have been widely used to find frequent patterns in large datasets can be extended and adapted to kinase data as well. This paper proposes a framework for mining frequent itemsets from the collected kinase dataset. An experiment using AMPK regulation data demonstrates that our approaches are useful and efficient in analyzing kinase regulation data.

[1]  Chengqi Zhang,et al.  Association Rule Mining , 2002, Lecture Notes in Computer Science.

[2]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[3]  Lawrence Hunter,et al.  Artificial Intelligence and Molecular Biology , 1992, AI Mag..

[4]  Shichao Zhang,et al.  Association Rule Mining: Models and Algorithms , 2002 .

[5]  D. Hardie,et al.  Effects of endurance training on activity and expression of AMP-activated protein kinase isoforms in rat muscles. , 2002, American journal of physiology. Endocrinology and metabolism.

[6]  Z. Beg,et al.  Modulation of 3-hydroxy-3-methylglutaryl coenzyme A reductase activity with cAMP and wth protein fractions of rat liver cytosol. , 1973, Biochemical and biophysical research communications.

[7]  O. Ljungqvist,et al.  AMP-activated protein kinase (AMPK) is activated in muscle of subjects with type 2 diabetes during exercise. , 2001, Diabetes.

[8]  Nicos Maglaveras,et al.  Mining Association Rules from Clinical Databases: An Intelligent Diagnostic Process in Healthcare , 2001, MedInfo.

[9]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[10]  Kei Sakamoto,et al.  Deficiency of LKB1 in skeletal muscle prevents AMPK activation and glucose uptake during contraction , 2005, The EMBO journal.

[11]  D C Torney,et al.  Discovery of association rules in medical data , 2001, Medical informatics and the Internet in medicine.

[12]  C. Carlson,et al.  Regulation of hepatic acetyl coenzyme A carboxylase by phosphorylation and dephosphorylation. , 1973, The Journal of biological chemistry.

[13]  M. Boguski,et al.  Functional genomics: it's all how you read it. , 1997, Science.

[14]  M. P. Quinlan,et al.  While E1A can facilitate epithelial cell transformation by several dominant oncogenes, the C-terminus seems only to regulate rac and cdc42 function, but in both epithelial and fibroblastic cells. , 2000, Virology.