Towards Mining Frequent Patterns in Genome Wide Association

Frequent pattern mining (FPM) is a popular method in discovering knowledge through generating associations between the attributes in database. Frequent patterns (FPs) represent interesting relationship that could improve understanding the data. High throughput genomic datasets normally consists of a lot of hidden relationships that can be discovered. It is challenging to explore the relationship between genomic factors and the complex disease. To face the challenges of high dimensional dataset, we perform preliminary study in genome wide association data using frequent pattern mining.

[1]  Ed Keedwell,et al.  Ant colony optimisation to identify genetic variant association with type 2 diabetes , 2011, Inf. Sci..

[2]  Manish Gupta,et al.  Mining Low-Support Discriminative Patterns from Dense and High-Dimensional Data , 2012, IEEE Transactions on Knowledge and Data Engineering.

[3]  M. Steinbach,et al.  High-Order SNP Combinations Associated with Complex Diseases: Efficient Discovery, Statistical Power and Functional Interactions , 2012, PloS one.

[4]  Shuzlina Abdul Rahman,et al.  Towards Applying Associative Classifier for Genetic Variants , 2012, ICONIP.

[5]  V. McKusick Mendelian Inheritance in Man and Its Online Version, OMIM , 2007, The American Journal of Human Genetics.

[6]  M. Anandhavalli,et al.  Association Rule Mining in Genomics , 2010 .

[7]  Lei Zhu,et al.  The Research on Association Rules Mining with Co-evolution Algorithm in High Dimensional Data , 2012, AsiaSim.

[8]  Jason H. Moore,et al.  BIOINFORMATICS REVIEW , 2005 .

[9]  Mingzhu Zhang,et al.  Survey on Association Rules Mining Algorithms , 2010 .

[10]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[11]  Ling Guo,et al.  GA-Based Data Mining Applied to Genetic Data for the Diagnosis of Complex Diseases , 2010 .

[12]  Park,et al.  Open Access Research Article Identification of Type 2 Diabetes-associated Combination of Snps Using Support Vector Machine , 2022 .

[13]  Claus Weihs,et al.  Local Analysis of SNP Data , 2010 .

[14]  Yan V. Sun,et al.  Machine learning in genome‐wide association studies , 2009, Genetic epidemiology.

[15]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[16]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[17]  P. Krishna Reddy,et al.  Novel techniques to reduce search space in multiple minimum supports-based frequent pattern mining algorithms , 2011, EDBT/ICDT '11.

[18]  M. McCarthy,et al.  Replication of Genome-Wide Association Signals in UK Samples Reveals Risk Loci for Type 2 Diabetes , 2007, Science.

[19]  Jesús S. Aguilar-Ruiz,et al.  Gene association analysis: a survey of frequent pattern mining from gene expression data , 2010, Briefings Bioinform..

[20]  T. Mcintosh,et al.  High Confidence Rule Mining for Microarray Analysis , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[21]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[22]  G. Abecasis,et al.  A Genome-Wide Association Study of Type 2 Diabetes in Finns Detects Multiple Susceptibility Variants , 2007, Science.

[23]  Anthony K. H. Tung,et al.  Carpenter: finding closed patterns in long biological datasets , 2003, KDD '03.

[24]  J. Stengård,et al.  Genes, Environment, and Cardiovascular Disease , 2003, Arteriosclerosis, thrombosis, and vascular biology.

[25]  Christian Borgelt,et al.  New algorithms for finding approximate frequent item sets , 2012, Soft Comput..

[26]  Ben Goertzel,et al.  Classifier Ensemble Based Analysis of a Genome-Wide SNP Dataset Concerning Late-Onset Alzheimer Disease , 2010, Int. J. Softw. Sci. Comput. Intell..

[27]  Bruce R. Korf,et al.  Human Genetics and Genomics , 2006 .

[28]  Athanasios V. Vasilakos,et al.  Computational Intelligence in Bioinformatics: SNP/Haplotype Data in Genetic Association Study for Common Diseases , 2009, IEEE Transactions on Information Technology in Biomedicine.