ADI-Minebio: A Graph Mining Algorithm for Biomedical Data

Pablo 0 0 2010-02-26T16:45:00Z 2010-02-26T16:45:00Z 1 128 734 USP 6 1 861 14.0 Normal 0 false false false EN-US JA X-NONE /* Style Definitions */ table.MsoNormalTable {mso-style-name:"Table Normal"; mso-tstyle-rowband-size:0; mso-tstyle-colband-size:0; mso-style-noshow:yes; mso-style-priority:99; mso-style-parent:""; mso-padding-alt:0cm 5.4pt 0cm 5.4pt; mso-para-margin:0cm; mso-para-margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:Calibri; mso-ansi-language:EN-US;} Graph mining is concerned with mining frequent subgraph patterns over a collection of graphs, aiming to find novel and useful knowledge. It has being used to analyze data from different domains, sometimes using algorithms tailored for a specific area of knowledge. In this paper, we propose a graph-mining algorithm and its application in the biomedical domain. We introduce the ADI-bio structure, which organizes data from a database with information of a disease’s patient, and also the ADI-Minebio algorithm, which performs a search on the proposed ADI-bio structure to find frequent subgraphs. Our approach is based on the ADI (adjacency index) structure and the ADI-Mine algorithm, but specifies a different structure and hence a new way of analyzing data through this structure. We also present a performance study to show the feasibility of our approach.

[1]  Keith C. C. Chan,et al.  A Graph Mining Algorithm for Classifying Chemical Compounds , 2008, 2008 IEEE International Conference on Bioinformatics and Biomedicine.

[2]  Cristina Dutra de Aguiar Ciferri,et al.  An Environment for Data Analysis in Biomedical Domain: Information Extraction for Decision Support Systems , 2010, IEA/AIE.

[3]  Su-Shing Chen,et al.  Item sets based graph mining algorithm and application in genetic regulatory networks , 2006, 2006 IEEE International Conference on Granular Computing.

[4]  Jiong Yang,et al.  SPIN: mining maximal frequent subgraphs from graph databases , 2004, KDD.

[5]  Huajun Chen,et al.  Semantic web for integrated network analysis in biomedicine , 2009, Briefings Bioinform..

[6]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[7]  Lawrence B. Holder,et al.  Frequent subgraph mining on a single large graph using sampling techniques , 2010, MLG '10.

[8]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[9]  Mehdi Heydari,et al.  A graph-based web usage mining method considering client side data , 2009, 2009 International Conference on Electrical Engineering and Informatics.

[10]  Christian Borgelt,et al.  Mining molecular fragments: finding relevant substructures of molecules , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[11]  R. Biswas,et al.  Metagraph-Based Substructure Pattern Mining , 2008, 2008 International Conference on Advanced Computer Theory and Engineering.

[12]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[13]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[14]  Jun Zhang,et al.  FOGGER: an algorithm for graph generator discovery , 2009, EDBT '09.

[15]  George Karypis,et al.  Frequent subgraph discovery , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[16]  Jian Pei,et al.  Pattern-growth methods for frequent pattern mining , 2002 .

[17]  Lawrence B. Holder,et al.  Mining Graph Data , 2006 .

[18]  Chen Wang,et al.  Scalable mining of large disk-based graph databases , 2004, KDD.

[19]  Jiawei Han,et al.  CloseGraph: mining closed frequent graph patterns , 2003, KDD '03.

[20]  Keith C. C. Chan,et al.  Analyzing web layout structures using graph mining , 2008, 2008 IEEE International Conference on Granular Computing.