A Cooperative Multi-agent Data Mining Model and Its Application to Medical Data on Diabetes

We present CoLe, a model for cooperative agents for mining knowledge from heterogeneous data. CoLe allows for the cooperation of different mining agents and the combination of the mined knowledge into knowledge structures that no individual mining agent can produce alone. CoLe organizes the work in rounds so that knowledge discovered by one mining agent can help others in the next round. We implemented a multi-agent system based on CoLe for mining diabetes data, including an agent using a genetic algorithm for mining event sequences, an agent with improvements to the PART algorithm for our problem and a combination agent with methods to produce hybrid rules containing conjunctive and sequence conditions. In our experiments, the CoLe-based system outperformed the individual mining algorithms, with better rules and more rules of a certain quality. From the medical perspective, our system confirmed hypertension has a tight relation to diabetes, and it also suggested connections new to medical doctors.

[1]  Herna L. Viktor,et al.  Data mining in practice: from data to knowledge using a hybrid mining approach , 2000, Int. J. Comput. Syst. Signals.

[2]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[3]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[4]  Ramasamy Uthurusamy,et al.  Evolving data into mining solutions for insights , 2002, CACM.

[5]  Jörg Denzinger,et al.  Cooperation of Heterogeneous Provers , 1999, IJCAI.

[6]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[7]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.

[8]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[9]  P. Loy International Classification of Diseases--9th revision. , 1978, Medical record and health care information journal.

[10]  Jörg Denzinger Conflict handling in collaborative search , 2001 .

[11]  Laurent Chaudron,et al.  Conflicting agents: conflict management in multi-agent systems , 2001 .

[12]  Hongjun Lu,et al.  Toward Multidatabase Mining: Identifying Relevant Databases , 2001, IEEE Trans. Knowl. Data Eng..

[13]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[14]  Ramasamy Uthurusamy,et al.  EVOLVING DATA MINING INTO SOLUTIONS FOR INSIGHTS , 2002 .

[15]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .