Cooperation between expert knowledge and data mining discovered knowledge: Lessons learned

Expert systems are built from knowledge traditionally elicited from the human expert. It is precisely knowledge elicitation from the expert that is the bottleneck in expert system construction. On the other hand, a data mining system, which automatically extracts knowledge, needs expert guidance on the successive decisions to be made in each of the system phases. In this context, expert knowledge and data mining discovered knowledge can cooperate, maximizing their individual capabilities: data mining discovered knowledge can be used as a complementary source of knowledge for the expert system, whereas expert knowledge can be used to guide the data mining process. This article summarizes different examples of systems where there is cooperation between expert knowledge and data mining discovered knowledge and reports our experience of such cooperation gathered from a medical diagnosis project called Intelligent Interpretation of Isokinetics Data, which we developed. From that experience, a series of lessons were learned throughout project development. Some of these lessons are generally applicable and others pertain exclusively to certain project types.

[1]  Donald K. Wedding,et al.  Discovering Knowledge in Data, an Introduction to Data Mining , 2005, Inf. Process. Manag..

[2]  Juan Pedro Caraça-Valente,et al.  Functions, rules and models: three complementary techniques for analyzing strength data , 2000, SAC '00.

[3]  Jianchao Han,et al.  A Knowledge-Based System Implementation of Intrusion Detection Rules , 2010, 2010 Seventh International Conference on Information Technology: New Generations.

[4]  Lourdes Mattos Brasil,et al.  Integration of Data Mining and Hybrid Expert System , 2002, FLAIRS Conference.

[5]  Loïc Martínez,et al.  An incremental solution for developing knowledge-based software: its application to an expert system for isokinetics interpretation , 2000 .

[6]  Ingoo Han,et al.  Knowledge-based data mining of news information on the Internet using cognitive maps and neural networks , 2002, Expert Syst. Appl..

[7]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[8]  Juan Pedro Caraça-Valente,et al.  Discovering Similar Patterns for Characterizing Time Series in a Medical Domain , 2003, Knowledge and Information Systems.

[9]  Fernando Alonso,et al.  Symbol Extraction Method and Symbolic Distance for Analysing Medical Time Series , 2006, ISBMDA.

[10]  Nguyen Hoang Phuong,et al.  Approach to generating rules for expert systems using rough set theory , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[11]  Sholom M. Weiss,et al.  Knowledge-based data mining , 2003, KDD '03.

[12]  Yong Shi,et al.  Data Mining Integrated with Domain Knowledge , 2009 .

[13]  Andrew Kusiak,et al.  Data-mining-based system for prediction of water chemistry faults , 2006, IEEE Transactions on Industrial Electronics.

[14]  Nikolaos M. Avouris,et al.  The Role of Domain Knowledge in a Large Scale Data Mining Project , 2002, SETN.

[15]  Yi Peng,et al.  A Domain Knowledge-Driven Framework for Multi-Criteria Optimization-Based Data Mining Methods , 2008, 2008 Fourth International Conference on Networked Computing and Advanced Information Management.

[16]  Gediminas Adomavicius,et al.  Expert-Driven Validation of Rule-Based User Models in Personalization Applications , 2004, Data Mining and Knowledge Discovery.

[17]  Hakikur Rahman,et al.  Data Mining Applications for Empowering Knowledge Societies , 2008 .

[18]  Han van Dissel,et al.  Risk management based on expert rules and data-mining : A case study in insurance , 2002, ECIS.

[19]  Wu Jing,et al.  Using expert system and KDD in optimization of mobile network , 2001, 2001 International Conferences on Info-Tech and Info-Net. Proceedings (Cat. No.01EX479).

[20]  Juan Pedro Caraça-Valente,et al.  Discovering Similar Patterns for Characterizing Time Series in a Medical Domain , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[21]  Lawrence O. Hall,et al.  Mining for Implications in Medical Data , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[22]  Andreas Al-Kinani,et al.  Integrating Data Mining and Expert Knowledge for an Artificial Lift Advisory System , 2010 .

[23]  Michael K. Ng,et al.  Medical Document Clustering Using Ontology-Based Term Similarity Measures , 2008, Int. J. Data Warehous. Min..

[24]  N P Gleeson,et al.  The Utility of Isokinetic Dynamometry in the Assessment of Human Muscle Function , 1996, Sports medicine.

[25]  Evelina Lamma,et al.  Artificial intelligence techniques for monitoring dangerous infections , 2006, IEEE Transactions on Information Technology in Biomedicine.

[26]  Abraham Bernstein,et al.  Towards Intelligent Assistance for a Data Mining Process , 2005 .

[27]  Guillermo Rodriguez-Ortiz,et al.  Obtaining expert system rules using data mining tools from a power generation database , 1998 .

[28]  Jozef Zurada,et al.  Discovering Patterns and Reference Models in the Medical Domain of Isokinetics , 2005 .

[29]  Frans Coenen,et al.  Mining Allocating Patterns in Investment Portfolios , 2009, Database Technologies: Concepts, Methodologies, Tools, and Applications.

[30]  Yongmei Liu,et al.  Designing and Realization of Intelligent Data Mining System Based on Expert Knowledge , 2006, 2006 IEEE International Conference on Management of Innovation and Technology.

[31]  David Taniar,et al.  Strategic Advancements in Utilizing Data Mining and Warehousing Technologies: New Concepts and Developments , 2009, Strategic Advancements in Utilizing Data Mining and Warehousing Technologies.

[32]  Renato J. O. Figueiredo,et al.  Application classification through monitoring and learning of resource consumption patterns , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[33]  Sally Jo Cunningham,et al.  Using data mining to support the construction and maintenance of expert systems , 1993, Proceedings 1993 The First New Zealand International Two-Stream Conference on Artificial Neural Networks and Expert Systems.

[34]  Stephan Kudyba,et al.  Managing Data Mining: Advice from Experts , 2004 .

[35]  Norberto F. Ezquerra,et al.  Validating expert system rule confidences using data mining of myocardial perfusion SPECT databases , 2000, Computers in Cardiology 2000. Vol.27 (Cat. 00CH37163).

[36]  Bart Baesens,et al.  Domain knowledge integration in data mining using decision tables: case studies in churn prediction , 2009, J. Oper. Res. Soc..

[37]  Russ Danstrom,et al.  The Healthcare Cost Dilemma: What Health Insurance Companies Can Do to Mitigate Unsustainable Premium Increases , 2004 .

[38]  Ping Liu,et al.  A self-learning expert system for diagnosis in traditional Chinese medicine , 2004, Expert Syst. Appl..