Towards cooperative planning of data mining workflows

A major challenge for third generation data mining and knowledge discovery systems is the integration of different data mining tools and services for data understanding, data integration, data preprocessing, data mining, evaluation and deployment, which are distributed across the network of computer systems. In this paper we outline how an intelligent assistant that is intended to support end-users in the difficult and time consuming task of designing KDD-Workflows out of these distributed services can be built. The assistant should support the user in checking the correctness of workflows, understanding the goals behind given workflows, enumeration of AI planner generated workflow completions, storage, retrieval, adaptation and repair of previous workflows. It should also be an open easy extendable system. This is reached by basing the system on a data mining ontology (DMO) in which all the services (operators) together with their in-/output, pre-/postconditions are described. This description is compatible with OWL-S and new operators can be added importing their OWL-S specification and classifying it into the operator ontology.

[1]  T. Euler,et al.  Using Ontologies in a KDD Workbench , 2004 .

[2]  Matthias Klusch,et al.  Semantic Web Service Composition Planning with OWLS-Xplan , 2005, AAAI Fall Symposium: Agents and the Semantic Web.

[3]  Paolo Traverso,et al.  Automated Planning: Theory & Practice , 2004 .

[4]  Gregory Piatetsky-Shapiro,et al.  The KDD process for extracting useful knowledge from volumes of data , 1996, CACM.

[5]  James F. Allen,et al.  TRAINS-95: Towards a Mixed-Initiative Planning Assistant , 1996, AIPS.

[6]  James A. Hendler,et al.  HTN planning for Web Service composition using SHOP2 , 2004, J. Web Semant..

[7]  Rüdiger Wirth,et al.  Towards Process-Oriented Tool Support for Knowledge Discovery in Databases , 1997, PKDD.

[8]  Katharina Morik,et al.  The MiningMart Approach to Knowledge Discovery in Databases , 2004 .

[9]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[10]  Abraham Bernstein,et al.  Towards Intelligent Assistance for a Data Mining Process , 2005 .

[11]  Dana S. Nau,et al.  Control Strategies in HTN Planning: Theory Versus Practice , 1998, AAAI/IAAI.

[12]  Abraham Bernstein,et al.  The NExT System: Towards True Dynamic Adaptations of Semantic Web Service Compositions , 2007, ESWC.

[13]  Jerry R. Hobbs,et al.  DAML-S: Semantic Markup for Web Services , 2001, SWWS.

[14]  Karl T. Ulrich,et al.  Product Design and Development , 1995 .

[15]  James A. Hendler,et al.  HTN Planning: Complexity and Expressivity , 1994, AAAI.