DMPML Data Mining Preparation Markup Language

In this paper we propose the language DMPML as an alternative to the standardization of the data preparation phase in a KDD process. DMPML is based on XML and uses XSL transformations to map raw data into processed data. DMPML features, such as extensibility, robustness and platform independence, support exchanging of data preparation projects among DMPML producers in an efficient way. This promotes work reusability and experience interchange among similar projects.

[1]  Robert L. Grossman,et al.  Data mining standards initiatives , 2002, CACM.

[2]  Usama M. Fayyad,et al.  Knowledge Discovery in Databases: An Overview , 1997, ILP.

[3]  Jiawei Han,et al.  DBMiner: A System for Mining Knowledge in Large Relational Databases , 1996, KDD.

[4]  Jaroslav Zendulka,et al.  Describing the Data Mining Process with DMSL , 2002, ADBIS Research Communications.

[5]  Wei Wang,et al.  DMQL: A Data Mining Query Language for Relational Databases , 2007 .

[6]  Jaroslav Zendulka,et al.  An XML Framework Proposal for Knowledge Discovery in Databases , 2000 .

[7]  Dietrich Wettschereck,et al.  Exchanging Data Mining Models with the Predictive Modelling Markup Language , 2001 .

[8]  James Clark,et al.  XSL Transformations (XSLT) Version 1.0 , 1999 .

[9]  Franco Turini,et al.  KDDML: A middleware language and system for knowledge discovery in databases , 2006, Data Knowl. Eng..

[10]  Jean-François Boulicaut,et al.  Query languages supporting descriptive rule mining , 2004 .

[11]  Jean-François Boulicaut,et al.  A Comparison between Query Languages for the Extraction of Association Rules , 2002, DaWaK.

[12]  Robert L. Grossman,et al.  The management and mining of multiple predictive models using the predictive modeling markup language , 1999, Inf. Softw. Technol..

[13]  Tomasz Imielinski,et al.  MSQL: A Query Language for Database Mining , 1999, Data Mining and Knowledge Discovery.

[14]  Giuseppe Psaila,et al.  A New SQL-like Operator for Mining Association Rules , 1996, VLDB.

[15]  Giuseppe Psaila,et al.  An Extension to SQL for Mining Association Rules , 1998, Data Mining and Knowledge Discovery.

[16]  Jean-François Boulicaut,et al.  Query Languages Supporting Descriptive Rule Mining: A Comparative Study , 2004, Database Support for Data Mining Applications.

[17]  Franco Turini,et al.  An XML Based Environment in Support of the Overall KDD Process , 2000, FQAS.

[18]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[19]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .