Reclassification of Linearly Classified Data Using Constraint Databases

In many problems the raw data is already classified according to a variety of features using some linear classification algorithm but needs to be reclassified. We introduce a novel reclassification method that creates new classes by combining in a flexible way the existing classes without requiring access to the raw data. The flexibility is achieved by representing the results of the linear classifications in a linear constraint database and using the full query capabilities of a constraint database system. We implemented this method based on the MLPQ constraint database system. We also tested the method on a data that was already classified using a decision tree algorithm.

[1]  Yonghui Wang,et al.  The MLPQ/GIS constraint database system , 2000, SIGMOD '00.

[2]  Emden R. Gansner,et al.  Graphviz and Dynagraph – Static and Dynamic Graph Drawing Tools , 2003 .

[3]  Ingolf Geist,et al.  A framework for data mining and KDD , 2002, SAC '02.

[4]  Laks V. S. Lakshmanan,et al.  The 3W Model and Algebra for Unified Data Mining , 2000, VLDB.

[5]  Michael Jünger,et al.  Graph Drawing Software , 2003, Graph Drawing Software.

[6]  Bart Kuijpers,et al.  Introduction to constraint databases , 2002, SGMD.

[7]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[8]  Stéphane Grumbach,et al.  Constraint Databases , 1999, JFPLC.

[9]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.