Loss optimal monotone relabeling of noisy multi-criteria data sets

A method to relabel noisy multi-criteria data sets is presented, taking advantage of the transitivity of the non-monotonicity relation to formulate the problem as an efficiently solvable maximum independent set problem. A framework and an algorithm for general loss functions are presented, and the flexibility of the approach is indicated by some examples, showcasing the ease with which the method can handle application-specific loss functions. Both didactical examples and real-life applications are provided, using the zero-one, the L1 and the squared loss functions, as well as combinations thereof.

[1]  Bernard De Baets,et al.  On the Role of Maximal Independent Sets in Cleaning Data Sets for Supervised Ranking , 2006, 2006 IEEE International Conference on Fuzzy Systems.

[2]  H.C.M. de Swart,et al.  Relational methods in computer science : 6th International Conference, RelMiCS 2001 and 1st Workshop of COST Action 274 TARSKI, Oisterwijk, The Netherlands, October 16-21, 2001 : revised papers , 2002 .

[3]  Denis Bouyssou,et al.  Building Criteria: A Prerequisite for MCDA , 1990 .

[4]  H. Daniels,et al.  Derivation of monotone decision models from noisy data , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[5]  Bernard De Baets,et al.  On the random generation of monotone data sets , 2008, Inf. Process. Lett..

[6]  William L. Maxwell,et al.  Establishing Consistent and Realistic Reorder Intervals in Production-Distribution Systems , 1985, Oper. Res..

[7]  R. Möhring Algorithmic Aspects of Comparability Graphs and Interval Graphs , 1985 .

[8]  A. J. Feelders,et al.  Nearest Neighbour Classification with Monotonicity Constraints , 2008, ECML/PKDD.

[9]  Bernard De Baets,et al.  Supervised ranking in the weka environment , 2010, Inf. Sci..

[10]  Carlos A. Bana e Costa,et al.  Readings in Multiple Criteria Decision Aid , 2011 .

[11]  Jan C. Bioch,et al.  Decision trees for ordinal classification , 2000, Intell. Data Anal..

[12]  Bernard De Baets,et al.  On the Definition and Representation of a Ranking , 2001, RelMiCS.

[13]  Carla E. Brodley,et al.  Identifying and Eliminating Mislabeled Training Instances , 1996, AAAI/IAAI, Vol. 1.

[14]  Arie Ben-David,et al.  Automatic Generation of Symbolic Multiattribute Ordinal Knowledge‐Based DSSs: Methodology and Applications* , 1992 .

[15]  Patrick Meyer,et al.  Sorting multi-attribute alternatives: The TOMASO method , 2005, Comput. Oper. Res..

[16]  Bernard De Baets,et al.  A probabilistic framework for the design of instance-based supervised ranking algorithms in an ordinal setting , 2008, Ann. Oper. Res..

[17]  Boros Edre,et al.  On the number of vertices belonging to all maximum stable sets of a graph , 1999 .

[18]  A. J. Feelders,et al.  Nonparametric Monotone Classification with MOCA , 2008, 2008 Eighth IEEE International Conference on Data Mining.