An Optimization-Based Classification Approach with the Non-additive Measure

Optimization-based classification approaches have well been used for decision making problems, such as classification in data mining. It considers that the contributions from all the attributes for the classification model equals to the joint individual contribution from each attribute. However, the impact from the interactions among attributes is ignored because of linearly or equally aggregation of attributes. Thus, we introduce the generalized Choquet integral with respect to the non-additive measure as the attributes aggregation tool to the optimization-based approaches in classification problem. Also, the boundary for classification is optimized in our proposed model compared with previous optimization-based models. The experimental result of two real life data sets shows the significant improvement of using the non-additive measure in data mining.

[1]  Zhengxin Chen,et al.  Classification with Choquet Integral with Respect to Signed Non-additive Measure , 2007 .

[2]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[3]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[4]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[5]  Kwong-Sak Leung,et al.  Classification by nonlinear integral projections , 2003, IEEE Trans. Fuzzy Syst..

[6]  Ned Freed,et al.  EVALUATING ALTERNATIVE LINEAR PROGRAMMING MODELS TO SOLVE THE TWO-GROUP DISCRIMINANT PROBLEM , 1986 .

[7]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[8]  Robert Bosch,et al.  Separating Hyperplanes and the Authorship of the Disputed Federalist Papers , 1998 .

[9]  Zhenyuan Wang,et al.  A new genetic algorithm for nonlinear multiregressions based on generalized Choquet integrals , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[10]  Michel Grabisch,et al.  A new algorithm for identifying fuzzy measures and its application to pattern recognition , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[11]  G. Klir,et al.  Fuzzy Measure Theory , 1993 .

[12]  Zhengxin Chen,et al.  Nonlinear Classification by Linear Programming with Signed Fuzzy Measures , 2006, 2006 IEEE International Conference on Fuzzy Systems.

[13]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[14]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[15]  F. Glover,et al.  Simple but powerful goal programming models for discriminant problems , 1981 .

[16]  S. Sathiya Keerthi,et al.  Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[17]  Michio Sugeno,et al.  Separated Hierarchical Decomposition of the Choquet Integral , 1997, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[18]  G. Choquet Theory of capacities , 1954 .

[19]  Kwong-Sak Leung,et al.  Applying fuzzy measures and nonlinear integrals in data mining , 2005, Fuzzy Sets Syst..