Nonlinear integrals with polynomial kernel and its applications

Nonlinear integrals (NIs) are useful integration tools. It can get a set of virtual values by projecting original data onto a virtual space for classification purpose using NIs. The classical NIs implement projection along a line with respect to the features. But, in many cases, the linear projection cannot achieve good performance for classification or regression due to the limitation of the integrand. The linear function used for the integrand is just a special type of function with respect to the features. In this paper, we propose a nonlinear integrals with polynomial kernel (NIPK). A polynomial function with respect to the features is used as the integrand of NIs. It enables the projection to be along different types of curves to the virtual space so that the virtual values gotten by NIs can be better regularized and have higher separation power for classification. We use genetic algorithm to learn the fuzzy measures so that a larger solution space can be searched. To test the capability of the NIPK, we apply it to classification on several benchmark datasets and a bioinformatics project. Experiments show that there is evident improvement on performance for the NIPK compared to classical NIs. © 2011 Wiley Periodicals, Inc.

[1]  Kwong-Sak Leung,et al.  Data Mining on DNA Sequences of Hepatitis B Virus , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  G. Klir,et al.  Fuzzy Measure Theory , 1993 .

[3]  Geoffrey J. McLachlan,et al.  Discriminant Analysis and Statistical Pattern Recognition: McLachlan/Discriminant Analysis & Pattern Recog , 2005 .

[4]  M. Sugeno,et al.  Fuzzy Measures and Integrals: Theory and Applications , 2000 .

[5]  Hans-Jürgen Zimmermann,et al.  Improved feature selection and classification by the 2-additive fuzzy measure , 1999, Fuzzy Sets Syst..

[6]  Kwong-Sak Leung,et al.  Classification by nonlinear integral projections , 2003, IEEE Trans. Fuzzy Syst..

[7]  M. Sugeno,et al.  Non-monotonic fuzzy measures and the Choquet integral , 1994 .

[8]  Kwong-Sak Leung,et al.  A genetic algorithm for determining nonadditive set functions in information fusion , 1999, Fuzzy Sets Syst..

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[11]  J. R. Quinlan,et al.  Data Mining Tools See5 and C5.0 , 2004 .

[12]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[13]  Vicenç Torra,et al.  Modeling decisions - information fusion and aggregation operators , 2007 .

[14]  Michel Grabisch,et al.  Classification by fuzzy integral: performance and tests , 1994, CVPR 1994.

[15]  Michel Grabisch,et al.  The representation of importance and interaction of features by fuzzy measures , 1996, Pattern Recognit. Lett..

[16]  Kwong-Sak Leung,et al.  Polynomial Nonlinear Integrals , 2008, ISNN.

[17]  菅野 道夫,et al.  Theory of fuzzy integrals and its applications , 1975 .

[18]  James M. Keller,et al.  Possibility expectation and its decision making algorithm , 1992, [1992 Proceedings] IEEE International Conference on Fuzzy Systems.

[19]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[20]  George J. Klir,et al.  Genetic algorithms for determining fuzzy measures from data , 1998, J. Intell. Fuzzy Syst..