Rectangular Basis Functions Applied to Imbalanced Datasets

Rectangular Basis Functions Networks (RecBFN) come from RBF Networks, and are composed by a set of Fuzzy Points which describe the network. In this paper, a set of characteristics of the RecBF are proposed to be used in imbalanced datasets, especially the order of the training patterns. We will demonstrate that it is an important factor to improve the generalization of the solution, which is the main problem in imbalanced datasets. Finally, this solution is compared with other important methods to work with imbalanced datasets, showing our method works well with this type of datasets and that an understandable set of rules can be extracted.

[1]  Dino Pedreschi,et al.  Machine Learning: ECML 2004 , 2004, Lecture Notes in Computer Science.

[2]  Herna L. Viktor,et al.  Learning from imbalanced data sets with boosting and data generation: the DataBoost-IM approach , 2004, SKDD.

[3]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[4]  Alberto Prieto,et al.  Computational intelligence and bioinspired systems , 2007, Neurocomputing.

[5]  José Salvador Sánchez,et al.  Strategies for learning in class imbalance problems , 2003, Pattern Recognit..

[6]  Michael R. Berthold,et al.  Constructing fuzzy graphs from examples , 1999, Intell. Data Anal..

[7]  Jesús Cerquides,et al.  Imbalanced Datasets Classification by Fuzzy Rule Extraction and Genetic Algorithms , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[8]  Panayiotis E. Pintelas,et al.  Mixture of Expert Agents for Handling Imbalanced Data Sets , 2003 .

[9]  Stephen Kwek,et al.  Applying Support Vector Machines to Imbalanced Datasets , 2004, ECML.

[10]  Alberto Sanfeliu,et al.  Progress in Pattern Recognition, Speech and Image Analysis , 2003, Lecture Notes in Computer Science.

[11]  Stan Matwin,et al.  Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[12]  Marta Prim,et al.  Fuzzy Rule Extraction Using Recombined RecBF for Very-Imbalanced Datasets , 2005, ICANN.

[13]  José Salvador Sánchez,et al.  Restricted Decontamination for the Imbalanced Training Sample Problem , 2003, CIARP.

[14]  José R. Dorronsoro,et al.  Discriminant Parallel Perceptrons , 2005, ICANN.

[15]  Edward Y. Chang,et al.  KBA: kernel boundary alignment considering imbalanced data distribution , 2005, IEEE Transactions on Knowledge and Data Engineering.

[16]  José R. Dorronsoro,et al.  Balanced Boosting with Parallel Perceptrons , 2005, IWANN.