Fuzzy min–max neural networks for categorical data: application to missing data imputation

The fuzzy min–max neural network classifier is a supervised learning method. This classifier takes the hybrid neural networks and fuzzy systems approach. All input variables in the network are required to correspond to continuously valued variables, and this can be a significant constraint in many real-world situations where there are not only quantitative but also categorical data. The usual way of dealing with this type of variables is to replace the categorical by numerical values and treat them as if they were continuously valued. But this method, implicitly defines a possibly unsuitable metric for the categories. A number of different procedures have been proposed to tackle the problem. In this article, we present a new method. The procedure extends the fuzzy min–max neural network input to categorical variables by introducing new fuzzy sets, a new operation, and a new architecture. This provides for greater flexibility and wider application. The proposed method is then applied to missing data imputation in voting intention polls. The micro data—the set of the respondents’ individual answers to the questions—of this type of poll are especially suited for evaluating the method since they include a large number of numerical and categorical attributes.

[1]  Jesús Cardeñosa,et al.  A FUZZY CONTROL APPROACH FOR VOTE ESTIMATION , 2007 .

[2]  Dimitar Filev,et al.  Relational partitioning of fuzzy rules , 1996, Fuzzy Sets Syst..

[3]  Sankar K. Pal,et al.  Self-organizing neural network as a fuzzy classifier , 1994, IEEE Trans. Syst. Man Cybern..

[4]  Bogdan Gabrys,et al.  Learning hybrid neuro-fuzzy classifier models from data: to combine or not to combine? , 2004, Fuzzy Sets Syst..

[5]  Chee Peng Lim,et al.  A Modified Fuzzy Min-Max Neural Network and Its Application to Fault Classification , 2007 .

[6]  Bogdan Gabrys,et al.  Neuro-fuzzy approach to processing inputs with missing values in pattern recognition problems , 2002, Int. J. Approx. Reason..

[7]  Henri Prade,et al.  What are fuzzy rules and how to use them , 1996, Fuzzy Sets Syst..

[8]  Sankar K. Pal,et al.  Data mining in soft computing framework: a survey , 2002, IEEE Trans. Neural Networks.

[9]  Kazuo Tanaka,et al.  An introduction to fuzzy logic for practical applications , 1996 .

[10]  Witold Pedrycz,et al.  A Novel Framework for Imputation of Missing Values in Databases , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[11]  Tshilidzi Marwala,et al.  The use of genetic algorithms and neural networks to approximate missing data in database , 2005, IEEE 3rd International Conference on Computational Cybernetics, 2005. ICCC 2005..

[12]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[13]  Patrick K. Simpson,et al.  Fuzzy min-max neural networks. I. Classification , 1992, IEEE Trans. Neural Networks.

[14]  P. K. Simpson Fuzzy Min-Max Neural Networks-Part 1 : Classification , 1992 .

[15]  Andrzej Bargiela,et al.  An inclusion/exclusion fuzzy hyperbox classifier , 2004, Int. J. Knowl. Based Intell. Eng. Syst..

[16]  Andrzej Bargiela,et al.  General fuzzy min-max neural network for clustering and classification , 2000, IEEE Trans. Neural Networks Learn. Syst..

[17]  Thomas J. Santner,et al.  A note on A. Albert and J. A. Anderson's conditions for the existence of maximum likelihood estimates in logistic regression models , 1986 .

[18]  Roelof K. Brouwer,et al.  A feed-forward network for input that is both categorical and quantitative , 2002, Neural Networks.

[19]  Ingunn Myrtveit,et al.  Analyzing Data Sets with Missing Data: An Empirical Evaluation of Imputation Methods and Likelihood-Based Methods , 2001, IEEE Trans. Software Eng..

[20]  R. Clarke,et al.  Theory and Applications of Correspondence Analysis , 1985 .

[21]  David R. Cox,et al.  PRINCIPLES OF STATISTICAL INFERENCE , 2017 .

[22]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[23]  Tshilidzi Marwala,et al.  Missing data: A comparison of neural network and expectation maximization techniques , 2007 .

[24]  Patrick K. Simpson,et al.  Fuzzy min-max neural networks - Part 2: Clustering , 1993, IEEE Trans. Fuzzy Syst..

[25]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[26]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[27]  D. Rubin Formalizing Subjective Notions about the Effect of Nonrespondents in Sample Surveys , 1977 .

[28]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[29]  Qinbao Song,et al.  Missing Data Imputation Techniques , 2007, Int. J. Bus. Intell. Data Min..

[30]  Gabriele B. Durrant Imputation Methods for Handling Item-Nonresponse in the Social Sciences: A Methodological Review , 2005 .

[31]  Witold Pedrycz,et al.  Fuzzy neural networks with reference neurons as pattern classifiers , 1992, IEEE Trans. Neural Networks.

[32]  Roberto Tagliaferri,et al.  Fuzzy neural networks for classification and detection of anomalies , 1998, IEEE Trans. Neural Networks.

[33]  Ming Zhong,et al.  Evolutionary Regression and Neural Imputations of Missing Values , 2008, Soft Computing Applications in Industry.

[34]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[35]  Bogdan Gabrys,et al.  Agglomerative Learning Algorithms for General Fuzzy Min-Max Neural Network , 2002, J. VLSI Signal Process..

[36]  Chang Chieh Hang,et al.  The min-max function differentiation and training of fuzzy neural networks , 1996, IEEE Trans. Neural Networks.

[37]  Ingram Olkin,et al.  Incomplete data in sample surveys , 1985 .